Dark Secrets of Bert, Radioactive Data, and Vanishing Gradients
Listen now
Description
Lan presents a blog post revealing the Dark secrets of BERT. This work uses telling visualizations of self-attention patterns before and after fine-tuning to probe: what happens in the fine-tuned BERT? George brings a novel technique to the show, "radioactive data" - a marriage of data and steganography. This work from Facebook AI Research gives us the ability to know exactly who's been training models on our data. Kyle: Learning Important Features Through Propagating Activation Differences
More Episodes
This week we are back with our regular panelists! Kyle brings us a short article exploring science fiction impacting AI titled "Survey Finds Science Fiction One of Many Factors Impacting Views of AI Technology." George brings us an article about using thousands fo computers from universities,...
Published 09/16/20
We are back with other guest this week! We have NLP/ML research scientist, Fredrik Olsson joining us. He discusses the work "Why You Should Do NLP Beyond English." Lan brings us a news item, "Research News: DNA Storage." George talks about the article "Discovering Symbolic Models from Deep...
Published 09/10/20
Rachel Bittner, a research scientist at Spotify, joins us in our discussion this week! She brings us the paper "Few-Shot Sound Event Detection." Lan discusses an article about fairness of search results. George talks about a blog post from England about an algorithm grading exams and the...
Published 09/01/20