04 Digesting The Data
Listen now
Description
Dónal and Ciarán discuss the vast ocean of data that Large Language Models (LLMs) depend on for their training, covering some of the issues of access to that data and the biases reflected within it. This episode should help you better understand some aspects of the AI training process.Topics in this episodeWhat data is being used to train models like ChatGPT?What are "supervised" or "unsupervised" machine learning methods?How have the owners of copyright data, like news organisations, reacted...
More Episodes
Dónal and Ciarán discuss some of the concerns about misinformation and disinformation that have emerged with the rise of impressively capable GenAI models, and provide some detail on what their effects might be. They discuss the calls for regulation and how this has begun to take shape in the EU,...
Published 11/18/24
Published 11/18/24
Dónal and Ciarán talk you through everything that's emerged since capable Large Language Models entered the chat a few years ago - up to late 2024, at least! This episode covers the companies, models, and tools that have become central to contemporary GenAI, and will broaden your understanding of...
Published 11/04/24