109 - How Can We Align Language Models like GPT with Human Values?
Listen now
Description
In this episode of the podcast I chat to Atoosa Kasirzadeh. Atoosa is an Assistant Professor/Chancellor's fellow at the University of Edinburgh. She is also the Director of Research at the Centre for Technomoral Futures at Edinburgh. We chat about the alignment problem in AI development, roughly: how do we ensure that AI acts in a way that is consistent with human values. We focus, in particular, on the alignment problem for language models such as ChatGPT, Bard and Claude, and how some old ideas from the philosophy of language could help us to address this problem. You can download the episode here or listen below. You can also subscribe the podcast on Apple, Spotify, Google, Amazon or whatever your preferred service might be. Relevant LinksAtoosa's webpageAtoosa's paper (with Iason Gabriel) 'In Conversation with AI: Aligning Language Models with Human Values' #mc_embed_signup{background:#fff; clear:left; font:14px Helvetica,Arial,sans-serif; } /* Add your own MailChimp form style overrides in your site stylesheet or in this style block. We recommend moving this block and the preceding CSS link to the HEAD of your HTML file. */ Subscribe to the newsletter
More Episodes
In this episode, John and Sven answer questions from podcast listeners. Topics covered include: the relationships between animal ethics and AI ethics; religion and philosophy of tech; the analytic-continental divide; the debate about short vs long-term risks; getting engineers to take ethics...
Published 12/20/23
What does the future hold for humanity's relationship with technology? Will we become ever more integrated with and dependent on technology? What are the normative and axiological consequences of this? In this episode, Sven and John discuss these questions and reflect, more generally, on...
Published 12/20/23
In this episode, Sven and John talk about relationships with machines. Can you collaborate with a machine? Can robots be friends, colleagues or, perhaps, even lovers? These are common tropes in science fiction and popular culture, but is there any credibility to them? What would the ethical...
Published 12/20/23