2 - Learning Human Biases with Rohin Shah - Listen - AXRP - the AI

2 - Learning Human Biases with Rohin Shah

Listen now

Description

Link to the paper - On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference Link to the transcript The Alignment Newsletter Rohin's contributions to the AI alignment forum Rohin's website

More Episodes

See all »

37 - Jaime Sevilla on AI Forecasting

Published 10/04/24

36 - Adam Shai and Paul Riechers on Computational Mechanics

Published 09/29/24

New Patreon tiers + MATS applications

Patreon: https://www.patreon.com/axrpodcast MATS: https://www.matsprogram.org Note: I'm employed by MATS, but they're not paying me to make this video.

Published 09/28/24