John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI
Listen now
Description
Chatted with John Schulman (cofounded OpenAI and led ChatGPT creation) on how posttraining tames the shoggoth, and the nature of the progress to come... Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here. Follow me on Twitter for updates on future episodes. Timestamps (00:00:00) - Pre-training, post-training, and future capabilities (00:16:57) - Plan for AGI 2025 (00:29:19) - Teaching models to reason (00:40:50) - The Road to ChatGPT (00:52:13) - What makes for a good RL researcher? (01:00:58) - Keeping humans in the loop (01:15:15) - State of research, plateaus, and moats Sponsors If you’re interested in advertising on the podcast, fill out this form. * Your DNA shapes everything about you. Want to know how? Take 10% off our Premium DNA kit with code DWARKESH at mynucleus.com. * CommandBar is an AI user assistant that any software product can embed to non-annoyingly assist, support, and unleash their users. Used by forward-thinking CX, product, growth, and marketing teams. Learn more at commandbar.com. Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe
More Episodes
Published 05/15/24
Mark Zuckerberg on: - Llama 3 - open sourcing towards AGI - custom silicon, synthetic data, & energy constraints on scaling - Caesar Augustus, intelligence explosion, bioweapons, $10b models, & much more Enjoy! Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast...
Published 04/18/24
Had so much fun chatting with my good friends Trenton Bricken and Sholto Douglas on the podcast. No way to summarize it, except:  This is the best context dump out there on how LLMs are trained, what capabilities they're likely to soon have, and what exactly is going on inside them. You would be...
Published 03/28/24