John Schulman (OpenAI Cofounder) — Reasoning, RLHF, & plan for 2027 AGI - Dwarkesh Podcast

Chatted with John Schulman (cofounded OpenAI and led ChatGPT creation) on how posttraining tames the shoggoth, and the nature of the progress to come... Watch on YouTube . Listen on Apple Podcasts , Spotify , or any other podcast platform. Read the full transcript here . Follow me on Twitter for updates on future episodes. Timestamps (00:00:00) - Pre-training, post-training, and future capabilities (00:16:55) - Plan for AGI 2025 (00:29:18) - Teaching models to reason (00:39:45) - The Road to ChatGPT (00:51:07) - What makes for a good RL researcher? (00:59:53) - Keeping humans in the loop (01:14:11) - State of research, plateaus, and moats Sponsors If you’re interested in advertising on the podcast, fill out this form . * CommandBar is an AI user assistant that any software product can embed to non-annoyingly assist, support, and unleash their users. Used by forward-thinking CX, product, growth, and marketing teams. Learn more at commandbar.com . Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe

John Schulman (OpenAI Cofounder) — Reasoning, RLHF, & plan for 2027 AGI

About this episode