Issue No. 20Sunday, May 17, 202685 episodes · 312 articles
The Throughline ↓
The Podcast Summary.

40 hours of podcasts, in 5 minutes.

TechDwarkesh Podcast

What rebuilding AlphaGo teaches us about self-play, RL, and future of LLMs - Eric Jang

With Dwarkesh Patel, Eric Jang · Sunday, May 17, 2026

Eric Jang discusses his experience rebuilding AlphaGo from scratch, detailing the intricacies of Monte Carlo Tree Search (MCTS) and neural network architectures. He explores AlphaGo's unique self-play reinforcement learning approach, contrasting it with LLM training methods, and delves into the philosophical implications of AI solving NP-hard problems. The episode concludes with insights into the current capabilities and limitations of using large language models for automating AI research.