Issue No. 17Sunday, April 26, 202676 episodes · 195 articles
The Throughline ↓
The Podcast Summary.

40 hours of podcasts, in 5 minutes.

TechDwarkesh Podcast

How GPT-5, Claude, and Gemini are actually trained and served – Reiner Pope

With Dwarkesh Patel, Reiner Pope · Sunday, April 26, 2026

Reiner Pope, CEO of MatX, breaks down the intricate details of how large language models like GPT-5, Claude, and Gemini are trained and served in cluster environments. He explains the critical role of batch size, mixture of experts, and parallelism in managing latency and cost, linking these technical elements to real-world AI API pricing structures. The discussion also ventures into the physical constraints of GPU rack design and the surprising architectural parallels between cryptographic protocols and neural networks.