PodcastsTechnologyThe MAD Podcast with Matt Turck

The MAD Podcast with Matt Turck

Matt Turck
The MAD Podcast with Matt Turck
Latest episode

119 episodes

  • The MAD Podcast with Matt Turck

    OpenAI's Dan Roberts: Why AI Can Now Make Discoveries

    04/06/2026 | 49 mins.
    Are we witnessing the first real signs of AI becoming a scientist? In this episode of The MAD Podcast, Matt Turck sits down with Dan Roberts, lead of the Foundations of Reinforcement Learning team at OpenAI, to explore one of the biggest shifts happening in AI: the rise of reasoning models, test-time compute, and reinforcement learning as engines of scientific discovery. Dan brings a rare perspective - from theoretical physics, black holes, quantum information, and deep learning theory - to explain how models are learning to “think,” why language may be such a powerful foundation for intelligence, what recent AI math breakthroughs really mean, and whether we are beginning to see AI systems that can contribute to science itself.

    (00:00) Intro: AI's wild week in mathematics
    (01:21) What OpenAI's Foundations of RL team does
    (03:08) Dan's journey: from black holes and quantum gravity to frontier AI
    (07:04) Are AI systems becoming useful for real science?
    (08:21) The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic
    (08:52) Why the OpenAI result was an act of exploration
    (10:25) OpenAI vs. DeepMind: informal reasoning vs. formal proof
    (12:13) RL 101: learning by doing, not just watching
    (15:10) Why reinforcement learning works
    (15:58) How RL breaks: sparse feedback and long-horizon tasks
    (17:03) RLHF: how human feedback shaped early language models
    (18:48) Move 37, self-play, and the search for novel strategies
    (22:16) Explore vs. exploit in scientific discovery
    (24:49) Why RL may now be "the cake," not the cherry on top
    (25:46) Why RL started working with large language models
    (27:29) Is RL "sucking supervision through a straw"?
    (28:47) Why language may be the grounding layer for intelligence
    (31:46) A contrarian take on the Bitter Lesson
    (32:41) What test-time compute actually is
    (34:50) How RL gives models the ability to think
    (35:40) Verifiable rewards, math, coding, and the messy real world
    (38:00) What physics can teach us about AI
    (42:08) Is there a thermodynamics of AI?
    (43:08) From Erdős problems to Einstein-level AI
    (45:16) Is AI already doing original science?
    (45:51) How far are we from AI automating AI research?
    (47:41) Why Dan is excited about the future of science
  • The MAD Podcast with Matt Turck

    State of Enterprise AI 2026: Aaron Levie on Tokenmaxxing, Rise of Headless, and AI-Proofing Your Job

    28/05/2026 | 1h 12 mins.
    Aaron Levie, co-founder and CEO of Box, returns to the MAD Podcast with the clearest read in tech on what AI is actually doing inside the world's largest enterprises right now - not the hype version, the real one. After hundreds of Fortune 500 CIO conversations this year, Aaron explains why we're still in "day one" of the agent era, why one badly written agent run can now cost $1,000 in compute, and why progress at the AI labs is paradoxically slowing enterprise deployment. We get into the token cost shock now reshaping IT budgets, why coding agents have reached escape velocity while the rest of knowledge work hasn't, the rise of headless software and what replaces per-seat pricing, the emergence of the forward-deployed engineer as the hottest job in tech, why Aaron thinks the AI doomers are wrong about jobs, and where startups can still win as the labs move up the stack.

    (00:00) Intro
    (01:18) Silicon Valley engineering vs. everyone else
    (05:35) Are enterprise CIOs actually bullish on AI?
    (08:51) Tokenmaxxing & why your AI bill is about to explode
    (11:34) The myth of falling token costs and AI spend escaping IT budgets
    (17:37) The $5B startup hiding in AI compute
    (18:14) The mosaic of models inside every enterprise
    (21:28) Why coding works and the rest of knowledge work doesn't
    (25:53) The Bob and Sally problem: access control breaks agents
    (30:31) Will enterprise AI really take 10 years to roll out?
    (32:24) The capability overhang: why faster models slow diffusion
    (34:23) Data is the bottleneck (it always was)
    (39:02) The rise of internal forward-deployed engineers
    (41:23) Why the AI doomers are wrong about jobs
    (43:43) Headless software is inevitable
    (46:14) What replaces per-seat pricing
    (47:37) How Box itself is going headless
    (49:42) How the org chart actually evolves
    (1:00:33) Future-proofing yourself as an enterprise employee
    (1:06:40) Are we all just going to work for OpenAI and Anthropic?
    (1:07:11) Where startups can still win as the labs move up
  • The MAD Podcast with Matt Turck

    OpenAI's Yann Dubois: Why AI Progress Suddenly Feels Real

    21/05/2026 | 1h 13 mins.
    AI suddenly feels like it has crossed a threshold, and Yann Dubois, co-lead of the Post-training Frontiers team at OpenAI, joins Matt Turck to explain why. Yann’s team has led the post-training behind the company's reasoning models, including the recent GPT-5.5 release. In this conversation, we go inside the shift from raw model capability to useful, reliable systems: what changed with GPT-5.5, why reinforcement learning is moving beyond math and coding competitions into messy real-world work, how reasoning models like GPT-5.5 actually work, the difference between GPT-5.5 Thinking and GPT-5.5 Pro, why post-training has become one of the most important frontiers in AI, and why evals, model-as-judge, hallucinations, agentic workflows, GDPval, and continual learning are now central to the next phase of frontier models. Yann also shares why continual learning remains one of AI's biggest unsolved problems three years after ChatGPT, and where startups still have massive room to build as frontier models race ahead.

    (00:00) - Cold open
    (00:34) - Intro
    (01:30) - Why recent AI progress feels like a step function
    (04:13) - Model reliability & the rollercoaster of shipping 5.5
    (07:33) - How OpenAI structures vertical and horizontal teams
    (09:49) - Improving model efficiency and test-time compute
    (12:32) - Yann Dubois' journey from Switzerland to OpenAI
    (15:37) - Reasoning in 2026: Real-world utility vs verifiable rewards
    (18:34) - GPT-5.5 Thinking vs Pro: Scaling test-time compute
    (20:09) - How reasoning models become more efficient
    (23:23) - Pre-training scaling and overcoming the data wall
    (27:03) - Multimodal data, synthetic data, and embodied AI
    (31:05) - Demystifying mid-training and post-training
    (37:21) - Does RL create new capabilities in AI?
    (38:53) - The challenges and frontier of scaling RL
    (43:09) - Is building AI models a craft or a strict science?
    (48:21) - How AI models generalize across different domains
    (54:18) - How reinforcement learning cures AI hallucinations
    (56:04) - Negative generalization and conflicting instructions
    (58:05) - Can RL scale to law, medicine, and the broader economy?
    (1:00:19) - The evaluation bottleneck and Model as a Judge
    (1:04:21) - Continuous AI progress & continual learning
    (1:08:49) - Will foundation models eat the agent harness?
    (1:11:23) - Why startups should focus on the last mile of AI
  • The MAD Podcast with Matt Turck

    Why AWS and Azure Cannot Run Autonomous AI – Ivan Burazin (Daytona)

    14/05/2026 | 1h 5 mins.
    If AI agents are the new digital knowledge workers, where exactly do they do their work? In this episode of the MAD Podcast, Ivan Burazin joins us to unpack the emerging infrastructure stack for AI agents and explain why every agent needs its own secure, stateful "computer." We explore the technical realities of sandboxes, dive into why legacy, stateless hyperscalers weren't built for these new workloads, and break down the mechanics of microVMs and custom schedulers alongside a contrarian prediction on an impending CPU shortage. Finally, Ivan delivers an absolute masterclass on product-led growth, community building, and go-to-market strategy for technical founders.

    (00:40) Intro
    (02:13) What is an AI agent sandbox?
    (03:17) Security risks of running agents locally
    (05:17) Stateful vs. stateless hyperscalers
    (07:04) The history of cloud IDEs and the end of localhost
    (09:45) Do all AI agents need a sandbox?
    (12:26) Sandbox use cases: RL evals & background agents
    (14:10) Unpacking the emerging AI Agent Stack
    (16:20) The unsolved problem of agent memory and learning
    (19:37) Where sandboxes fit in the agent harness
    (21:35) OpenAI, Anthropic, and agent SDKs
    (23:06) Ivan's founder journey: From CodeAnywhere to Daytona
    (26:59) GTM strategies and building developer communities
    (33:48) Why customer support is your best GTM strategy
    (35:34) Leveraging Twitter during the AI super cycle
    (40:50) The technical anatomy of a sandbox
    (41:53) Why fast spin-up speeds maximize GPU efficiency
    (46:09) Firecracker, QEMU, and isolation primitives
    (49:58) Why sandbox snapshots and state forking matter
    (51:40) Why Daytona built a custom scheduler from scratch
    (55:24) The challenge of long-running stateful sandboxes
    (58:10) The build your own sandbox trap
    (1:01:03) Why AI agents might trigger a global CPU shortage
    (1:02:46) The future of the AI Agent Stack
  • The MAD Podcast with Matt Turck

    OpenAI Board Member Zico Kolter on the Real Risks of Frontier AI

    07/05/2026 | 1h 16 mins.
    What actually happens before a frontier AI model gets released — and who decides whether it is safe enough? In this episode of The MAD Podcast, Matt Turck sits down with Zico Kolter — OpenAI board member, Head of the Machine Learning Department at Carnegie Mellon, and co-founder of Gray Swan — for a deep conversation on the real risks of frontier AI. They discuss how OpenAI’s safety oversight works before major model releases, why more powerful models do not automatically become safer, how jailbreaks and prompt injection expose real weaknesses in AI systems, why AI agents dramatically expand the attack surface, and where frontier AI is headed next. A clear, practical discussion on OpenAI, AI safety, AI security, AI agents, frontier models, red teaming, reinforcement learning, and the future of AI governance.

    (00:00) Intro
    (01:32) OpenAI board role and Safety & Security Committee
    (03:53) How OpenAI reviews major model releases
    (05:33) OpenAI’s preparedness framework explained
    (09:46) Are frontier AI models getting safer?
    (12:33) Why AI safety does not come from scale
    (15:23) The four categories of AI risk
    (19:38) Doomerism vs accelerationism in AI
    (24:11) The six-month AI pause debate
    (26:20) AI safety as a global effort
    (28:04) How Zico Kolter got into machine learning
    (31:05) OpenAI in the early days
    (34:14) Why Carnegie Mellon became an AI powerhouse
    (38:43) What Gray Swan does in AI security
    (40:44) AI safety vs AI security
    (43:15) The GCG jailbreak paper
    (49:19) How AI labs responded to jailbreak research
    (50:19) State-of-the-art AI defenses
    (52:32) State-of-the-art AI attacks
    (54:22) Why AI agents expand the attack surface
    (58:39) Are AI agents ready for production?
    (59:40) Mechanistic interpretability explained
    (1:02:31) Will AI be safer in two years?
    (1:03:46) Reinforcement learning and self-improving models
    (1:08:09) Do post-transformer architectures matter?
    (1:09:29) Best research directions in AI now
    (1:11:00) Zico Kolter’s Intro to Modern AI course
    (1:14:53) Why modern AI is simpler than people think
More Technology podcasts
About The MAD Podcast with Matt Turck
The MAD Podcast with Matt Turck, is a series of conversations with leaders from across the Machine Learning, AI, & Data landscape hosted by leading AI & data investor and Partner at FirstMark Capital, Matt Turck.
Podcast website

Listen to The MAD Podcast with Matt Turck, Darknet Diaries and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features
The MAD Podcast with Matt Turck: Podcasts in Family