This episode is brought to you by Hyperbolic and the MLflow team. Check out more information at hyperbolic.ai and MLflow.org.
Why AI Coding Agents Are Moving to the Cloud — With Zach Lloyd, CEO of Warp
Zach Lloyd is the founder and CEO of Warp, the AI-native terminal and agentic development platform trusted by over a million developers. Before Warp, Zach was a product lead at Google on Google Docs — giving him a uniquely deep intuition for what it means to build truly collaborative developer tools at scale.
Why Agents are Driving Software Development to the Cloud // MLOps Podcast #371 with Zach Lloyd, CEO of Warp
What we cover:
🏗️ Why agents belong in the cloud, not local sandboxes — Zach breaks down why the "set up a local dev box for your agent" approach is fundamentally flawed and what cloud-native agent execution actually looks like in practice.
🚀 GitHub is losing collaborative code review — One of the episode's sharpest takes: the hero features of GitHub, like collaborative code review, are migrating into agent workbenches. Zach explains why this shift is structural, not cyclical.
📱 "Just-in-time apps" are replacing SaaS — The era of long-lived, learn-to-use-it software may be ending. Zach argues that agents will generate ephemeral, purpose-built interfaces on demand — and why most current app categories are at risk.
🤖 Introducing Oz — Warp's cloud orchestration platform — A first look at how Oz works, how Demetrios is already using it to automate podcast production, and what multi-agent orchestration looks like in a real team environment.
👁️ Agent observability and why it matters — Debugging, compliance, context management, and handoff/steering: Zach outlines the three pillars every engineering team needs before trusting agents with production work.
🔐 Agent chaos is real — access control for AI — Why giving agents too much context is just as dangerous as giving them too little, and how Warp thinks about scoped agent permissions as you scale.
📦 SaaS for agents will look nothing like SaaS for humans — The 25-year investment in human-friendly UI is irrelevant for agents. Zach explains what the new infrastructure layer for AI workers will actually need.
⚡ Open-weight models will commoditize the coding agent space — With Nvidia investing $2B in open-weight models, Zach believes the current cost advantage that frontier labs hold is temporary — and how Warp is positioning for that world.
🧩 Multi-agent orchestration patterns — Parallel agents, agent-to-agent handoffs, and why there's no single "right" pattern yet. Warp's Oz platform is being built for flexibility, not prescription.
This episode is essential for engineering leaders, platform engineers, and any developer trying to understand where their daily workflow is headed in the next 18 months.
🔗 Links & Resources:
Warp: https://www.warp.dev
Warp Oz platform: https://oz.dev
Zach Lloyd on X/Twitter: https://x.com/zachlloyd
MLOps Community: https://mlops.community
MLOps Community Slack: https://go.mlops.community/slack
⏱️ Timestamps
[00:00] Agentic Coding Review Shift
[00:29] Warp Collaboration vs Sandboxes
[05:22] Continuous Co-Creation in Teams
[07:00] Hyperbolics GPU Cloud
[07:56] Skill Governance Framework
[14:41] Agents vs Browsers Analogy
[21:31] PR Provenance in Warp
[27:58] Agent System Commandments
[37:44] Harness vs ADE
[42:03] Adversarial Review Technique
[45:26] GitHub Limitations for Agents
[49:07] MLflow's GenAI
[50:06] Wrap up