Top stations Podcasts Live sports Near you Genres Topics

Podcasts TechnologyAI Odyssey

Listen to this podcast in the app for free:

radio.net

Sleep timer

Save favourites

Download for free in the App Store

AI Odyssey

Anlie Arnaudy, Daniel Herbera and Guillaume Fournier

Technology

Latest episode

69 episodes

🎧 AI That Rewrites Its Own Brain: Meet the HyperAgent
29/03/2026 | 24 mins.
What happens when you give an AI system the ability to modify not just its answers, but the very process it uses to improve itself?
In this episode, we explore HyperAgents, a new framework from Meta and UBC that enables AI systems to recursively improve their own learning mechanisms. Unlike previous approaches where the improvement strategy was fixed by human engineers, HyperAgents can rewrite their own self-improvement code, creating a loop where getting better at a task also means getting better at getting better. The results are striking: improvements discovered in one domain, like reviewing research papers, transfer to completely unrelated tasks like grading Olympic math solutions.
Inspired by the work of Jenny Zhang, Bingchen Zhao, Wannan Yang, Jakob Foerster, Jeff Clune, Minqi Jiang, Sam Devlin, and Tatiana Shavrina, this episode was created using Google's NotebookLM.
Read the original paper here: https://arxiv.org/abs/2603.19461
When Agents Remember Their Mistakes: The End of AI Amnesia
22/03/2026 | 21 mins.
What if an AI agent could learn from every single failure, every clumsy workaround, every brilliant recovery, and feed that experience back into its own future performance?
Today’s LLM-powered agents suffer from a fundamental flaw: amnesia. They repeat the same mistakes, miss the same shortcuts, and rediscover the same solutions over and over. A new framework from IBM Research changes that by mining agent execution trajectories for three types of actionable knowledge: strategy tips from clean successes, recovery tips from failure-and-fix sequences, and optimization tips from tasks completed inefficiently.
On the AppWorld benchmark, agents equipped with this learned memory improved scenario goal completion by up to 14.3 percentage points on unseen tasks, and by a staggering 28.5 points on complex multi-step challenges. That is a 149% relative increase, with zero model changes.
Inspired by the work of Gaodan Fang, Vatche Isahagian, K. R. Jayaram, Ritesh Kumar, Vinod Muthusamy, Punleuk Oum, and Gegi Thomas, this episode was created using Google’s NotebookLM.
Read the original paper here: https://arxiv.org/abs/2603.10600
Agents That Teach Themselves
14/03/2026 | 13 mins.
What if AI agents could diagnose their own mistakes and build the exact skills they need to fix them, with no human intervention?
In this episode, we explore EvoSkill, a self-evolving framework where coding agents automatically discover and refine reusable skills through iterative failure analysis. Instead of optimizing prompts or fine-tuning models, EvoSkill lets agents build structured skill libraries that accumulate over time, improving performance by up to 12% on challenging benchmarks. Even more striking: skills learned on one task transfer to completely different tasks without modification.
Inspired by the work of Salaheddin Alzubi, Noah Provenzano, Jaydon Bingham, Weiyuan Chen, and Tu Vu, this episode was created using Google’s NotebookLM.
Read the original paper here: https://arxiv.org/pdf/2603.02766
Your AI Agent is Flying Blind: The Skills Gap No One is Talking About
02/03/2026 | 23 mins.
What if the biggest bottleneck in AI agent performance isn’t the model itself—but what it doesn’t know how to do?
In this episode, we explore SkillsBench, the first benchmark that systematically measures how structured procedural knowledge—called Agent Skills—impacts AI agent performance across real-world tasks. The results are striking: curated Skills boost agent success rates by 16 percentage points on average, with some domains like Healthcare seeing gains above 50 points. But here’s the twist—when models try to generate their own Skills, performance actually drops. The takeaway? AI agents desperately need human expertise to unlock their full potential.
Inspired by the work of Xiangyi Li, Wenbo Chen, Yimin Liu, and colleagues, this episode was created using Google’s NotebookLM.
Read the original paper here: https://arxiv.org/pdf/2602.12670
Your AI Assistant Doesn't Know You Yet. But It's Learning.
22/02/2026 | 20 mins.
What if your AI assistant could actually remember you — not just your name, but how your preferences evolve over time?
Researchers from Meta have introduced PAHF — Personalized Agents from Human Feedback — a framework that lets AI agents learn who you are in real time, through the natural back-and-forth of interaction. Before acting, the agent asks targeted questions to avoid costly mistakes. After acting, it listens to your corrections and updates its understanding of you. No pre-collected data required. No static profiles. Just a system that gets smarter about you with every exchange.
For anyone deploying AI agents at scale — in enterprise, banking, or consumer applications — this is the missing piece: personalization that actually keeps up with people.
Inspired by the work of Kaiqu Liang, Julia Kruk, Shengyi Qian, Xianjun Yang, Shengjie Bi, Yuanshun Yao, Shaoliang Nie, Mingyang Zhang, Lijuan Liu, Jaime Fernández Fisac, Shuyan Zhou, and Saghar Hosseini, this episode was created using Google's NotebookLM.
Read the original paper here: https://arxiv.org/pdf/2602.16173

More Technology podcasts

About AI Odyssey

AI Odyssey is your journey through the vast and evolving world of artificial intelligence. Powered by AI, this podcast breaks down both the foundational concepts and the cutting-edge developments in the field. Whether you're just starting to explore the role of AI in our world or you're a seasoned expert looking for deeper insights, AI Odyssey offers something for everyone. From AI ethics to machine learning intricacies, each episode is crafted to inspire curiosity and spark discussion on how artificial intelligence is shaping our future.

Podcast website

Technology