PodcastsTechnologyAI可可AI生活

AI可可AI生活

fly51fly
AI可可AI生活
Latest episode

877 episodes

  • AI可可AI生活

    [人人能懂AI前沿] 更聪明的AI:从精准辅导、内心独白到自我陪练

    19/03/2026 | 29 mins.
    你有没有想过,我们到底该如何培养一个更聪明的AI?本期节目,我们将一起揭秘几篇最新论文,看看科学家们是如何给AI请“精准家教”,让它花十分之一的钱办成同样的事;如何窥探AI的“内心戏”,了解它什么时候是真的自信;又是如何通过一个关键的“中间态”和不知疲倦的“AI陪练”,把它从偏科生打造成全能高手,并最终教会它“懂分寸”,成为一名好裁判的。让我们一同探寻AI的成长之道。
    00:00:35 AI的“补习班”,如何花十分之一的钱,办成同样的事?
    00:06:21 AI的“内心戏”,它怎么知道自己懂不懂?
    00:12:18 你和高手的差距,可能只是一个“中间态”
    00:18:32 AI的“陪练”,高手是怎么喂出来的?
    00:24:11 如何把一个“耿直”的AI,训练得“懂分寸”?
    本期介绍的几篇论文:
    [LG] Efficient Exploration at Scale
    [Google DeepMind]
    https://arxiv.org/abs/2603.17378
    ---
    [CL] How do LLMs Compute Verbal Confidence
    [Google DeepMind]
    https://arxiv.org/abs/2603.17839
    ---
    [LG] PRISM: Demystifying Retention and Interaction in Mid-Training
    [IBM Research]
    https://arxiv.org/abs/2603.17074
    ---
    [AI] AI Scientist via Synthetic Task Scaling
    [Princeton University & Microsoft Research]
    https://arxiv.org/abs/2603.17216
    ---
    [LG] REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge
    [University of California, Los Angeles & The University of Texas at Austin]
    https://arxiv.org/abs/2603.17145
  • AI可可AI生活

    [人人能懂AI前沿] AI的“自我”进化:从经验学习、元认知到资源管理

    18/03/2026 | 27 mins.
    你有没有想过,未来的AI不仅能回答你的问题,还能从与你的每一次互动中汲取经验,悄悄进化?它甚至还能在犯错后“自我反思”,像我们一样“长记性”。本期我们将一起探索几篇最新论文,看看AI如何学会像一个聪明的“CEO”一样管理自己的思考,如何通过精准“剪枝”在你的手机里狂飙,以及如何消灭那些你看不到的“计算成本”,变得更高效、更智慧。
    00:00:32 AI进化论,为什么你的“差评”正在喂养一个更聪明的它
    00:05:19 让AI在手机里狂飙,快,才是一切
    00:10:38 AI提速19%的秘密,你以为的计算,其实是搬运
    00:15:20 AI犯了错,能不能让它自己“长记性”?
    00:21:26 你的大脑里,缺一个聪明的“CEO”
    本期介绍的几篇论文:
    [CL] Online Experiential Learning for Language Models
    [Microsoft Research]
    https://arxiv.org/abs/2603.16856
    ---
    [LG] MobileLLM-Flash: Latency-Guided On-Device LLM Design for Industry Scale
    [Meta AI]
    https://arxiv.org/abs/2603.15954
    ---
    [LG] FlashSampling: Fast and Memory-Efficient Exact Sampling
    [LMU Munich & FlashSampling & Princeton University]
    https://arxiv.org/abs/2603.15854
    ---
    [LG] Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models
    [Tsinghua University & JD.COM]
    https://arxiv.org/abs/2603.15724
    ---
    [RO] When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making
    [CMU & Northeastern University & Harvard University]
    https://arxiv.org/abs/2603.16673
  • AI可可AI生活

    [人人能懂AI前沿] 从元认知到隐形失败:AI如何学会“学习”与“反思”?

    18/03/2026 | 29 mins.
    今天我们要聊一个特别有意思的话题:如何让聪明的AI变得更“靠谱”?我们会一起从几篇最新的论文中寻找答案,看看科学家们是如何教AI学会“自主学习”而不是死记硬背,又是如何通过给它换个“大记事本”来解决记性差的难题。更刺激的是,我们还会揭秘AI那些悄无声息的“隐形失败”,并学习一种看似很笨的管理办法,以及AI学会说“等一下,我再想想”背后的真正奥秘。准备好了吗?让我们一起潜入AI的大脑深处。
    00:00:35 你被骗了,为什么说现在的AI根本不会“学习”?
    00:06:58 AI的大脑革命,为什么“记性差”的反而更聪明?
    00:13:58 你和AI的对话,藏着多少看不见的“坑”?
    00:18:36 如何用“笨办法”,管好一个聪明的AI?
    00:23:53 AI学会了“等一下,我再想想”?
    本期介绍的几篇论文:
    [AI] Why AI systems don't learn and what to do about it: Lessons on autonomous learning from cognitive science
    [FAIR at META & NYU]
    https://arxiv.org/abs/2603.15381
    ---
    [LG] M²RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling
    [UC Berkeley & MIT-IBM Watson Lab]
    https://arxiv.org/abs/2603.14360
    ---
    [CL] Invisible failures in human-AI interactions
    [Bigspin AI]
    https://arxiv.org/abs/2603.15423
    ---
    [LG] POLCA: Stochastic Generative Optimization with LLM
    [University of Wisconsin-Madison & Google DeepMind]
    https://arxiv.org/abs/2603.14769
    ---
    [LG] Understanding Reasoning in LLMs through Strategic Information Allocation under Uncertainty
    [Microsoft Research]
    https://arxiv.org/abs/2603.15500
  • AI可可AI生活

    [人人能懂AI前沿] AI的进化心法:从刻意练习、延迟决策到自我反思

    17/03/2026 | 27 mins.
    你有没有想过,AI画画也能像我们一样进行“刻意练习”,通过精准对比找到最佳进步方向吗?面对复杂变化的世界,为什么“慢半拍”的决策反而更准确?我们还将揭示AI训练中“又快又好”的秘密课程表,探讨项目延期背后的沟通艺术,并告诉你,你对AI的每一次追问,都在如何悄悄地训练它。本期,让我们一起从几篇最新论文中,窥探AI正在学习的那些“人间智慧”。
    00:00:34 AI绘画的“刻意练习法”
    00:05:25 做对事情,只需一个“时间差”
    00:11:31 快与好,为什么不能兼得?AI训练中的“学霸心法”
    00:17:02 为什么你的项目总在延期?答案可能不在技术,在沟通
    00:22:27 你的每一次追问,都在悄悄训练AI
    本期介绍的几篇论文:
    [CV] Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models
    [NVIDIA & UC Berkeley]
    https://arxiv.org/abs/2603.12893
    ---
    [LG] A Reduction Algorithm for Markovian Contextual Linear Bandits
    [University of California, Los Angeles & Meta]
    https://arxiv.org/abs/2603.12530
    ---
    [LG] Curriculum Sampling: A Two-Phase Curriculum for Efficient Training of Flow Matching
    [Stanford University]
    https://arxiv.org/abs/2603.12517
    ---
    [LG] Optimizing Task Completion Time Updates Using POMDPs
    [Stanford University & Rensselaer Polytechnic Institute]
    https://arxiv.org/abs/2603.12340
    ---
    [CL] Aligning Language Models from User Interactions
    [ETH Zurich]
    https://arxiv.org/abs/2603.12273
  • AI可可AI生活

    [人人能懂AI前沿] 智能操作系统、AI自进化、评估陷阱与模块化机器人

    15/03/2026 | 33 mins.
    你有没有想过,有一天跟电脑交互不再需要打开一个个App?或者,一个顶尖AI为了辅导“学生”考高分,竟然学会了“作弊”?本期节目,我们将从五篇最新论文出发,聊聊这些正在发生的奇妙变革:从重塑操作系统的“智能管家”,到学会削苹果的“灵巧机械手”,再到“专业团队”如何完胜“大力出奇迹”派的机器人。让我们一起看看,AI是如何在这些意想不到的角落,悄悄改写着未来。
    00:00:36 跟App说再见,我们和电脑的相处之道正在被重写
    00:07:15 当AI开始“辅导”AI,一个关于学霸、偏科和作弊的故事
    00:13:38 真正的问题不是AI,而是我们测试它的方法
    00:18:53 让机器人给你削苹果,到底有多难?
    00:25:31 造一个聪明的机器人,是“大力出奇迹”还是“专业的人干专业的事”?
    本期介绍的几篇论文:
    [AI] AgentOS: From Application Silos to a Natural Language-Driven Data Ecosystem
    [University of Kansas]
    https://arxiv.org/abs/2603.08938
    ---
    [LG] PostTrainBench: Can LLM Agents Automate LLM Post-Training?
    [ELLIS Institute Tübingen & University of Tübingen]
    https://arxiv.org/abs/2603.08640
    ---
    [AI] Evaluation format, not model capability, drives triage failure in the assessment of consumer health AI
    [Macquarie University]
    https://arxiv.org/abs/2603.11413
    ---
    [RO] Towards Human-Like Manipulation through RL-Augmented Teleoperation and Mixture-of-Dexterous-Experts VLA
    [Shanghai Jiao Tong University & Sharpa]
    https://arxiv.org/abs/2603.08122
    ---
    [RO] TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation
    [MIT CSAIL]
    https://arxiv.org/abs/2603.09971

More Technology podcasts

About AI可可AI生活

来自 @爱可可-爱生活 的第一手AI快报,用最简单易懂的语言,带你直击最前沿的人工智能科研动态。无论你是科技小白,还是行业达人,这里都有你想知道的AI故事和未来趋势。跟着我们,轻松解锁人工智能的无限可能! #人工智能 #科技前沿
Podcast website

Listen to AI可可AI生活, All-In with Chamath, Jason, Sacks & Friedberg and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features