Powered by RND
PodcastsTechnologyAI可可AI生活

AI可可AI生活

fly51fly
AI可可AI生活
Latest episode

Available Episodes

5 of 756
  • [人人能懂] 从看见空间、探索信息到理解“不要”
    你有没有想过,能写诗作画的AI,为什么有时却像个固执的孩子?本期我们要聊的几篇最新论文,就试图教会AI一些我们习以为常、但它却难以理解的人类智慧。我们将一起看看,如何治好AI的“路痴”症,让它拥有空间感;如何让它从被动看图,变身主动破案的“侦探”;甚至,如何通过巧妙的“换个姿势”,让它终于听懂“不要”,并随心所欲地调整观察事物的“粒度”。00:00:33 人工智能的“路痴”难题00:05:24 AI侦探,如何给千米大桥做“体检”?00:09:59 从“你猜”到“你定”:AI图像分割的新玩法00:14:45 换个姿势,让AI听懂“不要”本期介绍的几篇论文:[CV] Scaling Spatial Intelligence with Multimodal Foundation Models [SenseTime Research] https://arxiv.org/abs/2511.13719 ---[CV] BridgeEQA: Virtual Embodied Agents for Real Bridge Inspections [University of Houston] https://arxiv.org/abs/2511.12676 ---[CV] UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity [UC Berkeley] https://arxiv.org/abs/2511.13714 ---[CV] SpaceVLM: Sub-Space Modeling of Negation in Vision-Language Models [MIT] https://arxiv.org/abs/2511.12331
    --------  
    19:51
  • [人人能懂] 从灵感溯源、速读秘诀到诚实AI
    你有没有想过,最顶尖的AI,它的智慧可能不是体现在无所不知,而是敢于坦诚地说出“我不知道”?本期节目,我们将一起探索AI如何学会这项宝贵的品质。我们还会揭秘,如何给AI装上一双“眼睛”让它在嘈杂派对里也能跟你轻松对话,如何用一个优美的公式教会它“速读”长篇报告,甚至让一份200页的PDF自己开口说话,并在一秒内找到AI画作的灵感“祖先”。准备好了吗?让我们一起进入AI更深邃、更智慧的内心世界。00:00:39 AI画画的灵感,能秒速溯源吗?00:06:29 大模型读书慢?给它一副聪明的“速读眼镜”00:12:13 给AI一双眼睛,让它学会“察言观色”00:16:37 AI的最高智慧,是承认自己不知道00:22:56 如何让一份200页的PDF,自己开口说话?本期介绍的几篇论文:[CV] Fast Data Attribution for Text-to-Image Models[CMU & Adobe Research & UC Berkeley]https://arxiv.org/abs/2511.10721---[LG] Optimizing Mixture of Block Attention[MIT]https://arxiv.org/abs/2511.11571---[CL] AV-Dialog: Spoken Dialogue Models with Audio-Visual Input[University of Washington & Meta AI Research]https://arxiv.org/abs/2511.11124---[LG] Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation[Toyota Technological Institute at Chicago & University of California, San Diego]https://arxiv.org/abs/2511.11500---[CL] Information Extraction From Fiscal Documents Using LLMs[Google Inc & XKDR Forum]https://arxiv.org/abs/2511.10659
    --------  
    29:00
  • [人人能懂] 从组建乐团、自我修炼到深度思考
    想让AI更聪明,答案一定是用更多数据喂出个更大的模型吗?本期我们要聊点不一样的:当AI不再单打独斗,而是组建起一支“交响乐团”;当它不再追求更大,而是学会了“反复琢磨”;当它甚至能像武林高手一样开启“自我修炼”。我们将从几篇最新论文出发,看看AI如何从理解微观世界的“集体舞步”,到为自己的想象力配上一本“物理说明书”,走上一条更聪明的进化之路。00:00:31 AI制药,也需要一个“交响乐团”?00:05:32 人工智能的“自我修炼”手册00:11:46 如何预测一群舞者的集体舞步?00:17:16 AI变聪明的捷径:不是更大,而是更深00:22:03 给AI视频配一本“物理说明书”本期介绍的几篇论文:[LG] MADD: Multi-Agent Drug Discovery Orchestra [ITMO University] https://arxiv.org/abs/2511.08217 ---[LG] AgentEvolver: Towards Efficient Self-Evolving Agent System [Tongyi Lab] https://arxiv.org/abs/2511.10395 ---[LG] Entangled Schrödinger Bridge Matching [University of Pennsylvania & Duke-NUS Medical School] https://arxiv.org/abs/2511.07406 ---[CL] Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence [University of Maryland & New York University] https://arxiv.org/abs/2511.07384 ---[RO] Robot Learning from a Physical World Model [Google DeepMind & USC] https://arxiv.org/abs/2511.07416
    --------  
    26:52
  • [人人能懂] 乐高说明书、喜剧大赛与科研空间站
    你有没有想过,AI的大脑里到底是什么样?今天我们就来一次深度探险,看看最新论文如何为我们绘制出AI的“乐高说明书”,又如何让它兼顾深思熟虑与脱口而出。我们还会把AI送上喜剧舞台,一探它那难以共情的奇特笑点,甚至把它放进一个虚拟“空间站”,看它能否成为真正的科学家。最后,我们会聊一个大趋势:AI正在悄悄地从遥远的云端,搬到你我的身边。00:00:31 AI的“乐高”说明书00:06:17 让AI既能深思熟虑,又能脱口而出00:11:07 AI的笑点,为什么我们Get不到?00:15:29 AI科学家,告别流水线00:21:19 AI大变局:从云端到你身边本期介绍的几篇论文:[LG] Weight-sparse transformers have interpretable circuits [OpenAI] https://cdn.openai.com/pdf/41df8f28-d4ef-43e9-aed2-823f9393e470/circuit-sparsity-paper.pdf---[CL] TiDAR: Think in Diffusion, Talk in Autoregression [NVIDIA] https://arxiv.org/abs/2511.08923 ---[CL] Assessing the Capabilities of LLMs in Humor: A Multi-dimensional Analysis of Oogiri Generation and Evaluation [Hitotsubashi University] https://arxiv.org/abs/2511.09133 ---[LG] The Station: An Open-World Environment for AI-Driven Discovery [Dualverse AI] https://arxiv.org/abs/2511.06309 ---[LG] Intelligence per Watt: Measuring Intelligence Efficiency of Local AI [Stanford University] https://arxiv.org/abs/2511.07885
    --------  
    26:41
  • [人人能懂] 从大师风范到听懂音乐
    如何让AI更聪明、更可靠?这期节目,我们将颠覆你的好几个固有认知。我们会发现,让小模型拥有大师风范的最佳方式,竟是引入一场“鉴赏家”参与的博弈;而AI最好的记忆方法,有时反而是那个最“笨”的。接着,我们将探讨如何用一张“考试大纲”驯服AI,又如何给它内置一个“苏格拉底”进行自我纠错。最后,我们还会揭秘,AI是如何从仅仅“听到”音乐,进化到能够“听懂”音乐背后的高级情感与故事的。00:00:37 让你的小模型,拥有宗师风范00:05:09 为什么说,最笨的方法,是AI最好的记忆方法?00:10:30 AI的“考试大纲”:我们如何让它更听话?00:15:54 如何让AI少犯错?给它一个内置的“苏格拉底”00:21:06 从“好听”到“高级”:AI如何学会聊音乐?本期介绍的几篇论文:[CL] Black-Box On-Policy Distillation of Large Language Models [Microsoft Research] https://arxiv.org/abs/2511.10643 ---[CL] Convomem Benchmark: Why Your First 150 Conversations Don't Need RAG [Salesforce AI Research] https://arxiv.org/abs/2511.10523 ---[CL] Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following [Meta Superintelligence Labs & Princeton University] https://arxiv.org/abs/2511.10507 ---[CL] SSR: Socratic Self-Refine for Large Language Model Reasoning [Salesforce AI Research] https://arxiv.org/abs/2511.10621 ---[AS] Music Flamingo: Scaling Music Understanding in Audio Language Models [NVIDIA & University of Maryland] https://arxiv.org/abs/2511.10289
    --------  
    27:23

More Technology podcasts

About AI可可AI生活

来自 @爱可可-爱生活 的第一手AI快报,用最简单易懂的语言,带你直击最前沿的人工智能科研动态。无论你是科技小白,还是行业达人,这里都有你想知道的AI故事和未来趋势。跟着我们,轻松解锁人工智能的无限可能! #人工智能 #科技前沿
Podcast website

Listen to AI可可AI生活, The AI Daily Brief: Artificial Intelligence News and Analysis and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features
Social
v7.23.12 | © 2007-2025 radio.de GmbH
Generated: 11/19/2025 - 8:16:55 AM