Description
本期的 11 篇论文如下:
[00:27] 🔍 Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders(解构SDXL Turbo:使用稀疏自编码器解释文本到图像模型)
[01:05] 🧠 What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective(LLMs训练中快速与慢速思考的层级差异:梯度视角)
[01:43] 🔍 A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents(基于指针网络的多标签多类别意图联合提取与检测方法)
[02:23] 🔄 Constraint Back-translation Improves Complex Instruction Following of Large Language Models(约束反向翻译提升大型语言模型复杂指令遵循能力)
[02:59] 📄 Language Models can Self-Lengthen to Generate Long Texts(语言模型能够自我延长以生成长文本)
[03:35] 📊 BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays(BenchX:胸部X光片医学视觉-语言预训练统一基准框架)
[04:17] 💾 BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments(BitStack:在可变内存环境中压缩大型语言模型的细粒度大小控制)
[05:04] 🤖 Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks(探索未知:基于聊天的个性化探索任务协作界面)
[05:40] 🤖 SelfCodeAlign: Self-Alignment for Code Generation(自代码对齐:代码生成中的自对齐方法)
[06:18] 🎥 DELTA: Dense Efficient Long-range 3D Tracking for any video(DELTA:高效密集长程3D视频追踪)
[06:57] 🎥 Learning Video Representations without Natural Videos(无需自然视频即可学习视频表示)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递