Description
本期的 7 篇论文如下:
[00:33] ⚡ Continuous Speculative Decoding for Autoregressive Image Generation(自回归图像生成的连续推测解码)
[01:14] 📚 RedPajama: an Open Dataset for Training Large Language Models(红睡衣:用于训练大型语言模型的开放数据集)
[01:58] 🤖 Soft Robotic Dynamic In-Hand Pen Spinning(软体机器人动态手内笔旋转)
[02:39] 🚀 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements(ITACLIP:通过图像、文本和架构增强提升无训练语义分割)
[03:13] 🔒 Building Trust: Foundations of Security, Safety and Transparency in AI(构建信任:人工智能中的安全、安全和透明度基础)
[03:46] 🔍 SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning(SEAGULL:通过视觉语言指令调优的无参考图像质量评估方法)
[04:24] 📊 Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages(评估大型语言模型在印度官方语言中的分词器性能)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
本期的 16 篇论文如下:
[00:25] 📱 BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices(BlueLM-V-3B:移动设备上多模态大语言模型的算法与系统协同设计)
[01:06] 🌍 Generative World Explorer(生成世界探索者)
[01:43] 🔍 Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of...
Published 11/19/24
本期的 6 篇论文如下:
[00:28] 🧠 LLaVA-o1: Let Vision Language Models Reason Step-by-Step(LLaVA-o1:让视觉语言模型逐步推理)
[01:14] 🎨 Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement(区域感知文本到图像生成:硬绑定与软优化)
[01:51] 🌐 GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D...
Published 11/18/24