2024.11.20 每日AI论文 | 图像生成加速,语言模型数据集创新
Listen now
Description
本期的 7 篇论文如下: [00:33] ⚡ Continuous Speculative Decoding for Autoregressive Image Generation(自回归图像生成的连续推测解码) [01:14] 📚 RedPajama: an Open Dataset for Training Large Language Models(红睡衣:用于训练大型语言模型的开放数据集) [01:58] 🤖 Soft Robotic Dynamic In-Hand Pen Spinning(软体机器人动态手内笔旋转) [02:39] 🚀 ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements(ITACLIP:通过图像、文本和架构增强提升无训练语义分割) [03:13] 🔒 Building Trust: Foundations of Security, Safety and Transparency in AI(构建信任:人工智能中的安全、安全和透明度基础) [03:46] 🔍 SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning(SEAGULL:通过视觉语言指令调优的无参考图像质量评估方法) [04:24] 📊 Evaluating Tokenizer Performance of Large Language Models Across Official Indian Languages(评估大型语言模型在印度官方语言中的分词器性能) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递
More Episodes
本期的 16 篇论文如下: [00:25] 📱 BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices(BlueLM-V-3B:移动设备上多模态大语言模型的算法与系统协同设计) [01:06] 🌍 Generative World Explorer(生成世界探索者) [01:43] 🔍 Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of...
Published 11/19/24
本期的 6 篇论文如下: [00:28] 🧠 LLaVA-o1: Let Vision Language Models Reason Step-by-Step(LLaVA-o1:让视觉语言模型逐步推理) [01:14] 🎨 Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement(区域感知文本到图像生成:硬绑定与软优化) [01:51] 🌐 GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D...
Published 11/18/24