2024.11.22 每日AI论文 | 混合偏好优化提升推理，多模态自回归预训练创新。 - Listen - HuggingFace

2024.11.22 每日AI论文 | 混合偏好优化提升推理，多模态自回归预训练创新。

Listen now

Description

本期的 14 篇论文如下： [00:26] 🧠 Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization（通过混合偏好优化提升多模态大语言模型的推理能力） [01:12] 🌐 Multimodal Autoregressive Pre-training of Large Vision Encoders（大规模视觉编码器多模态自回归预训练） [01:55] 🧠 Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions（Marco-o1：面向开放式解决方案的开放推理模型） [02:40] 🧠 Hymba: A Hybrid-head Architecture for Small Language Models（Hymba：一种用于小语言模型的混合头架构） [03:22] 🚀 Ultra-Sparse Memory Network（超稀疏内存网络） [03:58] 📚 OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs（开放学者：利用检索增强型语言模型合成科学文献） [04:47] 🧠 Natural Language Reinforcement Learning（自然语言强化学习） [05:26] 🧠 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models（Insight-V：探索多模态大语言模型的长链视觉推理） [06:08] 🤖 Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models（我了解这个实体吗？语言模型中的知识意识与幻觉） [06:46] 🌊 Stable Flow: Vital Layers for Training-Free Image Editing（稳定流：无需训练的图像编辑关键层） [07:25] 🌐 UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages（统一爬取：利用Common Crawl为低资源语言的LLM提供经济适用的适应性） [08:03] 🚗 MagicDriveDiT: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control（MagicDriveDiT：基于自适应控制的高分辨率长视频生成用于自动驾驶） [08:44] 🧠 Patience Is The Key to Large Language Model Reasoning（耐心是大型语言模型推理的关键） [09:18] 🌐 Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation（将高斯散射融入扩散去噪器以实现快速且可扩展的单阶段图像到3D生成）【关注我们】您还可以在以下平台找到我们，获得播客内容以外更多信息小红书: AI速递

More Episodes

See all »

2024.11.27 每日AI论文 | ShowUI提升GUI效率，F2F改进图像编辑。

本期的 18 篇论文如下： [00:28] 🖥 ShowUI: One Vision-Language-Action Model for GUI Visual Agent（ShowUI：一种用于GUI视觉代理的视觉-语言-动作模型） [01:08] 🎥 Pathways on the Image Manifold: Image Editing via Video Generation（图像流形上的路径：通过视频生成进行图像编辑） [01:45] ⭐ Star Attention: Efficient LLM Inference over Long...

Published 11/27/24

2024.11.26 每日AI论文 | 3D材料生成自动化，零样本图像生成创新。

本期的 21 篇论文如下： [00:26] 🌐 Material Anything: Generating Materials for Any 3D Object via Diffusion（材料生成：通过扩散生成任意3D对象的材料） [01:05] 🎨 Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator（基于修复的大规模文本到图像模型：零样本主题驱动图像生成器） [01:48] 🤖 From Generation to Judgment:...

Published 11/26/24

HuggingFace 每日AI论文速递

Published 11/26/24