2024.11.25 每日AI论文 | 风格友好SNR采样器提升图像生成，TÜLU 3开源模型性能超越。 - Listen -

2024.11.25 每日AI论文 | 风格友好SNR采样器提升图像生成，TÜLU 3开源模型性能超越。

Listen now

Description

本期的 14 篇论文如下： [00:26] 🎨 Style-Friendly SNR Sampler for Style-Driven Generation（风格友好SNR采样器用于风格驱动生成） [01:08] 🚀 TÜLU 3: Pushing Frontiers in Open Language Model Post-Training（TÜLU 3：推动开放语言模型后训练的前沿） [01:53] 🌐 OminiControl: Minimal and Universal Control for Diffusion Transformer（OminiControl：扩散Transformer的最小且通用控制） [02:31] 🛡 A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection（一种应用于离题提示检测的灵活大型语言模型防护开发方法） [03:08] 🧠 Large Multi-modal Models Can Interpret Features in Large Multi-modal Models（大型多模态模型中的特征解释） [03:49] 🎥 VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection（视频浓缩：通过核心帧选择进行细粒度视频推理的大规模思维链数据集） [04:29] 🎮 BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games（BALROG：在游戏中评估代理型LLM和VLM的推理能力） [05:13] 🎥 Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction（基于协调的补丁重构高效长视频标记化） [05:56] 👴 MyTimeMachine: Personalized Facial Age Transformation（我的时光机：个性化面部年龄转换） [06:34] 🎥 Novel View Extrapolation with Video Diffusion Priors（基于视频扩散先验的新视角外推） [07:10] 🎥 VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement（视频修复：通过错位评估和局部细化改进文本到视频生成） [07:54] ☁ Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images（适应视觉基础模型用于遥感图像中云分割的鲁棒性） [08:31] 🤖 One to rule them all: natural language to bind communication, perception and action（一统天下：自然语言结合通信、感知与行动） [09:15] 🤖 WildLMa: Long Horizon Loco-Manipulation in the Wild（野外长时程移动操作）【关注我们】您还可以在以下平台找到我们，获得播客内容以外更多信息小红书: AI速递

More Episodes

See all »

2024.11.27 每日AI论文 | ShowUI提升GUI效率，F2F改进图像编辑。

本期的 18 篇论文如下： [00:28] 🖥 ShowUI: One Vision-Language-Action Model for GUI Visual Agent（ShowUI：一种用于GUI视觉代理的视觉-语言-动作模型） [01:08] 🎥 Pathways on the Image Manifold: Image Editing via Video Generation（图像流形上的路径：通过视频生成进行图像编辑） [01:45] ⭐ Star Attention: Efficient LLM Inference over Long...

Published 11/27/24

2024.11.26 每日AI论文 | 3D材料生成自动化，零样本图像生成创新。

本期的 21 篇论文如下： [00:26] 🌐 Material Anything: Generating Materials for Any 3D Object via Diffusion（材料生成：通过扩散生成任意3D对象的材料） [01:05] 🎨 Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator（基于修复的大规模文本到图像模型：零样本主题驱动图像生成器） [01:48] 🤖 From Generation to Judgment:...

Published 11/26/24

HuggingFace 每日AI论文速递

Published 11/26/24