Submitted by Wangchunshu 68 Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL · 30 authors 74 6
Submitted by yulunliu 41 LongSplat: Robust Unposed 3D Gaussian Splatting for Casual Long Videos · 6 authors 102 2
Submitted by shuaishuaicdp 15 MultiRef: Controllable Image Generation with Multiple Visual References · 9 authors 2
Submitted by IffYuan 10 Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation · 9 authors 22 2
Submitted by JinyiHan 10 Mind the Generation Process: Fine-Grained Confidence Estimation During LLM Generation · 11 authors 2
Submitted by zachary-yin 10 Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer · 10 authors 2
Submitted by marcodena 10 Evaluating Podcast Recommendations with Profile-Aware LLM-as-a-Judge · 10 authors 2
Submitted by JinyiHan 9 A Stitch in Time Saves Nine: Proactive Self-Refinement for Language Models · 12 authors 2
Submitted by abhi1nandy2 8 Leveraging Large Language Models for Predictive Analysis of Human Misery · 4 authors 2
Submitted by JusperLee 8 Advances in Speech Separation: Techniques, Challenges, and Future Trends · 11 authors 803 2
Submitted by BreynaldDva 5 Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends · 11 authors 16 2
Submitted by sefira32 4 MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents · 24 authors 7 4
Submitted by marcodena 4 Describe What You See with Multimodal Large Language Models to Enhance Video Recommendations · 3 authors 4
Submitted by Sreyan88 3 MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence · 34 authors 2
Submitted by EvanTHU 2 Motion2Motion: Cross-topology Motion Transfer with Sparse Correspondence · 8 authors 2
Submitted by seonglae 2 CorrSteer: Steering Improves Task Performance and Safety in LLMs through Correlation-based Sparse Autoencoder Feature Selection · 3 authors 2
Submitted by guinansu 2 MedSAMix: A Training-Free Model Merging Approach for Medical Image Segmentation · 6 authors 2 2
Submitted by cocolinux 2 Radiance Fields in XR: A Survey on How Radiance Fields are Envisioned and Addressed for XR Research · 4 authors 3 2
Submitted by maciejskorski 1 Beyond Human Judgment: A Bayesian Evaluation of LLMs' Moral Values Understanding · 2 authors 1 2
Submitted by ash56 1 Rapidly Adapting to New Voice Spoofing: Few-Shot Detection of Synthesized Speech Under Distribution Shifts · 8 authors 2
Submitted by Breezelled 1 ZARA: Zero-shot Motion Time-Series Analysis via Knowledge and Retrieval Driven LLM Agents · 4 authors 3 2
Submitted by dikw - Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward · 12 authors 2