Submitted by akhaliq 329 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning · 200 authors 5
Submitted by akhaliq 83 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding · 15 authors 3
Submitted by akhaliq 68 FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces · 10 authors 3
Submitted by yaful 56 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback · 5 authors 2
Submitted by akhaliq 24 O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning · 9 authors 2
Submitted by RicardoL1u 20 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament · 6 authors 3
Submitted by jedyang97 17 Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass · 9 authors 4
Submitted by Eladlev 13 IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems · 2 authors 2