Submitted by Hanyuezhuohua 66 Parallel-R1: Towards Parallel Thinking via Reinforcement Learning · 11 authors 35 3
Submitted by taesiri 54 Visual Representation Alignment for Multimodal Large Language Models · 13 authors 59 7
Submitted by taesiri 45 Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search · 6 authors 122 2
Submitted by sanaka87 31 Reconstruction Alignment Improves Unified Multimodal Models · 4 authors 62 2
Submitted by fenfan 24 UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward · 6 authors 43 2
Submitted by aopolin-lv 19 F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions · 10 authors 44 2
Submitted by ChillingDream 17 Staying in the Sweet Spot: Responsive Reasoning Evolution via Capability-Adaptive Hint Scaffolding · 11 authors 5 2
Submitted by JasperHaozhe 14 Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning · 6 authors 3 2
Submitted by taesiri 7 SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge · 5 authors 2
Submitted by benfielding 6 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing · 15 authors 1
Submitted by xianbao 6 Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference · 9 authors 2
Submitted by nfrumkin 5 Q-Sched: Pushing the Boundaries of Few-Step Diffusion Models with Quantization-Aware Scheduling · 2 authors 6 2
Submitted by jfkback 3 Benchmarking Information Retrieval Models on Complex Retrieval Tasks · 2 authors 2
Submitted by PraneetNeuro 1 From Noise to Narrative: Tracing the Origins of Hallucinations in Transformers · 5 authors 2