Submitted by chujiezheng 91 The Lessons of Developing Process Reward Models in Mathematical Reasoning · 9 authors 8
Submitted by davanstrien 50 BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature · 16 authors 2
Submitted by akhaliq 45 MinMo: A Multimodal Large Language Model for Seamless Voice Interaction · 36 authors 6
Submitted by akhaliq 29 O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning · 8 authors 2
Submitted by Shiweiliuiiiiiii 15 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training · 6 authors 2
Submitted by akhaliq 9 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning · 12 authors 2
Submitted by mbilkhu 5 Evaluating Sample Utility for Data Selection by Mimicking Model Weights · 4 authors 2