SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published 3 days ago • 87
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 13 days ago • 59
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities? Paper • 2502.12215 • Published 7 days ago • 15
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 5 days ago • 72
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 7 days ago • 133
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 6 days ago • 26
Lumina Family Collection Lumina-T2X is a unified framework for Text to Any Modality Generation • 8 items • Updated Jul 30, 2024 • 5
Ovis2 Collection Our latest advancement in multi-modal large language models (MLLMs) • 8 items • Updated 7 days ago • 51
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 20 days ago • 106
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 19 days ago • 190