LLMs Evaluation Collection Evaluate models on key benchmarks. Thanks @clefourrier and @VictorSanh for the recommandations. • 12 items • Updated 17 days ago • 1
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper • 2502.09056 • Published 10 days ago • 30
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 13 days ago • 59
ELAICHI Collection ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams. • 6 items • Updated Oct 24, 2024 • 6
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 20 days ago • 106
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 23 days ago • 36
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 27 days ago • 26
DolphinLabeled Datasets Collection Eric Hartford has added labels to help you filter datasets, for your pleasure. • 5 items • Updated Jan 6 • 13
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation Paper • 2412.04448 • Published Dec 5, 2024 • 10