SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights Paper • 2410.09008 • Published Oct 11, 2024 • 17
agentica-org/DeepScaleR-1.5B-Preview Text Generation • Updated about 14 hours ago • 22.5k • • 470
HuggingFaceTB/SmolVLM2-256M-Video-Instruct Video-Text-to-Text • Updated 2 days ago • 1.18k • 21
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Paper • 2502.14768 • Published 3 days ago • 32