Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL Paper • 2508.07976 • Published 29 days ago • 48
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing Paper • 2506.09965 • Published Jun 11 • 3
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning Paper • 2507.16802 • Published Jul 22 • 8
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published Aug 2 • 235
view article Article Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • Jul 2 • 23
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning Paper • 2505.24298 • Published May 30 • 28