Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published Nov 20, 2025 • 108
Running on CPU Upgrade Featured 2.77k The Smol Training Playbook 📚 2.77k The secrets to building world-class LLMs
L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks Paper • 2510.20976 • Published Oct 23, 2025 • 2
L^2M^3OF: A Large Language Multimodal Model for Metal-Organic Frameworks Paper • 2510.20976 • Published Oct 23, 2025 • 2 • 2
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 501
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published Oct 7, 2025 • 106
Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning Paper • 2509.11420 • Published Sep 14, 2025 • 2
Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts Paper • 2509.23188 • Published Sep 27, 2025 • 3
Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning Paper • 2509.11420 • Published Sep 14, 2025 • 2
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29, 2025 • 141