Towards a Unified View of Large Language Model Post-Training Paper • 2509.04419 • Published 2 days ago • 52
On the token distance modeling ability of higher RoPE attention dimension Paper • 2410.08703 • Published Oct 11, 2024 • 1
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Paper • 2505.22617 • Published May 28 • 130
Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models Paper • 2503.11224 • Published Mar 14 • 29