R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning Paper • 2508.21113 • Published 9 days ago • 103
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others • May 12 • 522