-
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
Paper • 2508.16949 • Published • 22 -
Diffusion Language Models Know the Answer Before Decoding
Paper • 2508.19982 • Published • 22 -
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Paper • 2508.18773 • Published • 14 -
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 243