Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9 • 131
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10 • 19
Running 3.61k The Ultra-Scale Playbook 🌌 3.61k The ultimate guide to training LLM on large GPU Clusters