Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought Paper • 2510.04230 • Published Oct 5 • 26
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published Oct 13 • 176
On Epistemic Uncertainty of Visual Tokens for Object Hallucinations in Large Vision-Language Models Paper • 2510.09008 • Published Oct 10 • 15