Submitted by akhaliq 46 Teaching Large Language Models to Reason with Reinforcement Learning · 9 authors 2
Submitted by akhaliq 38 Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference · 11 authors 1
Submitted by akhaliq 24 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error · 5 authors 1
Submitted by akhaliq 18 Common 7B Language Models Already Possess Strong Math Capabilities · 8 authors 1
Submitted by akhaliq 5 Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis · 8 authors 1