Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5 • 130
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published Feb 7 • 150
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 105
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales? Paper • 2410.23856 • Published Oct 31, 2024 • 4
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data Paper • 2502.14044 • Published Feb 19 • 8
SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? Paper • 2502.13233 • Published Feb 18 • 15
Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era Paper • 2403.08946 • Published Mar 13, 2024 • 3
Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models Paper • 2503.22165 • Published Mar 28 • 29