Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bestsonny 's Collections
papers

papers

updated 6 days ago
Upvote
-

  • Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

    Paper • 2508.16949 • Published 14 days ago • 22

  • Diffusion Language Models Know the Answer Before Decoding

    Paper • 2508.19982 • Published 10 days ago • 22

  • ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models

    Paper • 2508.18773 • Published 11 days ago • 14

  • Intern-S1: A Scientific Multimodal Foundation Model

    Paper • 2508.15763 • Published 16 days ago • 243

  • Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

    Paper • 2508.01191 • Published Aug 2 • 234

  • Self-Rewarding Vision-Language Model via Reasoning Decomposition

    Paper • 2508.19652 • Published 10 days ago • 78
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略