Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TsinghuaC3I 's Collections
SSRL
UltraMedical

SSRL

updated 21 days ago
Upvote
2

  • TsinghuaC3I/SSRL

    Preview • Updated Aug 5 • 65 • 2

  • TsinghuaC3I/Llama-3.1-8B-Instruct-SSRL

    Text Generation • 8B • Updated Aug 5 • 17

  • TsinghuaC3I/Llama-3.2-3B-Instruct-SSRL

    Text Generation • 4B • Updated Aug 5 • 15

  • TsinghuaC3I/Qwen2.5-7B-Instruct-SSRL

    Text Generation • 8B • Updated Aug 5 • 15

  • TsinghuaC3I/Qwen2.5-3B-Instruct-SSRL

    Text Generation • 3B • Updated Aug 5 • 11 • 1

  • SSRL: Self-Search Reinforcement Learning

    Paper • 2508.10874 • Published 24 days ago • 91
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略