Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sail 's Collections
🚀 Active PRM
🌾Oat-Zero: Understanding R1-Zero-Like Training
🔱 Sailor2 Language Models
🧬 RegMix: Data Mixture as Regression
📈 Scaling Laws with Vocabulary
💡 DICE
⚓️ Sailor Language Models

🚀 Active PRM

updated Apr 16

Efficient Process Reward Model Training via Active Learning.

Upvote
3

  • Efficient Process Reward Model Training via Active Learning

    Paper • 2504.10559 • Published Apr 14 • 13

  • sail/ActPRMData

    Viewer • Updated Apr 4 • 663k • 12 • 1

  • sail/ActPRM-X

    7B • Updated Apr 15 • 4

  • sail/ActPRM

    7B • Updated Apr 15 • 3
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略