Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • 免费去水印

  • Log In
  • Sign Up
sail 's Collections
Precision-RL
🚀 Active PRM
🌾Oat-Zero: Understanding R1-Zero-Like Training
🔱 Sailor2 Language Models
🧬 RegMix: Data Mixture as Regression
📈 Scaling Laws with Vocabulary
💡 DICE
⚓️ Sailor Language Models

🚀 Active PRM

updated Apr 16

Efficient Process Reward Model Training via Active Learning.

Upvote
3

  • Efficient Process Reward Model Training via Active Learning

    Paper • 2504.10559 • Published Apr 14 • 13

  • sail/ActPRMData

    Viewer • Updated Apr 4 • 663k • 61 • 1

  • sail/ActPRM-X

    7B • Updated Apr 15 • 24

  • sail/ActPRM

    7B • Updated Apr 15 • 8
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets 免费Z-image图片生成 免费去水印 Vibevoice

🎉 Free Image Generator Now Available!

Totally Free + Zero Barriers + No Login Required