Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
hkust-nlp
's Collections
SimpleRL
PreSelect
CodeI/O
M-STAR
Deita
🎯DART-Math
SimpleRL
updated
4 days ago
The collection for the Project "Simple Reinforcement Learning for Reasoning"
Upvote
4
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
about 2 hours ago
•
154
•
2
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
about 2 hours ago
•
35
•
1
Upvote
4
Share collection
View history
Collection guide
Browse collections