Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Hyperbolic
SambaNova
Novita
fal
Together AI
Nebius AI Studio
Fireworks
Replicate
HF Inference API
Misc
Reset Misc
Inference Endpoints
open-r1
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
8-bit precision
Misc with no match
Eval Results
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
240
Full-text search
Edit filters
Sort: Trending
Active filters:
open-r1
Clear all
yeshsurya/Qwen2.5-7B-Math-with_50stepGRPO
Text Generation
•
Updated
11 days ago
•
24
mradermacher/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math-GGUF
Updated
19 days ago
•
1.1k
mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-GGUF
Updated
19 days ago
•
2.53k
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
Updated
19 days ago
•
18
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
Updated
19 days ago
•
39
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
Updated
19 days ago
•
39
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
Updated
19 days ago
•
11
yh-yao/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
18 days ago
•
12
qorbanpour/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
18 days ago
•
3
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
Updated
13 days ago
•
69
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
Updated
18 days ago
•
24
schwamaths/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
18 days ago
•
7
ibndias/Qwen2.5-1.5B-Open-R1-GRPO1st
Text Generation
•
Updated
13 days ago
•
3
schwamaths/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
•
Updated
18 days ago
•
4
weltonwang88/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
about 18 hours ago
•
56
Jiawen006/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
17 days ago
•
9
mradermacher/Qwen2.5-1.5B-Instruct-Open-R1-GRPO-GGUF
Updated
18 days ago
•
282
qgallouedec/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
15 days ago
•
8
nlxpku/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
15 days ago
•
2
saemin21/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
17 days ago
•
3
JeffP111/Qwen2.5-3B-GRPO-Countdown
Text Generation
•
Updated
16 days ago
•
11
jl1019/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
17 days ago
•
4
zwt963/Qwen2.5-1.5B-Instruct-Open-R1-GRPO
Text Generation
•
Updated
17 days ago
•
14
susumuota/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
12 days ago
•
11
susumuota/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
12 days ago
•
3
calledice666/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
13 days ago
•
2
DominicX/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
16 days ago
•
2
Loong-Ma/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
about 16 hours ago
•
13
bushou/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
14 days ago
•
6
DeeLearning/Qwen2.5-1.5B-Open-R1-Distill
Text Generation
•
Updated
15 days ago
•
51
Previous
1
2
3
4
...
8
Next