Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Fireworks
Hyperbolic
Together AI
SambaNova
fal
Novita
Nebius AI Studio
HF Inference API
Misc
Reset Misc
GRPO
Inference Endpoints
text-generation-inference
AutoTrain Compatible
Merge
4-bit precision
Misc with no match
Eval Results
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
45
Full-text search
Edit filters
Sort: Trending
Active filters:
GRPO
Clear all
alpha-ai/llama-3.2-3B-Reason-Reflect-Lite
Text Generation
•
Updated
14 days ago
•
8
mradermacher/Cogito-R1-GGUF
Updated
11 days ago
•
590
accuracy-maker/Llama-3.2-1B-GRPO-gsm8k
Text Generation
•
Updated
12 days ago
mradermacher/Cogito-R1-i1-GGUF
Updated
11 days ago
•
1.21k
alpha-ai/Reason-With-Choice-3B-GGUF
Updated
6 days ago
•
100
alpha-ai/Reason-With-Choice-3B
Text Generation
•
Updated
6 days ago
•
10
mradermacher/Reason-With-Choice-3B-GGUF
Updated
6 days ago
•
288
Daemontatox/PathFinderAI-S1
Text Generation
•
Updated
4 days ago
•
78
mradermacher/SmolLM2_135M_Grpo_Checkpoint-GGUF
Updated
4 days ago
•
242
mradermacher/SmolLM2_135M_Grpo_Gsm8k-GGUF
Updated
5 days ago
•
206
mradermacher/SmolLM2_135M_Grpo_Gsm8k-i1-GGUF
Updated
5 days ago
•
418
mradermacher/PathFinderAI-S1-GGUF
Updated
4 days ago
•
295
TimeLordRaps/PathFinderAI-S1-Q4_K_M-GGUF
Text Generation
•
Updated
4 days ago
•
32
mradermacher/SmolLM2_135M_Grpo_Checkpoint-i1-GGUF
Updated
4 days ago
•
410
mradermacher/PathFinderAI-S1-i1-GGUF
Updated
4 days ago
•
659
Previous
1
2
Next