Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AaryanK
/
Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
like
1
PyTorch
GGUF
doi:10.57967/hf/5366
qwen2
Reasoning
GRPO
DeepSeek
CoT
finetune
conversational
License:
cc-by-nc-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Sort: Recently created
Improve language tag
#2 opened 4 months ago by
lbourdois
Adding `safetensors` variant of this model
#1 opened 4 months ago by
SFconvertbot