qgallouedec
/

Qwen2.5-0.5B-GRPO-2776-next

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Qwen2.5-0.5B-GRPO-2776-next / vocab.json

qgallouedec's picture

qgallouedec HF staff

End of training

4c48794 verified 11 days ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.