Qwen2.5-3B-reasoning-medical-symptoms-GRPO-f16 / pytorch_model-00001-of-00002.bin

Commit History

Trained with Unsloth
b515bf7
verified

dumbequation commited on