DeepSeek-R1-Distill-Qwen-32B-bnb-4bit-DPO-tuned / model-00013-of-00014.safetensors

Commit History

Trained with Unsloth
7ad734a
verified

imhmdf commited on