SmolLM2-135M-SFT-DPO / model.safetensors

Commit History

End of training
78a856a
verified

hsila commited on