CriteriaPO
/

qwen2.5-3b-dpo-finegrained-40-vanilla

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

qwen2.5-3b-dpo-finegrained-40-vanilla / checkpoint-5000 /zero_to_fp32.py

Commit History

Training in progress, step 5000, checkpoint

140c47f
verified

obiwit commited on 20 days ago