sergiopaniego
/

Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

Qwen2.5-VL-3B-Instruct-trl-mpo-rlaif-v / merges.txt

sergiopaniego's picture

sergiopaniego HF Staff

Training in progress, step 10

03bf6f9 verified about 1 month ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.