jerry128
/

Qwen2.5-7B-Instruct-HOTPOTQA-GRPO-STEP-CL

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

Qwen2.5-7B-Instruct-HOTPOTQA-GRPO-STEP-CL / runs /Mar03_05-32-23_ny2g3r14hh1-lxc

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

jerry128's picture

Upload folder using huggingface_hub

56a3daf verified 6 months ago

events.out.tfevents.1740980024.ny2g3r14hh1-lxc.948420.0

789 kB
xet

Upload folder using huggingface_hub 6 months ago