File size: 133 Bytes
d1c161f |
1 2 3 4 5 6 7 |
Base Model: Qwen/DeepSeek-R1-Distill-Qwen-7B Training Epochs: 3 Training Objective: RL only Training Data: ReasoningEval/Huatuo-RL |
d1c161f |
1 2 3 4 5 6 7 |
Base Model: Qwen/DeepSeek-R1-Distill-Qwen-7B Training Epochs: 3 Training Objective: RL only Training Data: ReasoningEval/Huatuo-RL |