Text Generation
Transformers
Safetensors
qwen3_moe
Mixture of Experts
conversational
huihui-ai commited on
Commit
c0606bc
·
verified ·
1 Parent(s): 970172c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ The model is designed for natural language processing tasks, including text gene
23
 
24
  ## Training
25
 
26
- This model is obtained through full-parameter fine-tuning. For each dataset, only the corresponding experts are fine-tuned.
27
 
28
  - **Coding**: First 20k rows of `nvidia/OpenCodeReasoning` dataset for 1 epoch.
29
  - **Math**: Entire `unsloth/OpenMathReasoning-mini` dataset for 1 epoch.
 
23
 
24
  ## Training
25
 
26
+ This model is obtained through full-parameter fine-tuning. For each dataset(max_length=1024), only the corresponding experts are fine-tuned.
27
 
28
  - **Coding**: First 20k rows of `nvidia/OpenCodeReasoning` dataset for 1 epoch.
29
  - **Math**: Entire `unsloth/OpenMathReasoning-mini` dataset for 1 epoch.