huihui-ai
/

Huihui-MoE-1B-A0.6B-SFT

Text Generation

Mixture of Experts

Model card Files Files and versions Community

huihui-ai commited on Jun 13

Commit

c0606bc

·

verified ·

1 Parent(s): 970172c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ The model is designed for natural language processing tasks, including text gene
 ## Training
-This model is obtained through full-parameter fine-tuning. For each dataset, only the corresponding experts are fine-tuned.
  - **Coding**: First 20k rows of `nvidia/OpenCodeReasoning` dataset for 1 epoch.
  - **Math**: Entire `unsloth/OpenMathReasoning-mini` dataset for 1 epoch.

 ## Training
+This model is obtained through full-parameter fine-tuning. For each dataset(max_length=1024), only the corresponding experts are fine-tuned.
  - **Coding**: First 20k rows of `nvidia/OpenCodeReasoning` dataset for 1 epoch.
  - **Math**: Entire `unsloth/OpenMathReasoning-mini` dataset for 1 epoch.