Update README.md
Browse files
README.md
CHANGED
@@ -19,12 +19,16 @@ pipeline_tag: text-generation
|
|
19 |
|
20 |
**Luth-0.6B-Instruct** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
21 |
|
|
|
|
|
22 |
## Model Details
|
23 |
|
24 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all selected benchmarks in both French and English.
|
25 |
|
26 |
## Benchmark Results
|
27 |
|
|
|
|
|
28 |
**French Evaluation:**
|
29 |
|
30 |

|
|
|
19 |
|
20 |
**Luth-0.6B-Instruct** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
21 |
|
22 |
+
Our Evaluation, training and data scripts are available on [GitHub](https://github.com/kurakurai/Luth).
|
23 |
+
|
24 |
## Model Details
|
25 |
|
26 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all selected benchmarks in both French and English.
|
27 |
|
28 |
## Benchmark Results
|
29 |
|
30 |
+
We used LightEval for evaluation, with custom tasks for the French benchmarks. The models were evaluated with a `temperature=0`.
|
31 |
+
|
32 |
**French Evaluation:**
|
33 |
|
34 |

|