Update README.md
Browse files
README.md
CHANGED
@@ -11,6 +11,24 @@ base_model:
|
|
11 |
pipeline_tag: text-generation
|
12 |
---
|
13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
## Citation
|
15 |
|
16 |
```bibtex
|
|
|
11 |
pipeline_tag: text-generation
|
12 |
---
|
13 |
|
14 |
+
# Luth-0.6B
|
15 |
+
|
16 |
+
**Luth-0.6B** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
17 |
+
|
18 |
+
## Model Details
|
19 |
+
|
20 |
+
Luth-0.6B was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all benchmarks in both French and English.
|
21 |
+
|
22 |
+
## Benchmark Results
|
23 |
+
|
24 |
+
**French Evaluation:**
|
25 |
+
|
26 |
+

|
27 |
+
|
28 |
+
**English Evaluation:**
|
29 |
+
|
30 |
+

|
31 |
+
|
32 |
## Citation
|
33 |
|
34 |
```bibtex
|