Update README.md
Browse files
README.md
CHANGED
@@ -13,13 +13,13 @@ pipeline_tag: text-generation
|
|
13 |
|
14 |

|
15 |
|
16 |
-
# Luth-0.6B
|
17 |
|
18 |
-
**Luth-0.6B** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
19 |
|
20 |
## Model Details
|
21 |
|
22 |
-
Luth
|
23 |
|
24 |
## Benchmark Results
|
25 |
|
|
|
13 |
|
14 |

|
15 |
|
16 |
+
# Luth-0.6B-Instruct
|
17 |
|
18 |
+
**Luth-0.6B-Instruct** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
19 |
|
20 |
## Model Details
|
21 |
|
22 |
+
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all selected benchmarks in both French and English.
|
23 |
|
24 |
## Benchmark Results
|
25 |
|