MaxLSB commited on
Commit
942e2bd
·
verified ·
1 Parent(s): 8498ba5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -11,6 +11,24 @@ base_model:
11
  pipeline_tag: text-generation
12
  ---
13
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
  ## Citation
15
 
16
  ```bibtex
 
11
  pipeline_tag: text-generation
12
  ---
13
 
14
+ # Luth-0.6B
15
+
16
+ **Luth-0.6B** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
17
+
18
+ ## Model Details
19
+
20
+ Luth-0.6B was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all benchmarks in both French and English.
21
+
22
+ ## Benchmark Results
23
+
24
+ **French Evaluation:**
25
+
26
+ ![French Evaluation](media/french_evaluation.png)
27
+
28
+ **English Evaluation:**
29
+
30
+ ![English Evaluation](media/english_evaluation.png)
31
+
32
  ## Citation
33
 
34
  ```bibtex