Update README.md
Browse files
README.md
CHANGED
@@ -15,6 +15,7 @@ pipeline_tag: text-generation
|
|
15 |
|
16 |
DPO fine-tuned of [rombodawg/Rombos-LLM-V2.5-Qwen-32b](https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b) based on [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
|
17 |
using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
|
|
|
18 |
|
19 |
Long-context Support up to 128K tokens and can generate up to 8K tokens.
|
20 |
|
@@ -23,6 +24,10 @@ Long-context Support up to 128K tokens and can generate up to 8K tokens.
|
|
23 |
|
24 |
Coming soon.
|
25 |
|
|
|
|
|
|
|
|
|
26 |
### Usage
|
27 |
|
28 |
You can run Chocolatine using the following code:
|
@@ -60,7 +65,7 @@ print(sequences[0]['generated_text'])
|
|
60 |
|
61 |
### Limitations
|
62 |
|
63 |
-
The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.
|
64 |
It does not have any moderation mechanism.
|
65 |
|
66 |
- **Developed by:** Jonathan Pacifico, 2024
|
|
|
15 |
|
16 |
DPO fine-tuned of [rombodawg/Rombos-LLM-V2.5-Qwen-32b](https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b) based on [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
|
17 |
using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
|
18 |
+
Training in French also improves the model, including in English, surpassing the performance of its base model.
|
19 |
|
20 |
Long-context Support up to 128K tokens and can generate up to 8K tokens.
|
21 |
|
|
|
24 |
|
25 |
Coming soon.
|
26 |
|
27 |
+
### OpenLLM French leaderboard
|
28 |
+
|
29 |
+
Coming soon.
|
30 |
+
|
31 |
### Usage
|
32 |
|
33 |
You can run Chocolatine using the following code:
|
|
|
65 |
|
66 |
### Limitations
|
67 |
|
68 |
+
The Chocolatine model series is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.
|
69 |
It does not have any moderation mechanism.
|
70 |
|
71 |
- **Developed by:** Jonathan Pacifico, 2024
|