jpacifico
/

Chocolatine-32B-Instruct-DPO-v1.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Nov 28, 2024

Commit

9ba59cd

·

verified ·

1 Parent(s): 7779496

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -15,6 +15,7 @@ pipeline_tag: text-generation
 DPO fine-tuned of [rombodawg/Rombos-LLM-V2.5-Qwen-32b](https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b) based on [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
 Long-context Support up to 128K tokens and can generate up to 8K tokens.
@@ -23,6 +24,10 @@ Long-context Support up to 128K tokens and can generate up to 8K tokens.
 Coming soon.
 ### Usage
 You can run Chocolatine using the following code:
@@ -60,7 +65,7 @@ print(sequences[0]['generated_text'])
 ### Limitations
-The Chocolatine model is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.
 It does not have any moderation mechanism.
 - **Developed by:** Jonathan Pacifico, 2024

 DPO fine-tuned of [rombodawg/Rombos-LLM-V2.5-Qwen-32b](https://huggingface.co/rombodawg/Rombos-LLM-V2.5-Qwen-32b) based on [Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
+Training in French also improves the model, including in English, surpassing the performance of its base model.
 Long-context Support up to 128K tokens and can generate up to 8K tokens.
 Coming soon.
+### OpenLLM French leaderboard
+Coming soon.
 ### Usage
 You can run Chocolatine using the following code:
 ### Limitations
+The Chocolatine model series is a quick demonstration that a base model can be easily fine-tuned to achieve compelling performance.
 It does not have any moderation mechanism.
 - **Developed by:** Jonathan Pacifico, 2024