lucyknada commited on
Commit
22f70f1
·
verified ·
1 Parent(s): 2f9a0be

Upload ./README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +35 -0
README.md ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ - fr
6
+ - de
7
+ - es
8
+ - it
9
+ - pt
10
+ base_model:
11
+ - alamios/Qwenstral-Small-3.1-0.5B
12
+ datasets:
13
+ - alamios/Mistral-Small-24B-Instruct-2501-Conversations
14
+ pipeline_tag: text-generation
15
+ library_name: transformers
16
+ tags:
17
+ - qwen
18
+ - qwen2.5
19
+ - mistral
20
+ - mistral-small
21
+ - mistral-small-3.1
22
+ ---
23
+ ### exl2 quant (measurement.json in main branch)
24
+ ---
25
+ ### check revisions for quants
26
+ ---
27
+
28
+
29
+ # Mistral-Small-3.1-DRAFT-0.5B
30
+
31
+ This model is meant to be used as draft model for speculative decoding with [mistralai/Mistral-Small-3.1-24B-Instruct-2503](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503) or [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501)
32
+
33
+ # Data info
34
+
35
+ The data are Mistral's outputs and includes all kind of tasks from various datasets in English, French, German, Spanish, Italian and Portuguese. It has been trained for 2 epochs on 20k unique examples, for a total of 12 million tokens per epoch.