HuggingFaceTB
/

SmolLM-1.7B-Instruct

Text Generation

alignment-handbook

text-generation-inference

Model card Files Files and versions

loubnabnl HF Staff commited on Aug 18, 2024

Commit

2957b4b

·

verified ·

1 Parent(s): 01de89e

mention how to load v0.1 in readme

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -40,6 +40,11 @@ To build SmolLM-Instruct, we finetuned the base models on publicly available dat
 v0.2 models are better at staying on topic and responding appropriately to standard prompts, such as greetings and questions about their role as AI assistants. SmolLM-360M-Instruct (v0.2) has a 63.3% win rate over SmolLM-360M-Instruct (v0.1) on AlpacaEval. You can find the details [here](https://huggingface.co/datasets/HuggingFaceTB/alpaca_eval_details/).
 ## Usage
 ### Local Applications
@@ -90,6 +95,8 @@ We train the models using the [alignment-handbook](https://github.com/huggingfac
 - warmup ratio 0.1
 - global batch size 262k tokens
 # Citation
 ```bash
 @misc{allal2024SmolLM,

 v0.2 models are better at staying on topic and responding appropriately to standard prompts, such as greetings and questions about their role as AI assistants. SmolLM-360M-Instruct (v0.2) has a 63.3% win rate over SmolLM-360M-Instruct (v0.1) on AlpacaEval. You can find the details [here](https://huggingface.co/datasets/HuggingFaceTB/alpaca_eval_details/).
+You can load v0.1 checkpoint by specifying `revision="v0.1"` in the transformers code:
+```python
+model = AutoModelForCausalLM.from_pretrained("HuggingFaceTB/SmolLM-1.7B-Instruct", revision="v0.1")
+```
 ## Usage
 ### Local Applications
 - warmup ratio 0.1
 - global batch size 262k tokens
+You can find the training recipe here: https://github.com/huggingface/alignment-handbook/tree/smollm/recipes/smollm
 # Citation
 ```bash
 @misc{allal2024SmolLM,