Wladastic commited on
Commit
86460e3
·
verified ·
1 Parent(s): a846975

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,10 +11,10 @@ tags:
11
  - llama
12
  - think
13
  ---
 
14
 
15
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646ba0d4c7f672003c851ed2/rsr_FSCzYXN5OTf5UrvCU.png)
16
 
17
- # MiniThink-1B-base
18
 
19
  MiniThink-1B is an experiment to reproduce the "Aha!" moment in AI.
20
  Is is trained using a modified version of the method used in the [Unsloth R1 training blog](https://unsloth.ai/blog/r1-reasoning) and the [notebook provided for training LLama 3.1 8B to learn R1 reasoning ](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb).
 
11
  - llama
12
  - think
13
  ---
14
+ # MiniThink-1B-base
15
 
16
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646ba0d4c7f672003c851ed2/rsr_FSCzYXN5OTf5UrvCU.png)
17
 
 
18
 
19
  MiniThink-1B is an experiment to reproduce the "Aha!" moment in AI.
20
  Is is trained using a modified version of the method used in the [Unsloth R1 training blog](https://unsloth.ai/blog/r1-reasoning) and the [notebook provided for training LLama 3.1 8B to learn R1 reasoning ](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb).