sijuntan commited on
Commit
67660de
·
verified ·
1 Parent(s): 50d1cf0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -75,7 +75,7 @@ A more detailed description of the training recipe can be found in our [blog pos
75
  We report Pass@1 accuracy averaged over 16 samples for each problem.
76
  | Model | AIME 2024 | MATH 500 | AMC 2023 | Minerva Math | OlympiadBench | Avg. |
77
  |-------|-----------|-----------|-----------|--------------|---------------|------|
78
- | 2.5-7B-Instruct | 13.3 | 79.8 | 50.6 | 34.6 | 40.7 | 43.8 |
79
  | rStar-Math-7B | 26.7 | 78.4 | 47.5 | - | 47.1 | - |
80
  | Eurus-2-7B-PRIME | 26.7 | 79.2 | 57.8 | 38.6 | 42.1 | 48.9 |
81
  | Qwen2.5-7B-SimpleRL | 26.7 | 82.4 | 62.5 | <strong>39.7</strong> | 43.3 | 50.9 |
@@ -107,7 +107,7 @@ This permissive license ensures that researchers, developers, and enthusiasts wo
107
  ```bibtex
108
  @misc{deepscaler2025,
109
  title={DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL},
110
- author={Michael Luo and Sijun Tan and Justin Wong and Xiaoxiang Shi and William Tang and Manan Roongta and Colin Cai and Jeffrey Luo and Tianjun Zhang and Erran Li and Raluca Ada Popa and Ion Stoica},
111
  year={2025},
112
  howpublished={\url{https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2}},
113
  note={Notion Blog}
 
75
  We report Pass@1 accuracy averaged over 16 samples for each problem.
76
  | Model | AIME 2024 | MATH 500 | AMC 2023 | Minerva Math | OlympiadBench | Avg. |
77
  |-------|-----------|-----------|-----------|--------------|---------------|------|
78
+ | Qwen-2.5-7B-Instruct | 13.3 | 79.8 | 50.6 | 34.6 | 40.7 | 43.8 |
79
  | rStar-Math-7B | 26.7 | 78.4 | 47.5 | - | 47.1 | - |
80
  | Eurus-2-7B-PRIME | 26.7 | 79.2 | 57.8 | 38.6 | 42.1 | 48.9 |
81
  | Qwen2.5-7B-SimpleRL | 26.7 | 82.4 | 62.5 | <strong>39.7</strong> | 43.3 | 50.9 |
 
107
  ```bibtex
108
  @misc{deepscaler2025,
109
  title={DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL},
110
+ author={Michael Luo and Sijun Tan and Justin Wong and Xiaoxiang Shi and William Y. Tang and Manan Roongta and Colin Cai and Jeffrey Luo and Tianjun Zhang and Li Erran Li and Raluca Ada Popa and Ion Stoica},
111
  year={2025},
112
  howpublished={\url{https://pretty-radio-b75.notion.site/DeepScaleR-Surpassing-O1-Preview-with-a-1-5B-Model-by-Scaling-RL-19681902c1468005bed8ca303013a4e2}},
113
  note={Notion Blog}