The collection for the Project "Simple Reinforcement Learning for Reasoning"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Collections
6
models
35
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL
Updated
•
28
•
1
hkust-nlp/Qwen-2.5-Math-7B-SimpleRL-Zero
Updated
•
35
•
2
hkust-nlp/preselect-fasttext-classifier
Updated
hkust-nlp/qwen2.5-7b-coder_codeio_stage1
Updated
•
18
hkust-nlp/qwen2.5-7b-coder_codeio
Updated
•
24
hkust-nlp/qwen2.5-7b-coder_codeio_pp_stage1
Updated
•
28
hkust-nlp/qwen2.5-7b-coder_codeio_pp
Updated
•
33
•
4
hkust-nlp/llama3.1-8b_codeio_stage1
Updated
•
19
hkust-nlp/llama3.1-8b_codeio
Updated
•
22
hkust-nlp/llama3.1-8b_codeio_pp_stage1
Updated
•
18
datasets
21
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
293
•
1
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
465
•
36
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
64
hkust-nlp/SynCSE-partial-NLI
Viewer
•
Updated
•
263k
•
62
•
2
hkust-nlp/SynCSE-scratch-NLI
Viewer
•
Updated
•
276k
•
84
•
2
hkust-nlp/gsm8k-fix
Viewer
•
Updated
•
7.47k
•
91
•
2
hkust-nlp/dart-math-uniform
Viewer
•
Updated
•
591k
•
103
•
9
hkust-nlp/vrt-baseline
Viewer
•
Updated
•
591k
•
54
•
1
hkust-nlp/dart-math-hard
Viewer
•
Updated
•
585k
•
122
•
13
hkust-nlp/dart-math-pool-gsm8k-query-info
Viewer
•
Updated
•
7.47k
•
66
•
2