Vfrz commited on
Commit
7aca0e3
·
verified ·
1 Parent(s): 79abcaa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -10,3 +10,45 @@ base_model:
10
  - Qwen/Qwen3-1.7B-Base
11
  pipeline_tag: text-generation
12
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  - Qwen/Qwen3-1.7B-Base
11
  pipeline_tag: text-generation
12
  ---
13
+ # [MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning](https://arxiv.org/abs/xxx)
14
+
15
+ ## Qwen3-1.7B-MegaScience
16
+
17
+ ### Training Recipe
18
+
19
+ - **LR**: 5e-6
20
+ - **LR Schedule**: Cosine
21
+ - **Batch Size**: 512
22
+ - **Max Length**: 4,096
23
+ - **Warm Up Ratio**: 0.05
24
+ - **Epochs**: 3
25
+
26
+ ### Evaluation Results
27
+
28
+ <div style="display: flex; justify-content: left; gap: 20px;">
29
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/616bfc2b40e2f69baa1c7add/abIVZ2XB9D-o-TCyvOkDE.png" alt="Data Pipeline" style="width:80%;">
30
+ </div>
31
+
32
+ <div style="display: flex; justify-content: left; gap: 20px;">
33
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/616bfc2b40e2f69baa1c7add/xFTJ7nevc3S4UYJxUS7ue.png" alt="Data Pipeline" style="width:80%;">
34
+ </div>
35
+
36
+ ### More about MegaScience
37
+
38
+ <div style="display: flex; justify-content: left; gap: 20px;">
39
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/616bfc2b40e2f69baa1c7add/VogIpBbjfNxXFP9DfVMms.png" alt="Data Pipeline" style="width:100%;">
40
+ </div>
41
+
42
+ ## Citation
43
+
44
+ Check out our [paper](https://arxiv.org/abs/xxx) for more details. If you use our dataset or find our work useful, please cite
45
+
46
+ ```
47
+ @article{fan2025megascience,
48
+ title={MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning},
49
+ author={Fan, Run-Ze and Wang, Zengzhi and Liu, Pengfei},
50
+ year={2025},
51
+ journal={arXiv preprint arXiv:xxx},
52
+ url={https://arxiv.org/abs/xxx}
53
+ }
54
+ ```