Safetensors
qwen2
stingning commited on
Commit
2f38b94
·
verified ·
1 Parent(s): 5a63919

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -8
README.md CHANGED
@@ -88,14 +88,11 @@ After finetuning, the performance of our Eurus-2-7B-SFT is shown in the followin
88
  ## Citation
89
 
90
  ```latex
91
- @misc{cui2025processreinforcementimplicitrewards,
92
- title={Process Reinforcement through Implicit Rewards},
93
- author={Ganqu Cui and Lifan Yuan and Zefan Wang and Hanbin Wang and Wendi Li and Bingxiang He and Yuchen Fan and Tianyu Yu and Qixin Xu and Weize Chen and Jiarui Yuan and Huayu Chen and Kaiyan Zhang and Xingtai Lv and Shuo Wang and Yuan Yao and Xu Han and Hao Peng and Yu Cheng and Zhiyuan Liu and Maosong Sun and Bowen Zhou and Ning Ding},
94
- year={2025},
95
- eprint={2502.01456},
96
- archivePrefix={arXiv},
97
- primaryClass={cs.LG},
98
- url={https://arxiv.org/abs/2502.01456},
99
  }
100
  ```
101
 
 
88
  ## Citation
89
 
90
  ```latex
91
+ @article{cui2025process,
92
+ title={Process reinforcement through implicit rewards},
93
+ author={Cui, Ganqu and Yuan, Lifan and Wang, Zefan and Wang, Hanbin and Li, Wendi and He, Bingxiang and Fan, Yuchen and Yu, Tianyu and Xu, Qixin and Chen, Weize and others},
94
+ journal={arXiv preprint arXiv:2502.01456},
95
+ year={2025}
 
 
 
96
  }
97
  ```
98