Safetensors
qwen2
hanbin commited on
Commit
a43649a
·
verified ·
1 Parent(s): 7f86cf7

update cite

Browse files
Files changed (1) hide show
  1. README.md +8 -4
README.md CHANGED
@@ -205,10 +205,14 @@ To leverage the capibility of **EurusPRM** better, we add ``Step K`` (where K is
205
  ## Citation
206
 
207
  ```latex
208
- @misc{cui2024process,
209
- title={Process Reinforcement through Implicit Rewards},
210
- author={Ganqu Cui and Lifan Yuan and Zefan Wang and Hanbin Wang and Wendi Li and Bingxiang He and Yuchen Fan and Tianyu Yu and Qixin Xu and Weize Chen and Jiarui Yuan and Huayu Chen and Kaiyan Zhang and Xingtai Lv and Shuo Wang and Yuan Yao and Hao Peng and Yu Cheng and Zhiyuan Liu and Maosong Sun and Bowen Zhou and Ning Ding},
211
- year={2025}
 
 
 
 
212
  }
213
  ```
214
 
 
205
  ## Citation
206
 
207
  ```latex
208
+ @misc{cui2025processreinforcementimplicitrewards,
209
+ title={Process Reinforcement through Implicit Rewards},
210
+ author={Ganqu Cui and Lifan Yuan and Zefan Wang and Hanbin Wang and Wendi Li and Bingxiang He and Yuchen Fan and Tianyu Yu and Qixin Xu and Weize Chen and Jiarui Yuan and Huayu Chen and Kaiyan Zhang and Xingtai Lv and Shuo Wang and Yuan Yao and Xu Han and Hao Peng and Yu Cheng and Zhiyuan Liu and Maosong Sun and Bowen Zhou and Ning Ding},
211
+ year={2025},
212
+ eprint={2502.01456},
213
+ archivePrefix={arXiv},
214
+ primaryClass={cs.LG},
215
+ url={https://arxiv.org/abs/2502.01456},
216
  }
217
  ```
218