Safetensors
qwen2
hanbin commited on
Commit
7f86cf7
Β·
verified Β·
1 Parent(s): 273c390

add paper link

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -5,6 +5,7 @@ license: apache-2.0
5
 
6
  ## Links
7
 
 
8
  - πŸ“œ [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f)
9
  - πŸ€— [PRIME Collection](https://huggingface.co/PRIME-RL)
10
  - πŸ€— [Training Data](https://huggingface.co/datasets/PRIME-RL/EurusPRM-Stage2-Data)
 
5
 
6
  ## Links
7
 
8
+ - πŸ“œ [Paper](https://arxiv.org/abs/2502.01456)
9
  - πŸ“œ [Blog](https://curvy-check-498.notion.site/Process-Reinforcement-through-Implicit-Rewards-15f4fcb9c42180f1b498cc9b2eaf896f)
10
  - πŸ€— [PRIME Collection](https://huggingface.co/PRIME-RL)
11
  - πŸ€— [Training Data](https://huggingface.co/datasets/PRIME-RL/EurusPRM-Stage2-Data)