Hanbin Wang

hanbin

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

authored a paper about 1 month ago
Process Reinforcement through Implicit Rewards
updated a dataset about 1 month ago
PRIME-RL/Eurus-2-RL-Data
View all activity

Organizations

OpenBMB's profile picture PRIME's profile picture

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME 22 days ago

real usage query

1
#4 opened 22 days ago by
asidaddy
updated a Space about 1 month ago
New activity in PRIME-RL/Eurus-2-RL-Data about 1 month ago
New activity in PRIME-RL/Eurus-2-7B-PRIME 2 months ago

Evaluation

6
#1 opened 2 months ago by
tugstugi
upvoted an article 2 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
24
published an article 2 months ago
view article
Article

Process Reinforcement through Implicit Rewards

By ganqu and 1 other
24