20 5 4

Hanbin Wang

hanbin

https://wanghanbinpanda.github.io/

wanghanbinpanda

AI & ML interests

Code Intelligence and LLM Reasoning (Code, Math)

Recent Activity

new activity 22 days ago

PRIME-RL/Eurus-2-7B-PRIME:real usage query

authored a paper about 1 month ago

Process Reinforcement through Implicit Rewards

updated a dataset about 1 month ago

PRIME-RL/Eurus-2-RL-Data

View all activity

Organizations

hanbin's activity

New activity in PRIME-RL/Eurus-2-7B-PRIME 22 days ago

real usage query

#4 opened 22 days ago by

asidaddy

authored a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55

updated 3 datasets about 1 month ago

updated 4 models about 1 month ago

PRIME-RL/EurusPRM-Stage2

Updated 21 days ago • 6.53k • 6

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 21 days ago • 731 • 60

PRIME-RL/Eurus-2-7B-SFT

Updated 21 days ago • 3.28k • 2

PRIME-RL/EurusPRM-Stage1

Updated 21 days ago • 6.4k • 4

updated a Space about 1 month ago

README

🏃

upvoted a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55

commented a paper about 1 month ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 55 •

New activity in PRIME-RL/Eurus-2-RL-Data about 1 month ago

some empty code ground truths (roughly 1k in train)

#3 opened about 1 month ago by

rawsh

New activity in PRIME-RL/Eurus-2-7B-PRIME 2 months ago

Evaluation

#1 opened 2 months ago by

tugstugi

Add library_name and pipeline_tag

#2 opened 2 months ago by

nielsr

upvoted an article 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

published an article 2 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 24

liked a model 2 months ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 21 days ago • 731 • 60

updated 2 datasets 2 months ago

PRIME-RL/Eurus-2-SFT-Data

Viewer • Updated 21 days ago • 230k • 247 • 11

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated 21 days ago • 483k • 2.59k • 28