Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
17
4
4
Hanbin Wang
hanbin
Follow
thomwolf's profile picture
SteveSHEN's profile picture
Pent's profile picture
10 followers
·
1 following
https://wanghanbinpanda.github.io/
wanghanbinpanda
AI & ML interests
Code Intelligence and LLM Reasoning (Code, Math)
Recent Activity
new
activity
about 1 hour ago
PRIME-RL/Eurus-2-7B-PRIME:
Add library_name and pipeline_tag
new
activity
about 3 hours ago
PRIME-RL/Eurus-2-7B-PRIME:
Evaluation
upvoted
an
article
1 day ago
Process Reinforcement through Implicit Rewards
View all activity
Articles
Process Reinforcement through Implicit Rewards
1 day ago
•
5
Organizations
hanbin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
PRIME-RL/Eurus-2-7B-PRIME
about 1 hour ago
Add library_name and pipeline_tag
#2 opened about 2 hours ago by
nielsr
New activity in
PRIME-RL/Eurus-2-7B-PRIME
about 3 hours ago
Evaluation
2
#1 opened about 17 hours ago by
tugstugi
upvoted
an
article
1 day ago
view article
Article
Process Reinforcement through Implicit Rewards
By
ganqu
•
1 day ago
•
5
liked
a model
2 days ago
PRIME-RL/Eurus-2-7B-PRIME
Text Generation
•
Updated
about 1 hour ago
•
87
•
16
updated
2 datasets
2 days ago
PRIME-RL/Eurus-2-SFT-Data
Viewer
•
Updated
2 days ago
•
230k
•
23
•
1
PRIME-RL/Eurus-2-RL-Data
Viewer
•
Updated
2 days ago
•
484k
•
26
•
7
updated
2 models
2 days ago
PRIME-RL/Eurus-2-7B-PRIME
Text Generation
•
Updated
about 1 hour ago
•
87
•
16
PRIME-RL/Eurus-2-7B-SFT
Updated
2 days ago
•
35
•
2
updated
3 models
5 days ago
PRIME-RL/EurusPRM-Stage2
Updated
about 4 hours ago
•
25
•
3
PRIME-RL/EurusPRM-Stage1
Updated
about 4 hours ago
•
34
•
1
PRIME-RL/Eurus-2-7B-SFT
Updated
2 days ago
•
35
•
2
updated
a dataset
5 days ago
PRIME-RL/Eurus-2-SFT-Data
Viewer
•
Updated
2 days ago
•
230k
•
23
•
1
upvoted
a
paper
about 1 month ago
Free Process Rewards without Process Labels
Paper
•
2412.01981
•
Published
Dec 2, 2024
•
28
updated
a dataset
about 1 month ago
hanbin/UltraInteract_sft_all_end_20240906
Viewer
•
Updated
Nov 26, 2024
•
681k
•
69
updated
a dataset
2 months ago
hanbin/UltraInteract_pair_all_20240911_v2_gt_v3
Preview
•
Updated
Nov 4, 2024
•
2
updated
a model
2 months ago
hanbin/o1_sft_all_abla_numina_oly_orca
Updated
Nov 4, 2024
•
3
liked
a dataset
3 months ago
yangweiqing/DebugEval
Updated
Aug 24, 2024
•
16
•
1
New activity in
livecodebench/code_generation_lite
3 months ago
can not load dataset
1
#2 opened 3 months ago by
hanbin
updated
a model
8 months ago
openbmb/Eurux-8x22b-kto
Text Generation
•
Updated
Apr 29, 2024
•
21
•
8
New activity in
openbmb/Eurux-8x22b-kto
9 months ago
Upload folder using huggingface_hub
#1 opened 9 months ago by
hanbin
Load more