Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
4
Hanning Zhang
HanningZhang
Follow
circulartext's profile picture
RogerZhuo's profile picture
2 followers
·
7 following
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math_uniform
published
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math_uniform
updated
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math
View all activity
Organizations
HanningZhang
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math_uniform
Viewer
•
Updated
about 22 hours ago
•
2k
•
8
published
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math_uniform
Viewer
•
Updated
about 22 hours ago
•
2k
•
8
updated
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math
Viewer
•
Updated
about 22 hours ago
•
2k
•
12
published
a dataset
about 22 hours ago
HanningZhang/scalebio_distill_qwen_math
Viewer
•
Updated
about 22 hours ago
•
2k
•
12
updated
a dataset
6 days ago
HanningZhang/scalebio_r1_distill_math
Viewer
•
Updated
6 days ago
•
1.15k
•
17
published
a dataset
6 days ago
HanningZhang/scalebio_r1_distill_math
Viewer
•
Updated
6 days ago
•
1.15k
•
17
updated
a model
7 days ago
HanningZhang/Llama3.1-RAG-Reward
Text Generation
•
Updated
7 days ago
•
9
published
a model
7 days ago
HanningZhang/Llama3.1-RAG-Reward
Text Generation
•
Updated
7 days ago
•
9
liked
a dataset
7 days ago
nvidia/AceMath-Instruct-Training-Data
Viewer
•
Updated
Jan 17
•
5.56M
•
1.75k
•
45
authored
a paper
7 days ago
Self-rewarding correction for mathematical reasoning
Paper
•
2502.19613
•
Published
9 days ago
•
75
upvoted
a
paper
8 days ago
Self-rewarding correction for mathematical reasoning
Paper
•
2502.19613
•
Published
9 days ago
•
75
updated
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step290-Vanilla
Updated
12 days ago
•
15
published
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step290-Vanilla
Updated
12 days ago
•
15
updated
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step280-Vanilla
Updated
12 days ago
•
14
published
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step280-Vanilla
Updated
12 days ago
•
14
updated
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step270-Vanilla
Updated
12 days ago
•
8
published
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step270-Vanilla
Updated
12 days ago
•
8
updated
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step260-Vanilla
Updated
12 days ago
•
12
published
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step260-Vanilla
Updated
12 days ago
•
12
updated
a model
12 days ago
HanningZhang/Qwen-PPO-Selfcorr-Step250-Vanilla
Updated
12 days ago
•
9
Load more