Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
2
24
Zhiwei He
zwhe99
Follow
Joelzhang's profile picture
TingchenFu's profile picture
Clent's profile picture
4 followers
·
2 following
https://zwhe99.github.io/
zwhe99
zwhe99
AI & ML interests
Natural Language Processing
Recent Activity
commented
on
a paper
12 days ago
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
commented
on
a paper
13 days ago
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
commented
on
a paper
13 days ago
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
View all activity
Organizations
None yet
Papers
4
arxiv:
2501.18585
arxiv:
2412.21187
arxiv:
2411.18462
arxiv:
2203.08394
spaces
1
pinned
Paused
3
MAPS Mt
📚
models
9
Sort:Â Recently updated
zwhe99/Qwen2.5-Math-7B-orz
Text Generation
•
Updated
27 days ago
•
3
zwhe99/Qwen2.5-7B-orz
Text Generation
•
Updated
Mar 2
•
22
zwhe99/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
Jan 24
•
6
zwhe99/rm_ptm.infolm-l_p.1
Updated
Apr 2, 2024
zwhe99/rm_ptm.xlmr-l_p.1
Updated
Apr 2, 2024
zwhe99/mm_ptm.xlmr-l_p.1
Updated
Mar 26, 2024
zwhe99/wmt21-comet-qe-mqm
Updated
Nov 10, 2023
zwhe99/wmt21-comet-qe-da
Updated
Nov 10, 2023
zwhe99/TAL-SJTU-WMT22-EnLiv
Updated
Nov 28, 2022
datasets
15
Sort:Â Recently updated
zwhe99/simplerl
Viewer
•
Updated
Feb 28
•
8.52k
•
142
zwhe99/simplerl-minerva-math
Viewer
•
Updated
Feb 10
•
272
•
396
zwhe99/simplerl-OlympiadBench
Viewer
•
Updated
Feb 10
•
675
•
260
zwhe99/aime90
Viewer
•
Updated
Jan 29
•
90
•
334
•
1
zwhe99/gsm8k
Viewer
•
Updated
Dec 17, 2024
•
8.79k
•
78
zwhe99/mathpile-text
Viewer
•
Updated
Dec 14, 2024
•
469k
•
114
zwhe99/mp-textbooks
Viewer
•
Updated
Dec 14, 2024
•
3.98k
•
37
zwhe99/MATH-DIFFIC
Viewer
•
Updated
Dec 11, 2024
•
17.5k
•
58
zwhe99/MATH
Viewer
•
Updated
Oct 31, 2024
•
17.5k
•
588
•
1
zwhe99/amc23
Viewer
•
Updated
Oct 30, 2024
•
40
•
512
•
1
Expand 15 datasets