Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
21
22
Andrew Zhao
andrewzh
Follow
Inflammable1230's profile picture
WyattTheSkid's profile picture
asmodun's profile picture
48 followers
·
3 following
https://andrewzh112.github.io/
_AndrewZhao
Andrewzh112
andrewqzhao
AI & ML interests
Reinforcement Learning, Agents
Recent Activity
upvoted
a
paper
3 days ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted
a
paper
about 2 months ago
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
upvoted
a
paper
2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
View all activity
Organizations
None yet
andrewzh
's models
3
Sort: Recently updated
andrewzh/Absolute_Zero_Reasoner-Coder-14b
15B
•
Updated
May 6
•
9
•
28
andrewzh/Absolute_Zero_Reasoner-Coder-3b
3B
•
Updated
May 6
•
75
•
11
andrewzh/Absolute_Zero_Reasoner-Coder-7b
8B
•
Updated
May 5
•
2.1k
•
19