Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andrew Zhao's picture
3 21 22

Andrew Zhao

andrewzh
Inflammable1230's profile picture WyattTheSkid's profile picture asmodun's profile picture
·
https://andrewzh112.github.io/
  • _AndrewZhao
  • Andrewzh112
  • andrewqzhao

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper 3 days ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted a paper about 2 months ago
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
upvoted a paper 2 months ago
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
View all activity

Organizations

None yet

andrewzh 's models 3

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 9 • 28

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 75 • 11

andrewzh/Absolute_Zero_Reasoner-Coder-7b

8B • Updated May 5 • 2.1k • 19
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs