Andrew Zhao's picture

Andrew Zhao

andrewzh

·

https://andrewzh112.github.io/

AI & ML interests

Reinforcement Learning, Agents

Recent Activity

upvoted a paper 3 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper about 2 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

upvoted a paper 2 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

View all activity

Organizations

None yet

andrewzh 's models 3

andrewzh/Absolute_Zero_Reasoner-Coder-14b

15B • Updated May 6 • 9 • 28

andrewzh/Absolute_Zero_Reasoner-Coder-3b

3B • Updated May 6 • 75 • 11

andrewzh/Absolute_Zero_Reasoner-Coder-7b

8B • Updated May 5 • 2.1k • 19