-
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Paper • 2501.04519 • Published • 275 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 114 -
START: Self-taught Reasoner with Tools
Paper • 2503.04625 • Published • 105 -
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild
Paper • 2503.18892 • Published • 28
Bozhou Li
zooblastlbz
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
zooblastlbz/mmpe
updated
a collection
12 days ago
reason
updated
a collection
12 days ago
reason