saeed abhari
galois77
·
AI & ML interests
None yet
Recent Activity
updated
a collection
9 days ago
RL
updated
a collection
9 days ago
Benchmarks and challenges
updated
a collection
9 days ago
Reasoning
Organizations
None yet
Collections
6
-
Towards General-Purpose Model-Free Reinforcement Learning
Paper • 2501.16142 • Published • 26 -
RL + Transformer = A General-Purpose Problem Solver
Paper • 2501.14176 • Published • 24 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 106 -
Process-Supervised Reinforcement Learning for Code Generation
Paper • 2502.01715 • Published
models
None public yet
datasets
None public yet