Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 9 days ago • 106 • 21
360Zhinao2 Collection 360Zhinao2 language model, include both base and chat model • 7 items • Updated Mar 5 • 1