OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper • 2504.01943 • Published 4 days ago • 5 • 2
OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 2 items • Updated 2 days ago • 1
Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model Paper • 2503.24290 • Published 6 days ago • 58
Expanding RL with Verifiable Rewards Across Diverse Domains Paper • 2503.23829 • Published 7 days ago • 17
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning Paper • 2503.19470 • Published 13 days ago • 15
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 12 days ago • 42
Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Paper • 2503.19855 • Published 12 days ago • 25
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 17 days ago • 46
view article Article NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets 20 days ago • 33
Running on L4 270 270 Thera Arbitrary-Scale Super-Resolution 🔥 Enhance image quality with real-time super-resolution
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 24 days ago • 27