StarVector SVG Datasets (🏆SVG-Bench) Collection Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12 • 10
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback Paper • 2503.22230 • Published 12 days ago • 43
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 20 days ago • 46
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 29 days ago • 379
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published 27 days ago • 27
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper • 2502.07374 • Published Feb 11 • 39
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 275