Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning Paper • 2502.19655 • Published 8 days ago
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published 8 days ago • 56
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Paper • 2502.19735 • Published 7 days ago • 7
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO Paper • 2502.14669 • Published 14 days ago • 11