DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 8 days ago • 104
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper • 2503.10582 • Published 13 days ago • 20
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper • 2503.02951 • Published 22 days ago • 27