LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 14 days ago • 105
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 10 days ago • 47
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published 8 days ago • 51
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning Paper • 2411.02337 • Published 25 days ago • 36
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents Paper • 2410.24024 • Published 29 days ago • 48
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published 24 days ago • 64