Evaluating and Enhancing LLMs for Multi-turn Text-to-SQL with Multiple Question Types Paper β’ 2412.17867 β’ Published Dec 21, 2024 β’ 2
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper β’ 2504.08600 β’ Published 13 days ago β’ 26
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 6 items β’ Updated 12 days ago β’ 61
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance Paper β’ 2504.01724 β’ Published 22 days ago β’ 64
Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper β’ 2503.20201 β’ Published 30 days ago β’ 46
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 β’ 399
π©βπ» OlympicCoder Collection Reasoning datasets and models for competitive coding β’ 4 items β’ Updated Mar 11 β’ 16
view article Article LeRobot goes to driving school: Worldβs largest open-source self-driving dataset Mar 11 β’ 77
Unified Reward Model for Multimodal Understanding and Generation Paper β’ 2503.05236 β’ Published Mar 7 β’ 121
AI Engineering Collection A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book. β’ 239 items β’ Updated 26 days ago β’ 16
view article Article Hugging Face and JFrog partner to make AI Security more transparent Mar 4 β’ 21