Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published 5 days ago β’ 88
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper β’ 2410.23218 β’ Published Oct 30 β’ 46
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper β’ 2410.23218 β’ Published Oct 30 β’ 46
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper β’ 2410.23218 β’ Published Oct 30 β’ 46 β’ 3
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper β’ 2410.18603 β’ Published Oct 24 β’ 30
A Controlled Study on Long Context Extension and Generalization in LLMs Paper β’ 2409.12181 β’ Published Sep 18 β’ 43
A Controlled Study on Long Context Extension and Generalization in LLMs Paper β’ 2409.12181 β’ Published Sep 18 β’ 43
MagicMan: Generative Novel View Synthesis of Humans with 3D-Aware Diffusion and Iterative Refinement Paper β’ 2408.14211 β’ Published Aug 26 β’ 10
A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond Paper β’ 2403.14734 β’ Published Mar 21 β’ 22