MobA: A Two-Level Agent System for Efficient Mobile Task Automation Paper • 2410.13757 • Published Oct 17 • 31
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Paper • 2404.07972 • Published Apr 11 • 46
Mobile-Env: An Evaluation Platform and Benchmark for Interactive Agents in LLM Era Paper • 2305.08144 • Published May 14, 2023