LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 8 days ago • 93
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published 8 days ago • 26