ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published 2 days ago • 54
The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use Paper • 2411.10323 • Published 13 days ago • 27
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated about 11 hours ago • 161