Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper โข 2503.19325 โข Published 20 days ago โข 71
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper โข 2411.17465 โข Published Nov 26, 2024 โข 87
WorldGUI: Dynamic Testing for Comprehensive Desktop GUI Automation Paper โข 2502.08047 โข Published Feb 12 โข 27