UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots โข 318 items โข Updated about 2 hours ago โข 47
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub โข 3 items โข Updated Nov 10, 2024 โข 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 575
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated 2 days ago โข 299
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper โข 2409.08264 โข Published Sep 12, 2024 โข 45
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). โข 8 items โข Updated 22 days ago โข 60
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma โข 16 items โข Updated 4 days ago โข 145
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐ Collection Papers about vision-language models, most important ones are on top of the list. โข 27 items โข Updated Apr 30, 2024 โข 36