UI Agent Collection a collection of algorithmic agents for user interfaces/interactions, program synthesis, and robots โข 312 items โข Updated about 13 hours ago โข 47
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub โข 3 items โข Updated Nov 10, 2024 โข 10
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 15 items โข Updated Dec 6, 2024 โข 576
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated about 2 hours ago โข 299
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper โข 2409.08264 โข Published Sep 12, 2024 โข 45
LLaVA-Video Collection Models focus on video understanding (previously known as LLaVA-NeXT-Video). โข 8 items โข Updated 20 days ago โข 60
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma โข 16 items โข Updated 1 day ago โข 145
Vision Language Models Papers ๐ผ๏ธ๐ฌ๐ Collection Papers about vision-language models, most important ones are on top of the list. โข 27 items โข Updated Apr 30, 2024 โข 36