view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 125
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving Paper β’ 2404.16771 β’ Published 22 days ago β’ 16
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 12 days ago β’ 76
HF-curated models available on Workers AI Collection A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. β’ 15 items β’ Updated Apr 2 β’ 48
StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control Paper β’ 2403.09055 β’ Published Mar 14 β’ 23
Foundation Models for Vision 𧩠Collection Foundation models for computer vision. ⒠24 items ⒠Updated Mar 11 ⒠16
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper β’ 2402.19479 β’ Published Feb 29 β’ 30
Gemma release Collection Groups the Gemma models released by the Google team. β’ 40 items β’ Updated 3 days ago β’ 302