What matters when building vision-language models? Paper β’ 2405.02246 β’ Published 15 days ago β’ 73
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 12 days ago β’ 76
Zero-Shot Detection and Segmentation Collection Demos of projects focused on zero-shot detection and segmentation. β’ 4 items β’ Updated Feb 7 β’ 3