HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
•
42.6k
•
312
Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.