Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
albertmundu
's Collections
Vision-Language Collections
Vision-Language Collections
updated
Sep 26, 2023
Some of the popular models for image-text domain
Upvote
-
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Dec 7, 2023
•
1.35M
•
•
1.25k
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Aug 1, 2023
•
1.24M
•
•
541
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
238k
•
86
microsoft/git-large-coco
Image-to-Text
•
Updated
Jun 26, 2023
•
7.72k
•
101
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
277k
•
325
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
6.95k
•
85
Salesforce/instructblip-flan-t5-xxl
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
926
•
21
Salesforce/instructblip-flan-t5-xl
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
7.09k
•
29
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
828k
•
•
358
microsoft/trocr-base-printed
Image-to-Text
•
Updated
May 27, 2024
•
119k
•
•
156
microsoft/trocr-large-printed
Image-to-Text
•
Updated
May 27, 2024
•
238k
•
145
microsoft/trocr-small-printed
Image-to-Text
•
Updated
May 27, 2024
•
51.7k
•
33
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
30.1k
•
97
google/pix2struct-large
Image-to-Text
•
Updated
Sep 6, 2023
•
27.7k
•
34
Upvote
-
Share collection
View history
Collection guide
Browse collections