Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
519
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
laion/mscoco_finetuned_CoCa-ViT-L-14-laion2B-s13B-b90k
Image-to-Text
•
Updated
Jan 16
•
41.3k
•
19
Abdou/vit-swin-base-224-gpt2-image-captioning
Image-to-Text
•
Updated
Apr 29, 2023
•
98
•
2
Salesforce/blip2-flan-t5-xl
Image-to-Text
•
Updated
Dec 13, 2023
•
55.3k
•
50
Salesforce/blip2-opt-6.7b
Image-to-Text
•
Updated
Mar 27
•
13.9k
•
65
Salesforce/blip2-opt-2.7b-coco
Image-to-Text
•
Updated
Mar 31
•
7.14k
•
7
Salesforce/blip2-opt-6.7b-coco
Image-to-Text
•
Updated
Mar 31
•
1.51k
•
28
Salesforce/blip2-flan-t5-xl-coco
Image-to-Text
•
Updated
Mar 27
•
846
•
11
sayakpaul/git-base-pokemon
Image-to-Text
•
Updated
Mar 26, 2023
•
5
•
1
svjack/vit-gpt-diffusion-zh
Image-to-Text
•
Updated
Feb 20, 2023
•
49
•
2
leeyunjai/img2txt
Image-to-Text
•
Updated
Sep 6, 2023
•
40
•
4
IDEA-CCNL/Taiyi-BLIP-750M-Chinese
Image-to-Text
•
Updated
Jun 6, 2023
•
10
•
14
jaimin/image_caption
Image-to-Text
•
Updated
Feb 19, 2023
•
17
•
2
google/pix2struct-textcaps-base
Image-to-Text
•
Updated
Sep 7, 2023
•
8.83k
•
27
to-be/donut-base-finetuned-invoices
Image-to-Text
•
Updated
Mar 3, 2023
•
450
•
8
ddobokki/ko-trocr
Image-to-Text
•
Updated
Sep 7, 2023
•
595
•
6
Tomatolovve/DemoTest
Image-to-Text
•
Updated
Mar 10, 2023
•
6
Shamima/Blip-finetuned-sd-1k
Image-to-Text
•
Updated
Mar 10, 2023
•
55
•
1
google/pix2struct-textcaps-large
Image-to-Text
•
Updated
May 19, 2023
•
275
•
12
google/pix2struct-base
Image-to-Text
•
Updated
Dec 24, 2023
•
12.4k
•
61
katanaml-org/invoices-donut-model-v1
Image-to-Text
•
Updated
May 11, 2023
•
685
•
35
Teklia/pylaia-home-alcar
Image-to-Text
•
Updated
Mar 13
Flova/omr_transformer
Image-to-Text
•
Updated
Oct 5, 2023
•
153
•
4
UBC-NLP/Qalam
Image-to-Text
•
Updated
Jul 5, 2023
•
196
•
3
google/pix2struct-large
Image-to-Text
•
Updated
Sep 6, 2023
•
11.7k
•
27
JB/rrg_emnlp_impression_128_rl
Image-to-Text
•
Updated
Mar 24, 2023
•
12
•
1
Maciel/Muge-Image-Caption
Image-to-Text
•
Updated
Mar 25, 2023
•
29
•
5
DunnBC22/trocr-base-printed-synthetic_dataset_ocr
Image-to-Text
•
Updated
Jun 10, 2023
•
102
•
1
nkasmanoff/sky-scribe
Image-to-Text
•
Updated
Mar 30, 2023
•
4
baseplate/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Apr 5, 2023
•
35
•
1
anaghasavit/trocr-processor
Image-to-Text
•
Updated
Apr 16, 2023
•
25
•
3
Previous
1
...
3
4
5
6
7
...
18
Next