Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
517
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
Teklia/pylaia-norhand-v1-postprocessed
Image-to-Text
•
Updated
Mar 13
microsoft/git-large-r
Image-to-Text
•
Updated
Apr 27, 2023
•
20
•
2
microsoft/git-large-r-coco
Image-to-Text
•
Updated
Feb 8, 2023
•
924
•
9
microsoft/git-large-r-textcaps
Image-to-Text
•
Updated
Feb 8, 2023
•
1.16k
•
10
tifa-benchmark/promptcap-coco-vqa
Image-to-Text
•
Updated
Dec 11, 2023
•
1.95k
•
12
SpringAI/AiGenImg2TxtV1
Image-to-Text
•
Updated
Jan 26, 2023
•
11
•
1
artificialguybr/textcaps-teste2
Image-to-Text
•
Updated
Jun 16, 2023
•
14
•
3
shikunl/prismer
Image-to-Text
•
Updated
Apr 5, 2023
•
9
nathansutton/generate-cxr
Image-to-Text
•
Updated
Feb 23
•
359
•
6
laion/mscoco_finetuned_CoCa-ViT-L-14-laion2B-s13B-b90k
Image-to-Text
•
Updated
Jan 16
•
42k
•
19
Abdou/vit-swin-base-224-gpt2-image-captioning
Image-to-Text
•
Updated
Apr 29, 2023
•
85
•
2
Salesforce/blip2-flan-t5-xl
Image-to-Text
•
Updated
Dec 13, 2023
•
45.4k
•
50
Salesforce/blip2-opt-6.7b
Image-to-Text
•
Updated
Mar 27
•
14.2k
•
65
Salesforce/blip2-opt-2.7b-coco
Image-to-Text
•
Updated
Mar 31
•
7.34k
•
7
Salesforce/blip2-opt-6.7b-coco
Image-to-Text
•
Updated
Mar 31
•
1.4k
•
28
Salesforce/blip2-flan-t5-xl-coco
Image-to-Text
•
Updated
Mar 27
•
913
•
11
sayakpaul/git-base-pokemon
Image-to-Text
•
Updated
Mar 26, 2023
•
6
•
1
svjack/vit-gpt-diffusion-zh
Image-to-Text
•
Updated
Feb 20, 2023
•
49
•
2
leeyunjai/img2txt
Image-to-Text
•
Updated
Sep 6, 2023
•
36
•
4
IDEA-CCNL/Taiyi-BLIP-750M-Chinese
Image-to-Text
•
Updated
Jun 6, 2023
•
10
•
14
jaimin/image_caption
Image-to-Text
•
Updated
Feb 19, 2023
•
17
•
2
google/pix2struct-textcaps-base
Image-to-Text
•
Updated
Sep 7, 2023
•
9.38k
•
27
Tomatolovve/DemoTest
Image-to-Text
•
Updated
Mar 10, 2023
•
7
Shamima/Blip-finetuned-sd-1k
Image-to-Text
•
Updated
Mar 10, 2023
•
56
•
1
google/pix2struct-textcaps-large
Image-to-Text
•
Updated
May 19, 2023
•
253
•
12
google/pix2struct-base
Image-to-Text
•
Updated
Dec 24, 2023
•
12.4k
•
61
katanaml-org/invoices-donut-model-v1
Image-to-Text
•
Updated
May 11, 2023
•
629
•
35
Teklia/pylaia-home-alcar
Image-to-Text
•
Updated
Mar 13
Flova/omr_transformer
Image-to-Text
•
Updated
Oct 5, 2023
•
149
•
4
UBC-NLP/Qalam
Image-to-Text
•
Updated
Jul 5, 2023
•
212
•
3
Previous
1
...
3
4
5
6
7
...
18
Next