license: apache-2.0 | |
language: | |
- en | |
library_name: transformers | |
pipeline_tag: image-to-text | |
tags: | |
- caption | |
- image caption | |
- captioning | |
- img2txt | |
- image-to-text | |
- coco | |
- flickr | |
- gan | |
- gpt | |
- image | |
- vision | |
- text | |
datasets: | |
- flickr30k | |
- coco2017 | |
pretrained model using coco2017 + flickr30k caption dataset / gtranslate | |
![](sample1.jpg) | |
![](sample2.jpg) | |
![](sample3.jpg) | |
![](sample4.jpg) | |
![](sample5.jpg) |