--- license: apache-2.0 language: - en pipeline_tag: image-to-text --- A pre trained ViT and GPT2 is fine tuned on flickr8k dataset.