Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
arunmadhusudh
/
Vit-gpt2-flickr8k
like
0
Image-to-Text
Transformers
PyTorch
English
vision-encoder-decoder
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use in Transformers
Edit model card
A pre trained ViT and GPT2 is fine tuned on flickr8k dataset.
Downloads last month
20