bhavitvyamalik commited on
Commit
47dc27b
1 Parent(s): 0c7ca41

usage bug fix

Browse files
Files changed (1) hide show
  1. sections/intro.md +1 -1
sections/intro.md CHANGED
@@ -1,3 +1,3 @@
1
  This demo uses [CLIP-mBART50 model checkpoint](https://huggingface.co/flax-community/clip-vit-base-patch32_mbart-large-50) to predict caption for a given image in 4 languages (English, French, German, Spanish). Training was done using image encoder (CLIP-ViT) and text decoder (mBART50) with approximately 5 million image-text pairs taken from the [Conceptual 12M dataset](https://github.com/google-research-datasets/conceptual-12m) translated using [MarianMT](https://huggingface.co/transformers/model_doc/marian.html).
2
 
3
- For more details, click on `Usage` or `Article` 🤗 below.
1
  This demo uses [CLIP-mBART50 model checkpoint](https://huggingface.co/flax-community/clip-vit-base-patch32_mbart-large-50) to predict caption for a given image in 4 languages (English, French, German, Spanish). Training was done using image encoder (CLIP-ViT) and text decoder (mBART50) with approximately 5 million image-text pairs taken from the [Conceptual 12M dataset](https://github.com/google-research-datasets/conceptual-12m) translated using [MarianMT](https://huggingface.co/transformers/model_doc/marian.html).
2
 
3
+ For more details, click on `Usage` 🤗 above.