bhavitvyamalik's picture
update sections
c62e9c5
|
raw
history blame contribute delete
No virus
558 Bytes

A newer version of the Streamlit SDK is available: 1.37.1

Upgrade

This demo uses CLIP-Vision-Marian model checkpoint to predict caption for a given image in Spanish. Training was done using image encoder and text decoder with approximately 2.5 million image-text pairs taken from the Conceptual 12M dataset with captions translated using MarianMT English to Spanish.

For more details, click on Usage or Article 🤗 below.