gchhablani's picture
Fix typo
ab61451
  • This demo loads the FlaxCLIPVisionMarianMT present in the model directory of this repository. The checkpoint is loaded from ckpt/ckpt-23999 which is pre-trained checkpoint with 24kk steps. 100 random validation set examples are present in the references.tsv with respective images in the images directory.

  • We provide English Translation of the generated caption and reference captions for users who are not well-acquainted with Spanish. This is done using mtranslate to keep things flexible enough and needs internet connection as it uses the Google Translate API. We will also add the original captions soon.

  • The sidebar contains generation parameters such as Number of Beams, Top-P, Temperature which will be used when generating the caption.

  • Clicking on Generate Caption will generate the caption in Spanish.