Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Pricing

  • Log In
  • Sign Up

kpyu
/
video-blip-flan-t5-xl-ego4d

Image-to-Text
Transformers PyTorch English blip-2 text2text-generation vision video-to-text image-captioning video-captioning visual-question-answering Inference Endpoints
Model card Files Files and versions Community
video-blip-flan-t5-xl-ego4d
  • 1 contributor
History: 4 commits
kpyu's picture
kpyu
Update README.md
b494c30 7 months ago
  • .gitattributes
    1.48 kB
    initial commit 7 months ago
  • README.md
    1.39 kB
    Update README.md 7 months ago
  • config.json
    7.81 kB
    Upload VideoBlipForConditionalGeneration 7 months ago
  • preprocessor_config.json
    432 Bytes
    Upload processor 7 months ago
  • pytorch_model-00001-of-00002.bin
    9.44 GB
    LFS
    Upload VideoBlipForConditionalGeneration 7 months ago
  • pytorch_model-00002-of-00002.bin
    6.33 GB
    LFS
    Upload VideoBlipForConditionalGeneration 7 months ago
  • pytorch_model.bin.index.json
    128 kB
    Upload VideoBlipForConditionalGeneration 7 months ago
  • special_tokens_map.json
    2.2 kB
    Upload processor 7 months ago
  • tokenizer.json
    2.42 MB
    Upload processor 7 months ago
  • tokenizer_config.json
    2.39 kB
    Upload processor 7 months ago