15
UniVTG
π
Ask questions about YouTube video content
image captioning, VQA
BLIP2 (cutting edge image captioning) in π€transformers
Compare different visual question answering
Play with all the pix2struct variants in this d
Cutting edge open-vocabulary object detection app
Generate text responses in a chat format
Chat with GPT-4 using your API key
Generate summaries for long-form text
Generate text descriptions from images