Spaces:

csuhan
/

LLaMA-Adapter

Runtime error

App Files Files

xet

Community

How to do VQA ?

by Jetcombo - opened May 3, 2023

Discussion

Jetcombo

May 3, 2023

•

edited May 3, 2023

How do I provide an image as a context input?
I also followed your code here: https://github.com/ZrrSkywalker/LLaMA-Adapter/blob/main/example.py
How do I provide an image object / image path in prompt_input ?

Jetcombo

May 3, 2023

I could do it in the hugging face App by just pasting the image url inside context box. Hopefully the image path would work in the code (example.py)

csuhan

Owner May 5, 2023

Hi @Jetcombo , we do not release the training code of VQA / image captioning with LLaMA-Adapter. But you can refer the inference code of coco captioning at here.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment