Why is it text generation pipeline?

by usamacogniz - opened

Brother I love your work.
I have a question, why did you select text generation pipeline for all these llava models? Why didnt you select Visual Question Answering Multimodal pipeline?
Dont you think that will make the usage with Huggingface API much easier for these Llava models?

Thank you for the suggestion. The pipeline is automatically selected by HF (we did not choose a pipeline).
I am not very familiar with the HF API and inference endpoint, and we are working to improve the integration. Contribution is welcomed! Thanks.

Sign up or log in to comment