KeyError: "Unknown task image-text-to-text

#9
by nktsanyal - opened

When trying to load the model using transformer's pipeline, I'm getting the following error:

from transformers import pipeline

pipe = pipeline("image-text-to-text", model="llava-hf/llava-v1.6-34b-hf")
Traceback (most recent call last):
raise KeyError(
KeyError: "Unknown task image-text-to-text, available tasks are ['audio-classification', 'automatic-speech-recognition', 'conversational', 'depth-estimation', 'document-question-answering', 'feature-extraction', 'fill-mask', 'image-classification', 'image-feature-extraction', 'image-segmentation', 'image-to-image', 'image-to-text', 'mask-generation', 'ner', 'object-detection', 'question-answering', 'sentiment-analysis', 'summarization', 'table-question-answering', 'text-classification', 'text-generation', 'text-to-audio', 'text-to-speech', 'text2text-generation', 'token-classification', 'translation', 'video-classification', 'visual-question-answering', 'vqa', 'zero-shot-audio-classification', 'zero-shot-classification', 'zero-shot-image-classification', 'zero-shot-object-detection', 'translation_XX_to_YY']"

Can you let us know which transformers and torch versions works with llava-hf/llava-v1.6-34b-hf ?

there's some commit from the past that worked, but since then, it has been broken/regressed.

nktsanyal changed discussion status to closed
Llava Hugging Face org

Hi,

The "image-text-to-text" pipeline PR is not merged yet: https://github.com/huggingface/transformers/pull/29572

Sign up or log in to comment