HuggingFaceM4/idefics2-8b · shape mismatch: value tensor of shape [2320] cannot be broadcast to indexing result of shape [2262]

May 16, 2024

I am trying the very first example given in the documentation, yet I'm kind of new to deploying LLMs and stuff.

processor = AutoProcessor.from_pretrained("HuggingFaceM4/idefics2-8b")
model = AutoModelForVision2Seq.from_pretrained(
    "HuggingFaceM4/idefics2-8b",
    # torch_dtype=torch.float16,
     # _attn_implementation="flash_attention_2",
    # quantization_config=quantization_config,
)

prompt = processor.apply_chat_template(messages, add_generation_prompt=True)
inputs = processor(text=prompt, images=[image1, image2], return_tensors="pt")

generated_ids = model.generate(**inputs, max_new_tokens=1500)
generated_texts = processor.batch_decode(generated_ids, skip_special_tokens=True)

And getting the following error
shape mismatch: value tensor of shape [2320] cannot be broadcast to indexing result of shape [2262]

VictorSanh

May 17, 2024

hi @yeargun
do you happen to have a full traceback?

sipie800

May 22, 2024

same

VictorSanh

May 22, 2024

same question @sipie800 , can you share a full traceback?

sipie800

May 24, 2024

by me the problem has been fixed somehow, but I can't recall it. It might be the transformers version issue.

chenduo

Jun 7, 2024

same issue, anyone can help me to solve this problem?😭

mdmev

Jun 8, 2024

I have a similar issue; I posted it on the chatty version. In case it helps, issue