Is the architecture for this model different from the 7b variant?

#7
by tsriharsha - opened

Should the architecture for this model be the same as the 7b in the config.json?

LlavaForConditionalGeneration?

Llava Hugging Face org

Hi @tsriharsha
yes they are the same arch, it should be already the case for this model I think no?

In the 7b model, the architecture is specified as LlavaForConditionalGeneration in the config.json; but in the 13b model, the architecture is LlavaForCausalLM.

Any update on this?

Llava Hugging Face org

Yes that's an artifact, during the addition of LLaVa to the Transformers library the model was first pushed to the hub using LlavaForCausalLM but was then renamed to LlavaForConditionalGeneration. Will update the config. Thanks for reporting

Sign up or log in to comment