Image-Text-to-Text
Transformers
Safetensors
English
idefics2
pretraining
multimodal
vision
Inference Endpoints
5 papers

Fine-tuning with LoRA?

#25
by PatchouliPatch - opened

Heya!

I'm a newbie here and I was wondering if there was a way to fine-tune this on our own? I know how to LLM-only models, but not multimodal models like this.

Hi @PatchouliPatch
glad you are getting into multimodal models!
did you see the resources linked at the end of the section https://huggingface.co/HuggingFaceM4/idefics2-8b#uses?

ah, my bad, I didn't see it. Thanks!

PatchouliPatch changed discussion status to closed

Sign up or log in to comment