Trouble disabling multi-modality

#10
by mohamedelyazidferhat - opened

Hi,
I am having trouble serving the model with vLLM for text-only workloads. I have tried to add the parameter "--limit-mm-per-prompt image=0" to my vllm serve command as suggested in the model's card, and I also tried "--limit-mm-per-prompt '{"image": 0, "audio": 0}'" from the vLLM Gemma 4 user guide, but none of them seem to disable the multi-modality. Any suggestions ?
Best regards,

Red Hat AI org

Hi @mohamedelyazidferhat , I'm not sure what you mean. If there's no input image/audio, it won't use those pathways, right? I believe there's a vllm serve --language-model-only flag with will prevent initialization of those components, if that's what you're asking

Sign up or log in to comment