4bit
/

Image-to-Text
Transformers
Safetensors
English
vlm
text-generation
image-captioning
visual-question-answering
Inference Endpoints