Image-Text-to-Text
Transformers
Safetensors
English
idefics2
pretraining
multimodal
vision
quantized
4-bit precision
AWQ
Inference Endpoints