Image-Text-to-Text
Transformers
Safetensors
English
idefics2
pretraining
multimodal
vision
Inference Endpoints
Reverb's picture
Update README.md
bfb53d5 verified