Image-Text-to-Text
Transformers
Safetensors
English
idefics2
pretraining
multimodal
vision
Inference Endpoints
Reverb's picture
Upload 8 files
4444407 verified
File too large to display, you can check the raw version instead.