Image-Text-to-Text
Transformers
Safetensors
vision-encoder-decoder
Inference Endpoints