--- license: mit language: - en library_name: transformers inference: False --- ## Sharded BLIP-2 Model Card This is a sharded version of the [BLIP-2 Model Card](https://huggingface.co/models/Salesforce/blip2-flan-t5-xl) which leverages [Flan T5-xl](https://huggingface.co/google/flan-t5-xl) for image-to-text tasks such as image captioning and visual question answering. Refer to the [original model card](https://huggingface.co/models/Salesforce/blip2-flan-t5-xl) for more details about the model description, intended uses, and limitations, as well as instructions for how to use the model on CPU and GPU in different precisions.