Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Salesforce
/
blip2-flan-t5-xl
like
56
Image-to-Text
Transformers
PyTorch
Safetensors
English
blip-2
visual-question-answering
vision
image-captioning
arxiv:
2301.12597
arxiv:
2210.11416
License:
mit
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (2)
Checkpoints just with ViT-g Dimension (1408) for the Q-former (cross-att)?
3
#3 opened 10 months ago by
Daromog
Inference Error: Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cuda:2!
#2 opened 11 months ago by
LDY