Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
IDEA-CCNL
/
Ziya-Visual-Lyrics-14B
like
3
Text2Text Generation
Transformers
PyTorch
English
blip-2
visual question answering
image captioning
visual-centric dialogue
Inference Endpoints
arxiv:
2312.05278
License:
gpl-3.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Ziya-Visual-Lyrics-14B
/
assets
2 contributors
History:
1 commit
LinhIcey
Upload 8 files
9958185
5 months ago
MQ-Former.png
181 kB
Upload 8 files
5 months ago
case.png
392 kB
Upload 8 files
5 months ago
graph.png
195 kB
Upload 8 files
5 months ago
image_caption_vqa.jpg
169 kB
Upload 8 files
5 months ago
leaderboard.png
145 kB
Upload 8 files
5 months ago
rec.png
163 kB
Upload 8 files
5 months ago
text_orient_vqa.png
45.2 kB
Upload 8 files
5 months ago
two_stage_training.png
164 kB
Upload 8 files
5 months ago