Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ceyda
's Collections
Korean Models
Useful Tools
vid-gen
Clips
VQA (Image captioning,QA)
Color
Nice~
Fashion
Cool names
VQA (Image captioning,QA)
updated
7 days ago
Upvote
-
Running
32
📊
FuseCap
Running
on
T4
388
💻
Kosmos 2
Running
5
🚀
Vilt Nlvr
Sleeping
125
⚡
Qwen VL
Running
on
T4
338
🔥
LLaVA
Running
on
A10G
308
👁
Fuyu Multimodal
Running
on
A10G
17
📚
Chat-UniVi
Sleeping
159
🚀
MoE LLaVA
Running
on
Zero
147
🐨
IDEFICS2 Playground
Running
on
Zero
82
🐐
CuMo 7b Zero
Running
on
Zero
234
🐬
Chat with DeepSeek VL 7B
What matters when building vision-language models?
Paper
•
2405.02246
•
Published
29 days ago
•
87
Running
on
Zero
268
🌔
moondream2
a tiny vision language model
Upvote
-
Share collection
View history
Collection guide
Browse collections