Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
GUI Intelligence
Document & UI Intelligence
Multimodal Models
Medical MultiModal
Multimodal Models
updated
2 days ago
A collection of multimodal models for the gpu poor
Upvote
2
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
4.23k
•
116
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
2.94k
•
407
alexshengzhili/llava-v1.5-13b-dpo
Text Generation
•
Updated
Apr 13, 2024
•
6
•
5
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
3 days ago
•
401k
•
249
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
36.8k
•
224
THUDM/cogvlm2-llama3-chat-19B
Text Generation
•
Updated
Sep 3, 2024
•
7.14k
•
208
BK-Lee/MoAI-7B
Image-Text-to-Text
•
Updated
Oct 2, 2024
•
331
•
45
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26, 2024
•
106
•
262
mPLUG/DocOwl1.5-Omni
Updated
Apr 10, 2024
•
46
•
16
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
Jul 19, 2024
•
320
•
8
Lin-Chen/open-llava-next-llama3-8b
Image-Text-to-Text
•
Updated
May 27, 2024
•
131
•
26
Mizukiluke/mplug_owl_2_1
Updated
Jan 31, 2024
•
52
•
11
HuanjinYao/DenseConnector-v1.5-8B
Image-to-Text
•
Updated
May 26, 2024
•
17
•
7
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20, 2024
•
143k
•
943
tiiuae/falcon-11B-vlm
Image-Text-to-Text
•
Updated
Jun 12, 2024
•
864
•
46
AIDC-AI/Ovis1.5-Llama3-8B
Image-Text-to-Text
•
Updated
Aug 2, 2024
•
713
•
25
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
38.2k
•
264
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
15 days ago
•
91.4k
•
921
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
Dec 8, 2024
•
421k
•
1.37k
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
594k
•
500
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
2.59M
•
1.27k
BAAI/Emu3-Gen
Any-to-Any
•
Updated
Oct 23, 2024
•
2.48k
•
205
vidore/colpali-v1.2
Image Feature Extraction
•
Updated
18 days ago
•
83k
•
105
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
19 days ago
•
944k
•
385
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
3 days ago
•
32.8k
•
561
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
1.55k
•
499
Xkev/Llama-3.2V-11B-cot
Image-Text-to-Text
•
Updated
Dec 16, 2024
•
6.97k
•
141
alibaba-damo/mgp-str-base
Image-to-Text
•
Updated
Dec 11, 2023
•
4.24k
•
63
omkarthawakar/LlamaV-o1
Question Answering
•
Updated
17 days ago
•
8.33k
•
85
openbmb/MiniCPM-o-2_6
Any-to-Any
•
Updated
4 days ago
•
169k
•
872
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
3 days ago
•
79.3k
•
2.04k
Upvote
2
Share collection
View history
Collection guide
Browse collections