Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
johannhartmann
's Collections
Multimodal Models
Medical MultiModal
Multimodal Models
updated
about 7 hours ago
A collection of multimodal models for the gpu poor
Upvote
2
google/paligemma-3b-pt-896
Image-Text-to-Text
•
Updated
Jul 19
•
99.5k
•
107
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
8 days ago
•
10.2k
•
398
alexshengzhili/llava-v1.5-13b-dpo
Text Generation
•
Updated
Apr 13
•
6
•
5
llava-hf/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
13 days ago
•
890k
•
224
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25
•
54.5k
•
208
THUDM/cogvlm2-llama3-chat-19B
Text Generation
•
Updated
28 days ago
•
45k
•
197
BK-Lee/MoAI-7B
Updated
Mar 12
•
714
•
45
01-ai/Yi-VL-34B
Image-Text-to-Text
•
Updated
Jun 26
•
121
•
260
mPLUG/DocOwl1.5-Omni
Updated
Apr 10
•
140
•
16
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
Jul 19
•
1.48k
•
5
Lin-Chen/open-llava-next-llama3-8b
Image-Text-to-Text
•
Updated
May 27
•
1.3k
•
25
Mizukiluke/mplug_owl_2_1
Updated
Jan 31
•
23
•
11
HuanjinYao/DenseConnector-v1.5-8B
Image-to-Text
•
Updated
May 26
•
67
•
7
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
Updated
Aug 20
•
112k
•
893
tiiuae/falcon-11B-vlm
Updated
Jun 12
•
2.56k
•
45
AIDC-AI/Ovis1.5-Llama3-8B
Image-Text-to-Text
•
Updated
Aug 2
•
651
•
23
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
13 days ago
•
36.6k
•
223
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
Updated
Aug 22
•
406k
•
749
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
Aug 21
•
412k
•
1.13k
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
1 day ago
•
11.5k
•
248
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text
•
Updated
2 days ago
•
109k
•
•
438
BAAI/Emu3-Gen
Text Generation
•
Updated
2 days ago
•
934
•
91
vidore/colpali-v1.2
Updated
5 days ago
•
48.2k
•
37
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
Updated
11 days ago
•
179k
•
194
Upvote
2
Share collection
View history
Collection guide
Browse collections