Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
carlizor
's Collections
Utilities
Document retrieval / chat
Flux
Image restoration
3D Generation
LLM
Embedding
LLM - Small
Video vision
To Read
Video
Image Segmentation
Image Generation (Fast)
Image Depth
Image caption
Audio
Image Generation
Image that talks
Image Enhance
Image Vision
Image editing
Image upscaling
Face Recognition
Multimodal
LLM - Medium
Image Vision
updated
8 days ago
Upvote
-
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text
•
Updated
Feb 3
•
1.46k
•
184
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
Updated
9 days ago
•
7.11k
•
269
nvidia/NVLM-D-72B
Image-Text-to-Text
•
Updated
Jan 14
•
21.3k
•
764
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
2.57k
•
1.63k
deepseek-ai/Janus-1.3B
Any-to-Any
•
Updated
Jan 27
•
151k
•
579
deepseek-ai/JanusFlow-1.3B
Any-to-Any
•
Updated
Jan 27
•
4.83k
•
143
NexaAIDev/OmniVLM-968M
Updated
Dec 17, 2024
•
1.57k
•
511
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Jan 9
•
131k
•
1.06k
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
78.6k
•
1.41k
jiuhai/florence-vl-8b-sft
Updated
Dec 3, 2024
•
54
•
19
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
333
•
62
OpenGVLab/InternVL2_5-78B
Image-Text-to-Text
•
Updated
about 1 month ago
•
4.74k
•
179
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
Jan 12
•
185k
•
•
561
deepseek-ai/deepseek-vl2
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
21.3k
•
297
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
Updated
Oct 10, 2024
•
94.4k
•
513
prithivMLmods/Qwen2-VL-OCR-2B-Instruct
Image-Text-to-Text
•
Updated
Jan 11
•
30.6k
•
61
ByteDance/Sa2VA-1B
Image-Text-to-Text
•
Updated
Jan 20
•
1.67k
•
20
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
about 24 hours ago
•
26.9k
•
110
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
about 3 hours ago
•
272k
•
357
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
1 day ago
•
3.52M
•
622
OpenGVLab/InternVideo2_5_Chat_8B
Video-Text-to-Text
•
Updated
17 days ago
•
14.8k
•
44
nvidia/Eagle2-9B
Image-Text-to-Text
•
Updated
Jan 28
•
1.77k
•
44
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
Updated
Jan 31
•
186k
•
170
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
11 days ago
•
123k
•
467
Upvote
-
Share collection
View history
Collection guide
Browse collections