Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
9,502
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
bartowski/Tesslate_Synthia-S1-27b-GGUF
Image-Text-to-Text
•
Updated
Apr 3
•
1.84k
•
11
baichuan-inc/BaichuanMed-OCR-7B
Image-Text-to-Text
•
Updated
Apr 7
•
204
•
6
OpenGVLab/InternVL3-38B
Image-Text-to-Text
•
Updated
26 days ago
•
92.7k
•
29
OpenGVLab/InternVL3-1B
Image-Text-to-Text
•
Updated
26 days ago
•
54.7k
•
60
TIGER-Lab/VL-Rethinker-7B
Image-Text-to-Text
•
Updated
16 days ago
•
21.7k
•
12
OpenGVLab/InternVL3-14B-Instruct
Image-Text-to-Text
•
Updated
26 days ago
•
1.95k
•
8
OpenGVLab/InternVL3-78B-AWQ
Image-Text-to-Text
•
Updated
Apr 17
•
1.18k
•
4
lmstudio-community/gemma-3-12B-it-qat-GGUF
Image-Text-to-Text
•
Updated
Apr 18
•
56.5k
•
8
Skywork/Skywork-R1V2-38B
Image-Text-to-Text
•
Updated
24 days ago
•
31.3k
•
119
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
Updated
22 days ago
•
14.5k
•
30
leon-se/gemma-3-27b-it-qat-W4A16-G128
Image-Text-to-Text
•
Updated
23 days ago
•
1.74k
•
9
ggml-org/Mistral-Small-3.1-24B-Instruct-2503-GGUF
Image-Text-to-Text
•
Updated
20 days ago
•
643
•
4
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
Updated
9 days ago
•
4.58k
•
4
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
9 days ago
•
7.24k
•
5
turing-motors/Heron-NVILA-Lite-33B
Image-Text-to-Text
•
Updated
9 days ago
•
76
•
4
tonyli8623/Himedical-R1-Sft-Gemma-27b-Q8.GGUF
Image-Text-to-Text
•
Updated
5 days ago
•
38
•
2
shreydan/SmolVLM-256M-Detection
Image-Text-to-Text
•
Updated
4 days ago
•
16
•
2
John6666/llama-joycaption-beta-one-hf-llava-nf4
Image-Text-to-Text
•
Updated
4 days ago
•
23
•
2
moondream/moondream-2b-2025-04-14-4bit
Image-Text-to-Text
•
Updated
43 minutes ago
•
2
unsloth/medgemma-4b-it-GGUF
Image-Text-to-Text
•
Updated
about 13 hours ago
•
2
Salesforce/blip2-opt-6.7b
Image-Text-to-Text
•
Updated
Feb 3
•
6.75k
•
77
Salesforce/blip2-opt-6.7b-coco
Image-Text-to-Text
•
Updated
Feb 3
•
112k
•
34
dragonstar/image-text-captcha-v2
Image-Text-to-Text
•
Updated
Sep 11, 2023
•
145
•
4
remyxai/SpaceLLaVA
Image-Text-to-Text
•
Updated
about 1 month ago
•
582
•
24
llava-hf/llava-v1.6-34b-hf
Image-Text-to-Text
•
Updated
Jan 27
•
2.15k
•
82
llava-hf/llava-v1.6-vicuna-13b-hf
Image-Text-to-Text
•
Updated
Jan 27
•
9.59k
•
19
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
Oct 14, 2024
•
17.5k
•
603
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
Updated
Mar 25
•
2.51k
•
412
RaincloudAi/llava-llama-3-8b-v1_1-Q4_K_M-GGUF
Image-Text-to-Text
•
Updated
Apr 22, 2024
•
45
•
1
xtuner/llava-phi-3-mini
Image-Text-to-Text
•
Updated
Apr 25, 2024
•
85
•
26
Previous
1
...
3
4
5
6
7
...
100
Next