Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
285
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/paligemma-3b-ft-rsvqa-hr-224
Image-Text-to-Text
•
Updated
9 days ago
•
43
•
1
google/paligemma-3b-ft-refcoco-seg-896
Image-Text-to-Text
•
Updated
9 days ago
•
115
•
1
google/paligemma-3b-ft-okvqa-224
Image-Text-to-Text
•
Updated
9 days ago
•
4
•
1
google/paligemma-3b-ft-docvqa-448
Image-Text-to-Text
•
Updated
9 days ago
•
168
•
1
google/paligemma-3b-ft-docvqa-896
Image-Text-to-Text
•
Updated
9 days ago
•
2.37k
•
2
google/paligemma-3b-ft-cococap-224
Image-Text-to-Text
•
Updated
9 days ago
•
16
•
1
google/paligemma-3b-ft-textvqa-448
Image-Text-to-Text
•
Updated
9 days ago
•
1
google/paligemma-3b-ft-cococap-448
Image-Text-to-Text
•
Updated
9 days ago
•
632
•
1
google/paligemma-3b-ft-coco35l-224
Image-Text-to-Text
•
Updated
9 days ago
•
32
•
1
google/paligemma-3b-ft-science-qa-448
Image-Text-to-Text
•
Updated
9 days ago
•
6
•
1
google/paligemma-3b-ft-textvqa-896
Image-Text-to-Text
•
Updated
9 days ago
•
7
•
1
microsoft/llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
Updated
9 days ago
•
11
•
1
tinyllava/TinyLLaVA-Phi-2-SigLIP-3.1B
Image-Text-to-Text
•
Updated
5 days ago
•
166
•
1
gokaygokay/paligemma-docci-transformers
Image-Text-to-Text
•
Updated
7 days ago
•
114
•
1
leo009/paligemma-3b-mix-224
Image-Text-to-Text
•
Updated
6 days ago
•
136
•
1
rulins/blip2-t5-llava
Image-Text-to-Text
•
Updated
Apr 21
•
1
s3nh/llava-llama-2-13b-chat-lightning-preview-GGML
Image-Text-to-Text
•
Updated
Mar 6
s3nh/Chinese-LLaVA-Baichuan-GGML
Image-Text-to-Text
•
Updated
Mar 6
Lorim/The_WonderMix
Image-Text-to-Text
•
Updated
Mar 10
•
83
liuhaotian/llava-v1.5-13b-lora
Image-Text-to-Text
•
Updated
14 days ago
•
325
•
22
kuyesu22/ll-avatar
Image-Text-to-Text
•
Updated
Mar 6
Frorozcol/LLaVa-instruction-trasaleted
Image-Text-to-Text
•
Updated
Mar 10
Nagase-Kotono/LLaVA_X_KoLlama2-7B-pretrain-0.2v
Image-Text-to-Text
•
Updated
Mar 7
leonardPKU/llava1.5_data
Image-Text-to-Text
•
Updated
Mar 22
Lin-Chen/ShareGPT4V-7B
Image-Text-to-Text
•
Updated
Mar 27
•
2.74k
•
73
MaoXun/llava-lora-7-20-10-5-vicuna-7b-v1.3
Image-Text-to-Text
•
Updated
Mar 6
•
1
PsiPi/liuhaotian_llava-v1.5-13b-GGUF
Image-Text-to-Text
•
Updated
Mar 11
•
1.94k
•
31
ybelkada/test-llava-13b
Image-Text-to-Text
•
Updated
Apr 10
PsiPi/NousResearch_Nous-Hermes-2-Vision-GGUF
Image-Text-to-Text
•
Updated
Mar 11
•
2.29k
•
12
y10ab1/ggml_llava-v1.5-7b
Image-Text-to-Text
•
Updated
Mar 12
•
257
•
2
Previous
1
2
3
4
5
...
10
Next