Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • Updated Dec 7, 2023 • 2.24M • • 1.18k microsoft/git-large-coco Image-to-Text • Updated Jun 26, 2023 • 9.7k • 98 Salesforce/instructblip-vicuna-7b Image-Text-to-Text • Updated 7 days ago • 271k • 85 Salesforce/blip2-flan-t5-xxl Image-Text-to-Text • Updated 7 days ago • 15.9k • 85
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 4 google/siglip-base-patch16-224 Zero-Shot Image Classification • Updated Sep 26 • 266k • 25 google/siglip-base-patch16-256 Zero-Shot Image Classification • Updated Sep 26 • 37.4k • 3 google/siglip-base-patch16-384 Zero-Shot Image Classification • Updated Sep 26 • 10.4k • 9
nielsr/gemini-results-paper-central-data-emnlp2024-sorted-by-github-stars-no-artifacts Viewer • Updated 3 days ago • 100 • 26
nielsr/paper-central-data-emnlp2024-sorted-by-github-stars-no-artifacts Viewer • Updated 10 days ago • 1.68k • 29