Collection of image captioning models
Niels Rogge
nielsr
AI & ML interests
Mainly interested in diving into complex Github repos and making AI easier and more accessible to everyone
Blog posts
Organizations
Collections
5
SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released.
-
Sigmoid Loss for Language Image Pre-Training
Paper • 2303.15343 • Published • 3 -
google/siglip-base-patch16-224
Zero-Shot Image Classification • Updated • 27.8k • 7 -
google/siglip-base-patch16-256
Zero-Shot Image Classification • Updated • 75 -
google/siglip-base-patch16-384
Zero-Shot Image Classification • Updated • 334 • 4
spaces
20
models
162
nielsr/vit-large-patch16-v-jepa
Updated
•
1
•
1
nielsr/imagebind-huge
Updated
•
6
•
2
nielsr/gemma-2b-it
Updated
nielsr/DUSt3R_ViTLarge_BaseDecoder_512_dpt
Updated
•
14
•
1
nielsr/udop-test
Text2Text Generation
•
Updated
•
153
nielsr/RMBG-1.4
Updated
nielsr/cogvlm-tiny-random
Text Generation
•
Updated
•
23
nielsr/crossmae-small-patch16
Updated
•
2
nielsr/efficientsam-tiny
Updated
•
5
nielsr/mobilesam
Mask Generation
•
Updated
•
8
•
2
datasets
75
nielsr/test-image
Viewer
•
Updated
nielsr/test-cogvlm
Updated
nielsr/ml6-website-rag
Viewer
•
Updated
•
22
nielsr/breast-cancer
Viewer
•
Updated
•
1.31k
•
7
nielsr/test-files
Updated
nielsr/example-pdf
Viewer
•
Updated
•
1
nielsr/test-maskrcnn
Updated
nielsr/dinov2-test-batch
Updated
nielsr/test-data-nougat
Updated
nielsr/datacomp-small-with-text-embeddings
Viewer
•
Updated