PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma โข 16 items โข Updated Jul 31 โข 137
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 โข 8 items โข Updated Jul 31 โข 34
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper โข 2311.06242 โข Published Nov 10, 2023 โข 79
Molmo Collection Artifacts for open multimodal language models. โข 5 items โข Updated 23 days ago โข 251
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 โข 11 items โข Updated 24 days ago โข 383
view article Article Unleash ML Power on iOS: Apple Silicon Optimization Secrets By fguzman82 โข Jul 18 โข 4
Qwen2-VL Collection Vision-language model series based on Qwen2 โข 15 items โข Updated Sep 18 โข 139
Building and better understanding vision-language models: insights and future directions Paper โข 2408.12637 โข Published Aug 22 โข 115
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models โข 11 items โข Updated 24 days ago โข 594
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*โก By xhluca โข Jul 9 โข 35
view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ By manu โข Jul 5 โข 136
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24 โข 176
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. โข 27 items โข Updated about 1 month ago โข 477
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 โข 163
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. โข 11 items โข Updated Apr 3 โข 109
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration Paper โข 2311.04257 โข Published Nov 7, 2023 โข 20