Text-to-Image Base Models Collection All text-to-image open source base models, with their respective license β’ 28 items β’ Updated May 10, 2024 β’ 23
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 32 items β’ Updated 3 days ago β’ 146
AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. β’ 19 items β’ Updated Nov 22, 2024 β’ 74
Vision Language Models Papers πΌοΈπ¬π Collection Papers about vision-language models, most important ones are on top of the list. β’ 27 items β’ Updated Apr 30, 2024 β’ 36
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 26 items β’ Updated Jan 8 β’ 566
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 735
Gemma release Collection Groups the Gemma models released by the Google team. β’ 40 items β’ Updated 3 days ago β’ 331
Top 10% instruction tuning datasets Collection Collects datasets with 'instruction' in the name and more than 1 download and in the top 10% for the number of likes β’ 13 items β’ Updated Jul 3, 2024 β’ 8