PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 10 items β’ Updated 2 days ago β’ 80
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model 3 days ago β’ 84
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 12 items β’ Updated 1 day ago β’ 35
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format β’ 9 items β’ Updated 10 days ago β’ 7
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 10 items β’ Updated 5 days ago β’ 116
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated 29 days ago β’ 516
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated 11 days ago β’ 75
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 125
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated 29 days ago β’ 41
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 61 items β’ Updated 3 days ago β’ 59
Aurora-M models Collection Aurora-M models (base, biden-harris redteams and instruct) β’ 5 items β’ Updated 11 days ago β’ 15
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 70 items β’ Updated about 11 hours ago β’ 303
Capybara Collection Un-aligned model for general use, leveraging Amplify-Instruct and novel quality curation techniques, made with a dataset of less than 20K examples. β’ 8 items β’ Updated Dec 3, 2023 β’ 22
Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 585 textual domains β’ 8 items β’ Updated Feb 1 β’ 13
π Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized β’ 39 items β’ Updated 15 days ago β’ 53
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated 4 days ago β’ 167
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data β’ 7 items β’ Updated Jan 10 β’ 31
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" β’ 19 items β’ Updated Feb 1 β’ 43
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Paper β’ 2402.00159 β’ Published Jan 31 β’ 55
β StarCoder Collection All models, datasets, and demos related to StarCoder! β’ 11 items β’ Updated Feb 27 β’ 19