Llama 3.1 Collection This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated about 13 hours ago β’ 377
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated 10 days ago β’ 114
Tulu V2.5 Suite Collection A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more! β’ 41 items β’ Updated Jun 14 β’ 10
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. β’ 29 items β’ Updated Jun 6 β’ 266
K2 Collection K2-65B is a fully reproducible LLM outperforming Llama 2 70B using 35% less compute. β’ 7 items β’ Updated Jun 12 β’ 6
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma β’ 16 items β’ Updated 1 day ago β’ 125
OLMo Suite Collection Artifacts for the first set of OLMo models. β’ 14 items β’ Updated about 1 month ago β’ 44
Mantis Collection Mantis model family optimized for multi-image reasoning with interleaved text/image format β’ 11 items β’ Updated 24 days ago β’ 7
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. β’ 25 items β’ Updated 2 days ago β’ 147
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. β’ 23 items β’ Updated 15 days ago β’ 377
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated about 13 hours ago β’ 658
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6 β’ 88
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 149
[lecture artifacts] aligning open language models Collection artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin β’ 63 items β’ Updated Apr 17 β’ 53
Leaderboards and benchmarks β¨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... β’ 66 items β’ Updated 7 days ago β’ 72
Aurora-M models Collection Aurora-M models (base, biden-harris redteams and instruct) β’ 5 items β’ Updated May 6 β’ 17
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 264 items β’ Updated Jun 22 β’ 355
Paloma Collection Dataset and baseline models for Paloma, a benchmark of language model fit to 585 textual domains β’ 8 items β’ Updated Jun 10 β’ 13
π Daily Picks in Interpretability & Analysis of LMs Collection Outstanding research in interpretability and evaluation of language models, summarized β’ 63 items β’ Updated 3 days ago β’ 75
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. β’ 55 items β’ Updated Jun 6 β’ 202
OpenChat Collection OpenChat: Advancing Open-source Language Models with Mixed-Quality Data β’ 7 items β’ Updated Jan 10 β’ 33
Tulu V2 Suite Collection The set of models associated with the paper "Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2" β’ 19 items β’ Updated Jun 10 β’ 43
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research Paper β’ 2402.00159 β’ Published Jan 31 β’ 56
β StarCoder Collection All models, datasets, and demos related to StarCoder! β’ 11 items β’ Updated Feb 27 β’ 20