L3-8B-Helium3 Collection The culmination of my first LLM project. Hybrid storytelling and RP model, with a focus on niche fetish content. (This will be a recurring theme.) • 3 items • Updated 11 days ago • 1
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated about 12 hours ago • 23
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 28 days ago • 160
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 9 days ago • 467
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 2 days ago • 676