💜 Kotlin ML Pack Collection A collection of datasets, fine-tuned models and benchmarks to train your models for perfect Kotlin code generation. • 9 items • Updated 2 days ago • 8
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 11 items • Updated 7 days ago • 97
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 14 items • Updated 2 days ago • 126
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated Feb 19 • 37
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated 10 days ago • 305
Code Llama Family Collection This collection hosts the transformers repos of the Code Llama release • 12 items • Updated Apr 18 • 20
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 536
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 20 items • Updated 2 days ago • 270
DBRX Collection DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 89
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21 • 104
Effective Long-Context Scaling of Foundation Models Paper • 2309.16039 • Published Sep 27, 2023 • 28