OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 2 items • Updated about 10 hours ago • 4
Llama Nemotron Collection Open, Production-ready Enterprise Models • 4 items • Updated about 10 hours ago • 31
Granite 3.2 Models (GGUF) Collection GGUF-formatted versions of IBM Granite 3.2 models. Licensed under the Apache 2.0 license. • 5 items • Updated 20 days ago • 4
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 7 days ago • 110
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 14 days ago • 82
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated Mar 4 • 68
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated 24 days ago • 109
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 965
Multimodal Models Collection Multimodal models with leading performance. • 17 items • Updated Jan 17 • 33
Breeze 2 Family Collection Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26 • 18