Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory ā¢ 8 items ā¢ Updated 4 days ago ā¢ 96
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 ā¢ 11 items ā¢ Updated 7 days ago ā¢ 436
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding ā¢ 9 items ā¢ Updated 21 days ago ā¢ 85
steiner-preview Collection Reasoning models trained on synthetic data using reinforcement learning. ā¢ 3 items ā¢ Updated Oct 20, 2024 ā¢ 32
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub Feb 12 ā¢ 56
Gemma 2 JPN Release Collection A Gemma 2 2B model fine-tuned on Japanese text. It supports the Japanese language the same level of performance of EN only queries on Gemma 2. ā¢ 3 items ā¢ Updated 4 days ago ā¢ 28
Gemma 2 2B Release Collection The 2.6B parameter version of Gemma 2. ā¢ 6 items ā¢ Updated 4 days ago ā¢ 79
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Paper ā¢ 2312.14187 ā¢ Published Dec 20, 2023 ā¢ 52