Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 5 days ago • 161
Running 168 168 Low-bit Quantized Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Running on CPU Upgrade 70 70 AIR-Bench Leaderboard 🥇 Explore benchmark results for QA and long doc models