Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 3 days ago • 92
CoRNStack Collection State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated 11 days ago • 15
🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 20 items • Updated 5 days ago • 122
Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated 16 days ago • 11
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 17 days ago • 46
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 17 hours ago • 48