unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit Text Generation • Updated Feb 14 • 148k • 28
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 2 days ago • 76
🧠Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 21 items • Updated 8 days ago • 129
Scaling Laws for Downstream Task Performance of Large Language Models Paper • 2402.04177 • Published Feb 6, 2024 • 19