Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 1 day ago • 37
Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models • 25 items • Updated 1 day ago • 56
Mistral Small 3 (All Versions) Collection A collection of Mistral's new Small 3.1 and 3 models including GGUF, 4-bit and more! • 14 items • Updated 1 day ago • 7
Phi-4 (All Versions) Collection Microsoft's new Phi-4 models including mini in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 8 items • Updated 1 day ago • 48
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 1 day ago • 214
Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. • 5 items • Updated 1 day ago • 34
Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated 1 day ago • 46
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 27 items • Updated 1 day ago • 68
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 244