distill-1.7B-4bit-GGUF

4-bit GGUF (Q4_K_M) of distill-1.7B โ€” an Expert Language Model for CLI output compression. Zero accuracy loss.

Format Size Accuracy
GGUF fp16 4.1 GB 95%
GGUF Q4_K_M 1.2 GB 95%

Use with LM Studio, Ollama, or llama.cpp. Built for distill.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for samuelfaj/distill-1.7B-GGUF

Finetuned
Qwen/Qwen3-1.7B
Finetuned
(2)
this model

Collection including samuelfaj/distill-1.7B-GGUF