ZeroWw
/

Mistral-7B-Instruct-v0.3-GGUF

Model card Files Files and versions Community

ZeroWw commited on 6 days ago

Commit

a7a786f

•

1 Parent(s): b645964

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -4,10 +4,9 @@ language:
 - en
 ---
-My own quantizations.
-output and embed tesnors quantized to f16.
 all other tensors quantized to q5_k or q6_k.
-the q8_0 version is pure (all tensors quantized to Q8_0 just for reference)
 Result:
 both f16.q6 and f16.q5 are smaller than q8_0 standard quantization

 - en
 ---
+My own (ZeroWw) quantizations.
+output and embed tensors quantized to f16.
 all other tensors quantized to q5_k or q6_k.
 Result:
 both f16.q6 and f16.q5 are smaller than q8_0 standard quantization