BramVanroy
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ tags:
|
|
8 |
|
9 |
This repository contains quantized versions of [BramVanroy/fietje-2b-instruct](https://huggingface.co/BramVanroy/fietje-2b-instruct):
|
10 |
|
11 |
-
- `-f16` (5.6GB): best quality, but largest and slowest
|
12 |
- `-q8_0` (3.0GB): minimal quality loss, smaller
|
13 |
- `-q5_k_m` (2.0GB): users have reported considerable quality loss in the chat `q5_k_m` version so you may want to avoid it
|
14 |
|
|
|
8 |
|
9 |
This repository contains quantized versions of [BramVanroy/fietje-2b-instruct](https://huggingface.co/BramVanroy/fietje-2b-instruct):
|
10 |
|
11 |
+
- `-f16` (5.6GB): best quality, but largest and slowest (recommended if you have the capacity, otherwise q8_0)
|
12 |
- `-q8_0` (3.0GB): minimal quality loss, smaller
|
13 |
- `-q5_k_m` (2.0GB): users have reported considerable quality loss in the chat `q5_k_m` version so you may want to avoid it
|
14 |
|