Update README.md
Browse files
README.md
CHANGED
@@ -107,7 +107,7 @@ and makes it harder for the compiler to parallelize instructions. You
|
|
107 |
want to ideally use the simplest smallest floating point format that's
|
108 |
natively implemented by your hardware. In most cases, that's BF16 or
|
109 |
FP16. However, llamafile is able to still offer respectable tinyBLAS
|
110 |
-
speedups for llama.cpp's
|
111 |
|
112 |
## Hardware Choices (Mixtral 8x22B Specific)
|
113 |
|
|
|
107 |
want to ideally use the simplest smallest floating point format that's
|
108 |
natively implemented by your hardware. In most cases, that's BF16 or
|
109 |
FP16. However, llamafile is able to still offer respectable tinyBLAS
|
110 |
+
speedups for llama.cpp's simplest quants: Q8\_0 and Q4\_0.
|
111 |
|
112 |
## Hardware Choices (Mixtral 8x22B Specific)
|
113 |
|