Mozilla
/

Mixtral-8x22B-Instruct-v0.1-llamafile

Model card Files Files and versions Community

jartine commited on Apr 25, 2024

Commit

e0a3324

·

verified ·

1 Parent(s): c7da924

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -107,7 +107,7 @@ and makes it harder for the compiler to parallelize instructions. You
 want to ideally use the simplest smallest floating point format that's
 natively implemented by your hardware. In most cases, that's BF16 or
 FP16. However, llamafile is able to still offer respectable tinyBLAS
-speedups for llama.cpp's oldest and simplest quants: Q8\_0 and Q4\_0.
 ## Hardware Choices (Mixtral 8x22B Specific)

 want to ideally use the simplest smallest floating point format that's
 natively implemented by your hardware. In most cases, that's BF16 or
 FP16. However, llamafile is able to still offer respectable tinyBLAS
+speedups for llama.cpp's simplest quants: Q8\_0 and Q4\_0.
 ## Hardware Choices (Mixtral 8x22B Specific)