fbaldassarri
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -38,7 +38,7 @@ Fast and low memory, 2-3X speedup (slight accuracy drop at W4G128)
|
|
38 |
|
39 |
Quantization framework: [Intel AutoRound](https://github.com/intel/auto-round) v0.4.3
|
40 |
|
41 |
-
Note: this INT4 version of
|
42 |
|
43 |
## Replication Recipe
|
44 |
|
|
|
38 |
|
39 |
Quantization framework: [Intel AutoRound](https://github.com/intel/auto-round) v0.4.3
|
40 |
|
41 |
+
Note: this INT4 version of Mistral-7B-v0.3 has been quantized to run inference through CPU.
|
42 |
|
43 |
## Replication Recipe
|
44 |
|