etemiz commited on
Commit
a07d189
1 Parent(s): 0fe9680

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -2,10 +2,10 @@
2
  license: llama3.1
3
  ---
4
  Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
5
- - IQ1_S: 86.8 GB b3459
6
- - IQ1_M: 95.1 GB b3459
7
- - IQ2_XXS: 109.0 GB b3459
8
- - IQ3_XXS: 157.7 GB b3484
9
 
10
  Quantization from BF16 here:
11
  https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/
 
2
  license: llama3.1
3
  ---
4
  Llama 3.1 405B Quants and llama.cpp versions that is used for quantization
5
+ - IQ1_S: 86.8 GB - b3459
6
+ - IQ1_M: 95.1 GB - b3459
7
+ - IQ2_XXS: 109.0 GB - b3459
8
+ - IQ3_XXS: 157.7 GB - b3484
9
 
10
  Quantization from BF16 here:
11
  https://huggingface.co/nisten/meta-405b-instruct-cpu-optimized-gguf/