hfl-rc commited on
Commit
1093147
1 Parent(s): 0fab619

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -15
README.md CHANGED
@@ -22,21 +22,29 @@ This repository contains the GGUF-v3 models (llama.cpp compatible) for **Chinese
22
 
23
  Metric: PPL, lower is better
24
 
25
- | Quant | PPL |
26
- | ----- | ---- |
27
- | IQ1_S | 27.7911 +/- 0.27400 |
28
- | IQ2_XXS | 6.7233 +/- 0.06118 |
29
- | IQ2_XS | 7.4175 +/- 0.08420 |
30
- | Q2_K | 4.5758 +/- 0.03959 |
31
- | IQ3_XXS | 4.0389 +/- 0.03489 |
32
- | Q3_K | 4.5563 +/- 0.04126 |
33
- | Q4_0 | 3.9757 +/- 0.03455 |
34
- | Q4_K | 3.9265 +/- 0.03407 |
35
- | Q5_0 | 3.9167 +/- 0.03399 |
36
- | Q5_K | 3.9232 +/- 0.03403 |
37
- | Q6_K | 3.9242 +/- 0.03415 |
38
- | Q8_0 | 3.9159 +/- 0.03402 |
39
- | F16 | x |
 
 
 
 
 
 
 
 
40
 
41
  Due to the file size limitation, for F16 model, please use `cat` command to concatenate all parts into a single file. **You must concatenate these parts in order.**
42
 
 
22
 
23
  Metric: PPL, lower is better
24
 
25
+ | Quant | Size ↓ | PPL |
26
+ | ------- | ------- | ------------------ |
27
+ | IQ1_S | 9.8 GB | 9.5782 +/- 0.08909 |
28
+ | IQ1_M | 10.8 GB | 7.4666 +/- 0.06741 |
29
+ | IQ2_XXS | 12.3 GB | 6.3923 +/- 0.05674 |
30
+ | IQ2_XS | 13.7 GB | 6.0606 +/- 0.05834 |
31
+ | IQ2_S | 14.1 GB | 4.7617 +/- 0.04177 |
32
+ | IQ2_M | 15.5 GB | 4.5911 +/- 0.04054 |
33
+ | Q2_K | 17.3 GB | 4.8592 +/- 0.04303 |
34
+ | IQ3_XXS | 18.3 GB | 4.3557 +/- 0.03846 |
35
+ | IQ3_XS | 19.3 GB | 4.3328 +/- 0.03779 |
36
+ | IQ3_S | 20.4 GB | 4.3138 +/- 0.03785 |
37
+ | IQ3_M | 21.4 GB | 4.3024 +/- 0.03775 |
38
+ | Q3_K | 22.5 GB | 4.4334 +/- 0.03937 |
39
+ | IQ4_XS | 25.1 GB | 4.2324 +/- 0.03757 |
40
+ | Q4_0 | 26.4 GB | 4.2688 +/- 0.03787 |
41
+ | IQ4_NL | 26.5 GB | 4.2384 +/- 0.03763 |
42
+ | Q4_K | 28.4 GB | 4.2433 +/- 0.03768 |
43
+ | Q5_0 | 32.2 GB | 4.2142 +/- 0.03733 |
44
+ | Q5_K | 33.2 GB | 4.2177 +/- 0.03743 |
45
+ | Q6_K | 38.4 GB | 4.2184 +/- 0.03754 |
46
+ | Q8_0 | 49.6 GB | 4.2053 +/- 0.03732 |
47
+ | F16 | 93.5 GB | x |
48
 
49
  Due to the file size limitation, for F16 model, please use `cat` command to concatenate all parts into a single file. **You must concatenate these parts in order.**
50