GGUF
English
exbert
Eval Results
Inference Endpoints
Crataco commited on
Commit
6b7642c
·
verified ·
1 Parent(s): 70c8caa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -27,6 +27,31 @@ distilgpt2, available in as many GGUF quantization levels as possible as of Marc
27
 
28
  Importance matrix was used with context set to 1,024, created with [Kalomaze's "groups_merged.txt"](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384).
29
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
  Original model card below.
31
  ***
32
  # DistilGPT2
 
27
 
28
  Importance matrix was used with context set to 1,024, created with [Kalomaze's "groups_merged.txt"](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384).
29
 
30
+ |Quantization|HellaSwag|
31
+ |--------|--------|
32
+ |IQ1_S |25.3% |
33
+ |IQ2_XXS |27.3% |
34
+ |IQ2_XS |27.5% |
35
+ |IQ2_S |26.6% |
36
+ |IQ2_M |27.4% |
37
+ |Q2_K_S |24.4% |
38
+ |Q2_K |27.3% |
39
+ |IQ3_XXS |27.1% |
40
+ |IQ3_XS |26.6% |
41
+ |IQ3_S |26.2% |
42
+ |IQ3_M |26.2% |
43
+ |Q3_K_S |27.8% |
44
+ |Q3_K_M |26.2% |
45
+ |IQ4_XS |27.1% |
46
+ |IQ4_NL |27.1% |
47
+ |Q4_0 |26.3% |
48
+ |Q4_K_M |26.8% |
49
+ |Q5_0 |27.2% |
50
+ |Q5_1 |26.7% |
51
+ |Q5_K_M |26.8% |
52
+ |Q6_K |26.7% |
53
+ |Q8_K |26.5% |
54
+
55
  Original model card below.
56
  ***
57
  # DistilGPT2