QMB15 commited on
Commit
4b9924d
1 Parent(s): 66dd1be

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -1,3 +1,10 @@
 
 
 
 
 
 
 
1
  ---
2
  license: llama2
3
  language:
 
1
+ This is a exllama V2 quantization of https://huggingface.co/TheBloke/Stheno-L2-13B-GPTQ
2
+ Uses a target bpw of 8, intended for best quality on cards like a 3090 or similar.
3
+ Includes measurement.json for convenience of quantizing to other sizes.
4
+ Calibration data: https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet
5
+
6
+
7
+
8
  ---
9
  license: llama2
10
  language: