cgus
/

CausalLM-14B-exl2

Text Generation

text-generation-inference

Model card Files Files and versions Community

cgus commited on Nov 7, 2023

Commit

229a8ed

•

1 Parent(s): 85f3628

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -53,7 +53,7 @@ tags:
 Experimental exl2 quantization for CausalLM-14B for Exllamav2.
 I had some issues during quantization process, so I suspect it might have quality issues.
-3.5bpw version barely fits 12GB VRAM but has unusually high perplexity for wikitext dataset.
 I couldn't measure perplexity for 4bpw version and to compare it with TheBloke's GPTQ, so I have no idea if my quantization has issues or it supposed to be like this.
 You could try this exl2 version but I'd recommend to use [TheBloke's GPTQ](https://huggingface.co/TheBloke/CausalLM-14B-GPTQ) version instead.

 Experimental exl2 quantization for CausalLM-14B for Exllamav2.
 I had some issues during quantization process, so I suspect it might have quality issues.
+3.5bpw version barely fits my 12GB VRAM but has unusually high perplexity for wikitext dataset.
 I couldn't measure perplexity for 4bpw version and to compare it with TheBloke's GPTQ, so I have no idea if my quantization has issues or it supposed to be like this.
 You could try this exl2 version but I'd recommend to use [TheBloke's GPTQ](https://huggingface.co/TheBloke/CausalLM-14B-GPTQ) version instead.