Panchovix
/

WizardLM-33B-V1.0-Uncensored-SuperHOT-8k-4bit-32g

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Panchovix commited on Jun 26, 2023

Commit

ca5ff77

•

1 Parent(s): c47c96e

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 ---
 license: other
 ---

 ---
 license: other
 ---
+[WizardLM-33B-V1.0-Uncensored](https://huggingface.co/ehartford/WizardLM-33B-V1.0-Uncensored) merged with kaiokendev's [33b SuperHOT 8k LoRA](https://huggingface.co/kaiokendev/superhot-30b-8k-no-rlhf-test), quantized at 4 bit.
+It was created with GPTQ-for-LLaMA with group size 32 and act order true as parameters, to get the maximum perplexity vs FP16 model.
+I HIGHLY suggest to use exllama, to evade some VRAM issues.