Panchovix's picture
Update README.md
ca5ff77
|
raw
history blame
445 Bytes
metadata
license: other

WizardLM-33B-V1.0-Uncensored merged with kaiokendev's 33b SuperHOT 8k LoRA, quantized at 4 bit.

It was created with GPTQ-for-LLaMA with group size 32 and act order true as parameters, to get the maximum perplexity vs FP16 model.

I HIGHLY suggest to use exllama, to evade some VRAM issues.