digitous
/

ChanSung_Elina_33b-4bit

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

digitous commited on Apr 27, 2023

Commit

cd023f5

·

1 Parent(s): 3a0bb89

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -4,13 +4,19 @@ license: other
 This is a GPTQ 4 bit quant of ChanSung's Elina 33b.
 This is a LlaMa based model; LoRA merged with latest transformers conversion.
-Quantized with GPTQ, --wbits 4 --act-order --true-sequential --save_safetensors 4bit.safetensors c4.
 128 groupsize was not used so those running this on a consumer GPU with 24GB VRAM can run it at full
 context (2048) without any risk of OOM.
 Original LoRA:
 https://huggingface.co/LLMs/Alpaca-LoRA-30B-elina
 Repo:
 https://huggingface.co/LLMs
 Likely Author:
 https://huggingface.co/chansung

 This is a GPTQ 4 bit quant of ChanSung's Elina 33b.
 This is a LlaMa based model; LoRA merged with latest transformers conversion.
+Quantized with GPTQ, --wbits 4 --act-order --true-sequential --save_safetensors c4.
 128 groupsize was not used so those running this on a consumer GPU with 24GB VRAM can run it at full
 context (2048) without any risk of OOM.
 Original LoRA:
 https://huggingface.co/LLMs/Alpaca-LoRA-30B-elina
 Repo:
 https://huggingface.co/LLMs
 Likely Author:
 https://huggingface.co/chansung