TheBloke commited on
Commit
b2d0b56
1 Parent(s): 7d845a2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -39,7 +39,7 @@ In the `main` branch you will find `Guanaco-13B-GPTQ-4bit-128g.no-act-order.safe
39
 
40
  This will work with all versions of GPTQ-for-LLaMa. It has maximum compatibility.
41
 
42
- It was created without groupsize to minimise VRAM requirements, to keep it under 24GB VRAM. It was created with the `--act-order` parameter to maximise accuracy.
43
 
44
  * `Guanaco-13B-GPTQ-4bit-128g.no-act-order.safetensors`
45
  * Works with all versions of GPTQ-for-LLaMa code, both Triton and CUDA branches
 
39
 
40
  This will work with all versions of GPTQ-for-LLaMa. It has maximum compatibility.
41
 
42
+ It was created with groupsize 128 to ensure higher quality inference, and without `--act-order` to maximise compatibility.
43
 
44
  * `Guanaco-13B-GPTQ-4bit-128g.no-act-order.safetensors`
45
  * Works with all versions of GPTQ-for-LLaMa code, both Triton and CUDA branches