Panchovix commited on
Commit
137e2d4
1 Parent(s): 807e81c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ tags:
17
 
18
  These files are GPTQ model files for [Meta's Llama 2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf/tree/main) but with new FP16 files, made with the last transformers version. (transformers-4.32.0.dev0)
19
 
20
- This is a direct quant from the model, and since there was no remote_code in the files, it is unkown if GQA works or not.
21
 
22
  ## Quant parameters
23
 
 
17
 
18
  These files are GPTQ model files for [Meta's Llama 2 70B](https://huggingface.co/meta-llama/Llama-2-70b-hf/tree/main) but with new FP16 files, made with the last transformers version. (transformers-4.32.0.dev0)
19
 
20
+ GQA Works with exllama, but not GPTQ for LLaMA/AutoGPTQ.
21
 
22
  ## Quant parameters
23