elinas commited on
Commit
ac5ec49
1 Parent(s): a9d016c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -7,7 +7,8 @@ tags:
7
  ---
8
 
9
  # vicuna-13b-4bit
10
- Converted `vicuna-13b` to GPTQ 4bit using `true-sequentual` and `groupsize 128` in `safetensors` for best possible model performance.
 
11
 
12
  Vicuna is a high coherence model based on Llama that is comparable to ChatGPT. Read more here https://vicuna.lmsys.org/
13
 
 
7
  ---
8
 
9
  # vicuna-13b-4bit
10
+ Converted `vicuna-13b` to GPTQ 4bit using `true-sequentual` and `groupsize 128` in `safetensors` for best possible model performance.
11
+ This does **not** support `llama.cpp` or any other cpp implemetations, only `cuda` or `triton`.
12
 
13
  Vicuna is a high coherence model based on Llama that is comparable to ChatGPT. Read more here https://vicuna.lmsys.org/
14