TheBloke commited on
Commit
67e539e
1 Parent(s): 3a6f4ab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -16,6 +16,7 @@ It is the result of quantising to 4bit and 5bit GGML for CPU inference using [ll
16
 
17
  * [4bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GPTQ).
18
  * [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GGML).
 
19
 
20
  ## Provided files
21
  | Name | Quant method | Bits | Size | RAM required | Use case |
 
16
 
17
  * [4bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GPTQ).
18
  * [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-GGML).
19
+ * [float16 HF format model for GPU inference](https://huggingface.co/TheBloke/wizard-vicuna-13B-HF).
20
 
21
  ## Provided files
22
  | Name | Quant method | Bits | Size | RAM required | Use case |