TheBloke
/

Project-Baize-v2-13B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on May 24, 2023

Commit

a5e5605

·

1 Parent(s): 5534bc6

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,8 +11,8 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
 ## Other repositories available
-* [4bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Project-Baize-v2-13B-GPTQ)
-* [4bit and 5bit GGML models for CPU inference](https://huggingface.co/TheBloke/Project-Baize-v2-13B-GGML)
 * [Original unquantised fp16 model in HF format](https://huggingface.co/project-baize/baize-v2-13b)
 ## How to easily download and use this model in text-generation-webui

 ## Other repositories available
+* [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Project-Baize-v2-13B-GPTQ)
+* [4-bit, 5-bit and 8-bit GGML models for CPU(+GPU) inference](https://huggingface.co/TheBloke/Project-Baize-v2-13B-GGML)
 * [Original unquantised fp16 model in HF format](https://huggingface.co/project-baize/baize-v2-13b)
 ## How to easily download and use this model in text-generation-webui