Update README.md
Browse files
README.md
CHANGED
@@ -4,7 +4,7 @@ train: false
|
|
4 |
inference: false
|
5 |
pipeline_tag: text-generation
|
6 |
---
|
7 |
-
## Mixtral-8x7B-v0.1-hf-4bit_g64-HQQ
|
8 |
This is a version of the Mixtral-8x7B-v0.1 model (https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) quantized to 4-bit via Half-Quadratic Quantization (HQQ): https://mobiusml.github.io/hqq_blog/
|
9 |
### Basic Usage
|
10 |
To run the model, install the HQQ library from https://github.com/mobiusml/hqq and use it as follows:
|
|
|
4 |
inference: false
|
5 |
pipeline_tag: text-generation
|
6 |
---
|
7 |
+
## Mixtral-8x7B-v0.1-hf-4bit_g64-HQQ
|
8 |
This is a version of the Mixtral-8x7B-v0.1 model (https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) quantized to 4-bit via Half-Quadratic Quantization (HQQ): https://mobiusml.github.io/hqq_blog/
|
9 |
### Basic Usage
|
10 |
To run the model, install the HQQ library from https://github.com/mobiusml/hqq and use it as follows:
|