casperhansen
/

mixtral-instruct-awq

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

This is a working version of Mixtral Instruct that is AWQ quantized. As of 11-02-2024, https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-AWQ is not working, so please use this repository instead.

Downloads last month: 2,792

Safetensors

Model size

6.48B params

Tensor type

I32

·

FP16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Space using casperhansen/mixtral-instruct-awq 1