Edit model card
Downloads last month
13
GGUF
Model size
46.7B params
Architecture
llama

4-bit

6-bit

8-bit

Inference API (serverless) has been turned off for this model.

Quantized from

Collection including martyn/mixtral-megamerge-dare-8x7b-v2-GGUF