README.md · dranger003/Smaug-Mixtral-v0.1-iMat.GGUF at 8c1389fbb3105ff4c3e0786b1e149d9107acb88d

metadata

license: apache-2.0
pipeline_tag: text-generation
library_name: gguf

GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-Mixtral-v0.1
The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.

NOTE: The new IQ3_M/IQ3_S/Q3_K_XS quants are currently causing a segfault during quantization, so I'll upload them once llama.cpp gets fixed. The imatrix is being used on the K-quants as well.

Layers	Context	Template
32	32768	<s>[INST] {prompt} [/INST] {response}