README.md · dranger003/Smaug-Mixtral-v0.1-iMat.GGUF at fe837155e62121f272c6406f2efc8261b57a5747

metadata

license: apache-2.0
pipeline_tag: text-generation
library_name: gguf

GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-Mixtral-v0.1
The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.

NOTE: The new IQ3_M/IQ3_S/Q3_K_XS quants are currently causing a segfault during quantization, so I'll upload them once llama.cpp gets fixed.

Layers	Context	Template
32	32768	<s>[INST] {prompt} [/INST] {response}