Mixtral-8x7B-Instruct-2bit / quantization_config.json
keyfan's picture
Upload folder using huggingface_hub
029a854
{
"quant_method": "QUiP",
"rescale_WH": false,
"use_rand": true,
"codebook": "E8P12",
"codesz": 8,
"idx_dtype": "torch.int16",
"modules_to_not_convert": [
"gate"
]
}