keyfan
/

Mixtral-8x7B-Instruct-2bit

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mixtral-8x7B-Instruct-2bit / quantization_config.json

keyfan's picture

Upload folder using huggingface_hub

029a854 6 months ago

raw history blame contribute delete

No virus

185 Bytes

	{
	"quant_method": "QUiP",
	"rescale_WH": false,
	"use_rand": true,
	"codebook": "E8P12",
	"codesz": 8,
	"idx_dtype": "torch.int16",
	"modules_to_not_convert": [
	"gate"
	]
	}