quantize_config.json · study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8 at main

Meta-Llama-3-70B-Instruct-GPTQ-Int8 / quantize_config.json

Jintao Huang

first commit

cab4783 7 months ago

history blame contribute delete

223 Bytes

	{
	"bits": 8,
	"dataset": "sharegpt-gpt4-mini",
	"group_size": 128,
	"damp_percent": 0.1,
	"desc_act": false,
	"sym": true,
	"true_sequential": true,
	"quant_method": "gptq",
	"modules_in_block_to_quantize": null
	}