Meta-Llama-3-8B-Instruct-GPTQ-4bit / quantize_config.json

Commit History

AutoGPTQ model for meta-llama/Meta-Llama-3-8B-Instruct: 4bits, gr128, desc_act=True
502546c
verified

HRuiii commited on