Llama-3-8B-Instruct-OmniQuant / quant_config.json
Vasily Alexeev
add asymm quantized model, add two eos in code sample
6758e8a
{"wbits": 4, "abits": 16, "group_size": 128, "symmetric": false}