The official prequantized EfficientQAT models.
-
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128
Text Generation • Updated • 6 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64
Text Generation • Updated • 8 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128
Text Generation • Updated • 9 -
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128
Text Generation • Updated • 11