hugging-quants/Meta-Llama-3.1-70B-Instruct-AWQ-INT4 Text Generation • Updated Aug 7, 2024 • 81.2k • 100
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a16 Text Generation • Updated Oct 9, 2024 • 95 • 5
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • Updated Oct 23, 2024 • 12.8k • 10
QuantFactory/Meta-Llama-3-70B-Instruct-GGUF-v2 Text Generation • Updated May 6, 2024 • 1.24k • 16
Llama 3.1 GPTQ, AWQ, and BNB Quants Collection Optimised Quants for high-throughput deployments! Compatible with Transformers, TGI & VLLM 🤗 • 9 items • Updated Sep 26, 2024 • 56