neuralmagic/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • Updated 22 days ago • 123 • 1
neuralmagic/Meta-Llama-3.1-405B-Instruct-quantized.w8a8 Text Generation • Updated Dec 3, 2024 • 411 • 2