"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published 30 days ago • 46
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization Paper • 2411.02355 • Published 30 days ago • 46
neuralmagic/Llama-2-7b-cnn-daily-mail-pruned_70-quantized-deepsparse Text Generation • Updated May 17 • 10
neuralmagic/Llama-2-7b-cnn-daily-mail-pruned_50-quantized-deepsparse Text Generation • Updated May 17 • 9
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated Sep 26 • 8
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse Text Generation • Updated May 16 • 14 • 1
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse Text Generation • Updated May 16 • 31
Sparse Foundational Llama 2 Models Collection Sparse pre-trained and fine-tuned Llama models made by Neural Magic + Cerebras • 27 items • Updated Sep 26 • 8