EfficientQAT: Efficient Quantization-Aware Training for Large Language Models Paper • 2407.11062 • Published 12 days ago • 3
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper • 2407.12327 • Published 5 days ago • 61