SliceGPT: Compress Large Language Models by Deleting Rows and Columns Paper • 2401.15024 • Published Jan 26 • 67
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning Paper • 2407.07523 • Published Jul 10 • 4
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models Paper • 2407.12327 • Published 28 days ago • 72
Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published 25 days ago • 33
DDK: Distilling Domain Knowledge for Efficient Large Language Models Paper • 2407.16154 • Published 22 days ago • 20
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Paper • 2408.00690 • Published 12 days ago • 20