Compact Language Models via Pruning and Knowledge Distillation Paper • 2407.14679 • Published 26 days ago • 33