EfficientLLM: Pruning-Aware Pretraining

xrxing 's Collections

updated 14 days ago

This is the models of our paper "EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models".