ProSparse-LLaMA-2-13B-Predictor
- Model Creator: THUNLP, ModelBest, and PowerInfer
This repository provides a group of sparsity predictors serving for SparseLLM/ProSparse-LLaMA-2-13B.
Citation
Please kindly cite using the following BibTeX:
@article{song2024prosparse,
title={{ProSparse}: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models},
author={Song, Chenyang and Han, Xu and Zhang, Zhengyan and Hu, Shengding and Shi, Xiyu and Li, Kuai and Chen, Chen and Liu, Zhiyuan and Li, Guangli and Yang, Tao and Sun, Maosong},
year={2024},
journal={arXiv preprint arXiv:2402.13516},
url={https://arxiv.org/pdf/2402.13516.pdf}
}
- Downloads last month
- 5
Inference API (serverless) does not yet support model repos that contain custom code.