SLAB-Llama-350M / README.md
OpenEfficientAI's picture
Update README.md
e330118 verified
metadata
license: mit

An unofficial reproduced PRepBN-Llama-350M checkpoints for SLAB.

Model Sources [optional]

Evaluation

https://github.com/xinghaochen/SLAB/tree/main/llama

python evaluation.py --ckpt <checkpoint-path>

Results

BibTeX:

@inproceedings{guo2024slab,
  title={SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization},
  author={Guo, Jialong and Chen, Xinghao and Tang, Yehui  and Wang, Yunhe},
  booktitle={International Conference on Machine Learning},
  year={2024}
}