File size: 501 Bytes
95882b0 1c9bc4e 95882b0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
license: apache-2.0
---
The Acceptance Prediction Head for Llama-2-chat 7B and 70B model pair. See [arxiv: 2405.19715](https://arxiv.org/abs/2405.19715) for more details.
Usage: [GitHub](https://github.com/Kaffaljidhmah2/SpecDec_pp)
### Citation
```bibtex
@article{huang2024specdec++,
title={SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths},
author={Huang, Kaixuan and Guo, Xudong and Wang, Mengdi},
journal={arXiv preprint arXiv:2405.19715},
year={2024}
}
``` |