hacky's picture
Update README.md
1c9bc4e verified
|
raw
history blame contribute delete
No virus
501 Bytes
metadata
license: apache-2.0

The Acceptance Prediction Head for Llama-2-chat 7B and 70B model pair. See arxiv: 2405.19715 for more details.

Usage: GitHub

Citation

@article{huang2024specdec++,
  title={SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths},
  author={Huang, Kaixuan and Guo, Xudong and Wang, Mengdi},
  journal={arXiv preprint arXiv:2405.19715},
  year={2024}
}