File size: 501 Bytes
95882b0
 
 
 
 
 
1c9bc4e
 
95882b0
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: apache-2.0
---

The Acceptance Prediction Head for Llama-2-chat 7B and 70B model pair. See [arxiv: 2405.19715](https://arxiv.org/abs/2405.19715) for more details.

Usage: [GitHub](https://github.com/Kaffaljidhmah2/SpecDec_pp)


### Citation

```bibtex
@article{huang2024specdec++,
  title={SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths},
  author={Huang, Kaixuan and Guo, Xudong and Wang, Mengdi},
  journal={arXiv preprint arXiv:2405.19715},
  year={2024}
}
```