Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs Paper • 2503.16870 • Published Mar 21 • 5