Better & Faster Large Language Models via Multi-token Prediction Paper • 2404.19737 • Published 23 days ago • 61