A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published 29 days ago • 7