Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models Paper • 2401.04658 • Published Jan 9 • 24
TransNormerLLM Collection TransNormerLLM: A Faster and Better Large Language Model with Improved TransNormer • 10 items • Updated Apr 11 • 3