Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published Oct 24 • 16 • 3
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27 • 19 • 3