Why Does the Effective Context Length of LLMs Fall Short? Paper • 2410.18745 • Published 28 days ago • 16
Law of the Weakest Link: Cross Capabilities of Large Language Models Paper • 2409.19951 • Published Sep 30 • 53