A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression Paper • 2406.11430 • Published Jun 17 • 22 • 3
WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences Paper • 2406.11069 • Published Jun 16 • 12 • 4
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27 • 581 • 140