Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset Paper • 2311.05113 • Published Nov 9, 2023 • 1
Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation Paper • 2311.15211 • Published Nov 26, 2023
Layer-Condensed KV Cache for Efficient Inference of Large Language Models Paper • 2405.10637 • Published May 17 • 19
Layer-Condensed KV Cache for Efficient Inference of Large Language Models Paper • 2405.10637 • Published May 17 • 19