Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions Paper • 2310.18780 • Published Oct 28, 2023 • 3
How to Train Your HiPPO: State Space Models with Generalized Orthogonal Basis Projections Paper • 2206.12037 • Published Jun 24, 2022
Zoology: Measuring and Improving Recall in Efficient Language Models Paper • 2312.04927 • Published Dec 8, 2023 • 2
Simple linear attention language models balance the recall-throughput tradeoff Paper • 2402.18668 • Published Feb 28, 2024 • 21