Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Paper • 2406.07522 • Published Jun 11 • 35
Sparse Modular Activation for Efficient Sequence Modeling Paper • 2306.11197 • Published Jun 19, 2023 • 1