MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection Paper • 2403.19888 • Published Mar 29 • 9
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Paper • 1804.07461 • Published Apr 20, 2018 • 4