view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13, 2024 • 98
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 • 54
view article Article Welcome FalconMamba: The first strong attention-free 7B model Aug 12, 2024 • 108