Facebert pbelcak/UltraFastBERT-1x11-long Updated Nov 22, 2023 • 245 • 75 Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 119
Sparse MoE mistralai/Mixtral-8x7B-Instruct-v0.1 Text Generation • Updated Aug 19, 2024 • 515k • • 4.38k