Facebert pbelcak/UltraFastBERT-1x11-long Updated Nov 22, 2023 • 35 • 72 Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 117
Sparse MoE mistralai/Mixtral-8x7B-Instruct-v0.1 Text Generation • Updated 19 days ago • 787k • • 3.99k