Facebert pbelcak/UltraFastBERT-1x11-long Updated Nov 22, 2023 • 11 • 75 Exponentially Faster Language Modelling Paper • 2311.10770 • Published Nov 15, 2023 • 118
Sparse MoE mistralai/Mixtral-8x7B-Instruct-v0.1 Text Generation • Updated Aug 19, 2024 • 484k • • 4.29k