mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
480k
•
•
4.21k
Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"