mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
•
3.16M
•
•
4.24k
Pruned experts from Mixtral-8x7B-Instruct-v0.1 with respect to the paper "A Provably Effective Method for Pruning Experts in Fine-tuned Sparse MoEs"