Edit model card

2024-03-14

This is a 3-bit exl2 (Exllamav2) conversion of MistralAI's Mixtral-8x7B-v0.1

This takes about 17GB on disk, so it should load on consumer cards with 24GB of VRAM.

Downloads last month
1
Model is too large to load in Inference API (serverless). To try the model, launch it on Inference Endpoints (dedicated) instead.