mixtral-7b-8expert / README.md
bjoernp's picture
Create README.md
16bc0f9
|
raw
history blame
No virus
498 Bytes

Mixtral 7b 8 Expert

image/png

This is a preliminary HuggingFace implementation of the newly released MoE model by MistralAi. Make sure to load with trust_remote_code=True.

Thanks to @dzhulgakov for his early implementation (https://github.com/dzhulgakov/llama-mistral) that helped me find a working setup.

Come chat about this in our Disco(rd)! :)