Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss 3.5bpw

Exllama quant of NeverSleep/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss

You will need 24gb of vram to run this model at about half context (16k, you can probably go a bit higher too)

Prompt format: Chatml

<|im_start|>system
{sysprompt}<|im_end|>
<|im_start|>user
{input}<|im_end|>
<|im_start|>assistant
{output}<|im_end|>

Contact

Kooten on discord.

Downloads last month
11
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Collection including Kooten/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss-3.5bpw-exl2