intervitens
/

Mixtral-8x7B-Instruct-v0.1-3.7bpw-h6-exl2-rpcal

Text Generation

text-generation-inference

Model card Files Files and versions Community

intervitens commited on Dec 18, 2023

Commit

6935025

•

1 Parent(s): d38c266

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -8,6 +8,18 @@ language:
 - en
 inference: false
 ---
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.

 - en
 inference: false
 ---
+Quantized using 200 samples of 8192 tokens from an RP-oriented [PIPPA](https://huggingface.co/datasets/royallab/PIPPA-cleaned) dataset. For purposes other than RP, use quantizations done on a more general dataset, like [these](https://huggingface.co/turboderp/Mixtral-8x7B-instruct-exl2).
+Requires ExllamaV2 version 0.0.11 and up.
+Original model link: [Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)
+Original model README below.
+***
 # Model Card for Mixtral-8x7B
 The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x7B outperforms Llama 2 70B on most benchmarks we tested.