intervitens
/

Mixtral-8x7B-Instruct-limarp-v0.1-5.5bpw-h6-exl2-rpcal

Text Generation

text-generation-inference

Model card Files Files and versions Community

intervitens commited on Dec 25, 2023

Commit

91fa22b

•

1 Parent(s): 9f8998e

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -9,6 +9,17 @@ tags:
 license: apache-2.0
 ---
 # Mixtral-8x7B-Instruct-limarp-v0.1
 Experimental model, using a limarp qlora trained at 10k ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) and then fused to [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) at 0.5 weight.

 license: apache-2.0
 ---
+Quantized using 200 samples of 8192 tokens from an RP-oriented [PIPPA](https://huggingface.co/datasets/royallab/PIPPA-cleaned) dataset.
+Requires ExllamaV2 version 0.0.11 and up.
+Original model link: [Doctor-Shotgun/Mixtral-8x7B-Instruct-limarp-v0.1](https://huggingface.co/Doctor-Shotgun/Mixtral-8x7B-Instruct-limarp-v0.1)
+Original model README below.
+***
 # Mixtral-8x7B-Instruct-limarp-v0.1
 Experimental model, using a limarp qlora trained at 10k ctx length (greater than size of the longest limarp sample when tokenized via mistral's tokenizer) on [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) and then fused to [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) at 0.5 weight.