Artefact2
/

Mixtral-8x7B-v0.1-GGUF

Inference Endpoints

Model card Files Files and versions Community

Mixtral-8x7B-v0.1-GGUF / README.md

Artefact2's picture

Create README.md

7b0359c verified 9 months ago

|

596 Bytes

	---
	language:
	- en
	license: apache-2.0
	---

	These are GGUF quantized versions of [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1).

	The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using `wiki.train.raw`.

	Some model files above 50GB are split into smaller files. To concatenate them, use the `cat` command (on Windows, use PowerShell): `cat foo-Q6_K.gguf.* > foo-Q6_K.gguf`

	* What quant do I need? See https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
	* Quant requests? Just open a discussion in the community tabs.