bartowski
/

Yi-9B-Coder-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Yi-9B-Coder-GGUF / README.md

bartowski's picture

Update README.md

8e6e8a6 verified 8 months ago

|

2.84 kB

	---
	tags:
	- code
	- llama
	library_name: transformers
	pipeline_tag: text-generation
	license: other
	license_name: yi-license
	license_link: https://huggingface.co/01-ai/Yi-9B/blob/main/LICENSE
	quantized_by: bartowski
	---

	## Llamacpp Quantizations of Yi-9B-Coder

	Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2405">b2405</a> for quantization.

	Original model: https://huggingface.co/TechxGenus/Yi-9B-Coder

	Download a file (not the whole branch) from below:

	\| Filename \| Quant type \| File Size \| Description \|
	\| -------- \| ---------- \| --------- \| ----------- \|
	\| [Yi-9B-Coder-Q8_0.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q8_0.gguf) \| Q8_0 \| 9.38GB \| Extremely high quality, generally unneeded but max available quant. \|
	\| [Yi-9B-Coder-Q6_K.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q6_K.gguf) \| Q6_K \| 7.24GB \| Very high quality, near perfect, recommended. \|
	\| [Yi-9B-Coder-Q5_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q5_K_M.gguf) \| Q5_K_M \| 6.25GB \| High quality, very usable. \|
	\| [Yi-9B-Coder-Q5_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q5_K_S.gguf) \| Q5_K_S \| 6.10GB \| High quality, very usable. \|
	\| [Yi-9B-Coder-Q5_0.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q5_0.gguf) \| Q5_0 \| 6.10GB \| High quality, older format, generally not recommended. \|
	\| [Yi-9B-Coder-Q4_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q4_K_M.gguf) \| Q4_K_M \| 5.32GB \| Good quality, similar to 4.25 bpw. \|
	\| [Yi-9B-Coder-Q4_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q4_K_S.gguf) \| Q4_K_S \| 5.07GB \| Slightly lower quality with small space savings. \|
	\| [Yi-9B-Coder-Q4_0.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q4_0.gguf) \| Q4_0 \| 5.03GB \| Decent quality, older format, generally not recommended. \|
	\| [Yi-9B-Coder-Q3_K_L.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q3_K_L.gguf) \| Q3_K_L \| 4.69GB \| Lower quality but usable, good for low RAM availability. \|
	\| [Yi-9B-Coder-Q3_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q3_K_M.gguf) \| Q3_K_M \| 4.32GB \| Even lower quality. \|
	\| [Yi-9B-Coder-Q3_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q3_K_S.gguf) \| Q3_K_S \| 3.89GB \| Low quality, not recommended. \|
	\| [Yi-9B-Coder-Q2_K.gguf](https://huggingface.co/bartowski/Yi-9B-Coder-GGUF/blob/main/Yi-9B-Coder-Q2_K.gguf) \| Q2_K \| 3.35GB \| Extremely low quality, not recommended.

	Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski