Llamacpp quants

Browse files

Files changed (14) hide show

.gitattributes +12 -0
README.md +30 -0
mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf +3 -0
mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,30 @@

+---
+license: apache-2.0
+quantized_by: bartowski
+pipeline_tag: text-generation
+---
+## Llamacpp Quantizations of mistral-7b-v0.1-layla-v4-chatml
+Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2405">b2405</a> for quantization.
+Original model: https://huggingface.co/l3utterfly/mistral-7b-v0.1-layla-v4-chatml
+Download a file (not the whole branch) from below:
+| Filename | Quant type | File Size | Description |
+| -------- | ---------- | --------- | ----------- |
+| [mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf) | Q8_0 | 7.69GB | Extremely high quality, generally unneeded but max available quant. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf) | Q6_K | 5.94GB | Very high quality, near perfect, *recommended*. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf) | Q5_K_M | 5.13GB | High quality, very usable. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf) | Q5_K_S | 4.99GB | High quality, very usable. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf) | Q5_0 | 4.99GB | High quality, older format, generally not recommended. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf) | Q4_K_M | 4.36GB | Good quality, similar to 4.25 bpw. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf) | Q4_K_S | 4.14GB | Slightly lower quality with small space savings. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf) | Q4_0 | 4.10GB | Decent quality, older format, generally not recommended. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf) | Q3_K_L | 3.82GB | Lower quality but usable, good for low RAM availability. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf) | Q3_K_M | 3.51GB | Even lower quality. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf) | Q3_K_S | 3.16GB | Low quality, not recommended. |
+| [mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf) | Q2_K | 2.71GB | Extremely low quality, *not* recommended.
+Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski

mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eb1f9ae7da4b1de76c8a5f782c8bb2554f672e777223f534bfc08a5531dff406
+size 2719252064

mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b89a0467e47380675509bf4f2086e8d5a4a34a2d51112792c5f00930cd03845c
+size 3822035360

mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e79c1df90dc1ee3aef5a663322ec4810090b38b57dd0d14baa106441c047f59b
+size 3518996896

mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d68c76facd1ffdd8cab33017acc32dfe2509891329ca05af075cc0039acbd825
+size 3164578208

mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:60e28544880c5552ce3f19c0e3a8ce62120f6d539f1b335fa6b9d38ae8b7961b
+size 4108928480

mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:10f372adbd2bcefbf16f7b6db0bdf59e0f348eb5c47c06224981ba8c361540c9
+size 4368451040

mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c11694868c11985e66f12e2b144700b9f8c32c11348278720b11cc137752bbfe
+size 4140385760

mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:22694b4507715572976ccd777717fdf4a7d3dd918f186e1a12f5c644d28539e1
+size 4997728736

mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8e1a10f9377e39ed4cdb4f954974ab3d9d98e79f36170979fa62353c92d7eca0
+size 5131422176

mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:21d9ca76bd47e94d8b931d6a4a6b6bc65e2d7527d2ebe01fa8d7e8e82486d71b
+size 4997728736

mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:99ffb85064e97723d125df74384231eb31f948012c9581b4312e1ef5ed6b2be5
+size 5942079008

mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a25b45791a4da50e5b3f7553aa47d3934208d70b070f94be69ea6704e9e96ad4
+size 7695875488