Llamacpp quants
Browse files- .gitattributes +12 -0
- README.md +30 -0
- mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf +3 -0
- mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,30 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
quantized_by: bartowski
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
---
|
6 |
+
|
7 |
+
## Llamacpp Quantizations of mistral-7b-v0.1-layla-v4-chatml
|
8 |
+
|
9 |
+
Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2405">b2405</a> for quantization.
|
10 |
+
|
11 |
+
Original model: https://huggingface.co/l3utterfly/mistral-7b-v0.1-layla-v4-chatml
|
12 |
+
|
13 |
+
Download a file (not the whole branch) from below:
|
14 |
+
|
15 |
+
| Filename | Quant type | File Size | Description |
|
16 |
+
| -------- | ---------- | --------- | ----------- |
|
17 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf) | Q8_0 | 7.69GB | Extremely high quality, generally unneeded but max available quant. |
|
18 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf) | Q6_K | 5.94GB | Very high quality, near perfect, *recommended*. |
|
19 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf) | Q5_K_M | 5.13GB | High quality, very usable. |
|
20 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf) | Q5_K_S | 4.99GB | High quality, very usable. |
|
21 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf) | Q5_0 | 4.99GB | High quality, older format, generally not recommended. |
|
22 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf) | Q4_K_M | 4.36GB | Good quality, similar to 4.25 bpw. |
|
23 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf) | Q4_K_S | 4.14GB | Slightly lower quality with small space savings. |
|
24 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf) | Q4_0 | 4.10GB | Decent quality, older format, generally not recommended. |
|
25 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf) | Q3_K_L | 3.82GB | Lower quality but usable, good for low RAM availability. |
|
26 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf) | Q3_K_M | 3.51GB | Even lower quality. |
|
27 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf) | Q3_K_S | 3.16GB | Low quality, not recommended. |
|
28 |
+
| [mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf) | Q2_K | 2.71GB | Extremely low quality, *not* recommended.
|
29 |
+
|
30 |
+
Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski
|
mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eb1f9ae7da4b1de76c8a5f782c8bb2554f672e777223f534bfc08a5531dff406
|
3 |
+
size 2719252064
|
mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b89a0467e47380675509bf4f2086e8d5a4a34a2d51112792c5f00930cd03845c
|
3 |
+
size 3822035360
|
mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e79c1df90dc1ee3aef5a663322ec4810090b38b57dd0d14baa106441c047f59b
|
3 |
+
size 3518996896
|
mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d68c76facd1ffdd8cab33017acc32dfe2509891329ca05af075cc0039acbd825
|
3 |
+
size 3164578208
|
mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:60e28544880c5552ce3f19c0e3a8ce62120f6d539f1b335fa6b9d38ae8b7961b
|
3 |
+
size 4108928480
|
mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:10f372adbd2bcefbf16f7b6db0bdf59e0f348eb5c47c06224981ba8c361540c9
|
3 |
+
size 4368451040
|
mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c11694868c11985e66f12e2b144700b9f8c32c11348278720b11cc137752bbfe
|
3 |
+
size 4140385760
|
mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:22694b4507715572976ccd777717fdf4a7d3dd918f186e1a12f5c644d28539e1
|
3 |
+
size 4997728736
|
mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8e1a10f9377e39ed4cdb4f954974ab3d9d98e79f36170979fa62353c92d7eca0
|
3 |
+
size 5131422176
|
mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:21d9ca76bd47e94d8b931d6a4a6b6bc65e2d7527d2ebe01fa8d7e8e82486d71b
|
3 |
+
size 4997728736
|
mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:99ffb85064e97723d125df74384231eb31f948012c9581b4312e1ef5ed6b2be5
|
3 |
+
size 5942079008
|
mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a25b45791a4da50e5b3f7553aa47d3934208d70b070f94be69ea6704e9e96ad4
|
3 |
+
size 7695875488
|