bartowski commited on
Commit
7f22514
1 Parent(s): 1dd5f4a

Llamacpp quants

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ quantized_by: bartowski
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ ## Llamacpp Quantizations of mistral-7b-v0.1-layla-v4-chatml
8
+
9
+ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2405">b2405</a> for quantization.
10
+
11
+ Original model: https://huggingface.co/l3utterfly/mistral-7b-v0.1-layla-v4-chatml
12
+
13
+ Download a file (not the whole branch) from below:
14
+
15
+ | Filename | Quant type | File Size | Description |
16
+ | -------- | ---------- | --------- | ----------- |
17
+ | [mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf) | Q8_0 | 7.69GB | Extremely high quality, generally unneeded but max available quant. |
18
+ | [mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf) | Q6_K | 5.94GB | Very high quality, near perfect, *recommended*. |
19
+ | [mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf) | Q5_K_M | 5.13GB | High quality, very usable. |
20
+ | [mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf) | Q5_K_S | 4.99GB | High quality, very usable. |
21
+ | [mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf) | Q5_0 | 4.99GB | High quality, older format, generally not recommended. |
22
+ | [mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf) | Q4_K_M | 4.36GB | Good quality, similar to 4.25 bpw. |
23
+ | [mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf) | Q4_K_S | 4.14GB | Slightly lower quality with small space savings. |
24
+ | [mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf) | Q4_0 | 4.10GB | Decent quality, older format, generally not recommended. |
25
+ | [mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf) | Q3_K_L | 3.82GB | Lower quality but usable, good for low RAM availability. |
26
+ | [mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf) | Q3_K_M | 3.51GB | Even lower quality. |
27
+ | [mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf) | Q3_K_S | 3.16GB | Low quality, not recommended. |
28
+ | [mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf](https://huggingface.co/bartowski/mistral-7b-v0.1-layla-v4-chatml-GGUF/blob/main/mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf) | Q2_K | 2.71GB | Extremely low quality, *not* recommended.
29
+
30
+ Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski
mistral-7b-v0.1-layla-v4-chatml-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb1f9ae7da4b1de76c8a5f782c8bb2554f672e777223f534bfc08a5531dff406
3
+ size 2719252064
mistral-7b-v0.1-layla-v4-chatml-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b89a0467e47380675509bf4f2086e8d5a4a34a2d51112792c5f00930cd03845c
3
+ size 3822035360
mistral-7b-v0.1-layla-v4-chatml-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e79c1df90dc1ee3aef5a663322ec4810090b38b57dd0d14baa106441c047f59b
3
+ size 3518996896
mistral-7b-v0.1-layla-v4-chatml-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d68c76facd1ffdd8cab33017acc32dfe2509891329ca05af075cc0039acbd825
3
+ size 3164578208
mistral-7b-v0.1-layla-v4-chatml-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60e28544880c5552ce3f19c0e3a8ce62120f6d539f1b335fa6b9d38ae8b7961b
3
+ size 4108928480
mistral-7b-v0.1-layla-v4-chatml-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10f372adbd2bcefbf16f7b6db0bdf59e0f348eb5c47c06224981ba8c361540c9
3
+ size 4368451040
mistral-7b-v0.1-layla-v4-chatml-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c11694868c11985e66f12e2b144700b9f8c32c11348278720b11cc137752bbfe
3
+ size 4140385760
mistral-7b-v0.1-layla-v4-chatml-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22694b4507715572976ccd777717fdf4a7d3dd918f186e1a12f5c644d28539e1
3
+ size 4997728736
mistral-7b-v0.1-layla-v4-chatml-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8e1a10f9377e39ed4cdb4f954974ab3d9d98e79f36170979fa62353c92d7eca0
3
+ size 5131422176
mistral-7b-v0.1-layla-v4-chatml-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21d9ca76bd47e94d8b931d6a4a6b6bc65e2d7527d2ebe01fa8d7e8e82486d71b
3
+ size 4997728736
mistral-7b-v0.1-layla-v4-chatml-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:99ffb85064e97723d125df74384231eb31f948012c9581b4312e1ef5ed6b2be5
3
+ size 5942079008
mistral-7b-v0.1-layla-v4-chatml-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a25b45791a4da50e5b3f7553aa47d3934208d70b070f94be69ea6704e9e96ad4
3
+ size 7695875488