Llamacpp quants
Browse files- .gitattributes +12 -0
- README.md +41 -0
- Yi-9B-200K-Q2_K.gguf +3 -0
- Yi-9B-200K-Q3_K_L.gguf +3 -0
- Yi-9B-200K-Q3_K_M.gguf +3 -0
- Yi-9B-200K-Q3_K_S.gguf +3 -0
- Yi-9B-200K-Q4_0.gguf +3 -0
- Yi-9B-200K-Q4_K_M.gguf +3 -0
- Yi-9B-200K-Q4_K_S.gguf +3 -0
- Yi-9B-200K-Q5_0.gguf +3 -0
- Yi-9B-200K-Q5_K_M.gguf +3 -0
- Yi-9B-200K-Q5_K_S.gguf +3 -0
- Yi-9B-200K-Q6_K.gguf +3 -0
- Yi-9B-200K-Q8_0.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
Yi-9B-200K-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
Yi-9B-200K-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
Yi-9B-200K-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
Yi-9B-200K-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
Yi-9B-200K-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
Yi-9B-200K-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
Yi-9B-200K-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
Yi-9B-200K-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
Yi-9B-200K-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
Yi-9B-200K-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
Yi-9B-200K-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
Yi-9B-200K-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,41 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
license_name: yi-license
|
4 |
+
license_link: LICENSE
|
5 |
+
widget:
|
6 |
+
- example_title: "Yi-34B-Chat"
|
7 |
+
text: "hi"
|
8 |
+
output:
|
9 |
+
text: " Hello! How can I assist you today?"
|
10 |
+
- example_title: "Yi-34B"
|
11 |
+
text: "There's a place where time stands still. A place of breath taking wonder, but also"
|
12 |
+
output:
|
13 |
+
text: " an eerie sense that something is just not right…\nBetween the two worlds lies The Forgotten Kingdom - home to creatures long since thought extinct and ancient magic so strong it defies belief! Only here can you find what has been lost for centuries: An Elixir Of Life which will restore youth and vitality if only those who seek its power are brave enough to face up against all manner of dangers lurking in this mysterious land! But beware; some say there may even exist powerful entities beyond our comprehension whose intentions towards humanity remain unclear at best ---- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?"
|
14 |
+
pipeline_tag: text-generation
|
15 |
+
quantized_by: bartowski
|
16 |
+
---
|
17 |
+
|
18 |
+
## Llamacpp Quantizations of Yi-9B-200K
|
19 |
+
|
20 |
+
Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2440">b2440</a> for quantization.
|
21 |
+
|
22 |
+
Original model: https://huggingface.co/01-ai/Yi-9B-200K
|
23 |
+
|
24 |
+
Download a file (not the whole branch) from below:
|
25 |
+
|
26 |
+
| Filename | Quant type | File Size | Description |
|
27 |
+
| -------- | ---------- | --------- | ----------- |
|
28 |
+
| [Yi-9B-200K-Q8_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q8_0.gguf) | Q8_0 | 9.38GB | Extremely high quality, generally unneeded but max available quant. |
|
29 |
+
| [Yi-9B-200K-Q6_K.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q6_K.gguf) | Q6_K | 7.24GB | Very high quality, near perfect, *recommended*. |
|
30 |
+
| [Yi-9B-200K-Q5_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_K_M.gguf) | Q5_K_M | 6.25GB | High quality, very usable. |
|
31 |
+
| [Yi-9B-200K-Q5_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_K_S.gguf) | Q5_K_S | 6.10GB | High quality, very usable. |
|
32 |
+
| [Yi-9B-200K-Q5_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_0.gguf) | Q5_0 | 6.10GB | High quality, older format, generally not recommended. |
|
33 |
+
| [Yi-9B-200K-Q4_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_K_M.gguf) | Q4_K_M | 5.32GB | Good quality, similar to 4.25 bpw. |
|
34 |
+
| [Yi-9B-200K-Q4_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_K_S.gguf) | Q4_K_S | 5.07GB | Slightly lower quality with small space savings. |
|
35 |
+
| [Yi-9B-200K-Q4_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_0.gguf) | Q4_0 | 5.03GB | Decent quality, older format, generally not recommended. |
|
36 |
+
| [Yi-9B-200K-Q3_K_L.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_L.gguf) | Q3_K_L | 4.69GB | Lower quality but usable, good for low RAM availability. |
|
37 |
+
| [Yi-9B-200K-Q3_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_M.gguf) | Q3_K_M | 4.32GB | Even lower quality. |
|
38 |
+
| [Yi-9B-200K-Q3_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_S.gguf) | Q3_K_S | 3.89GB | Low quality, not recommended. |
|
39 |
+
| [Yi-9B-200K-Q2_K.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q2_K.gguf) | Q2_K | 3.35GB | Extremely low quality, *not* recommended.
|
40 |
+
|
41 |
+
Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski
|
Yi-9B-200K-Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:90b1dd55e31646153071705bea62f2c6c9499b15cf5454e6bda16813769465a7
|
3 |
+
size 3354324768
|
Yi-9B-200K-Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6b12456d303a1b8a1910a0e2bf04584f372ea448bfd8ed306264d7967e7226eb
|
3 |
+
size 4690751264
|
Yi-9B-200K-Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5b67544f789eaa3006e2e9ac59429fd5196a50bb8102a51f0c815c4735a267f5
|
3 |
+
size 4324405024
|
Yi-9B-200K-Q3_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:71c7ce6ae7d4beb6c0dc8712f06ff67eacc602821cf1e16c06b72097f76afc0b
|
3 |
+
size 3899207456
|
Yi-9B-200K-Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7926fa8a5bf89cd25ada86581e435fc49870ec40240850a6de334b6d99f671b8
|
3 |
+
size 5036994336
|
Yi-9B-200K-Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9505d9cae76933f788c8271a8a608640fecf1bf1ee4819a9ffca2b9b04374a08
|
3 |
+
size 5328957216
|
Yi-9B-200K-Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8a748923eafa20bb701907e6e6208fe29766a9ba1fb6301cec6ebe2eda6c8384
|
3 |
+
size 5071859488
|
Yi-9B-200K-Q5_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:09814a06d1bea6f95d329b4b2b28a87566727bcda43a3e026afd38f68edb5778
|
3 |
+
size 6107852576
|
Yi-9B-200K-Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7d7bcef5e28483b5825ca03db213601317df5a5d7698497b487773a4e65f9b7
|
3 |
+
size 6258257696
|
Yi-9B-200K-Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c350b46c98efa70e9b0f5b3caa3eb30d3193c72b2ce361800db95036dba2ca4
|
3 |
+
size 6107852576
|
Yi-9B-200K-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:16162007818fa8a7f0c381eddca3383e641268c6e0dc3453085772b82e7315dc
|
3 |
+
size 7245639456
|
Yi-9B-200K-Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6a00e8fda65a86e8d44cb0eae3f769c47192fb3cc5ce58c1c99b62f9a86363b3
|
3 |
+
size 9383915296
|