bartowski commited on
Commit
5b750d1
1 Parent(s): 0cab50b

Llamacpp quants

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Yi-9B-200K-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Yi-9B-200K-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Yi-9B-200K-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Yi-9B-200K-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Yi-9B-200K-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Yi-9B-200K-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Yi-9B-200K-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Yi-9B-200K-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Yi-9B-200K-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Yi-9B-200K-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Yi-9B-200K-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Yi-9B-200K-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: yi-license
4
+ license_link: LICENSE
5
+ widget:
6
+ - example_title: "Yi-34B-Chat"
7
+ text: "hi"
8
+ output:
9
+ text: " Hello! How can I assist you today?"
10
+ - example_title: "Yi-34B"
11
+ text: "There's a place where time stands still. A place of breath taking wonder, but also"
12
+ output:
13
+ text: " an eerie sense that something is just not right…\nBetween the two worlds lies The Forgotten Kingdom - home to creatures long since thought extinct and ancient magic so strong it defies belief! Only here can you find what has been lost for centuries: An Elixir Of Life which will restore youth and vitality if only those who seek its power are brave enough to face up against all manner of dangers lurking in this mysterious land! But beware; some say there may even exist powerful entities beyond our comprehension whose intentions towards humanity remain unclear at best ---- they might want nothing more than destruction itself rather then anything else from their quest after immortality (and maybe someone should tell them about modern medicine)? In any event though – one thing remains true regardless : whether or not success comes easy depends entirely upon how much effort we put into conquering whatever challenges lie ahead along with having faith deep down inside ourselves too ;) So let’s get started now shall We?"
14
+ pipeline_tag: text-generation
15
+ quantized_by: bartowski
16
+ ---
17
+
18
+ ## Llamacpp Quantizations of Yi-9B-200K
19
+
20
+ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2440">b2440</a> for quantization.
21
+
22
+ Original model: https://huggingface.co/01-ai/Yi-9B-200K
23
+
24
+ Download a file (not the whole branch) from below:
25
+
26
+ | Filename | Quant type | File Size | Description |
27
+ | -------- | ---------- | --------- | ----------- |
28
+ | [Yi-9B-200K-Q8_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q8_0.gguf) | Q8_0 | 9.38GB | Extremely high quality, generally unneeded but max available quant. |
29
+ | [Yi-9B-200K-Q6_K.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q6_K.gguf) | Q6_K | 7.24GB | Very high quality, near perfect, *recommended*. |
30
+ | [Yi-9B-200K-Q5_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_K_M.gguf) | Q5_K_M | 6.25GB | High quality, very usable. |
31
+ | [Yi-9B-200K-Q5_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_K_S.gguf) | Q5_K_S | 6.10GB | High quality, very usable. |
32
+ | [Yi-9B-200K-Q5_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q5_0.gguf) | Q5_0 | 6.10GB | High quality, older format, generally not recommended. |
33
+ | [Yi-9B-200K-Q4_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_K_M.gguf) | Q4_K_M | 5.32GB | Good quality, similar to 4.25 bpw. |
34
+ | [Yi-9B-200K-Q4_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_K_S.gguf) | Q4_K_S | 5.07GB | Slightly lower quality with small space savings. |
35
+ | [Yi-9B-200K-Q4_0.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q4_0.gguf) | Q4_0 | 5.03GB | Decent quality, older format, generally not recommended. |
36
+ | [Yi-9B-200K-Q3_K_L.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_L.gguf) | Q3_K_L | 4.69GB | Lower quality but usable, good for low RAM availability. |
37
+ | [Yi-9B-200K-Q3_K_M.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_M.gguf) | Q3_K_M | 4.32GB | Even lower quality. |
38
+ | [Yi-9B-200K-Q3_K_S.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q3_K_S.gguf) | Q3_K_S | 3.89GB | Low quality, not recommended. |
39
+ | [Yi-9B-200K-Q2_K.gguf](https://huggingface.co/bartowski/Yi-9B-200K-GGUF/blob/main/Yi-9B-200K-Q2_K.gguf) | Q2_K | 3.35GB | Extremely low quality, *not* recommended.
40
+
41
+ Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski
Yi-9B-200K-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90b1dd55e31646153071705bea62f2c6c9499b15cf5454e6bda16813769465a7
3
+ size 3354324768
Yi-9B-200K-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b12456d303a1b8a1910a0e2bf04584f372ea448bfd8ed306264d7967e7226eb
3
+ size 4690751264
Yi-9B-200K-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b67544f789eaa3006e2e9ac59429fd5196a50bb8102a51f0c815c4735a267f5
3
+ size 4324405024
Yi-9B-200K-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:71c7ce6ae7d4beb6c0dc8712f06ff67eacc602821cf1e16c06b72097f76afc0b
3
+ size 3899207456
Yi-9B-200K-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7926fa8a5bf89cd25ada86581e435fc49870ec40240850a6de334b6d99f671b8
3
+ size 5036994336
Yi-9B-200K-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9505d9cae76933f788c8271a8a608640fecf1bf1ee4819a9ffca2b9b04374a08
3
+ size 5328957216
Yi-9B-200K-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a748923eafa20bb701907e6e6208fe29766a9ba1fb6301cec6ebe2eda6c8384
3
+ size 5071859488
Yi-9B-200K-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09814a06d1bea6f95d329b4b2b28a87566727bcda43a3e026afd38f68edb5778
3
+ size 6107852576
Yi-9B-200K-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7d7bcef5e28483b5825ca03db213601317df5a5d7698497b487773a4e65f9b7
3
+ size 6258257696
Yi-9B-200K-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c350b46c98efa70e9b0f5b3caa3eb30d3193c72b2ce361800db95036dba2ca4
3
+ size 6107852576
Yi-9B-200K-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16162007818fa8a7f0c381eddca3383e641268c6e0dc3453085772b82e7315dc
3
+ size 7245639456
Yi-9B-200K-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6a00e8fda65a86e8d44cb0eae3f769c47192fb3cc5ce58c1c99b62f9a86363b3
3
+ size 9383915296