Xe Iaso
commited on
Commit
•
19b36de
1
Parent(s):
d720850
initial commit
Browse filesSigned-off-by: Xe Iaso <me@xeiaso.net>
- .gitattributes +1 -0
- README.md +66 -0
- config.json +3 -0
- yi-chat-6b.Q2_K.gguf +3 -0
- yi-chat-6b.Q3_K_L.gguf +3 -0
- yi-chat-6b.Q3_K_M.gguf +3 -0
- yi-chat-6b.Q3_K_S.gguf +3 -0
- yi-chat-6b.Q4_0.gguf +3 -0
- yi-chat-6b.Q4_K_M.gguf +3 -0
- yi-chat-6b.Q4_K_S.gguf +3 -0
- yi-chat-6b.Q5_0.gguf +3 -0
- yi-chat-6b.Q5_K_M.gguf +3 -0
- yi-chat-6b.Q5_K_S.gguf +3 -0
- yi-chat-6b.Q6_K.gguf +3 -0
- yi-chat-6b.Q8_0.gguf +3 -0
- yi-chat-6b.f16.gguf +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
@@ -0,0 +1,66 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: 01-ai/yi-chat-6b-Chat
|
3 |
+
inference: false
|
4 |
+
license: other
|
5 |
+
license_link: LICENSE
|
6 |
+
license_name: yi-license
|
7 |
+
model_creator: 01-ai
|
8 |
+
model_name: Yi 34B Chat
|
9 |
+
model_type: yi
|
10 |
+
pipeline_tag: text-generation
|
11 |
+
prompt_template: '<|im_start|>system
|
12 |
+
{system_message}<|im_end|>
|
13 |
+
<|im_start|>user
|
14 |
+
{prompt}<|im_end|>
|
15 |
+
<|im_start|>assistant
|
16 |
+
'
|
17 |
+
quantized_by: XeIaso
|
18 |
+
widget:
|
19 |
+
- example_title: yi-chat-6b-Chat
|
20 |
+
output:
|
21 |
+
text: 'Hello! How can I assist you today?'
|
22 |
+
text: hi
|
23 |
+
---
|
24 |
+
|
25 |
+
# Yi 6B Chat - GGUF
|
26 |
+
|
27 |
+
- Model creator: [01-ai](https://huggingface.co/01-ai)
|
28 |
+
- Original model: [Yi 6B
|
29 |
+
Chat](https://huggingface.co/01-ai/yi-chat-6b-Chat)
|
30 |
+
|
31 |
+
<!-- prompt-template start -->
|
32 |
+
## Prompt template: ChatML
|
33 |
+
|
34 |
+
```
|
35 |
+
<|im_start|>system
|
36 |
+
{system_message}<|im_end|>
|
37 |
+
<|im_start|>user
|
38 |
+
{prompt}<|im_end|>
|
39 |
+
<|im_start|>assistant
|
40 |
+
```
|
41 |
+
<!-- prompt-template end -->
|
42 |
+
|
43 |
+
<!-- README_GGUF.md-provided-files start -->
|
44 |
+
## Provided files
|
45 |
+
|
46 |
+
| Name | Quant method | Bits | Size | Max RAM required | Use case |
|
47 |
+
| ---- | ---- | ---- | ---- | ---- | ----- |
|
48 |
+
| [yi-chat-6b.Q2_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q2_K.gguf) | Q2_K | 2 | 2.62 GB| 5.12 GB | smallest, significant quality loss - not recommended for most purposes |
|
49 |
+
| [yi-chat-6b.Q3_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_S.gguf) | Q3_K_S | 3 | 2.71 GB| 5.21 GB | very small, high quality loss |
|
50 |
+
| [yi-chat-6b.Q3_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_M.gguf) | Q3_K_M | 3 | 2.99 GB| 5.49 GB | very small, high quality loss |
|
51 |
+
| [yi-chat-6b.Q3_K_L.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_L.gguf) | Q3_K_L | 3 | 3.24 GB| 5.74 GB | small, substantial quality loss |
|
52 |
+
| [yi-chat-6b.Q4_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_0.gguf) | Q4_0 | 4 | 3.48 GB| 5.98 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
|
53 |
+
| [yi-chat-6b.Q4_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_S.gguf) | Q4_K_S | 4 | 3.50 GB| 6.00 GB | small, greater quality loss |
|
54 |
+
| [yi-chat-6b.Q4_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_M.gguf) | Q4_K_M | 4 | 3.67 GB| 6.17 GB | medium, balanced quality - recommended |
|
55 |
+
| [yi-chat-6b.Q5_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_0.gguf) | Q5_0 | 5 | 4.20 GB| 6.70 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
|
56 |
+
| [yi-chat-6b.Q5_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_S.gguf) | Q5_K_S | 5 | 4.20 GB| 6.70 GB | large, low quality loss - recommended |
|
57 |
+
| [yi-chat-6b.Q5_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_M.gguf) | Q5_K_M | 5 | 4.30 GB| 6.80 GB | large, very low quality loss - recommended |
|
58 |
+
| [yi-chat-6b.Q6_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q6_K.gguf) | Q6_K | 6 | 4.97 GB| 7.47 GB | very large, extremely low quality loss |
|
59 |
+
| [yi-chat-6b.Q8_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q8_0.gguf) | Q8_0 | 8 | 6.44 GB| 8.94 GB | very large, extremely low quality loss - not recommended |
|
60 |
+
| [yi-chat-6b.f16.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.f16.gguf) | f16 | 16 | 12.2 GB | 14 GB | extremely large, minimal quality loss |
|
61 |
+
|
62 |
+
**Note**: the above RAM figures assume no GPU offloading. If layers are offloaded to the GPU, this will reduce RAM usage and use VRAM instead.
|
63 |
+
|
64 |
+
<!-- README_GGUF.md-provided-files end -->
|
65 |
+
|
66 |
+
If you want to support my efforts, check out my [Patreon](https://patreon.com/cadey).
|
config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
{
|
2 |
+
"model_type": "yi"
|
3 |
+
}
|
yi-chat-6b.Q2_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:abac18b4508ea281884ed547d0e20edd983c663d536598143b36c05ea58a52fd
|
3 |
+
size 2621230656
|
yi-chat-6b.Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:60055831e231d6fbc767340d487edcc64196820824ce678ea6e7f760cab96487
|
3 |
+
size 3236892224
|
yi-chat-6b.Q3_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5b84a6d51fbf4ed60889c65cbac646998f54c865e954d0e40ed8d8fb45a37eeb
|
3 |
+
size 2992836160
|
yi-chat-6b.Q3_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:df7c60d3020b734160021493cf449f07455b8d651441fd1fb1346e6a5240b525
|
3 |
+
size 2709196352
|
yi-chat-6b.Q4_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9a1a647d93f0686687bc776e37331fec00b2b0236584a95a9c8d0db6a3784ea3
|
3 |
+
size 3479326272
|
yi-chat-6b.Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:704cded3bd2af243be46a6f0bd904c8d5418804186bcfc8edbaea912ddd6d06f
|
3 |
+
size 3673968192
|
yi-chat-6b.Q4_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:36cf0d946abbacd8a7d563be1165e8b4c80a41c4b8ce6d213042e697f3b7ad5a
|
3 |
+
size 3502919232
|
yi-chat-6b.Q5_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:de1a19c734c0ef393ad2d55377c042c91195ef240eb7e8150241e21bfaf4b269
|
3 |
+
size 4204154432
|
yi-chat-6b.Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:844ae9b449d4a1cdb985ae87b77e22cbfc6ec09d632bc33e57aada364fff5036
|
3 |
+
size 4304424512
|
yi-chat-6b.Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0707bf14e073092545973fdfd8c2de1eafee2d1e0c662103185596f03eb06b16
|
3 |
+
size 4204154432
|
yi-chat-6b.Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:098a0e5b819cfef7ed9ff46cc1d6dcbf9ad7e0dcd610206b0d14b993d50179ab
|
3 |
+
size 4974284352
|
yi-chat-6b.Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e98959bfa3bddc221eb45ba9d870174844e7de9e9b8f05e0b953e100e5ebe343
|
3 |
+
size 6442126912
|
yi-chat-6b.f16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:44f7c77c1b3d5f71721cb918add07b8be3074f6402acbebf62595682b8c50c77
|
3 |
+
size 12124098080
|