Xe Iaso commited on
Commit
19b36de
1 Parent(s): d720850

initial commit

Browse files

Signed-off-by: Xe Iaso <me@xeiaso.net>

.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: 01-ai/yi-chat-6b-Chat
3
+ inference: false
4
+ license: other
5
+ license_link: LICENSE
6
+ license_name: yi-license
7
+ model_creator: 01-ai
8
+ model_name: Yi 34B Chat
9
+ model_type: yi
10
+ pipeline_tag: text-generation
11
+ prompt_template: '<|im_start|>system
12
+ {system_message}<|im_end|>
13
+ <|im_start|>user
14
+ {prompt}<|im_end|>
15
+ <|im_start|>assistant
16
+ '
17
+ quantized_by: XeIaso
18
+ widget:
19
+ - example_title: yi-chat-6b-Chat
20
+ output:
21
+ text: 'Hello! How can I assist you today?'
22
+ text: hi
23
+ ---
24
+
25
+ # Yi 6B Chat - GGUF
26
+
27
+ - Model creator: [01-ai](https://huggingface.co/01-ai)
28
+ - Original model: [Yi 6B
29
+ Chat](https://huggingface.co/01-ai/yi-chat-6b-Chat)
30
+
31
+ <!-- prompt-template start -->
32
+ ## Prompt template: ChatML
33
+
34
+ ```
35
+ <|im_start|>system
36
+ {system_message}<|im_end|>
37
+ <|im_start|>user
38
+ {prompt}<|im_end|>
39
+ <|im_start|>assistant
40
+ ```
41
+ <!-- prompt-template end -->
42
+
43
+ <!-- README_GGUF.md-provided-files start -->
44
+ ## Provided files
45
+
46
+ | Name | Quant method | Bits | Size | Max RAM required | Use case |
47
+ | ---- | ---- | ---- | ---- | ---- | ----- |
48
+ | [yi-chat-6b.Q2_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q2_K.gguf) | Q2_K | 2 | 2.62 GB| 5.12 GB | smallest, significant quality loss - not recommended for most purposes |
49
+ | [yi-chat-6b.Q3_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_S.gguf) | Q3_K_S | 3 | 2.71 GB| 5.21 GB | very small, high quality loss |
50
+ | [yi-chat-6b.Q3_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_M.gguf) | Q3_K_M | 3 | 2.99 GB| 5.49 GB | very small, high quality loss |
51
+ | [yi-chat-6b.Q3_K_L.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_L.gguf) | Q3_K_L | 3 | 3.24 GB| 5.74 GB | small, substantial quality loss |
52
+ | [yi-chat-6b.Q4_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_0.gguf) | Q4_0 | 4 | 3.48 GB| 5.98 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
53
+ | [yi-chat-6b.Q4_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_S.gguf) | Q4_K_S | 4 | 3.50 GB| 6.00 GB | small, greater quality loss |
54
+ | [yi-chat-6b.Q4_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_M.gguf) | Q4_K_M | 4 | 3.67 GB| 6.17 GB | medium, balanced quality - recommended |
55
+ | [yi-chat-6b.Q5_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_0.gguf) | Q5_0 | 5 | 4.20 GB| 6.70 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
56
+ | [yi-chat-6b.Q5_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_S.gguf) | Q5_K_S | 5 | 4.20 GB| 6.70 GB | large, low quality loss - recommended |
57
+ | [yi-chat-6b.Q5_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_M.gguf) | Q5_K_M | 5 | 4.30 GB| 6.80 GB | large, very low quality loss - recommended |
58
+ | [yi-chat-6b.Q6_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q6_K.gguf) | Q6_K | 6 | 4.97 GB| 7.47 GB | very large, extremely low quality loss |
59
+ | [yi-chat-6b.Q8_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q8_0.gguf) | Q8_0 | 8 | 6.44 GB| 8.94 GB | very large, extremely low quality loss - not recommended |
60
+ | [yi-chat-6b.f16.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.f16.gguf) | f16 | 16 | 12.2 GB | 14 GB | extremely large, minimal quality loss |
61
+
62
+ **Note**: the above RAM figures assume no GPU offloading. If layers are offloaded to the GPU, this will reduce RAM usage and use VRAM instead.
63
+
64
+ <!-- README_GGUF.md-provided-files end -->
65
+
66
+ If you want to support my efforts, check out my [Patreon](https://patreon.com/cadey).
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "model_type": "yi"
3
+ }
yi-chat-6b.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:abac18b4508ea281884ed547d0e20edd983c663d536598143b36c05ea58a52fd
3
+ size 2621230656
yi-chat-6b.Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60055831e231d6fbc767340d487edcc64196820824ce678ea6e7f760cab96487
3
+ size 3236892224
yi-chat-6b.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b84a6d51fbf4ed60889c65cbac646998f54c865e954d0e40ed8d8fb45a37eeb
3
+ size 2992836160
yi-chat-6b.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df7c60d3020b734160021493cf449f07455b8d651441fd1fb1346e6a5240b525
3
+ size 2709196352
yi-chat-6b.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9a1a647d93f0686687bc776e37331fec00b2b0236584a95a9c8d0db6a3784ea3
3
+ size 3479326272
yi-chat-6b.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:704cded3bd2af243be46a6f0bd904c8d5418804186bcfc8edbaea912ddd6d06f
3
+ size 3673968192
yi-chat-6b.Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36cf0d946abbacd8a7d563be1165e8b4c80a41c4b8ce6d213042e697f3b7ad5a
3
+ size 3502919232
yi-chat-6b.Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:de1a19c734c0ef393ad2d55377c042c91195ef240eb7e8150241e21bfaf4b269
3
+ size 4204154432
yi-chat-6b.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:844ae9b449d4a1cdb985ae87b77e22cbfc6ec09d632bc33e57aada364fff5036
3
+ size 4304424512
yi-chat-6b.Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0707bf14e073092545973fdfd8c2de1eafee2d1e0c662103185596f03eb06b16
3
+ size 4204154432
yi-chat-6b.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:098a0e5b819cfef7ed9ff46cc1d6dcbf9ad7e0dcd610206b0d14b993d50179ab
3
+ size 4974284352
yi-chat-6b.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e98959bfa3bddc221eb45ba9d870174844e7de9e9b8f05e0b953e100e5ebe343
3
+ size 6442126912
yi-chat-6b.f16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:44f7c77c1b3d5f71721cb918add07b8be3074f6402acbebf62595682b8c50c77
3
+ size 12124098080