Xe Iaso commited on Nov 24, 2023

Commit

19b36de

•

1 Parent(s): d720850

initial commit

Browse files

Signed-off-by: Xe Iaso <me@xeiaso.net>

Files changed (16) hide show

.gitattributes +1 -0
README.md +66 -0
config.json +3 -0
yi-chat-6b.Q2_K.gguf +3 -0
yi-chat-6b.Q3_K_L.gguf +3 -0
yi-chat-6b.Q3_K_M.gguf +3 -0
yi-chat-6b.Q3_K_S.gguf +3 -0
yi-chat-6b.Q4_0.gguf +3 -0
yi-chat-6b.Q4_K_M.gguf +3 -0
yi-chat-6b.Q4_K_S.gguf +3 -0
yi-chat-6b.Q5_0.gguf +3 -0
yi-chat-6b.Q5_K_M.gguf +3 -0
yi-chat-6b.Q5_K_S.gguf +3 -0
yi-chat-6b.Q6_K.gguf +3 -0
yi-chat-6b.Q8_0.gguf +3 -0
yi-chat-6b.f16.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,66 @@

+---
+base_model: 01-ai/yi-chat-6b-Chat
+inference: false
+license: other
+license_link: LICENSE
+license_name: yi-license
+model_creator: 01-ai
+model_name: Yi 34B Chat
+model_type: yi
+pipeline_tag: text-generation
+prompt_template: '<|im_start|>system
+  {system_message}<|im_end|>
+  <|im_start|>user
+  {prompt}<|im_end|>
+  <|im_start|>assistant
+  '
+quantized_by: XeIaso
+widget:
+- example_title: yi-chat-6b-Chat
+  output:
+    text: 'Hello! How can I assist you today?'
+  text: hi
+---
+# Yi 6B Chat - GGUF
+- Model creator: [01-ai](https://huggingface.co/01-ai)
+- Original model: [Yi 6B
+  Chat](https://huggingface.co/01-ai/yi-chat-6b-Chat)
+<!-- prompt-template start -->
+## Prompt template: ChatML
+```
+<|im_start|>system
+{system_message}<|im_end|>
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+<!-- prompt-template end -->
+<!-- README_GGUF.md-provided-files start -->
+## Provided files
+| Name | Quant method | Bits | Size | Max RAM required | Use case |
+| ---- | ---- | ---- | ---- | ---- | ----- |
+| [yi-chat-6b.Q2_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q2_K.gguf) | Q2_K | 2 | 2.62 GB| 5.12 GB | smallest, significant quality loss - not recommended for most purposes |
+| [yi-chat-6b.Q3_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_S.gguf) | Q3_K_S | 3 | 2.71 GB| 5.21 GB | very small, high quality loss |
+| [yi-chat-6b.Q3_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_M.gguf) | Q3_K_M | 3 | 2.99 GB| 5.49 GB | very small, high quality loss |
+| [yi-chat-6b.Q3_K_L.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q3_K_L.gguf) | Q3_K_L | 3 | 3.24 GB| 5.74 GB | small, substantial quality loss |
+| [yi-chat-6b.Q4_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_0.gguf) | Q4_0 | 4 | 3.48 GB| 5.98 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
+| [yi-chat-6b.Q4_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_S.gguf) | Q4_K_S | 4 | 3.50 GB| 6.00 GB | small, greater quality loss |
+| [yi-chat-6b.Q4_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q4_K_M.gguf) | Q4_K_M | 4 | 3.67 GB| 6.17 GB | medium, balanced quality - recommended |
+| [yi-chat-6b.Q5_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_0.gguf) | Q5_0 | 5 | 4.20 GB| 6.70 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
+| [yi-chat-6b.Q5_K_S.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_S.gguf) | Q5_K_S | 5 | 4.20 GB| 6.70 GB | large, low quality loss - recommended |
+| [yi-chat-6b.Q5_K_M.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q5_K_M.gguf) | Q5_K_M | 5 | 4.30 GB| 6.80 GB | large, very low quality loss - recommended |
+| [yi-chat-6b.Q6_K.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q6_K.gguf) | Q6_K | 6 | 4.97 GB| 7.47 GB | very large, extremely low quality loss |
+| [yi-chat-6b.Q8_0.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.Q8_0.gguf) | Q8_0 | 8 | 6.44 GB| 8.94 GB | very large, extremely low quality loss - not recommended |
+| [yi-chat-6b.f16.gguf](https://huggingface.co/XeIaso/yi-chat-6b-GGUF/blob/main/yi-chat-6b.f16.gguf) | f16 | 16 | 12.2 GB | 14 GB | extremely large, minimal quality loss |
+**Note**: the above RAM figures assume no GPU offloading. If layers are offloaded to the GPU, this will reduce RAM usage and use VRAM instead.
+<!-- README_GGUF.md-provided-files end -->
+If you want to support my efforts, check out my [Patreon](https://patreon.com/cadey).

config.json ADDED Viewed

	@@ -0,0 +1,3 @@

+{
+  "model_type": "yi"
+}

yi-chat-6b.Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:abac18b4508ea281884ed547d0e20edd983c663d536598143b36c05ea58a52fd
+size 2621230656

yi-chat-6b.Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:60055831e231d6fbc767340d487edcc64196820824ce678ea6e7f760cab96487
+size 3236892224

yi-chat-6b.Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b84a6d51fbf4ed60889c65cbac646998f54c865e954d0e40ed8d8fb45a37eeb
+size 2992836160

yi-chat-6b.Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:df7c60d3020b734160021493cf449f07455b8d651441fd1fb1346e6a5240b525
+size 2709196352

yi-chat-6b.Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9a1a647d93f0686687bc776e37331fec00b2b0236584a95a9c8d0db6a3784ea3
+size 3479326272

yi-chat-6b.Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:704cded3bd2af243be46a6f0bd904c8d5418804186bcfc8edbaea912ddd6d06f
+size 3673968192

yi-chat-6b.Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:36cf0d946abbacd8a7d563be1165e8b4c80a41c4b8ce6d213042e697f3b7ad5a
+size 3502919232

yi-chat-6b.Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:de1a19c734c0ef393ad2d55377c042c91195ef240eb7e8150241e21bfaf4b269
+size 4204154432

yi-chat-6b.Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:844ae9b449d4a1cdb985ae87b77e22cbfc6ec09d632bc33e57aada364fff5036
+size 4304424512

yi-chat-6b.Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0707bf14e073092545973fdfd8c2de1eafee2d1e0c662103185596f03eb06b16
+size 4204154432

yi-chat-6b.Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:098a0e5b819cfef7ed9ff46cc1d6dcbf9ad7e0dcd610206b0d14b993d50179ab
+size 4974284352

yi-chat-6b.Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e98959bfa3bddc221eb45ba9d870174844e7de9e9b8f05e0b953e100e5ebe343
+size 6442126912

yi-chat-6b.f16.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:44f7c77c1b3d5f71721cb918add07b8be3074f6402acbebf62595682b8c50c77
+size 12124098080