morriszms commited on
Commit
3846e67
·
verified ·
1 Parent(s): 89a8440

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ gemma-ko-v01-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ gemma-ko-v01-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ gemma-ko-v01-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ gemma-ko-v01-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ gemma-ko-v01-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ gemma-ko-v01-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ gemma-ko-v01-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ gemma-ko-v01-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ gemma-ko-v01-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ gemma-ko-v01-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ gemma-ko-v01-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ gemma-ko-v01-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,82 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
+ language:
5
+ - ko
6
+ pipeline_tag: text-generation
7
+ base_model: cpm-ai/gemma-ko-v01
8
+ tags:
9
+ - TensorBlock
10
+ - GGUF
11
+ ---
12
+
13
+ <div style="width: auto; margin-left: auto; margin-right: auto">
14
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
15
+ </div>
16
+ <div style="display: flex; justify-content: space-between; width: 100%;">
17
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
18
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
19
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
20
+ </p>
21
+ </div>
22
+ </div>
23
+
24
+ ## cpm-ai/gemma-ko-v01 - GGUF
25
+
26
+ This repo contains GGUF format model files for [cpm-ai/gemma-ko-v01](https://huggingface.co/cpm-ai/gemma-ko-v01).
27
+
28
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
29
+
30
+ <div style="text-align: left; margin: 20px 0;">
31
+ <a href="https://tensorblock.co/waitlist/client" style="display: inline-block; padding: 10px 20px; background-color: #007bff; color: white; text-decoration: none; border-radius: 5px; font-weight: bold;">
32
+ Run them on the TensorBlock client using your local machine ↗
33
+ </a>
34
+ </div>
35
+
36
+ ## Prompt template
37
+
38
+ ```
39
+ <bos><start_of_turn>user
40
+ {prompt}<end_of_turn>
41
+ <start_of_turn>model
42
+ ```
43
+
44
+ ## Model file specification
45
+
46
+ | Filename | Quant type | File Size | Description |
47
+ | -------- | ---------- | --------- | ----------- |
48
+ | [gemma-ko-v01-Q2_K.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q2_K.gguf) | Q2_K | 1.078 GB | smallest, significant quality loss - not recommended for most purposes |
49
+ | [gemma-ko-v01-Q3_K_S.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q3_K_S.gguf) | Q3_K_S | 1.200 GB | very small, high quality loss |
50
+ | [gemma-ko-v01-Q3_K_M.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q3_K_M.gguf) | Q3_K_M | 1.289 GB | very small, high quality loss |
51
+ | [gemma-ko-v01-Q3_K_L.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q3_K_L.gguf) | Q3_K_L | 1.365 GB | small, substantial quality loss |
52
+ | [gemma-ko-v01-Q4_0.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q4_0.gguf) | Q4_0 | 1.445 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
53
+ | [gemma-ko-v01-Q4_K_S.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q4_K_S.gguf) | Q4_K_S | 1.453 GB | small, greater quality loss |
54
+ | [gemma-ko-v01-Q4_K_M.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q4_K_M.gguf) | Q4_K_M | 1.518 GB | medium, balanced quality - recommended |
55
+ | [gemma-ko-v01-Q5_0.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q5_0.gguf) | Q5_0 | 1.675 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
56
+ | [gemma-ko-v01-Q5_K_S.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q5_K_S.gguf) | Q5_K_S | 1.675 GB | large, low quality loss - recommended |
57
+ | [gemma-ko-v01-Q5_K_M.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q5_K_M.gguf) | Q5_K_M | 1.713 GB | large, very low quality loss - recommended |
58
+ | [gemma-ko-v01-Q6_K.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q6_K.gguf) | Q6_K | 1.921 GB | very large, extremely low quality loss |
59
+ | [gemma-ko-v01-Q8_0.gguf](https://huggingface.co/tensorblock/gemma-ko-v01-GGUF/blob/main/gemma-ko-v01-Q8_0.gguf) | Q8_0 | 2.486 GB | very large, extremely low quality loss - not recommended |
60
+
61
+
62
+ ## Downloading instruction
63
+
64
+ ### Command line
65
+
66
+ Firstly, install Huggingface Client
67
+
68
+ ```shell
69
+ pip install -U "huggingface_hub[cli]"
70
+ ```
71
+
72
+ Then, downoad the individual model file the a local directory
73
+
74
+ ```shell
75
+ huggingface-cli download tensorblock/gemma-ko-v01-GGUF --include "gemma-ko-v01-Q2_K.gguf" --local-dir MY_LOCAL_DIR
76
+ ```
77
+
78
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
79
+
80
+ ```shell
81
+ huggingface-cli download tensorblock/gemma-ko-v01-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
82
+ ```
gemma-ko-v01-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d10fd42765f57fdaab9cc1cebe52bd21927299db1d60125dc033d9e673649c35
3
+ size 1157925120
gemma-ko-v01-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2936b7a66474cf74ab47418cb0b5c547b2526d5b6d7b4fa987a3e7bfde90a3a4
3
+ size 1465592064
gemma-ko-v01-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac5e7a20edf69c24453f88e88f48ec14c4da9d2de9432b8f942eea89bc89304a
3
+ size 1383803136
gemma-ko-v01-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7a7aa8065c6cd4bb1029699fba9e0aed77fc5d4cac96fd9f3f32b0155d73e84
3
+ size 1287981312
gemma-ko-v01-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:056223011adce573a1a366780d59a90d6de4811474a28c75bf22d86b06cdce78
3
+ size 1551190272
gemma-ko-v01-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:817f3bf926a1d4d61ac5b7393a2cf2840cdf753ce6a3fe24d08136ff2753e681
3
+ size 1630263552
gemma-ko-v01-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:386de6574b2ad9be75d58c69daa4402a063867498d36bf1f0b8548494c3d641b
3
+ size 1559841024
gemma-ko-v01-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:daf07da104daa8789601e6ce14b9be7707fe1936f40e6067d6c57210e6c06e80
3
+ size 1798916352
gemma-ko-v01-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dfed9c2b0ae4e5aa2f9c2d47e15a30d4df3fa0cb18d14255f53782b4a8bffc1a
3
+ size 1839651072
gemma-ko-v01-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f76ec03d1895b0cd8cd3bfbb300777d23f6a23c91eeefc448f7a19506f246b6f
3
+ size 1798916352
gemma-ko-v01-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0c3ed3addf8237f97c4c72d593350274e5ed5c933201642061fa998b7cd253eb
3
+ size 2062125312
gemma-ko-v01-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41d36d73c5ce002d7bc54d14234a08b19bac561a2819c231bd33b06cf19f0e4a
3
+ size 2669070592