GGUF
Japanese
English
TensorBlock
GGUF
Eval Results
morriszms commited on
Commit
d31b6ba
·
verified ·
1 Parent(s): 6dd0710

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ youri-7b-chat-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ youri-7b-chat-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ youri-7b-chat-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ youri-7b-chat-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ youri-7b-chat-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ youri-7b-chat-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ youri-7b-chat-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ youri-7b-chat-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ youri-7b-chat-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ youri-7b-chat-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ youri-7b-chat-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ youri-7b-chat-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,188 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - ja
4
+ - en
5
+ license: llama2
6
+ datasets:
7
+ - databricks/databricks-dolly-15k
8
+ - kunishou/databricks-dolly-15k-ja
9
+ - izumi-lab/llm-japanese-dataset
10
+ thumbnail: https://github.com/rinnakk/japanese-pretrained-models/blob/master/rinna.png
11
+ inference: false
12
+ base_model: rinna/youri-7b-chat
13
+ tags:
14
+ - TensorBlock
15
+ - GGUF
16
+ model-index:
17
+ - name: youri-7b-chat
18
+ results:
19
+ - task:
20
+ type: text-generation
21
+ name: Text Generation
22
+ dataset:
23
+ name: AI2 Reasoning Challenge (25-Shot)
24
+ type: ai2_arc
25
+ config: ARC-Challenge
26
+ split: test
27
+ args:
28
+ num_few_shot: 25
29
+ metrics:
30
+ - type: acc_norm
31
+ value: 51.19
32
+ name: normalized accuracy
33
+ source:
34
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
35
+ name: Open LLM Leaderboard
36
+ - task:
37
+ type: text-generation
38
+ name: Text Generation
39
+ dataset:
40
+ name: HellaSwag (10-Shot)
41
+ type: hellaswag
42
+ split: validation
43
+ args:
44
+ num_few_shot: 10
45
+ metrics:
46
+ - type: acc_norm
47
+ value: 76.09
48
+ name: normalized accuracy
49
+ source:
50
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
51
+ name: Open LLM Leaderboard
52
+ - task:
53
+ type: text-generation
54
+ name: Text Generation
55
+ dataset:
56
+ name: MMLU (5-Shot)
57
+ type: cais/mmlu
58
+ config: all
59
+ split: test
60
+ args:
61
+ num_few_shot: 5
62
+ metrics:
63
+ - type: acc
64
+ value: 46.06
65
+ name: accuracy
66
+ source:
67
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
68
+ name: Open LLM Leaderboard
69
+ - task:
70
+ type: text-generation
71
+ name: Text Generation
72
+ dataset:
73
+ name: TruthfulQA (0-shot)
74
+ type: truthful_qa
75
+ config: multiple_choice
76
+ split: validation
77
+ args:
78
+ num_few_shot: 0
79
+ metrics:
80
+ - type: mc2
81
+ value: 41.17
82
+ source:
83
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
84
+ name: Open LLM Leaderboard
85
+ - task:
86
+ type: text-generation
87
+ name: Text Generation
88
+ dataset:
89
+ name: Winogrande (5-shot)
90
+ type: winogrande
91
+ config: winogrande_xl
92
+ split: validation
93
+ args:
94
+ num_few_shot: 5
95
+ metrics:
96
+ - type: acc
97
+ value: 75.06
98
+ name: accuracy
99
+ source:
100
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
101
+ name: Open LLM Leaderboard
102
+ - task:
103
+ type: text-generation
104
+ name: Text Generation
105
+ dataset:
106
+ name: GSM8k (5-shot)
107
+ type: gsm8k
108
+ config: main
109
+ split: test
110
+ args:
111
+ num_few_shot: 5
112
+ metrics:
113
+ - type: acc
114
+ value: 1.52
115
+ name: accuracy
116
+ source:
117
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=rinna/youri-7b-chat
118
+ name: Open LLM Leaderboard
119
+ ---
120
+
121
+ <div style="width: auto; margin-left: auto; margin-right: auto">
122
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
123
+ </div>
124
+ <div style="display: flex; justify-content: space-between; width: 100%;">
125
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
126
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
127
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
128
+ </p>
129
+ </div>
130
+ </div>
131
+
132
+ ## rinna/youri-7b-chat - GGUF
133
+
134
+ This repo contains GGUF format model files for [rinna/youri-7b-chat](https://huggingface.co/rinna/youri-7b-chat).
135
+
136
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4242](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
137
+
138
+ <div style="text-align: left; margin: 20px 0;">
139
+ <a href="https://tensorblock.co/waitlist/client" style="display: inline-block; padding: 10px 20px; background-color: #007bff; color: white; text-decoration: none; border-radius: 5px; font-weight: bold;">
140
+ Run them on the TensorBlock client using your local machine ↗
141
+ </a>
142
+ </div>
143
+
144
+ ## Prompt template
145
+
146
+ ```
147
+
148
+ ```
149
+
150
+ ## Model file specification
151
+
152
+ | Filename | Quant type | File Size | Description |
153
+ | -------- | ---------- | --------- | ----------- |
154
+ | [youri-7b-chat-Q2_K.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q2_K.gguf) | Q2_K | 2.533 GB | smallest, significant quality loss - not recommended for most purposes |
155
+ | [youri-7b-chat-Q3_K_S.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q3_K_S.gguf) | Q3_K_S | 2.948 GB | very small, high quality loss |
156
+ | [youri-7b-chat-Q3_K_M.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q3_K_M.gguf) | Q3_K_M | 3.298 GB | very small, high quality loss |
157
+ | [youri-7b-chat-Q3_K_L.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q3_K_L.gguf) | Q3_K_L | 3.597 GB | small, substantial quality loss |
158
+ | [youri-7b-chat-Q4_0.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q4_0.gguf) | Q4_0 | 3.826 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
159
+ | [youri-7b-chat-Q4_K_S.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q4_K_S.gguf) | Q4_K_S | 3.857 GB | small, greater quality loss |
160
+ | [youri-7b-chat-Q4_K_M.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q4_K_M.gguf) | Q4_K_M | 4.081 GB | medium, balanced quality - recommended |
161
+ | [youri-7b-chat-Q5_0.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q5_0.gguf) | Q5_0 | 4.652 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
162
+ | [youri-7b-chat-Q5_K_S.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q5_K_S.gguf) | Q5_K_S | 4.652 GB | large, low quality loss - recommended |
163
+ | [youri-7b-chat-Q5_K_M.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q5_K_M.gguf) | Q5_K_M | 4.783 GB | large, very low quality loss - recommended |
164
+ | [youri-7b-chat-Q6_K.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q6_K.gguf) | Q6_K | 5.529 GB | very large, extremely low quality loss |
165
+ | [youri-7b-chat-Q8_0.gguf](https://huggingface.co/tensorblock/youri-7b-chat-GGUF/blob/main/youri-7b-chat-Q8_0.gguf) | Q8_0 | 7.161 GB | very large, extremely low quality loss - not recommended |
166
+
167
+
168
+ ## Downloading instruction
169
+
170
+ ### Command line
171
+
172
+ Firstly, install Huggingface Client
173
+
174
+ ```shell
175
+ pip install -U "huggingface_hub[cli]"
176
+ ```
177
+
178
+ Then, downoad the individual model file the a local directory
179
+
180
+ ```shell
181
+ huggingface-cli download tensorblock/youri-7b-chat-GGUF --include "youri-7b-chat-Q2_K.gguf" --local-dir MY_LOCAL_DIR
182
+ ```
183
+
184
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
185
+
186
+ ```shell
187
+ huggingface-cli download tensorblock/youri-7b-chat-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
188
+ ```
youri-7b-chat-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d65279c5d98f16f4365f16a7e111c32cb20c1ff06f59bd791c43dd6f03ee0a5
3
+ size 2532865152
youri-7b-chat-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e93405814ebab8b01b43868793286a869d4b8f067d586a4f4450ecea803be0b
3
+ size 3597112448
youri-7b-chat-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c347fc6b26392fb161adf01dd42e4424a1b2e1396efcc92dd6c1f3cd3a3b3942
3
+ size 3298006144
youri-7b-chat-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9dd59ec85b3fe0adb09e7232ba2992e19f9b5b66043288ba309e9a843d1f75e
3
+ size 2948306048
youri-7b-chat-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a23773c8a92ddb0a853fee0d50c0c3258c3567bc13fdc9eb35e60a56391dcd1f
3
+ size 3825808512
youri-7b-chat-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba51a54a3ccb3dc79b95e82d12fc61bf5e632652fd94fa267594754378272eaf
3
+ size 4081005696
youri-7b-chat-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6603bbbc26ae0a0a3bee85b1208092135a2e0afcc28b51420af8dc53dd29b29f
3
+ size 3856741504
youri-7b-chat-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbe903d0977b9d8a556375628473276c087937ada9125074134ed55862c51028
3
+ size 4651693184
youri-7b-chat-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4ec060c4179b833cd480fa3a530ddc766b39810d4efc15a57630f58ccdb84deb
3
+ size 4783158400
youri-7b-chat-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46a8f7bc632a9450f8a2ba688085bf64ff52e9c90f0638daa7a19a728433db44
3
+ size 4651693184
youri-7b-chat-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:43a6bd0a928e510fd6e067963489c3dc8eee0497bd17fdc4bb89c4053d1e2077
3
+ size 5529195648
youri-7b-chat-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c659869174943992c5cfb5e135286fa164c8f59a3b805f4f3965706760344b0
3
+ size 7161091200