noopSD commited on
Commit
5098e61
โ€ข
1 Parent(s): d23f6e6

Upload 4 files

Browse files
Files changed (5) hide show
  1. .gitattributes +1 -0
  2. README.md +93 -0
  3. config.json +5 -0
  4. imatrix-20k_random_data.dat +3 -0
  5. kiqu.webp +0 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ imatrix-20k_random_data.dat filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,96 @@
1
  ---
2
  license: cc-by-sa-4.0
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-sa-4.0
3
+ language:
4
+ - ko
5
+ - en
6
+ model_creator: maywell
7
+ model_name: kiqu-70b
8
+ model_type: mistral
9
+ prompt_template: '[INST] {prompt} [/INST]
10
+
11
+ '
12
+ quantized_by: noopSD
13
  ---
14
+
15
+ > This repo contains quantized large language model(LLM) weight files in GGUF format for [maywell/kiqu-70b](https://huggingface.co/maywell/kiqu-70b). The quantized model files are calibrated with [20k_random_data.txt](https://github.com/ggerganov/llama.cpp/files/13970111/20k_random_data.txt)
16
+
17
+ # **kiqu-70b** [(Arena Leaderboard)](https://huggingface.co/spaces/instructkr/ko-chatbot-arena-leaderboard)
18
+
19
+ <img src="./kiqu.webp" alt="kiqu-70B" width="390"/>
20
+
21
+ **kiqu-70b** is a SFT+DPO trained model based on Miqu-70B-Alpaca-DPO using **Korean** datasets.
22
+
23
+ Since this model is finetune of miqu-1-70b using it on commercial purposes is at your own risk. โ€” leaked early version Mistral-Medium
24
+
25
+ ๋ณธ ๋ชจ๋ธ **kiqu-70b**๋Š” Miqu-70B-Alpaca-DPO ๋ชจ๋ธ์„ ๊ธฐ๋ฐ˜์œผ๋กœ **ํ•œ๊ตญ์–ด** ๋ฐ์ดํ„ฐ์…‹์„ ์‚ฌ์šฉํ•˜์—ฌ SFT+DPO ํ›ˆ๋ จ์„ ์ง„ํ–‰ํ•˜์—ฌ ์ œ์ž‘๋˜์—ˆ์Šต๋‹ˆ๋‹ค.
26
+
27
+ ๋ฒ ์ด์Šค ๋ชจ๋ธ์ธ miqu-1-70b ๋ชจ๋ธ์ด ๋ฏธ์ŠคํŠธ๋ž„-๋ฏธ๋””์›€์˜ ์ดˆ๊ธฐ ์œ ์ถœ ๋ฒ„์ „์ด๊ธฐ์— ์ƒ์—…์  ์‚ฌ์šฉ์— ๋Œ€ํ•œ risk๋Š” ๋ณธ์ธ์—๊ฒŒ ์žˆ์Šต๋‹ˆ๋‹ค.
28
+
29
+ Beside that this model follows **cc-by-sa-4.0**
30
+
31
+ ๋ณธ ๋ชจ๋ธ ์ž์ฒด๋กœ์„œ๋Š” **cc-by-sa-4.0**์„ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.
32
+
33
+ # **Model Details**
34
+
35
+ **Base Model**
36
+ miqu-1-70b (Early Mistral-Medium)
37
+
38
+ **Instruction format**
39
+
40
+ It follows **Mistral** format.
41
+ Giving few-shots to model is highly recommended
42
+
43
+ ๋ณธ ๋ชจ๋ธ์€ ๋ฏธ์ŠคํŠธ๋ž„ ํฌ๋งท์„ ๋”ฐ๋ฆ…๋‹ˆ๋‹ค.
44
+ few-shot ์‚ฌ์šฉ์„ ์ ๊ทน ๊ถŒ์žฅํ•ฉ๋‹ˆ๋‹ค.
45
+
46
+ ```
47
+ [INST] {instruction}
48
+ [/INST] {output}
49
+ ```
50
+
51
+ Multi-shot
52
+
53
+ ```
54
+ [INST] {instruction}
55
+ [/INST] {output}
56
+
57
+ [INST] {instruction}
58
+ [/INST] {output}
59
+
60
+ [INST] {instruction}
61
+ [/INST] {output}
62
+ .
63
+ .
64
+ .
65
+ ```
66
+
67
+ **Recommended Template** - 1-shot with system prompt
68
+
69
+ ```
70
+ ๋„ˆ๋Š” kiqu-70B๋ผ๋Š” ํ•œ๊ตญ์–ด์— ํŠนํ™”๋œ ์–ธ์–ด๋ชจ๋ธ์ด์•ผ. ๊น”๋”ํ•˜๊ณ  ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋Œ€๋‹ตํ•ด์ค˜!
71
+ [INST] ์•ˆ๋…•?
72
+ [/INST] ์•ˆ๋…•ํ•˜์„ธ์š”! ๋ฌด์—‡์„ ๋„์™€๋“œ๋ฆด๊นŒ์š”? ์งˆ๋ฌธ์ด๋‚˜ ๊ถ๊ธˆํ•œ ์ ์ด ์žˆ๋‹ค๋ฉด ์–ธ์ œ๋“ ์ง€ ๋ง์”€ํ•ด์ฃผ์„ธ์š”.
73
+
74
+ [INST] {instruction}
75
+ [/INST]
76
+ ```
77
+
78
+ Trailing space after [/INST] can affect models performance in significant margin. So, when doing inference it is recommended to not include trailing space in chat template.
79
+
80
+ [/INST] ๋’ค์— ๋„์–ด์“ฐ๊ธฐ๋Š” ๋ชจ๋ธ ์„ฑ๋Šฅ์— ์œ ์˜๋ฏธํ•œ ์˜ํ–ฅ์„ ๋ฏธ์นฉ๋‹ˆ๋‹ค. ๋”ฐ๋ผ์„œ, ์ธํผ๋Ÿฐ์Šค(์ถ”๋ก )๊ณผ์ •์—์„œ๋Š” ์ฑ— ํ…œํ”Œ๋ฆฟ์— ๋„์–ด์“ฐ๊ธฐ๋ฅผ ์ œ์™ธํ•˜๋Š” ๊ฒƒ์„ ์ ๊ทน ๊ถŒ์žฅํ•ฉ๋‹ˆ๋‹ค.
81
+
82
+ # **Model Benchmark**
83
+
84
+ TBD
85
+
86
+ # **Author's Message**
87
+
88
+ This model's training got sponsered by no one but support from people around Earth.
89
+
90
+ [Support Me](https://www.buymeacoffee.com/mwell)
91
+
92
+ [Discord Server](https://discord.gg/MrBt3PXdXc)
93
+
94
+ Contact Me on Discord - is.maywell
95
+
96
+ Follow me on twitter - https://twitter.com/stablefluffy
config.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "LlamaForCausalLM"
4
+ ]
5
+ }
imatrix-20k_random_data.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:751debf93409471426055211d47bc7386dce4c95d7c4274bb45ce7d7635b3845
3
+ size 24922254
kiqu.webp ADDED