Upload 4 files

Browse files

Files changed (5) hide show

.gitattributes +1 -0
README.md +93 -0
config.json +5 -0
imatrix-20k_random_data.dat +3 -0
kiqu.webp +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+imatrix-20k_random_data.dat filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,96 @@
 ---
 license: cc-by-sa-4.0
 ---

 ---
 license: cc-by-sa-4.0
+language:
+- ko
+- en
+model_creator: maywell
+model_name: kiqu-70b
+model_type: mistral
+prompt_template: '[INST] {prompt} [/INST]
+  '
+quantized_by: noopSD
 ---
+> This repo contains quantized large language model(LLM) weight files in GGUF format for [maywell/kiqu-70b](https://huggingface.co/maywell/kiqu-70b). The quantized model files are calibrated with [20k_random_data.txt](https://github.com/ggerganov/llama.cpp/files/13970111/20k_random_data.txt)
+# **kiqu-70b** [(Arena Leaderboard)](https://huggingface.co/spaces/instructkr/ko-chatbot-arena-leaderboard)
+<img src="./kiqu.webp" alt="kiqu-70B" width="390"/>
+**kiqu-70b** is a SFT+DPO trained model based on Miqu-70B-Alpaca-DPO using **Korean** datasets.
+Since this model is finetune of miqu-1-70b using it on commercial purposes is at your own risk. — leaked early version Mistral-Medium
+본 모델 **kiqu-70b**는 Miqu-70B-Alpaca-DPO 모델을 기반으로 **한국어** 데이터셋을 사용하여 SFT+DPO 훈련을 진행하여 제작되었습니다.
+베이스 모델인 miqu-1-70b 모델이 미스트랄-미디움의 초기 유출 버전이기에 상업적 사용에 대한 risk는 본인에게 있습니다.
+Beside that this model follows **cc-by-sa-4.0**
+본 모델 자체로서는 **cc-by-sa-4.0**을 따릅니다.
+# **Model Details**
+**Base Model**
+miqu-1-70b (Early Mistral-Medium)
+**Instruction format**
+It follows **Mistral** format.
+Giving few-shots to model is highly recommended
+본 모델은 미스트랄 포맷을 따릅니다.
+few-shot 사용을 적극 권장합니다.
+```
+[INST] {instruction}
+[/INST] {output}
+```
+Multi-shot
+```
+[INST] {instruction}
+[/INST] {output}
+[INST] {instruction}
+[/INST] {output}
+[INST] {instruction}
+[/INST] {output}
+.
+.
+.
+```
+**Recommended Template** - 1-shot with system prompt
+```
+너는 kiqu-70B라는 한국어에 특화된 언어모델이야. 깔끔하고 자연스럽게 대답해줘!
+[INST] 안녕?
+[/INST] 안녕하세요! 무엇을 도와드릴까요? 질문이나 궁금한 점이 있다면 언제든지 말씀해주세요.
+[INST] {instruction}
+[/INST]
+```
+Trailing space after [/INST] can affect models performance in significant margin. So, when doing inference it is recommended to not include trailing space in chat template.
+[/INST] 뒤에 띄어쓰기는 모델 성능에 유의미한 영향을 미칩니다. 따라서, 인퍼런스(추론)과정에서는 챗 템플릿에 띄어쓰기를 제외하는 것을 적극 권장합니다.
+# **Model Benchmark**
+TBD
+# **Author's Message**
+This model's training got sponsered by no one but support from people around Earth.
+[Support Me](https://www.buymeacoffee.com/mwell)
+[Discord Server](https://discord.gg/MrBt3PXdXc)
+Contact Me on Discord - is.maywell
+Follow me on twitter - https://twitter.com/stablefluffy

config.json ADDED Viewed

	@@ -0,0 +1,5 @@

+{
+  "architectures": [
+    "LlamaForCausalLM"
+  ]
+}

imatrix-20k_random_data.dat ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:751debf93409471426055211d47bc7386dce4c95d7c4274bb45ce7d7635b3845
+size 24922254

kiqu.webp ADDED Viewed