Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

.gitattributes +1 -0
README.md +64 -0
nekomata-7b.Q4_K_M.gguf +3 -0
rinna.png +0 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+nekomata-7b.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,64 @@

+---
+thumbnail: https://github.com/rinnakk/japanese-pretrained-models/blob/master/rinna.png
+language:
+- ja
+- en
+tags:
+- qwen
+inference: false
+---
+# `rinna/nekomata-7b-gguf`
+![rinna-icon](./rinna.png)
+# Overview
+The model is the GGUF version of [`rinna/nekomata-7b`](https://huggingface.co/rinna/nekomata-7b). It can be used with [llama.cpp](https://github.com/ggerganov/llama.cpp) for lightweight inference.
+Quantization of this model may cause stability issue in GPTQ, AWQ and GGUF q4_0. We recommend **GGUF q4_K_M** for 4-bit quantization.
+See [`rinna/nekomata-7b`](https://huggingface.co/rinna/nekomata-7b) for details about model architecture and data.
+* **Authors**
+    - [Toshiaki Wakatsuki](https://huggingface.co/t-w)
+    - [Tianyu Zhao](https://huggingface.co/tianyuz)
+    - [Kei Sawada](https://huggingface.co/keisawada)
+---
+# How to use the model
+See [llama.cpp](https://github.com/ggerganov/llama.cpp) for more usage details.
+~~~~bash
+git clone https://github.com/ggerganov/llama.cpp
+cd llama.cpp
+make
+MODEL_PATH=/path/to/nekomata-7b-gguf/nekomata-7b.Q4_K_M.gguf
+MAX_N_TOKENS=128
+PROMPT="西田幾多郎は、"
+./main -m ${MODEL_PATH} -n ${MAX_N_TOKENS} -p "${PROMPT}"
+~~~~
+---
+# Tokenization
+Please refer to [`rinna/nekomata-7b`](https://huggingface.co/rinna/nekomata-7b) for tokenization details.
+---
+# How to cite
+~~~
+@misc{RinnaNekomata7bGGUF,
+    url={https://huggingface.co/rinna/nekomata-7b-gguf},
+    title={rinna/nekomata-7b-gguf},
+    author={Wakatsuki, Toshiaki and Zhao, Tianyu and Sawada, Kei}
+}
+~~~
+---
+# License
+[Tongyi Qianwen LICENSE AGREEMENT](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)

nekomata-7b.Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c9fda77da34e1093449299c7d3cf56ac71feaf3c80f264b7acf4b5846590987b
+size 4899217600

rinna.png ADDED Viewed