init

Files changed (18) hide show

README.md +27 -29
README_zh.md +0 -58
chinese-llama-2-7b.Q2_K.gguf +3 -0
chinese-llama-2-7b.Q3_K.gguf +3 -0
chinese-llama-2-7b.Q3_K_L.gguf +3 -0
chinese-llama-2-7b.Q3_K_S.gguf +3 -0
chinese-llama-2-7b.Q4_0.gguf +3 -0
chinese-llama-2-7b.Q4_1.gguf +3 -0
chinese-llama-2-7b.Q4_K.gguf +3 -0
chinese-llama-2-7b.Q4_K_S.gguf +3 -0
chinese-llama-2-7b.Q5_0.gguf +3 -0
chinese-llama-2-7b.Q5_1.gguf +3 -0
chinese-llama-2-7b.Q5_K.gguf +3 -0
chinese-llama-2-7b.Q5_K_S.gguf +3 -0
chinese-llama-2-7b.Q6_K.gguf +3 -0
chinese-llama-2-7b.Q8_0.gguf +3 -0
chinese-llama-2-7b.gguf +3 -0
configuration.json +3 -0

README.md CHANGED Viewed

@@ -17,42 +17,40 @@ tags:
   - chinese
 ---
-> English | [中文](README_zh.md)
 ## Provided files
-| Name                                                                                                                                  | Quant method | Size   |
-| ------------------------------------------------------------------------------------------------------------------------------------- | ------------ | ------ |
-| [chinese-llama-2-7b.Q2_K.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q2_K.gguf)     | Q2_K         | 2.7 GB |
-| [chinese-llama-2-7b.Q3_K.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K.gguf)     | Q3_K         | 3.2 GB |
-| [chinese-llama-2-7b.Q3_K_L.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K_L.gguf) | Q3_K_L       | 3.5 GB |
-| [chinese-llama-2-7b.Q3_K_S.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K_S.gguf) | Q3_K_S       | 2.9 GB |
-| [chinese-llama-2-7b.Q4_0.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_0.gguf)     | Q4_0         | 3.7 GB |
-| [chinese-llama-2-7b.Q4_1.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_1.gguf)     | Q4_1         | 4.1 GB |
-| [chinese-llama-2-7b.Q4_K.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_K.gguf)     | Q4_K         | 3.9 GB |
-| [chinese-llama-2-7b.Q4_K_S.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_K_S.gguf) | Q4_K_S       | 3.7 GB |
-| [chinese-llama-2-7b.Q5_0.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_0.gguf)     | Q5_0         | 4.5 GB |
-| [chinese-llama-2-7b.Q5_1.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_1.gguf)     | Q5_1         | 4.9 GB |
-| [chinese-llama-2-7b.Q5_K.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_K.gguf)     | Q5_K         | 4.6 GB |
-| [chinese-llama-2-7b.Q5_K_S.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_K_S.gguf) | Q5_K_S       | 4.5 GB |
-| [chinese-llama-2-7b.Q6_K.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q6_K.gguf)     | Q6_K         | 5.3 GB |
-| [chinese-llama-2-7b.Q8_0.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q8_0.gguf)     | Q8_0         | 6.9 GB |
-| [chinese-llama-2-7b.gguf](https://huggingface.co/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.gguf)               | full         | 13 GB  |
 ## Provided images
-| Name                                                                                                                               | Quant method | Size    |
-| ---------------------------------------------------------------------------------------------------------------------------------- | ------------ | ------- |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q2_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q2_K         | 3.68 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q3_K         | 4.16 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K_L](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q3_K_L       | 4.46 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K_S](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q3_K_S       | 3.81 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_0](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q4_0         | 4.7 GB  |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q4_K         | 4.95 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_K_S](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q4_K_S       | 4.73 GB |
 ```
 docker run --rm -p 8000:8000 shaowenchen/chinese-llama-2-7b-gguf:Q2_K
 ```
-and open http://localhost:8000/docs to view the API documentation.

   - chinese
 ---
 ## Provided files
+| Name                           | Quant method | Size   |
+| ------------------------------ | ------------ | ------ |
+| chinese-llama-2-7b.Q2_K.gguf   | Q2_K         | 2.7 GB |
+| chinese-llama-2-7b.Q3_K.gguf   | Q3_K         | 3.2 GB |
+| chinese-llama-2-7b.Q3_K_L.gguf | Q3_K_L       | 3.5 GB |
+| chinese-llama-2-7b.Q3_K_S.gguf | Q3_K_S       | 2.9 GB |
+| chinese-llama-2-7b.Q4_0.gguf   | Q4_0         | 3.7 GB |
+| chinese-llama-2-7b.Q4_1.gguf   | Q4_1         | 4.1 GB |
+| chinese-llama-2-7b.Q4_K.gguf   | Q4_K         | 3.9 GB |
+| chinese-llama-2-7b.Q4_K_S.gguf | Q4_K_S       | 3.7 GB |
+| chinese-llama-2-7b.Q5_0.gguf   | Q5_0         | 4.5 GB |
+| chinese-llama-2-7b.Q5_1.gguf   | Q5_1         | 4.9 GB |
+| chinese-llama-2-7b.Q5_K.gguf   | Q5_K         | 4.6 GB |
+| chinese-llama-2-7b.Q5_K_S.gguf | Q5_K_S       | 4.5 GB |
+| chinese-llama-2-7b.Q6_K.gguf   | Q6_K         | 5.3 GB |
+| chinese-llama-2-7b.Q8_0.gguf   | Q8_0         | 6.9 GB |
+| chinese-llama-2-7b.gguf        | full         | 13 GB  |
 ## Provided images
+| Name                                         | Quant method | Size    |
+| -------------------------------------------- | ------------ | ------- |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q2_K`   | Q2_K         | 3.68 GB |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q3_K`   | Q3_K         | 4.16 GB |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q3_K_L` | Q3_K_L       | 4.46 GB |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q3_K_S` | Q3_K_S       | 3.81 GB |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q4_0`   | Q4_0         | 4.7 GB  |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q4_K`   | Q4_K         | 4.95 GB |
+| `shaowenchen/chinese-llama-2-7b-gguf:Q4_K_S` | Q4_K_S       | 4.73 GB |
 ```
 docker run --rm -p 8000:8000 shaowenchen/chinese-llama-2-7b-gguf:Q2_K
 ```
+and you can view http://localhost:8000/docs to see the swagger UI.

README_zh.md DELETED Viewed

@@ -1,58 +0,0 @@
----
-inference: false
-language:
-  - zh
-license: apache-2.0
-model_creator: ziqingyang
-model_link: https://www.modelscope.cn/ziqingyang/chinese-llama-2-7b
-model_name: chinese-llama-2-7b
-model_type: llama
-pipeline_tag: text-generation
-quantized_by: shaowenchen
-tags:
-  - meta
-  - gguf
-  - llama
-  - llama-2
-  - chinese
----
-> [English](README.md) | 中文
-## 提供的文件
-| 名称                                                                                                                                     | 量化方法 | 大小   |
-| ---------------------------------------------------------------------------------------------------------------------------------------- | -------- | ------ |
-| [chinese-llama-2-7b.Q2_K.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q2_K.gguf)     | Q2_K     | 2.7 GB |
-| [chinese-llama-2-7b.Q3_K.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K.gguf)     | Q3_K     | 3.2 GB |
-| [chinese-llama-2-7b.Q3_K_L.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K_L.gguf) | Q3_K_L   | 3.5 GB |
-| [chinese-llama-2-7b.Q3_K_S.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q3_K_S.gguf) | Q3_K_S   | 2.9 GB |
-| [chinese-llama-2-7b.Q4_0.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_0.gguf)     | Q4_0     | 3.7 GB |
-| [chinese-llama-2-7b.Q4_1.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_1.gguf)     | Q4_1     | 4.1 GB |
-| [chinese-llama-2-7b.Q4_K.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_K.gguf)     | Q4_K     | 3.9 GB |
-| [chinese-llama-2-7b.Q4_K_S.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q4_K_S.gguf) | Q4_K_S   | 3.7 GB |
-| [chinese-llama-2-7b.Q5_0.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_0.gguf)     | Q5_0     | 4.5 GB |
-| [chinese-llama-2-7b.Q5_1.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_1.gguf)     | Q5_1     | 4.9 GB |
-| [chinese-llama-2-7b.Q5_K.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_K.gguf)     | Q5_K     | 4.6 GB |
-| [chinese-llama-2-7b.Q5_K_S.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q5_K_S.gguf) | Q5_K_S   | 4.5 GB |
-| [chinese-llama-2-7b.Q6_K.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q6_K.gguf)     | Q6_K     | 5.3 GB |
-| [chinese-llama-2-7b.Q8_0.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.Q8_0.gguf)     | Q8_0     | 6.9 GB |
-| [chinese-llama-2-7b.gguf](https://www.modelscope.cn/shaowenchen/chinese-llama-2-7b-GGUF/blob/main/chinese-llama-2-7b.gguf)               | 完整     | 13 GB  |
-## 提供的镜像
-| 名称                                                                                                                               | 量化方法 | 大小    |
-| ---------------------------------------------------------------------------------------------------------------------------------- | -------- | ------- |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q2_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q2_K     | 3.68 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q3_K     | 4.16 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K_L](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q3_K_L   | 4.46 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q3_K_S](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q3_K_S   | 3.81 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_0](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q4_0     | 4.7 GB  |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_K](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general)   | Q4_K     | 4.95 GB |
-| [shaowenchen/chinese-llama-2-7b-gguf:Q4_K_S](https://hub.docker.com/repository/docker/shaowenchen/chinese-llama-2-7b-gguf/general) | Q4_K_S   | 4.73 GB |
-```
-docker run --rm -p 8000:8000 shaowenchen/chinese-llama-2-7b-gguf:Q2_K
-```
-并打开 http://localhost:8000/docs 查看 API 文档。

chinese-llama-2-7b.Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:beb5c78a461819fae3d8c51c72293c70237ae0af3bd7e0a69826932825122c6b
+size 2936032800

chinese-llama-2-7b.Q3_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7cf785014e4ff2c6f18aed9467939ba41457bf2899abf6e9b95f74ce53307dd3
+size 3417787936

chinese-llama-2-7b.Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c24abe12dce1a3981772647686f668c218190217aeef800358c4c0e43ce9e51c
+size 3716894240

chinese-llama-2-7b.Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd9e7a77814ca9acca0a85de0fd41973fd8a6f59c8bbbf3529699f450d8a322a
+size 3068087840

chinese-llama-2-7b.Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b28b7c388426594e6cf7fb5b4629b3008d7f16154838aff6cb3a9f3be8bed5a5
+size 3958263328

chinese-llama-2-7b.Q4_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:01bacb9ed904eb89b48d8f85a7f201c85f40f91f50099df5c5fcbbe999d52d65
+size 4377169440

chinese-llama-2-7b.Q4_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cafc3f11b81e3a3b2d6ccc90d02385c09cdc221a437edb57380939537f26ed85
+size 4213460512

chinese-llama-2-7b.Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:63824393c46d4756d41c46751cdaf764523c3d10209695c880ee947789618448
+size 3989196320

chinese-llama-2-7b.Q5_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4a93664a3ac9ec07abfc1cbbf6c239dac355869330780a92978a8ff1c0484930
+size 4796075552

chinese-llama-2-7b.Q5_1.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:75776288868e502f52110fdc251f93fcc7d53aaf1cd458db353c2603d4f9292a
+size 5214981664

chinese-llama-2-7b.Q5_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:67b365ba50f6d94e9aef066ffe090be1fedfeb0787c4f677e3f648b8531b2e19
+size 4927540768

chinese-llama-2-7b.Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:32539dad763566a452a9efd50e1c98ecf8a5ff9d3ef03ed9bf80675e70b0ae4e
+size 4796075552

chinese-llama-2-7b.Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7d6bc82e9a0a69e96d7865008ba98dc4a38adc1dceef3fd2c64e0719d2ce4fc7
+size 5686251040

chinese-llama-2-7b.Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0182d4188160f6c283a4b1e624f9b97e80b271bddc9ecbd00498ddb318d57ebf
+size 7364365856

chinese-llama-2-7b.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:acbd95212ffc6215b91308065ca49bcb030c8f8b87395864e9c06449324c4c39
+size 13860294144

configuration.json CHANGED Viewed

@@ -3,5 +3,8 @@
     "task": "text-generation",
     "model": {
         "type": "llama2"
     }
 }

     "task": "text-generation",
     "model": {
         "type": "llama2"
+    },
+    "pipeline": {
+        "type": "chinese-llama-2-7b-text-generation-pipe"
     }
 }