npc0
/

chatglm3-6b-32k-fp16

Model card Files Files and versions Community

npc0 commited on Nov 27, 2023

Commit

76c03a4

•

1 Parent(s): 96f99bc

Create README.md

Files changed (1) hide show

README.md +40 -0

README.md ADDED Viewed

	@@ -0,0 +1,40 @@

+---
+language:
+- zh
+- en
+tags:
+- glm
+- chatglm
+- ggml
+---
+# ChatGLM3-6B-32k-fp16
+介绍 (Introduction)
+ChatGLM3-6B-32k 是 ChatGLM 系列最新一代的开源模型，[THUDM/chatglm3-6b](https://github.com/THUDM/ChatGLM3)
+用 [ChatGLM.CPP](https://github.com/li-plus/chatglm.cpp) 基於 GGML quantize 生成 f16 權重 weights 儲存於此倉庫。
+## Performance
+|Model                     |GGML quantize method| HDD size |
+|--------------------------|--------------------|----------|
+|chatglm3-32k-ggml-q4_0.bin|        f16         |  ?.?? GB |
+## Getting Started
+1. Install dependency
+  ```sh
+  pip install chatglm-cpp transformers
+  ```
+2. Download weight
+  ```sh
+  wget https://huggingface.co/npc0/chatglm3-6b-32k-f16/resolve/main/chatglm3-32k-ggml-f16.bin
+  ```
+3. Code
+  ```py
+  import chatglm_cpp
+  pipeline = chatglm_cpp.Pipeline("./chatglm3-32k-ggml-f16.bin")
+  pipeline.chat(["你好"])
+  # Output: 你好👋！我是人工智能助手 ChatGLM3-6B，很高兴见到你，欢迎问我任何问题。
+  ```