File size: 837 Bytes
ecf83ca
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
language:
  - zh
  - en
tags:
  - glm
  - chatglm
  - thudm
---

# ChatGLM2 6b int4 g32 量化模型

详情参考 [K024/chatglm-q](https://github.com/K024/chatglm-q)。

See [K024/chatglm-q](https://github.com/K024/chatglm-q) for more details.

```python
import torch
from chatglm_q.decoder import ChatGLMDecoder, chat_template

device = torch.device("cuda")
decoder = ChatGLMDecoder.from_pretrained("K024/chatglm2-6b-int4g32", device=device)

prompt = chat_template([], "我是谁?")
for text in decoder.generate(prompt):
    print(text)
```

模型权重按 ChatGLM2-6b 许可发布,见 [MODEL LICENSE](https://huggingface.co/THUDM/chatglm2-6b/blob/main/MODEL_LICENSE)。

Model weights are released under the same license as ChatGLM2-6b, see [MODEL LICENSE](https://huggingface.co/THUDM/chatglm2-6b/blob/main/MODEL_LICENSE).