File size: 973 Bytes
2f480cf
 
 
aed7914
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
license: bsd
---

Converted INT8/INT4 files for [fastllm](https://github.com/ztxz16/fastllm) with [baichuan2-13b-chat](https://huggingface.co/baichuan-inc/Baichuan2-13B-Chat)

Directly download from Baidu Netdisk:

Link:https://pan.baidu.com/s/1Xsiif_1VzDyWFei1u5oJcA 
Code:wxbo

Updated time: 2023/09/11


```

baichuan2-13b-chat-int8.flm:

|===============================+======================+======================|
|   0  NVIDIA GeForce ...  Off  | 00000000:05:00.0 Off |                  N/A |
| 31%   36C    P8    28W / 250W |  15420MiB / 22528MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+
        
```

```python
from fastllm_pytools import llm
model = llm.model("baichuan2-13b-chat-int8.flm")
for response in model.stream_response("介绍一下南京"):
    print(response, flush = True, end = "")
```
> Note: please use the lastest version of FastLLM (no later than 2023/09/11 main branch)