QuantPanda
commited on
Commit
•
18ab389
1
Parent(s):
d8d98ce
Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,30 @@
|
|
1 |
-
---
|
2 |
-
license: other
|
3 |
-
license_name: glm-4-9b-license
|
4 |
-
license_link: https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/LICENSE
|
5 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
license_name: glm-4-9b-license
|
4 |
+
license_link: https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/LICENSE
|
5 |
+
datasets:
|
6 |
+
- THUDM/LongWriter-6k
|
7 |
+
language:
|
8 |
+
- en
|
9 |
+
pipeline_tag: text-generation
|
10 |
+
---
|
11 |
+
|
12 |
+
# LongWriter-glm4-9b
|
13 |
+
|
14 |
+
Original model link: https://huggingface.co/THUDM/LongWriter-glm4-9b
|
15 |
+
|
16 |
+
Model by: **THUDM**
|
17 |
+
|
18 |
+
Quants by: **QuantPanda**
|
19 |
+
|
20 |
+
GGUF quantization for [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar applications.
|
21 |
+
|
22 |
+
**Example:**
|
23 |
+
|
24 |
+
``./llama-cli -m LongWriter-glm4-9B-Q5_K_M.gguf -p "You are a helpful AI assistant." --conversation``
|
25 |
+
|
26 |
+
If the model takes too long to load you can reduce the context size with ```--ctx-size```
|
27 |
+
|
28 |
+
**Example with smaller context size:**
|
29 |
+
|
30 |
+
``./llama-cli -m LongWriter-glm4-9B-Q5_K_M.gguf -p "You are a helpful AI assistant." --conversation --ctx-size 4096``
|