QuantPanda commited on
Commit
18ab389
1 Parent(s): d8d98ce

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -5
README.md CHANGED
@@ -1,5 +1,30 @@
1
- ---
2
- license: other
3
- license_name: glm-4-9b-license
4
- license_link: https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/LICENSE
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: glm-4-9b-license
4
+ license_link: https://huggingface.co/THUDM/glm-4-9b-chat/blob/main/LICENSE
5
+ datasets:
6
+ - THUDM/LongWriter-6k
7
+ language:
8
+ - en
9
+ pipeline_tag: text-generation
10
+ ---
11
+
12
+ # LongWriter-glm4-9b
13
+
14
+ Original model link: https://huggingface.co/THUDM/LongWriter-glm4-9b
15
+
16
+ Model by: **THUDM**
17
+
18
+ Quants by: **QuantPanda**
19
+
20
+ GGUF quantization for [llama.cpp](https://github.com/ggerganov/llama.cpp) and similar applications.
21
+
22
+ **Example:**
23
+
24
+ ``./llama-cli -m LongWriter-glm4-9B-Q5_K_M.gguf -p "You are a helpful AI assistant." --conversation``
25
+
26
+ If the model takes too long to load you can reduce the context size with ```--ctx-size```
27
+
28
+ **Example with smaller context size:**
29
+
30
+ ``./llama-cli -m LongWriter-glm4-9B-Q5_K_M.gguf -p "You are a helpful AI assistant." --conversation --ctx-size 4096``