TheBloke
/

zephyr_7b_norobots-GGUF

Model card Files Files and versions Community

TheBloke commited on Nov 20, 2023

Commit

3dfaa85

•

1 Parent(s): 40ada5b

Upload README.md

Files changed (1) hide show

README.md +5 -16

README.md CHANGED Viewed

@@ -8,15 +8,8 @@ license: apache-2.0
 model_creator: MonsterAPI
 model_name: Zephyr 7B Norobots
 model_type: mistral
-prompt_template: '<|im_start|>system
-  {system_message}<|im_end|>
-  <|im_start|>user
-  {prompt}<|im_end|>
-  <|im_start|>assistant
   '
 quantized_by: TheBloke
@@ -85,14 +78,10 @@ Here is an incomplete list of clients and libraries that are known to support GG
 <!-- repositories-available end -->
 <!-- prompt-template start -->
-## Prompt template: ChatML
 ```
-<|im_start|>system
-{system_message}<|im_end|>
-<|im_start|>user
-{prompt}<|im_end|>
-<|im_start|>assistant
 ```
@@ -211,7 +200,7 @@ Windows Command Line users: You can set the environment variable by running `set
 Make sure you are using `llama.cpp` from commit [d0cee0d](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
-./main -ngl 32 -m zephyr_7b_norobots.Q4_K_M.gguf --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "<|im_start|>system\n{system_message}<|im_end|>\n<|im_start|>user\n{prompt}<|im_end|>\n<|im_start|>assistant"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.

 model_creator: MonsterAPI
 model_name: Zephyr 7B Norobots
 model_type: mistral
+prompt_template: '<|system|> </s> <|user|> {prompt} </s> <|assistant|> {{response}}
+  </s>
   '
 quantized_by: TheBloke
 <!-- repositories-available end -->
 <!-- prompt-template start -->
+## Prompt template: NoRobots
 ```
+<|system|> </s> <|user|> {prompt} </s> <|assistant|> {{response}} </s>
 ```
 Make sure you are using `llama.cpp` from commit [d0cee0d](https://github.com/ggerganov/llama.cpp/commit/d0cee0d36d5be95a0d9088b674dbb27354107221) or later.
 ```shell
+./main -ngl 32 -m zephyr_7b_norobots.Q4_K_M.gguf --color -c 2048 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "<|system|> </s> <|user|> {prompt} </s> <|assistant|> {{response}} </s>"
 ```
 Change `-ngl 32` to the number of layers to offload to GPU. Remove it if you don't have GPU acceleration.