TheBloke
/

WizardMath-70B-V1.0-GGML

text-generation-inference

Model card Files Files and versions Community

WizardLM commited on Aug 12, 2023

Commit

d5856bb

•

1 Parent(s): 5a58ba3

Update README.md

Files changed (1) hide show

README.md +10 -5

README.md CHANGED Viewed

@@ -44,17 +44,22 @@ GPU acceleration is now available for Llama 2 70B GGML files, with both CUDA (NV
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/WizardMath-70B-V1.0-GGML)
 * [WizardLM's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/WizardLM/WizardMath-70B-V1.0)
-## Prompt template: Alpaca-CoT
 ```
-Below is an instruction that describes a task. Write a response that appropriately completes the request.
-### Instruction:
-{prompt}
-### Response: Let's think step by step.
 ```
 <!-- compatibility_ggml start -->

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/WizardMath-70B-V1.0-GGML)
 * [WizardLM's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/WizardLM/WizardMath-70B-V1.0)
+## Prompt template:
+❗<b>Note for model system prompts usage:</b>
+**Default version:**
 ```
+"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response:"
+```
+**CoT Version:** （❗For the **simple** math questions, we do NOT recommend to use the CoT prompt.）
+```
+"Below is an instruction that describes a task. Write a response that appropriately completes the request.\n\n### Instruction:\n{instruction}\n\n### Response: Let's think step by step."
 ```
 <!-- compatibility_ggml start -->