TheBloke
/

orca_mini_7B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jun 24, 2023

Commit

f31023b

•

1 Parent(s): 776a4eb

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -36,7 +36,7 @@ It is the result of quantising to 4bit using [GPTQ-for-LLaMa](https://github.com
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/orca_mini_7B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/psmathur/orca_mini_7b)
-## Prompt template: Alpaca with system message
 ```
 ### System:
@@ -45,7 +45,7 @@ You are an AI assistant that follows instruction extremely well. Help as much as
 ### User:
 prompt
-### Response
 ```
 or
 ```
@@ -55,10 +55,10 @@ You are an AI assistant that follows instruction extremely well. Help as much as
 ### User:
 prompt
-### Input
 input
-### Response
 ```
 ## How to easily download and use this model in text-generation-webui

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/orca_mini_7B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/psmathur/orca_mini_7b)
+## Prompt template:
 ```
 ### System:
 ### User:
 prompt
+### Response:
 ```
 or
 ```
 ### User:
 prompt
+### Input:
 input
+### Response:
 ```
 ## How to easily download and use this model in text-generation-webui