TheBloke
/

OpenOrcaxOpenChat-Preview2-13B-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Aug 3, 2023

Commit

b744d51

•

1 Parent(s): 6666239

Initial GPTQ model commit

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -44,10 +44,10 @@ Multiple GPTQ parameter permutations are provided; see Provided Files below for
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/OpenOrcaxOpenChat-Preview2-13B-GGML)
 * [Open-Orca's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B)
-## Prompt template: TBC
 ```
-Info on prompt template will be added shortly.
 ```
 ## Provided files
@@ -141,7 +141,7 @@ model = AutoGPTQForCausalLM.from_quantized(model_name_or_path,
 """
 prompt = "Tell me about AI"
-prompt_template=f'''Info on prompt template will be added shortly.
 '''
 print("\n\n*** Generate:")

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/OpenOrcaxOpenChat-Preview2-13B-GGML)
 * [Open-Orca's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B)
+## Prompt template: OpenChat Llama2 V1
 ```
+User: {prompt}<|end_of_turn|>Assistant:
 ```
 ## Provided files
 """
 prompt = "Tell me about AI"
+prompt_template=f'''User: {prompt}<|end_of_turn|>Assistant:
 '''
 print("\n\n*** Generate:")