TheBloke
/

tulu-30B-fp16

Text Generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jun 10, 2023

Commit

b1bff7c

•

1 Parent(s): 452eb83

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -29,6 +29,30 @@ It is the result of merging and/or converting the source repository to float16.
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
 <!-- footer start -->
 ## Discord

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
+## Prompt template
+According to the original model's README, the following template should be used:
+```
+<|user|>
+prompt goes here
+<|assistant|>
+```
+However in my own testing, this seems to return no response at all.  But I do get good responses using:
+```
+### Instruction: prompt goes here
+### Response:
+```
+and
+```
+USER: prompt goes here
+ASSISTANT:
+```
 <!-- footer start -->
 ## Discord