TheBloke
/

tulu-30B-fp16

Text Generation

text-generation-inference

Model card Files Files and versions Community

TheBloke commited on Jun 10, 2023

Commit

266771b

•

1 Parent(s): 81c7196

Initial FP16 model commit

Files changed (1) hide show

README.md +0 -24

README.md CHANGED Viewed

@@ -29,30 +29,6 @@ It is the result of merging and/or converting the source repository to float16.
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
-## Prompt template
-According to the original model's README, the following template should be used:
-```
-<|user|>
-prompt goes here
-<|assistant|>
-```
-However in my own testing, this seems to return no response at all.  But I do get good responses using:
-```
-### Instruction: prompt goes here
-### Response:
-```
-and
-```
-USER: prompt goes here
-ASSISTANT:
-```
 <!-- footer start -->
 ## Discord

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
 * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
 <!-- footer start -->
 ## Discord