Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
TheBloke commited on
Commit
266771b
1 Parent(s): 81c7196

Initial FP16 model commit

Browse files
Files changed (1) hide show
  1. README.md +0 -24
README.md CHANGED
@@ -29,30 +29,6 @@ It is the result of merging and/or converting the source repository to float16.
29
  * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
30
  * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
31
 
32
- ## Prompt template
33
-
34
- According to the original model's README, the following template should be used:
35
-
36
- ```
37
- <|user|>
38
- prompt goes here
39
- <|assistant|>
40
- ```
41
-
42
- However in my own testing, this seems to return no response at all. But I do get good responses using:
43
-
44
- ```
45
- ### Instruction: prompt goes here
46
- ### Response:
47
- ```
48
-
49
- and
50
-
51
- ```
52
- USER: prompt goes here
53
- ASSISTANT:
54
- ```
55
-
56
  <!-- footer start -->
57
  ## Discord
58
 
 
29
  * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/tulu-30B-GGML)
30
  * [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/tulu-30B-fp16)
31
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
32
  <!-- footer start -->
33
  ## Discord
34