Special tokens support?

#3
by gsaivinay - opened

Hello,

Thanks for this awesome model.

Does this iteration support special tokens?

<|system|>
<|assistant|>
<|prefix_begin|>
<|prefix_end|>
<|prompter|>

if so, can we have some examples on how to use?

Yes, it does seem to support those special tokens. According to MESSAGE_AND_TOKEN_FORMAT, it doesn't support all of them. However, I have used them in my projects so far and it has been working well. This is what I've done:
<|system|> - Used to let the model know what it is supposed to do. I don't know if you are meant to include the <|endoftext|> token afterwards, but it seems to be working well for me so far.
Ex: <|system|>You are an AI that summarizes a conversation in as few words as possible.<|endoftext|>

<|assistant|> - The basic token letting the model know it needs to start acting like the assistant.
Ex: <|prompter|>What is a meme, and what's the history behind this word?<|endoftext|><|assistant|>

<|prefix_begin|> & <|prefix_end|> - According to the GitHub, it isn't implemented yet. I've been using it and it seems to be working well so far for me.
Ex: <|prefix_begin|>"Rice is a common grain found in various cuisines worldwide." "There exist multiple types of rices including long grain, medium grain, etcetera." There are numerous recipes one could make from rice depending on their preferences."<|prefix_end|><|prompter|>What is the subject of the text in 1 word?<|endoftext|><|assistant|> -> Food and drink<|endoftext|>

<|prompter|> - I think you can infer the usage based on previous examples.
Ex: <|prompter|>What is a meme, and what's the history behind this word?<|endoftext|><|assistant|>

Sign up or log in to comment