Regarding Tokeniser

#19
by ashesblue - opened

Did you guys use EOT TOKEN while training the model ?

Salesforce org

Yes, <|endoftext|> is used as the document boundary during training.

rooa changed discussion status to closed

Sign up or log in to comment