Pieter Delobelle commited on
1 Parent(s): 599308b

Updated vocab.json to include unused tokens.

Browse files

The fast tokenizer complains when not all tokens from the merges.txt are
included in the vocab.json. So to fix this, the unused tokens are added
to the back.


