baby_llama / tokenizer_config.json
bbunzeck's picture
Upload 8 files
314d211 verified
raw
history blame
234 Bytes
{
"bos_token": "<|endoftext|>",
"clean_up_tokenization_spaces": true,
"eos_token": "<|endoftext|>",
"model_max_length": 1000000000000000019884624838656,
"pad_token": "<pad>",
"tokenizer_class": "PreTrainedTokenizerFast"
}