IDMGSP-Galactica-TRAIN / tokenizer_config.json
qwenzoo's picture
Update tokenizer_config.json
a1dec3e
raw
history blame
222 Bytes
{
"clean_up_tokenization_spaces": true,
"eos_token": "</s>",
"model_max_length": 2048,
"pad_token": "<pad>",
"tokenizer_class": "PreTrainedTokenizerFast",
"model_input_names": ["input_ids", "attention_mask"]
}