rembert / tokenizer_config.json
iwontbecreative's picture
Update tokenizer config to match latest bugfixes, add tokenizer.json
e33b2da
raw
history blame
263 Bytes
{"do_lower_case": false, "remove_space": true, "keep_accents": true, "bos_token": "[CLS]", "eos_token": "[SEP]", "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenizer_class": "RemBertTokenizer"}