telugu_bertu / tokenizer_config.json
system's picture
system HF staff
Update tokenizer_config.json
d84c59e
raw
history blame
191 Bytes
{"unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "clean_text": false, "handle_chinese_chars": false, "wordpieces_prefix": "##"}