stefan-it's picture
tokenizer add config (no accent stripping) and vocab
2c19cbc
{"do_lower_case": true, "max_len": 512, "init_inputs": [], "strip_accents":false}