stefan-it's picture
tokenizer: add config (no accent stripping) and vocab
6cd52d3
File too large to display, you can check the raw version instead.