Younes Belkada
change readme
2c7a271
|
raw
history blame
No virus
217 Bytes

Japanese Dummy Tokenizer

Repository containing a dummy Japanese Tokenizer trained on snow_simplified_japanese_corpus dataset. The tokenizer has been trained using Hugging Face datasets in a streaming manner.