Compare different tokenizers in char-level and byte-level.
Experiment with and compare different tokenizers