lsb's picture
add tokenizer
e3884d4
{"a": 1, "b": 2, "c": 3, "d": 4, "e": 5, "f": 6, "g": 7, "h": 8, "i": 9, "k": 10, "l": 11, "m": 12, "n": 13, "o": 14, "p": 15, "q": 16, "r": 17, "s": 18, "t": 19, "u": 20, "x": 21, "y": 22, "z": 23, "|": 0, "[UNK]": 24, "[PAD]": 25}