mirari's picture
add tokenizer
0428e24
raw
history blame
268 Bytes
{"v": 0, "j": 1, "w": 3, "h": 4, "p": 5, "i": 6, "x": 7, "c": 8, "b": 9, "m": 10, "z": 11, "d": 12, "u": 13, "e": 14, "y": 15, "f": 16, "s": 17, "r": 18, "q": 19, "t": 20, "g": 21, "k": 22, "'": 23, "n": 24, "a": 25, "l": 26, "o": 27, "|": 2, "[UNK]": 28, "[PAD]": 29}