large_base_M07 / vocab.json
mprzibilla's picture
add tokenizer
96c000c
raw
history blame
250 Bytes
{"t": 0, "d": 1, "k": 2, "b": 3, "a": 4, "u": 5, "e": 6, "q": 7, "i": 8, "g": 9, "v": 10, "r": 11, "o": 12, "m": 13, "n": 14, "h": 15, "c": 16, "p": 17, "w": 19, "y": 20, "l": 21, "x": 22, "j": 23, "f": 24, "s": 25, "|": 18, "[UNK]": 26, "[PAD]": 27}