tokenizer-arena / stats /compress_rate /chinese_llama.en.json
xu-song's picture
update compress rate
988921c
raw
history blame
81 Bytes
{"vocab_size": 49953, "n_bytes": 1124813, "n_tokens": 291514, "n_chars": 1121360}