tokenizer-arena / stats /compress_rate /bert_base_chinese.en.json
xu-song's picture
update compress rate
988921c
raw
history blame
81 Bytes
{"vocab_size": 21128, "n_bytes": 1124813, "n_tokens": 377068, "n_chars": 1121360}