tokenizer-arena / stats /compress_rate /bert_base_uncased.zh-Hans.json
xu-song's picture
update compress rate
988921c
raw
history blame
80 Bytes
{"vocab_size": 30522, "n_bytes": 2633047, "n_tokens": 898554, "n_chars": 927311}