tokenizer-arena / stats /compress_rate /bert_base_chinese.zh-Hans.json
xu-song's picture
update compress rate
988921c
raw
history blame
80 Bytes
{"vocab_size": 21128, "n_bytes": 2633047, "n_tokens": 896599, "n_chars": 927311}