Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
xu-song
/
tokenizer-arena
like
56
Running
App
Files
Files
Community
1
988921c
tokenizer-arena
/
vocab
/
gpt_nexo_20b
2 contributors
History:
4 commits
xu-song
add compress rate
814ee6b
7 months ago
tokenizer
update
about 1 year ago
20B_tokenizer.json
Safe
2.47 MB
update
about 1 year ago
20B_tokenizer.zh.json
Safe
2.11 MB
update
about 1 year ago
README.md
Safe
1.69 kB
add compress rate
7 months ago
__init__.py
Safe
717 Bytes
update
about 1 year ago
convert_vocab_to_txt.py
Safe
459 Bytes
update
about 1 year ago
test_gpt_neox_20b.py
Safe
2.96 kB
update
about 1 year ago
test_hf_gpt_neox.py
Safe
487 Bytes
update
about 1 year ago
test_oov.py
Safe
505 Bytes
update
about 1 year ago
test_special_token.py
Safe
340 Bytes
update
about 1 year ago
test_tokenizer.py
Safe
3.39 kB
add compress rate
7 months ago
test_tokenizer_HF.py
Safe
1.3 kB
update
about 1 year ago
test_zh_coding_len.py
Safe
447 Bytes
update
about 1 year ago
vocab.zh.txt
Safe
9.34 kB
update
about 1 year ago