Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
eson
/
tokenizer-arena
like
53
Running
App
Files
Files
Community
1
a173fe5
tokenizer-arena
/
vocab
/
glm_chinese
1 contributor
History:
1 commit
eson
update
751936e
12 months ago
chinese_sentencepiece
update
12 months ago
README.md
487 Bytes
update
12 months ago
__init__.py
827 Bytes
update
12 months ago
convert_vocab_to_txt.py
689 Bytes
update
12 months ago
file_utils.py
8.38 kB
update
12 months ago
glm_chinese.vocab.txt
659 kB
update
12 months ago
sp_tokenizer.py
4.67 kB
update
12 months ago
test.py
65 Bytes
update
12 months ago
test_glm.py
2.5 kB
update
12 months ago
tokenization.py
51.9 kB
update
12 months ago
tokenization_gpt2.py
13.5 kB
update
12 months ago
utils.py
213 Bytes
update
12 months ago
wordpiece.py
15.5 kB
update
12 months ago