Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
xu-song
/
tokenizer-arena
like
56
Running
App
Files
Files
Community
1
bce41d0
tokenizer-arena
2 contributors
History:
58 commits
xu-song
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
bce41d0
7 months ago
css
update
about 1 year ago
images
update
about 1 year ago
js
fix chatglm; new feature about add_special_tokens;
8 months ago
tokenizer
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
7 months ago
utils
add more tokenizer
8 months ago
vocab
fix unicode error: 'unicodeescape' codec can't decode bytes in position 602-608: unknown Unicode character name
7 months ago
.gitattributes
1.83 kB
add gemma_7b
8 months ago
.gitignore
171 Bytes
update
about 1 year ago
README.md
330 Bytes
update
8 months ago
app.py
7.08 kB
update
7 months ago
config.py
45 Bytes
fix chatglm; new feature about add_special_tokens;
8 months ago
evaluation.md
58 Bytes
update
about 1 year ago
examples.py
2.93 kB
update
7 months ago
requirements.txt
72 Bytes
add olmo tokenizer
7 months ago
util.py
6.22 kB
fix tiktoken
7 months ago