Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
xu-song
/
tokenizer-arena
like
56
App
Files
Files
Community
1
a37f943
tokenizer-arena
/
vocab
/
chinese_llama2
/
demo.py
xu-song
add more tokenizers
f4973d4
11 months ago
raw
Copy download link
history
blame
Safe
132 Bytes
from
vocab.chinese_llama2
import
tokenizer
encoding = tokenizer.encode(
"<s>开始</s>站位符<pad>试试<unk>"
)
print
(encoding)