zirui3 commited on
Commit
492f8a3
·
1 Parent(s): 84c361f

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -2,3 +2,5 @@
2
 
3
  # summary
4
  multilingual tokenizer trained on multilingual data by using the SentencePiece library and the BPE algorithm.
 
 
 
2
 
3
  # summary
4
  multilingual tokenizer trained on multilingual data by using the SentencePiece library and the BPE algorithm.
5
+
6
+ * vocab size: 10k