File size: 145 Bytes
47d7788
1
The trained BPE tokenziers with various vocabulary sizes, which uses to study how the vocabulary size affects the performance of language models.