mathemakitten commited on
Commit
5dca946
1 Parent(s): afe2e6f

Update vocab size

Browse files

Per https://huggingface.co/bigscience/bloom-560m/blob/main/config.json, vocab size is 250880 not 250680.

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -191,7 +191,7 @@ The BLOOM tokenizer ([link](https://huggingface.co/bigscience/tokenizer)) is a l
191
 
192
  - A simple pre-tokenization rule, no normalization
193
 
194
- - A vocabulary size of 250,680
195
 
196
  It was trained on a subset of a preliminary version of the corpus using alpha-weighting per language.
197
 
191
 
192
  - A simple pre-tokenization rule, no normalization
193
 
194
+ - A vocabulary size of 250,880
195
 
196
  It was trained on a subset of a preliminary version of the corpus using alpha-weighting per language.
197