yjernite HF staff cakiki commited on
Commit
34aa644
1 Parent(s): eb49b9c

Correct number of languages (#24)

Browse files

- Correct number of languages (69e78440cdbd55e8588883927a460cd8aa222c24)
- Update README.md (0de6009ab9827a696a1d2fb1fcdedeb21088c6c0)


Co-authored-by: Christopher Akiki <cakiki@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -179,11 +179,11 @@ Details for each dataset are provided in individual [Data Cards](https://hugging
179
 
180
  Training data includes:
181
 
182
- - 45 natural languages
183
 
184
- - 12 programming languages
185
 
186
- - In 1.5TB of pre-processed text, converted into 350B unique tokens (see [the tokenizer section](#tokenization) for more.)
187
 
188
  ### Languages
189
 
179
 
180
  Training data includes:
181
 
182
+ - 46 natural languages
183
 
184
+ - 13 programming languages
185
 
186
+ - In 1.6TB of pre-processed text, converted into 350B unique tokens (see [the tokenizer section](#tokenization) for more.)
187
 
188
  ### Languages
189