reach-vb HF staff commited on
Commit
3fc028f
1 Parent(s): 677b9a7

Update README.md (#6)

Browse files

- Update README.md (87f5c8a59a2346abd425b2b5cdfdb8ba82718f69)

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -274,8 +274,8 @@ Data used for model training and how the data was processed.
274
 
275
  ### Training Dataset
276
 
277
- These models were trained on a dataset of text data that includes a wide variety
278
- of sources, totaling 15 trillion tokens. Here are the key components:
279
 
280
  * Web Documents: A diverse collection of web text ensures the model is exposed
281
  to a broad range of linguistic styles, topics, and vocabulary. Primarily
 
274
 
275
  ### Training Dataset
276
 
277
+ These models were trained on a dataset of text data that includes a wide variety of sources. The 27B model was trained with 13 trillion tokens and the 9B model was trained with 8 trillion tokens.
278
+ Here are the key components:
279
 
280
  * Web Documents: A diverse collection of web text ensures the model is exposed
281
  to a broad range of linguistic styles, topics, and vocabulary. Primarily