yhavinga commited on
Commit
073e227
1 Parent(s): 043ba5a

Autoupdate README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -4
README.md CHANGED
@@ -115,10 +115,7 @@ Therefore, the model can have biased predictions. This bias will also affect all
115
  The `ul2-large-dutch` T5 model was pre-trained simultaneously on a combination of several datasets,
116
  including the full version of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
117
  crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), and a subset of "mc4_nl_cleaned"
118
- containing only texts from Dutch and Belgian newspapers. This last dataset is oversampled to bias the model
119
- towards descriptions of events in the Netherlands and Belgium.
120
-
121
-
122
 
123
  ## Training procedure
124
 
115
  The `ul2-large-dutch` T5 model was pre-trained simultaneously on a combination of several datasets,
116
  including the full version of the "mc4_nl_cleaned" dataset, which is a cleaned version of Common Crawl's web
117
  crawl corpus, Dutch books, the Dutch subset of Wikipedia (2022-03-20), and a subset of "mc4_nl_cleaned"
118
+ containing only texts from Dutch newspapers.
 
 
 
119
 
120
  ## Training procedure
121