Commit
•
0140768
1
Parent(s):
5556040
Add link to interactive corpus treemap (#26)
Browse files- Add link to interactive corpus treemap (26e50ada4d39c875fde0c486eb860e4aa4a65c4c)
Co-authored-by: Yacine Jernite <yjernite@users.noreply.huggingface.co>
README.md
CHANGED
@@ -175,7 +175,7 @@ Jean Zay Public Supercomputer, provided by the French government (see [announcem
|
|
175 |
## Training Data
|
176 |
*This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
|
177 |
|
178 |
-
Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus).
|
179 |
|
180 |
Training data includes:
|
181 |
|
|
|
175 |
## Training Data
|
176 |
*This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
|
177 |
|
178 |
+
Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus), and the sizes of each of their contributions to the aggregated training data are presented in an [Interactive Corpus Map](https://huggingface.co/spaces/bigscience-catalogue-lm-data/corpus-map).
|
179 |
|
180 |
Training data includes:
|
181 |
|