cakiki yjernite HF staff commited on
Commit
0140768
1 Parent(s): 5556040

Add link to interactive corpus treemap (#26)

Browse files

- Add link to interactive corpus treemap (26e50ada4d39c875fde0c486eb860e4aa4a65c4c)


Co-authored-by: Yacine Jernite <yjernite@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -175,7 +175,7 @@ Jean Zay Public Supercomputer, provided by the French government (see [announcem
175
  ## Training Data
176
  *This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
177
 
178
- Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus).
179
 
180
  Training data includes:
181
 
175
  ## Training Data
176
  *This section provides a high-level overview of the training data. It is relevant for anyone who wants to know the basics of what the model is learning.*
177
 
178
+ Details for each dataset are provided in individual [Data Cards](https://huggingface.co/spaces/bigscience/BigScienceCorpus), and the sizes of each of their contributions to the aggregated training data are presented in an [Interactive Corpus Map](https://huggingface.co/spaces/bigscience-catalogue-lm-data/corpus-map).
179
 
180
  Training data includes:
181