stas commited on
Commit
9618dbc
1 Parent(s): 36b1f8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -2
README.md CHANGED
@@ -1,4 +1,8 @@
 
1
 
2
- - indices - these are shuffled indices that the training was using using. they were generated the first time the training started. So the order is the same if one replays them via the dataloader
3
- the corresponding dataset is oscar-en that's on JZ at `$six_ALL_CCFRWORK/datasets-custom/oscar-en`
 
 
 
4
 
 
1
+ This data is from [13B-en training](https://github.com/bigscience-workshop/bigscience/tree/master/train/tr1-13B-base)
2
 
3
+
4
+
5
+ - indices - these are shuffled indices that the training was using. They were generated the first time the training started. So the order is the same if one replays them via the dataloader w/o actually doing the training steps.
6
+
7
+ - the corresponding dataset is oscar-en that's on JZ at `$six_ALL_CCFRWORK/datasets-custom/oscar-en`
8