stas commited on
Commit
f7b3de8
1 Parent(s): 8ad08c2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -6,7 +6,8 @@ records on each checkpoint saving.
6
  The training had 168000 iterations. Therefore multiply the reported data by 67. This would be quite approximate since we were using 16 nodes when doing
7
  the ramp up, then 64 and only the last 3 weeks 128 nodes.
8
 
9
- Caveat emptor: I'm not sure whether CC-reports overlap since each report is per gpu and I think they may be measuring the same thing, other than the gpu itself. So this requires research.
 
10
 
11
  Each csv file contains a report for a single gpu.
12
 
6
  The training had 168000 iterations. Therefore multiply the reported data by 67. This would be quite approximate since we were using 16 nodes when doing
7
  the ramp up, then 64 and only the last 3 weeks 128 nodes.
8
 
9
+ Caveat emptor: I'm not sure whether CC-reports overlap since each report is per gpu and I think they may be measuring the same thing, other than the gpu itself.
10
+ So this requires research.
11
 
12
  Each csv file contains a report for a single gpu.
13