bigscience-bot commited on
Commit
f3719aa
1 Parent(s): a4eb31b
Files changed (1) hide show
  1. logs/main_log.txt +9 -0
logs/main_log.txt CHANGED
@@ -16163,3 +16163,12 @@ time (ms) | save-checkpoint: 1490.10
16163
  time (ms)
16164
  iteration 42400/ 152972 | consumed samples: 16629184 | elapsed time per iteration (ms): 6179.2 | learning rate: 1.773E-04 | global batch size: 512 | lm loss: 2.945879E+00 | loss scale: 262144.0 | grad norm: 27153.375 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
16165
  time (ms)
 
 
 
 
 
 
 
 
 
 
16163
  time (ms)
16164
  iteration 42400/ 152972 | consumed samples: 16629184 | elapsed time per iteration (ms): 6179.2 | learning rate: 1.773E-04 | global batch size: 512 | lm loss: 2.945879E+00 | loss scale: 262144.0 | grad norm: 27153.375 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
16165
  time (ms)
16166
+ iteration 42600/ 152972 | consumed samples: 16731584 | elapsed time per iteration (ms): 6185.8 | learning rate: 1.770E-04 | global batch size: 512 | lm loss: 2.938939E+00 | loss scale: 262144.0 | grad norm: 25700.885 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
16167
+ time (ms)
16168
+ iteration 42800/ 152972 | consumed samples: 16833984 | elapsed time per iteration (ms): 6163.8 | learning rate: 1.768E-04 | global batch size: 512 | lm loss: 2.940046E+00 | loss scale: 524288.0 | grad norm: 49709.805 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
16169
+ time (ms)
16170
+ iteration 43000/ 152972 | consumed samples: 16936384 | elapsed time per iteration (ms): 6177.1 | learning rate: 1.765E-04 | global batch size: 512 | lm loss: 2.939341E+00 | loss scale: 524288.0 | grad norm: 47217.024 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
16171
+ time (ms)
16172
+ -------------------------------------------------------------------------------------------------
16173
+ validation loss at iteration 43000 | lm loss value: 2.885082E+00 | lm loss PPL: 1.790504E+01 |
16174
+ -------------------------------------------------------------------------------------------------