bigscience-bot commited on
Commit
d0c1ccd
1 Parent(s): b73d700
Files changed (1) hide show
  1. logs/main_log.txt +7 -0
logs/main_log.txt CHANGED
@@ -40628,3 +40628,10 @@ time (ms)
40628
  steps: 110000 loss: 2.7673 iter time (s): 0.003 samples/sec: 172747.511
40629
  iteration 110000/ 152972 | consumed samples: 51240384 | elapsed time per iteration (ms): 5934.5 | learning rate: 4.982E-05 | global batch size: 512 | lm loss: 2.781886E+00 | loss scale: 524288.0 | grad norm: 51011.958 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
40630
  time (ms)
 
 
 
 
 
 
 
 
40628
  steps: 110000 loss: 2.7673 iter time (s): 0.003 samples/sec: 172747.511
40629
  iteration 110000/ 152972 | consumed samples: 51240384 | elapsed time per iteration (ms): 5934.5 | learning rate: 4.982E-05 | global batch size: 512 | lm loss: 2.781886E+00 | loss scale: 524288.0 | grad norm: 51011.958 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
40630
  time (ms)
40631
+ --------------------------------------------------------------------------------------------------
40632
+ validation loss at iteration 110000 | lm loss value: 2.730394E+00 | lm loss PPL: 1.533892E+01 |
40633
+ --------------------------------------------------------------------------------------------------
40634
+ iteration 110200/ 152972 | consumed samples: 51342784 | elapsed time per iteration (ms): 6804.5 | learning rate: 4.948E-05 | global batch size: 512 | lm loss: 2.781904E+00 | loss scale: 1048576.0 | grad norm: 123221.355 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
40635
+ time (ms)
40636
+ iteration 110400/ 152972 | consumed samples: 51445184 | elapsed time per iteration (ms): 5943.6 | learning rate: 4.914E-05 | global batch size: 512 | lm loss: 2.781725E+00 | loss scale: 524288.0 | grad norm: 51531.358 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
40637
+ time (ms)