bigscience-bot
commited on
Commit
•
f3719aa
1
Parent(s):
a4eb31b
new data
Browse files- logs/main_log.txt +9 -0
logs/main_log.txt
CHANGED
@@ -16163,3 +16163,12 @@ time (ms) | save-checkpoint: 1490.10
|
|
16163 |
time (ms)
|
16164 |
iteration 42400/ 152972 | consumed samples: 16629184 | elapsed time per iteration (ms): 6179.2 | learning rate: 1.773E-04 | global batch size: 512 | lm loss: 2.945879E+00 | loss scale: 262144.0 | grad norm: 27153.375 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
|
16165 |
time (ms)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16163 |
time (ms)
|
16164 |
iteration 42400/ 152972 | consumed samples: 16629184 | elapsed time per iteration (ms): 6179.2 | learning rate: 1.773E-04 | global batch size: 512 | lm loss: 2.945879E+00 | loss scale: 262144.0 | grad norm: 27153.375 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
|
16165 |
time (ms)
|
16166 |
+
iteration 42600/ 152972 | consumed samples: 16731584 | elapsed time per iteration (ms): 6185.8 | learning rate: 1.770E-04 | global batch size: 512 | lm loss: 2.938939E+00 | loss scale: 262144.0 | grad norm: 25700.885 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
|
16167 |
+
time (ms)
|
16168 |
+
iteration 42800/ 152972 | consumed samples: 16833984 | elapsed time per iteration (ms): 6163.8 | learning rate: 1.768E-04 | global batch size: 512 | lm loss: 2.940046E+00 | loss scale: 524288.0 | grad norm: 49709.805 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
|
16169 |
+
time (ms)
|
16170 |
+
iteration 43000/ 152972 | consumed samples: 16936384 | elapsed time per iteration (ms): 6177.1 | learning rate: 1.765E-04 | global batch size: 512 | lm loss: 2.939341E+00 | loss scale: 524288.0 | grad norm: 47217.024 | num zeros: 0.0 | number of skipped iterations: 0 | number of nan iterations: 0 |
|
16171 |
+
time (ms)
|
16172 |
+
-------------------------------------------------------------------------------------------------
|
16173 |
+
validation loss at iteration 43000 | lm loss value: 2.885082E+00 | lm loss PPL: 1.790504E+01 |
|
16174 |
+
-------------------------------------------------------------------------------------------------
|