Dmitry Chaplinsky commited on
Commit
ec4ce97
1 Parent(s): c742047

Updated model: 752 splits, 26.86 epochs, min_loss: 1.0411, min_ppl: 2.8325

Browse files
Files changed (1) hide show
  1. loss.txt +17 -0
loss.txt CHANGED
@@ -733,3 +733,20 @@ TEST: valid loss 1.0419 | valid ppl 2.8346
733
  | end of split 200 / 28 | epoch 20 | time: 3790.88s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
734
  | end of split 201 / 28 | epoch 20 | time: 3786.15s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
735
  | end of split 202 / 28 | epoch 20 | time: 3786.84s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
733
  | end of split 200 / 28 | epoch 20 | time: 3790.88s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
734
  | end of split 201 / 28 | epoch 20 | time: 3786.15s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
735
  | end of split 202 / 28 | epoch 20 | time: 3786.84s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
736
+ | end of split 203 / 28 | epoch 20 | time: 3774.92s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
737
+ | end of split 204 / 28 | epoch 20 | time: 3774.54s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
738
+ | end of split 205 / 28 | epoch 20 | time: 3773.10s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
739
+ | end of split 206 / 28 | epoch 20 | time: 3776.61s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
740
+ | end of split 207 / 28 | epoch 20 | time: 3777.30s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
741
+ | end of split 208 / 28 | epoch 20 | time: 5363.05s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
742
+ | end of split 209 / 28 | epoch 20 | time: 3770.27s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
743
+ | end of split 210 / 28 | epoch 20 | time: 3776.92s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
744
+ | end of split 211 / 28 | epoch 20 | time: 3775.37s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
745
+ | end of split 212 / 28 | epoch 20 | time: 3777.34s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
746
+ | end of split 213 / 28 | epoch 20 | time: 3776.31s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
747
+ | end of split 214 / 28 | epoch 20 | time: 3777.03s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
748
+ | end of split 215 / 28 | epoch 20 | time: 3775.57s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0049
749
+ | end of split 216 / 28 | epoch 20 | time: 3776.52s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0049
750
+ | end of split 217 / 28 | epoch 20 | time: 3778.71s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049
751
+ | end of split 218 / 28 | epoch 20 | time: 3776.20s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049
752
+ | end of split 219 / 28 | epoch 20 | time: 1090.35s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049