Dmitry Chaplinsky
commited on
Commit
•
ec4ce97
1
Parent(s):
c742047
Updated model: 752 splits, 26.86 epochs, min_loss: 1.0411, min_ppl: 2.8325
Browse files
loss.txt
CHANGED
@@ -733,3 +733,20 @@ TEST: valid loss 1.0419 | valid ppl 2.8346
|
|
733 |
| end of split 200 / 28 | epoch 20 | time: 3790.88s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
|
734 |
| end of split 201 / 28 | epoch 20 | time: 3786.15s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
|
735 |
| end of split 202 / 28 | epoch 20 | time: 3786.84s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
733 |
| end of split 200 / 28 | epoch 20 | time: 3790.88s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
|
734 |
| end of split 201 / 28 | epoch 20 | time: 3786.15s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0781
|
735 |
| end of split 202 / 28 | epoch 20 | time: 3786.84s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
|
736 |
+
| end of split 203 / 28 | epoch 20 | time: 3774.92s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
737 |
+
| end of split 204 / 28 | epoch 20 | time: 3774.54s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
738 |
+
| end of split 205 / 28 | epoch 20 | time: 3773.10s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
739 |
+
| end of split 206 / 28 | epoch 20 | time: 3776.61s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
|
740 |
+
| end of split 207 / 28 | epoch 20 | time: 3777.30s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
741 |
+
| end of split 208 / 28 | epoch 20 | time: 5363.05s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
742 |
+
| end of split 209 / 28 | epoch 20 | time: 3770.27s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
743 |
+
| end of split 210 / 28 | epoch 20 | time: 3776.92s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
744 |
+
| end of split 211 / 28 | epoch 20 | time: 3775.37s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
745 |
+
| end of split 212 / 28 | epoch 20 | time: 3777.34s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
|
746 |
+
| end of split 213 / 28 | epoch 20 | time: 3776.31s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0195
|
747 |
+
| end of split 214 / 28 | epoch 20 | time: 3777.03s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0195
|
748 |
+
| end of split 215 / 28 | epoch 20 | time: 3775.57s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0049
|
749 |
+
| end of split 216 / 28 | epoch 20 | time: 3776.52s | valid loss 1.0411 | valid ppl 2.8324 | learning rate 0.0049
|
750 |
+
| end of split 217 / 28 | epoch 20 | time: 3778.71s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049
|
751 |
+
| end of split 218 / 28 | epoch 20 | time: 3776.20s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049
|
752 |
+
| end of split 219 / 28 | epoch 20 | time: 1090.35s | valid loss 1.0411 | valid ppl 2.8325 | learning rate 0.0049
|