Dmitry Chaplinsky
commited on
Commit
•
25f502d
1
Parent(s):
eacde4b
Updated model: 514 splits, 18.36 epochs, min_loss: 1.0164, min_ppl: 2.7632
Browse files- best-lm.pt +1 -1
- loss.txt +16 -0
best-lm.pt
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 22791455
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:43a896302f954ed5900cb4ac3eefd96189d368a32a3eda428cbce25990d4b54d
|
3 |
size 22791455
|
loss.txt
CHANGED
@@ -496,3 +496,19 @@
|
|
496 |
| end of split 76 / 28 | epoch 16 | time: 3246.27s | valid loss 1.0175 | valid ppl 2.7663 | learning rate 5.0000
|
497 |
| end of split 77 / 28 | epoch 16 | time: 3262.21s | valid loss 1.0167 | valid ppl 2.7641 | learning rate 1.2500
|
498 |
| end of split 78 / 28 | epoch 16 | time: 3268.11s | valid loss 1.0166 | valid ppl 2.7639 | learning rate 1.2500
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
496 |
| end of split 76 / 28 | epoch 16 | time: 3246.27s | valid loss 1.0175 | valid ppl 2.7663 | learning rate 5.0000
|
497 |
| end of split 77 / 28 | epoch 16 | time: 3262.21s | valid loss 1.0167 | valid ppl 2.7641 | learning rate 1.2500
|
498 |
| end of split 78 / 28 | epoch 16 | time: 3268.11s | valid loss 1.0166 | valid ppl 2.7639 | learning rate 1.2500
|
499 |
+
| end of split 79 / 28 | epoch 16 | time: 3269.38s | valid loss 1.0167 | valid ppl 2.7639 | learning rate 1.2500
|
500 |
+
| end of split 80 / 28 | epoch 16 | time: 3269.47s | valid loss 1.0166 | valid ppl 2.7638 | learning rate 1.2500
|
501 |
+
| end of split 81 / 28 | epoch 16 | time: 3258.11s | valid loss 1.0166 | valid ppl 2.7637 | learning rate 1.2500
|
502 |
+
| end of split 82 / 28 | epoch 16 | time: 3255.05s | valid loss 1.0165 | valid ppl 2.7636 | learning rate 1.2500
|
503 |
+
| end of split 83 / 28 | epoch 16 | time: 3267.27s | valid loss 1.0165 | valid ppl 2.7635 | learning rate 1.2500
|
504 |
+
| end of split 84 / 28 | epoch 16 | time: 3267.39s | valid loss 1.0165 | valid ppl 2.7634 | learning rate 1.2500
|
505 |
+
| end of split 85 / 28 | epoch 16 | time: 3268.88s | valid loss 1.0165 | valid ppl 2.7635 | learning rate 1.2500
|
506 |
+
| end of split 86 / 28 | epoch 16 | time: 3271.95s | valid loss 1.0165 | valid ppl 2.7634 | learning rate 1.2500
|
507 |
+
| end of split 87 / 28 | epoch 16 | time: 3211.88s | valid loss 1.0165 | valid ppl 2.7635 | learning rate 1.2500
|
508 |
+
| end of split 60 / 28 | epoch 17 | time: 3260.00s | valid loss 1.0164 | valid ppl 2.7632 | learning rate 1.2500
|
509 |
+
| end of split 61 / 28 | epoch 17 | time: 3331.83s | valid loss 1.0164 | valid ppl 2.7632 | learning rate 1.2500
|
510 |
+
| end of split 62 / 28 | epoch 17 | time: 3266.91s | valid loss 1.0164 | valid ppl 2.7632 | learning rate 1.2500
|
511 |
+
| end of split 63 / 28 | epoch 17 | time: 3267.75s | valid loss 1.0164 | valid ppl 2.7633 | learning rate 1.2500
|
512 |
+
| end of split 64 / 28 | epoch 17 | time: 3265.47s | valid loss 1.0164 | valid ppl 2.7633 | learning rate 1.2500
|
513 |
+
| end of split 65 / 28 | epoch 17 | time: 3253.35s | valid loss 1.0164 | valid ppl 2.7632 | learning rate 1.2500
|
514 |
+
| end of split 66 / 28 | epoch 17 | time: 3263.04s | valid loss 1.0164 | valid ppl 2.7633 | learning rate 1.2500
|