ner-french / training.log
alanakbik's picture
Update model for torch 2.0
27fb2ba
2023-04-05 22:27:04,698 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,698 Model: "SequenceTagger(
(embeddings): StackedEmbeddings(
(list_embedding_0): WordEmbeddings(
'fr'
(embedding): Embedding(1000000, 300)
)
(list_embedding_1): FlairEmbeddings(
(lm): LanguageModel(
(drop): Dropout(p=0.5, inplace=False)
(encoder): Embedding(275, 100)
(rnn): LSTM(100, 1024)
)
)
(list_embedding_2): FlairEmbeddings(
(lm): LanguageModel(
(drop): Dropout(p=0.5, inplace=False)
(encoder): Embedding(275, 100)
(rnn): LSTM(100, 1024)
)
)
)
(word_dropout): WordDropout(p=0.05)
(locked_dropout): LockedDropout(p=0.5)
(embedding2nn): Linear(in_features=2348, out_features=2348, bias=True)
(rnn): LSTM(2348, 256, batch_first=True, bidirectional=True)
(linear): Linear(in_features=512, out_features=19, bias=True)
(loss_function): ViterbiLoss()
(crf): CRF()
)"
2023-04-05 22:27:04,698 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,698 Corpus: "MultiCorpus: 107128 train + 11903 dev + 13226 test sentences
- ColumnCorpus Corpus: 107128 train + 11903 dev + 13226 test sentences - /vol/home-vol2/ml/akbikala/.flair/datasets/ner_multi_wikiner/fr"
2023-04-05 22:27:04,698 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,698 Parameters:
2023-04-05 22:27:04,698 - learning_rate: "0.100000"
2023-04-05 22:27:04,698 - mini_batch_size: "32"
2023-04-05 22:27:04,698 - patience: "3"
2023-04-05 22:27:04,698 - anneal_factor: "0.5"
2023-04-05 22:27:04,699 - max_epochs: "150"
2023-04-05 22:27:04,699 - shuffle: "True"
2023-04-05 22:27:04,699 - train_with_dev: "True"
2023-04-05 22:27:04,699 - batch_growth_annealing: "False"
2023-04-05 22:27:04,699 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,699 Model training base path: "resources/taggers/release-fr-ner-0"
2023-04-05 22:27:04,699 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,699 Device: cuda:1
2023-04-05 22:27:04,699 ----------------------------------------------------------------------------------------------------
2023-04-05 22:27:04,699 Embeddings storage mode: cpu
2023-04-05 22:27:04,699 ----------------------------------------------------------------------------------------------------
2023-04-05 22:28:33,936 epoch 1 - iter 372/3720 - loss 0.26668558 - time (sec): 89.24 - samples/sec: 3617.17 - lr: 0.100000
2023-04-05 22:30:02,154 epoch 1 - iter 744/3720 - loss 0.18821172 - time (sec): 177.45 - samples/sec: 3623.68 - lr: 0.100000
2023-04-05 22:31:33,098 epoch 1 - iter 1116/3720 - loss 0.15484475 - time (sec): 268.40 - samples/sec: 3598.78 - lr: 0.100000
2023-04-05 22:32:59,874 epoch 1 - iter 1488/3720 - loss 0.13700614 - time (sec): 355.18 - samples/sec: 3593.73 - lr: 0.100000
2023-04-05 22:34:26,533 epoch 1 - iter 1860/3720 - loss 0.12470905 - time (sec): 441.83 - samples/sec: 3593.17 - lr: 0.100000
2023-04-05 22:35:52,865 epoch 1 - iter 2232/3720 - loss 0.11525106 - time (sec): 528.17 - samples/sec: 3587.74 - lr: 0.100000
2023-04-05 22:37:20,753 epoch 1 - iter 2604/3720 - loss 0.10793162 - time (sec): 616.05 - samples/sec: 3579.76 - lr: 0.100000
2023-04-05 22:38:49,774 epoch 1 - iter 2976/3720 - loss 0.10238419 - time (sec): 705.08 - samples/sec: 3570.50 - lr: 0.100000
2023-04-05 22:39:54,061 epoch 1 - iter 3348/3720 - loss 0.09811850 - time (sec): 769.36 - samples/sec: 3686.03 - lr: 0.100000
2023-04-05 22:40:56,945 epoch 1 - iter 3720/3720 - loss 0.09450245 - time (sec): 832.25 - samples/sec: 3787.00 - lr: 0.100000
2023-04-05 22:40:56,945 ----------------------------------------------------------------------------------------------------
2023-04-05 22:40:56,945 EPOCH 1 done: loss 0.0945 - lr 0.100000
2023-04-05 22:40:56,945 BAD EPOCHS (no improvement): 0
2023-04-05 22:40:56,948 ----------------------------------------------------------------------------------------------------
2023-04-05 22:42:00,729 epoch 2 - iter 372/3720 - loss 0.06065240 - time (sec): 63.78 - samples/sec: 4924.62 - lr: 0.100000
2023-04-05 22:43:05,463 epoch 2 - iter 744/3720 - loss 0.05924859 - time (sec): 128.52 - samples/sec: 4904.42 - lr: 0.100000
2023-04-05 22:44:09,545 epoch 2 - iter 1116/3720 - loss 0.05885594 - time (sec): 192.60 - samples/sec: 4908.57 - lr: 0.100000
2023-04-05 22:45:13,493 epoch 2 - iter 1488/3720 - loss 0.05818312 - time (sec): 256.55 - samples/sec: 4910.54 - lr: 0.100000
2023-04-05 22:46:18,228 epoch 2 - iter 1860/3720 - loss 0.05786242 - time (sec): 321.28 - samples/sec: 4903.04 - lr: 0.100000
2023-04-05 22:47:22,311 epoch 2 - iter 2232/3720 - loss 0.05702811 - time (sec): 385.36 - samples/sec: 4911.93 - lr: 0.100000
2023-04-05 22:48:27,905 epoch 2 - iter 2604/3720 - loss 0.05644926 - time (sec): 450.96 - samples/sec: 4896.02 - lr: 0.100000
2023-04-05 22:49:32,658 epoch 2 - iter 2976/3720 - loss 0.05594932 - time (sec): 515.71 - samples/sec: 4890.45 - lr: 0.100000
2023-04-05 22:50:37,124 epoch 2 - iter 3348/3720 - loss 0.05562567 - time (sec): 580.18 - samples/sec: 4891.12 - lr: 0.100000
2023-04-05 22:51:41,572 epoch 2 - iter 3720/3720 - loss 0.05521556 - time (sec): 644.62 - samples/sec: 4889.22 - lr: 0.100000
2023-04-05 22:51:41,572 ----------------------------------------------------------------------------------------------------
2023-04-05 22:51:41,572 EPOCH 2 done: loss 0.0552 - lr 0.100000
2023-04-05 22:51:41,572 BAD EPOCHS (no improvement): 0
2023-04-05 22:51:41,575 ----------------------------------------------------------------------------------------------------
2023-04-05 22:52:45,931 epoch 3 - iter 372/3720 - loss 0.05046567 - time (sec): 64.36 - samples/sec: 4895.71 - lr: 0.100000
2023-04-05 22:53:49,969 epoch 3 - iter 744/3720 - loss 0.05024878 - time (sec): 128.39 - samples/sec: 4895.63 - lr: 0.100000
2023-04-05 22:54:55,129 epoch 3 - iter 1116/3720 - loss 0.05024194 - time (sec): 193.55 - samples/sec: 4876.90 - lr: 0.100000
2023-04-05 22:55:59,149 epoch 3 - iter 1488/3720 - loss 0.05028535 - time (sec): 257.57 - samples/sec: 4890.33 - lr: 0.100000
2023-04-05 22:57:04,010 epoch 3 - iter 1860/3720 - loss 0.05030635 - time (sec): 322.43 - samples/sec: 4888.14 - lr: 0.100000
2023-04-05 22:58:07,595 epoch 3 - iter 2232/3720 - loss 0.05013778 - time (sec): 386.02 - samples/sec: 4902.42 - lr: 0.100000
2023-04-05 22:59:11,614 epoch 3 - iter 2604/3720 - loss 0.05015459 - time (sec): 450.04 - samples/sec: 4901.46 - lr: 0.100000
2023-04-05 23:00:15,174 epoch 3 - iter 2976/3720 - loss 0.04980648 - time (sec): 513.60 - samples/sec: 4911.18 - lr: 0.100000
2023-04-05 23:01:19,651 epoch 3 - iter 3348/3720 - loss 0.04956168 - time (sec): 578.08 - samples/sec: 4910.97 - lr: 0.100000
2023-04-05 23:02:23,440 epoch 3 - iter 3720/3720 - loss 0.04952181 - time (sec): 641.86 - samples/sec: 4910.24 - lr: 0.100000
2023-04-05 23:02:23,440 ----------------------------------------------------------------------------------------------------
2023-04-05 23:02:23,440 EPOCH 3 done: loss 0.0495 - lr 0.100000
2023-04-05 23:02:23,441 BAD EPOCHS (no improvement): 0
2023-04-05 23:02:23,444 ----------------------------------------------------------------------------------------------------
2023-04-05 23:03:28,321 epoch 4 - iter 372/3720 - loss 0.04613959 - time (sec): 64.88 - samples/sec: 4888.98 - lr: 0.100000
2023-04-05 23:04:32,814 epoch 4 - iter 744/3720 - loss 0.04723903 - time (sec): 129.37 - samples/sec: 4875.64 - lr: 0.100000
2023-04-05 23:05:37,556 epoch 4 - iter 1116/3720 - loss 0.04692235 - time (sec): 194.11 - samples/sec: 4880.06 - lr: 0.100000
2023-04-05 23:06:40,836 epoch 4 - iter 1488/3720 - loss 0.04683749 - time (sec): 257.39 - samples/sec: 4905.46 - lr: 0.100000
2023-04-05 23:07:45,092 epoch 4 - iter 1860/3720 - loss 0.04668699 - time (sec): 321.65 - samples/sec: 4915.44 - lr: 0.100000
2023-04-05 23:08:49,644 epoch 4 - iter 2232/3720 - loss 0.04665987 - time (sec): 386.20 - samples/sec: 4907.91 - lr: 0.100000
2023-04-05 23:09:52,884 epoch 4 - iter 2604/3720 - loss 0.04664835 - time (sec): 449.44 - samples/sec: 4911.95 - lr: 0.100000
2023-04-05 23:10:57,031 epoch 4 - iter 2976/3720 - loss 0.04652260 - time (sec): 513.59 - samples/sec: 4912.37 - lr: 0.100000
2023-04-05 23:12:00,247 epoch 4 - iter 3348/3720 - loss 0.04657826 - time (sec): 576.80 - samples/sec: 4918.85 - lr: 0.100000
2023-04-05 23:13:03,773 epoch 4 - iter 3720/3720 - loss 0.04654629 - time (sec): 640.33 - samples/sec: 4922.02 - lr: 0.100000
2023-04-05 23:13:03,773 ----------------------------------------------------------------------------------------------------
2023-04-05 23:13:03,773 EPOCH 4 done: loss 0.0465 - lr 0.100000
2023-04-05 23:13:03,774 BAD EPOCHS (no improvement): 0
2023-04-05 23:13:03,776 ----------------------------------------------------------------------------------------------------
2023-04-05 23:14:07,173 epoch 5 - iter 372/3720 - loss 0.04491548 - time (sec): 63.40 - samples/sec: 4971.42 - lr: 0.100000
2023-04-05 23:15:09,761 epoch 5 - iter 744/3720 - loss 0.04494219 - time (sec): 125.98 - samples/sec: 4985.63 - lr: 0.100000
2023-04-05 23:16:15,361 epoch 5 - iter 1116/3720 - loss 0.04485739 - time (sec): 191.58 - samples/sec: 4925.37 - lr: 0.100000
2023-04-05 23:17:20,769 epoch 5 - iter 1488/3720 - loss 0.04486438 - time (sec): 256.99 - samples/sec: 4901.50 - lr: 0.100000
2023-04-05 23:18:25,883 epoch 5 - iter 1860/3720 - loss 0.04477035 - time (sec): 322.11 - samples/sec: 4894.20 - lr: 0.100000
2023-04-05 23:19:29,636 epoch 5 - iter 2232/3720 - loss 0.04462821 - time (sec): 385.86 - samples/sec: 4893.28 - lr: 0.100000
2023-04-05 23:20:34,474 epoch 5 - iter 2604/3720 - loss 0.04470493 - time (sec): 450.70 - samples/sec: 4884.85 - lr: 0.100000
2023-04-05 23:21:39,925 epoch 5 - iter 2976/3720 - loss 0.04477533 - time (sec): 516.15 - samples/sec: 4879.53 - lr: 0.100000
2023-04-05 23:22:44,993 epoch 5 - iter 3348/3720 - loss 0.04475228 - time (sec): 581.22 - samples/sec: 4877.68 - lr: 0.100000
2023-04-05 23:23:50,164 epoch 5 - iter 3720/3720 - loss 0.04467442 - time (sec): 646.39 - samples/sec: 4875.88 - lr: 0.100000
2023-04-05 23:23:50,165 ----------------------------------------------------------------------------------------------------
2023-04-05 23:23:50,165 EPOCH 5 done: loss 0.0447 - lr 0.100000
2023-04-05 23:23:50,165 BAD EPOCHS (no improvement): 0
2023-04-05 23:23:50,168 ----------------------------------------------------------------------------------------------------
2023-04-05 23:24:56,275 epoch 6 - iter 372/3720 - loss 0.04236946 - time (sec): 66.11 - samples/sec: 4800.02 - lr: 0.100000
2023-04-05 23:26:00,676 epoch 6 - iter 744/3720 - loss 0.04346961 - time (sec): 130.51 - samples/sec: 4833.73 - lr: 0.100000
2023-04-05 23:27:05,725 epoch 6 - iter 1116/3720 - loss 0.04341193 - time (sec): 195.56 - samples/sec: 4834.31 - lr: 0.100000
2023-04-05 23:28:10,557 epoch 6 - iter 1488/3720 - loss 0.04353977 - time (sec): 260.39 - samples/sec: 4843.95 - lr: 0.100000
2023-04-05 23:29:14,854 epoch 6 - iter 1860/3720 - loss 0.04356128 - time (sec): 324.69 - samples/sec: 4852.90 - lr: 0.100000
2023-04-05 23:30:19,014 epoch 6 - iter 2232/3720 - loss 0.04363767 - time (sec): 388.85 - samples/sec: 4858.71 - lr: 0.100000
2023-04-05 23:31:23,740 epoch 6 - iter 2604/3720 - loss 0.04364881 - time (sec): 453.57 - samples/sec: 4852.89 - lr: 0.100000
2023-04-05 23:32:28,669 epoch 6 - iter 2976/3720 - loss 0.04361488 - time (sec): 518.50 - samples/sec: 4851.02 - lr: 0.100000
2023-04-05 23:33:35,290 epoch 6 - iter 3348/3720 - loss 0.04342392 - time (sec): 585.12 - samples/sec: 4844.92 - lr: 0.100000
2023-04-05 23:34:40,827 epoch 6 - iter 3720/3720 - loss 0.04337900 - time (sec): 650.66 - samples/sec: 4843.88 - lr: 0.100000
2023-04-05 23:34:40,827 ----------------------------------------------------------------------------------------------------
2023-04-05 23:34:40,827 EPOCH 6 done: loss 0.0434 - lr 0.100000
2023-04-05 23:34:40,827 BAD EPOCHS (no improvement): 0
2023-04-05 23:34:40,830 ----------------------------------------------------------------------------------------------------
2023-04-05 23:35:46,776 epoch 7 - iter 372/3720 - loss 0.04216405 - time (sec): 65.95 - samples/sec: 4799.64 - lr: 0.100000
2023-04-05 23:36:51,460 epoch 7 - iter 744/3720 - loss 0.04195712 - time (sec): 130.63 - samples/sec: 4816.10 - lr: 0.100000
2023-04-05 23:37:55,121 epoch 7 - iter 1116/3720 - loss 0.04172125 - time (sec): 194.29 - samples/sec: 4864.28 - lr: 0.100000
2023-04-05 23:39:00,044 epoch 7 - iter 1488/3720 - loss 0.04181797 - time (sec): 259.21 - samples/sec: 4867.10 - lr: 0.100000
2023-04-05 23:40:05,144 epoch 7 - iter 1860/3720 - loss 0.04193096 - time (sec): 324.31 - samples/sec: 4859.06 - lr: 0.100000
2023-04-05 23:41:10,633 epoch 7 - iter 2232/3720 - loss 0.04198594 - time (sec): 389.80 - samples/sec: 4849.47 - lr: 0.100000
2023-04-05 23:42:14,072 epoch 7 - iter 2604/3720 - loss 0.04209685 - time (sec): 453.24 - samples/sec: 4862.50 - lr: 0.100000
2023-04-05 23:43:18,823 epoch 7 - iter 2976/3720 - loss 0.04213716 - time (sec): 517.99 - samples/sec: 4863.88 - lr: 0.100000
2023-04-05 23:44:23,416 epoch 7 - iter 3348/3720 - loss 0.04224028 - time (sec): 582.59 - samples/sec: 4868.17 - lr: 0.100000
2023-04-05 23:45:29,396 epoch 7 - iter 3720/3720 - loss 0.04226558 - time (sec): 648.57 - samples/sec: 4859.51 - lr: 0.100000
2023-04-05 23:45:29,396 ----------------------------------------------------------------------------------------------------
2023-04-05 23:45:29,396 EPOCH 7 done: loss 0.0423 - lr 0.100000
2023-04-05 23:45:29,396 BAD EPOCHS (no improvement): 0
2023-04-05 23:45:29,399 ----------------------------------------------------------------------------------------------------
2023-04-05 23:46:33,584 epoch 8 - iter 372/3720 - loss 0.04172150 - time (sec): 64.18 - samples/sec: 4911.49 - lr: 0.100000
2023-04-05 23:47:37,938 epoch 8 - iter 744/3720 - loss 0.04180014 - time (sec): 128.54 - samples/sec: 4896.63 - lr: 0.100000
2023-04-05 23:48:42,007 epoch 8 - iter 1116/3720 - loss 0.04179824 - time (sec): 192.61 - samples/sec: 4911.23 - lr: 0.100000
2023-04-05 23:49:45,242 epoch 8 - iter 1488/3720 - loss 0.04151445 - time (sec): 255.84 - samples/sec: 4931.13 - lr: 0.100000
2023-04-05 23:50:49,675 epoch 8 - iter 1860/3720 - loss 0.04166393 - time (sec): 320.28 - samples/sec: 4923.25 - lr: 0.100000
2023-04-05 23:51:55,508 epoch 8 - iter 2232/3720 - loss 0.04150733 - time (sec): 386.11 - samples/sec: 4903.37 - lr: 0.100000
2023-04-05 23:53:01,769 epoch 8 - iter 2604/3720 - loss 0.04146176 - time (sec): 452.37 - samples/sec: 4876.82 - lr: 0.100000
2023-04-05 23:54:06,502 epoch 8 - iter 2976/3720 - loss 0.04150560 - time (sec): 517.10 - samples/sec: 4872.69 - lr: 0.100000
2023-04-05 23:55:12,693 epoch 8 - iter 3348/3720 - loss 0.04151976 - time (sec): 583.29 - samples/sec: 4864.86 - lr: 0.100000
2023-04-05 23:56:17,542 epoch 8 - iter 3720/3720 - loss 0.04164803 - time (sec): 648.14 - samples/sec: 4862.68 - lr: 0.100000
2023-04-05 23:56:17,543 ----------------------------------------------------------------------------------------------------
2023-04-05 23:56:17,543 EPOCH 8 done: loss 0.0416 - lr 0.100000
2023-04-05 23:56:17,543 BAD EPOCHS (no improvement): 0
2023-04-05 23:56:17,545 ----------------------------------------------------------------------------------------------------
2023-04-05 23:57:22,891 epoch 9 - iter 372/3720 - loss 0.04059165 - time (sec): 65.35 - samples/sec: 4821.72 - lr: 0.100000
2023-04-05 23:58:28,040 epoch 9 - iter 744/3720 - loss 0.04076265 - time (sec): 130.49 - samples/sec: 4826.48 - lr: 0.100000
2023-04-05 23:59:32,267 epoch 9 - iter 1116/3720 - loss 0.04079673 - time (sec): 194.72 - samples/sec: 4845.58 - lr: 0.100000
2023-04-06 00:00:37,866 epoch 9 - iter 1488/3720 - loss 0.04100264 - time (sec): 260.32 - samples/sec: 4838.91 - lr: 0.100000
2023-04-06 00:01:42,188 epoch 9 - iter 1860/3720 - loss 0.04102605 - time (sec): 324.64 - samples/sec: 4848.46 - lr: 0.100000
2023-04-06 00:02:46,538 epoch 9 - iter 2232/3720 - loss 0.04095108 - time (sec): 388.99 - samples/sec: 4859.31 - lr: 0.100000
2023-04-06 00:03:51,285 epoch 9 - iter 2604/3720 - loss 0.04087753 - time (sec): 453.74 - samples/sec: 4866.60 - lr: 0.100000
2023-04-06 00:04:55,279 epoch 9 - iter 2976/3720 - loss 0.04081089 - time (sec): 517.73 - samples/sec: 4872.21 - lr: 0.100000
2023-04-06 00:06:01,561 epoch 9 - iter 3348/3720 - loss 0.04085946 - time (sec): 584.02 - samples/sec: 4859.40 - lr: 0.100000
2023-04-06 00:07:06,309 epoch 9 - iter 3720/3720 - loss 0.04084458 - time (sec): 648.76 - samples/sec: 4858.03 - lr: 0.100000
2023-04-06 00:07:06,309 ----------------------------------------------------------------------------------------------------
2023-04-06 00:07:06,309 EPOCH 9 done: loss 0.0408 - lr 0.100000
2023-04-06 00:07:06,309 BAD EPOCHS (no improvement): 0
2023-04-06 00:07:06,312 ----------------------------------------------------------------------------------------------------
2023-04-06 00:08:11,202 epoch 10 - iter 372/3720 - loss 0.03977085 - time (sec): 64.89 - samples/sec: 4857.10 - lr: 0.100000
2023-04-06 00:09:16,581 epoch 10 - iter 744/3720 - loss 0.04042448 - time (sec): 130.27 - samples/sec: 4828.72 - lr: 0.100000
2023-04-06 00:10:22,000 epoch 10 - iter 1116/3720 - loss 0.04021729 - time (sec): 195.69 - samples/sec: 4825.54 - lr: 0.100000
2023-04-06 00:11:27,331 epoch 10 - iter 1488/3720 - loss 0.04006120 - time (sec): 261.02 - samples/sec: 4826.13 - lr: 0.100000
2023-04-06 00:12:30,914 epoch 10 - iter 1860/3720 - loss 0.04018432 - time (sec): 324.60 - samples/sec: 4849.77 - lr: 0.100000
2023-04-06 00:13:34,453 epoch 10 - iter 2232/3720 - loss 0.04038774 - time (sec): 388.14 - samples/sec: 4866.26 - lr: 0.100000
2023-04-06 00:14:39,329 epoch 10 - iter 2604/3720 - loss 0.04035558 - time (sec): 453.02 - samples/sec: 4865.27 - lr: 0.100000
2023-04-06 00:15:44,245 epoch 10 - iter 2976/3720 - loss 0.04031628 - time (sec): 517.93 - samples/sec: 4859.06 - lr: 0.100000
2023-04-06 00:16:50,500 epoch 10 - iter 3348/3720 - loss 0.04030915 - time (sec): 584.19 - samples/sec: 4851.80 - lr: 0.100000
2023-04-06 00:17:56,482 epoch 10 - iter 3720/3720 - loss 0.04028057 - time (sec): 650.17 - samples/sec: 4847.52 - lr: 0.100000
2023-04-06 00:17:56,482 ----------------------------------------------------------------------------------------------------
2023-04-06 00:17:56,482 EPOCH 10 done: loss 0.0403 - lr 0.100000
2023-04-06 00:17:56,482 BAD EPOCHS (no improvement): 0
2023-04-06 00:17:56,485 ----------------------------------------------------------------------------------------------------
2023-04-06 00:19:02,260 epoch 11 - iter 372/3720 - loss 0.03881162 - time (sec): 65.78 - samples/sec: 4790.89 - lr: 0.100000
2023-04-06 00:20:07,864 epoch 11 - iter 744/3720 - loss 0.03934850 - time (sec): 131.38 - samples/sec: 4801.79 - lr: 0.100000
2023-04-06 00:21:12,756 epoch 11 - iter 1116/3720 - loss 0.03941690 - time (sec): 196.27 - samples/sec: 4817.33 - lr: 0.100000
2023-04-06 00:22:17,917 epoch 11 - iter 1488/3720 - loss 0.03964001 - time (sec): 261.43 - samples/sec: 4819.91 - lr: 0.100000
2023-04-06 00:23:23,428 epoch 11 - iter 1860/3720 - loss 0.03969471 - time (sec): 326.94 - samples/sec: 4821.40 - lr: 0.100000
2023-04-06 00:24:28,856 epoch 11 - iter 2232/3720 - loss 0.03980826 - time (sec): 392.37 - samples/sec: 4822.24 - lr: 0.100000
2023-04-06 00:25:34,439 epoch 11 - iter 2604/3720 - loss 0.03980274 - time (sec): 457.95 - samples/sec: 4817.42 - lr: 0.100000
2023-04-06 00:26:40,245 epoch 11 - iter 2976/3720 - loss 0.03986743 - time (sec): 523.76 - samples/sec: 4815.16 - lr: 0.100000
2023-04-06 00:27:44,010 epoch 11 - iter 3348/3720 - loss 0.03985353 - time (sec): 587.53 - samples/sec: 4829.76 - lr: 0.100000
2023-04-06 00:28:49,148 epoch 11 - iter 3720/3720 - loss 0.03979586 - time (sec): 652.66 - samples/sec: 4829.00 - lr: 0.100000
2023-04-06 00:28:49,148 ----------------------------------------------------------------------------------------------------
2023-04-06 00:28:49,148 EPOCH 11 done: loss 0.0398 - lr 0.100000
2023-04-06 00:28:49,148 BAD EPOCHS (no improvement): 0
2023-04-06 00:28:49,152 ----------------------------------------------------------------------------------------------------
2023-04-06 00:29:55,927 epoch 12 - iter 372/3720 - loss 0.03948977 - time (sec): 66.78 - samples/sec: 4753.05 - lr: 0.100000
2023-04-06 00:31:00,559 epoch 12 - iter 744/3720 - loss 0.03857903 - time (sec): 131.41 - samples/sec: 4815.13 - lr: 0.100000
2023-04-06 00:32:06,155 epoch 12 - iter 1116/3720 - loss 0.03863680 - time (sec): 197.00 - samples/sec: 4817.46 - lr: 0.100000
2023-04-06 00:33:10,411 epoch 12 - iter 1488/3720 - loss 0.03888190 - time (sec): 261.26 - samples/sec: 4835.92 - lr: 0.100000
2023-04-06 00:34:14,880 epoch 12 - iter 1860/3720 - loss 0.03894279 - time (sec): 325.73 - samples/sec: 4840.66 - lr: 0.100000
2023-04-06 00:35:19,705 epoch 12 - iter 2232/3720 - loss 0.03904878 - time (sec): 390.55 - samples/sec: 4846.41 - lr: 0.100000
2023-04-06 00:36:23,790 epoch 12 - iter 2604/3720 - loss 0.03926250 - time (sec): 454.64 - samples/sec: 4852.32 - lr: 0.100000
2023-04-06 00:37:29,351 epoch 12 - iter 2976/3720 - loss 0.03935777 - time (sec): 520.20 - samples/sec: 4849.10 - lr: 0.100000
2023-04-06 00:38:34,450 epoch 12 - iter 3348/3720 - loss 0.03927677 - time (sec): 585.30 - samples/sec: 4849.29 - lr: 0.100000
2023-04-06 00:39:37,391 epoch 12 - iter 3720/3720 - loss 0.03929491 - time (sec): 648.24 - samples/sec: 4861.96 - lr: 0.100000
2023-04-06 00:39:37,391 ----------------------------------------------------------------------------------------------------
2023-04-06 00:39:37,391 EPOCH 12 done: loss 0.0393 - lr 0.100000
2023-04-06 00:39:37,391 BAD EPOCHS (no improvement): 0
2023-04-06 00:39:37,395 ----------------------------------------------------------------------------------------------------
2023-04-06 00:40:42,023 epoch 13 - iter 372/3720 - loss 0.03793697 - time (sec): 64.63 - samples/sec: 4907.18 - lr: 0.100000
2023-04-06 00:41:47,485 epoch 13 - iter 744/3720 - loss 0.03851898 - time (sec): 130.09 - samples/sec: 4872.84 - lr: 0.100000
2023-04-06 00:42:53,288 epoch 13 - iter 1116/3720 - loss 0.03867673 - time (sec): 195.89 - samples/sec: 4836.38 - lr: 0.100000
2023-04-06 00:43:58,372 epoch 13 - iter 1488/3720 - loss 0.03867665 - time (sec): 260.98 - samples/sec: 4841.98 - lr: 0.100000
2023-04-06 00:45:04,638 epoch 13 - iter 1860/3720 - loss 0.03897579 - time (sec): 327.24 - samples/sec: 4830.30 - lr: 0.100000
2023-04-06 00:46:09,680 epoch 13 - iter 2232/3720 - loss 0.03911655 - time (sec): 392.29 - samples/sec: 4833.13 - lr: 0.100000
2023-04-06 00:47:14,113 epoch 13 - iter 2604/3720 - loss 0.03913999 - time (sec): 456.72 - samples/sec: 4839.97 - lr: 0.100000
2023-04-06 00:48:19,561 epoch 13 - iter 2976/3720 - loss 0.03915125 - time (sec): 522.17 - samples/sec: 4833.02 - lr: 0.100000
2023-04-06 00:49:25,258 epoch 13 - iter 3348/3720 - loss 0.03928003 - time (sec): 587.86 - samples/sec: 4827.79 - lr: 0.100000
2023-04-06 00:50:30,379 epoch 13 - iter 3720/3720 - loss 0.03920813 - time (sec): 652.98 - samples/sec: 4826.63 - lr: 0.100000
2023-04-06 00:50:30,379 ----------------------------------------------------------------------------------------------------
2023-04-06 00:50:30,379 EPOCH 13 done: loss 0.0392 - lr 0.100000
2023-04-06 00:50:30,379 BAD EPOCHS (no improvement): 0
2023-04-06 00:50:30,383 ----------------------------------------------------------------------------------------------------
2023-04-06 00:51:35,122 epoch 14 - iter 372/3720 - loss 0.03718396 - time (sec): 64.74 - samples/sec: 4874.25 - lr: 0.100000
2023-04-06 00:52:39,958 epoch 14 - iter 744/3720 - loss 0.03795859 - time (sec): 129.58 - samples/sec: 4864.62 - lr: 0.100000
2023-04-06 00:53:44,221 epoch 14 - iter 1116/3720 - loss 0.03814956 - time (sec): 193.84 - samples/sec: 4879.49 - lr: 0.100000
2023-04-06 00:54:48,326 epoch 14 - iter 1488/3720 - loss 0.03824906 - time (sec): 257.94 - samples/sec: 4887.69 - lr: 0.100000
2023-04-06 00:55:53,342 epoch 14 - iter 1860/3720 - loss 0.03841750 - time (sec): 322.96 - samples/sec: 4886.92 - lr: 0.100000
2023-04-06 00:56:56,998 epoch 14 - iter 2232/3720 - loss 0.03869255 - time (sec): 386.62 - samples/sec: 4893.93 - lr: 0.100000
2023-04-06 00:58:01,416 epoch 14 - iter 2604/3720 - loss 0.03867406 - time (sec): 451.03 - samples/sec: 4891.72 - lr: 0.100000
2023-04-06 00:59:06,253 epoch 14 - iter 2976/3720 - loss 0.03879602 - time (sec): 515.87 - samples/sec: 4886.24 - lr: 0.100000
2023-04-06 01:00:11,096 epoch 14 - iter 3348/3720 - loss 0.03868007 - time (sec): 580.71 - samples/sec: 4884.80 - lr: 0.100000
2023-04-06 01:01:16,460 epoch 14 - iter 3720/3720 - loss 0.03875447 - time (sec): 646.08 - samples/sec: 4878.23 - lr: 0.100000
2023-04-06 01:01:16,460 ----------------------------------------------------------------------------------------------------
2023-04-06 01:01:16,460 EPOCH 14 done: loss 0.0388 - lr 0.100000
2023-04-06 01:01:16,460 BAD EPOCHS (no improvement): 0
2023-04-06 01:01:16,464 ----------------------------------------------------------------------------------------------------
2023-04-06 01:02:21,589 epoch 15 - iter 372/3720 - loss 0.03884488 - time (sec): 65.12 - samples/sec: 4859.08 - lr: 0.100000
2023-04-06 01:03:26,238 epoch 15 - iter 744/3720 - loss 0.03831782 - time (sec): 129.77 - samples/sec: 4871.54 - lr: 0.100000
2023-04-06 01:04:29,717 epoch 15 - iter 1116/3720 - loss 0.03824842 - time (sec): 193.25 - samples/sec: 4896.28 - lr: 0.100000
2023-04-06 01:05:33,584 epoch 15 - iter 1488/3720 - loss 0.03824843 - time (sec): 257.12 - samples/sec: 4903.13 - lr: 0.100000
2023-04-06 01:06:38,784 epoch 15 - iter 1860/3720 - loss 0.03847771 - time (sec): 322.32 - samples/sec: 4896.35 - lr: 0.100000
2023-04-06 01:07:43,045 epoch 15 - iter 2232/3720 - loss 0.03837810 - time (sec): 386.58 - samples/sec: 4895.60 - lr: 0.100000
2023-04-06 01:08:47,591 epoch 15 - iter 2604/3720 - loss 0.03846540 - time (sec): 451.13 - samples/sec: 4887.26 - lr: 0.100000
2023-04-06 01:09:52,730 epoch 15 - iter 2976/3720 - loss 0.03853469 - time (sec): 516.27 - samples/sec: 4884.77 - lr: 0.100000
2023-04-06 01:10:58,427 epoch 15 - iter 3348/3720 - loss 0.03841999 - time (sec): 581.96 - samples/sec: 4877.72 - lr: 0.100000
2023-04-06 01:12:02,191 epoch 15 - iter 3720/3720 - loss 0.03840150 - time (sec): 645.73 - samples/sec: 4880.88 - lr: 0.100000
2023-04-06 01:12:02,191 ----------------------------------------------------------------------------------------------------
2023-04-06 01:12:02,191 EPOCH 15 done: loss 0.0384 - lr 0.100000
2023-04-06 01:12:02,191 BAD EPOCHS (no improvement): 0
2023-04-06 01:12:02,194 ----------------------------------------------------------------------------------------------------
2023-04-06 01:13:07,605 epoch 16 - iter 372/3720 - loss 0.03656534 - time (sec): 65.41 - samples/sec: 4811.92 - lr: 0.100000
2023-04-06 01:14:13,165 epoch 16 - iter 744/3720 - loss 0.03768461 - time (sec): 130.97 - samples/sec: 4816.79 - lr: 0.100000
2023-04-06 01:15:18,025 epoch 16 - iter 1116/3720 - loss 0.03769401 - time (sec): 195.83 - samples/sec: 4827.59 - lr: 0.100000
2023-04-06 01:16:22,286 epoch 16 - iter 1488/3720 - loss 0.03801334 - time (sec): 260.09 - samples/sec: 4847.08 - lr: 0.100000
2023-04-06 01:17:27,638 epoch 16 - iter 1860/3720 - loss 0.03814107 - time (sec): 325.44 - samples/sec: 4842.98 - lr: 0.100000
2023-04-06 01:18:32,828 epoch 16 - iter 2232/3720 - loss 0.03806304 - time (sec): 390.63 - samples/sec: 4848.90 - lr: 0.100000
2023-04-06 01:19:36,777 epoch 16 - iter 2604/3720 - loss 0.03810921 - time (sec): 454.58 - samples/sec: 4856.32 - lr: 0.100000
2023-04-06 01:20:39,643 epoch 16 - iter 2976/3720 - loss 0.03800707 - time (sec): 517.45 - samples/sec: 4872.65 - lr: 0.100000
2023-04-06 01:21:43,342 epoch 16 - iter 3348/3720 - loss 0.03808011 - time (sec): 581.15 - samples/sec: 4881.02 - lr: 0.100000
2023-04-06 01:22:47,752 epoch 16 - iter 3720/3720 - loss 0.03817140 - time (sec): 645.56 - samples/sec: 4882.15 - lr: 0.100000
2023-04-06 01:22:47,753 ----------------------------------------------------------------------------------------------------
2023-04-06 01:22:47,753 EPOCH 16 done: loss 0.0382 - lr 0.100000
2023-04-06 01:22:47,753 BAD EPOCHS (no improvement): 0
2023-04-06 01:22:47,756 ----------------------------------------------------------------------------------------------------
2023-04-06 01:23:52,545 epoch 17 - iter 372/3720 - loss 0.03777496 - time (sec): 64.79 - samples/sec: 4873.10 - lr: 0.100000
2023-04-06 01:24:57,014 epoch 17 - iter 744/3720 - loss 0.03762120 - time (sec): 129.26 - samples/sec: 4875.10 - lr: 0.100000
2023-04-06 01:26:02,115 epoch 17 - iter 1116/3720 - loss 0.03794625 - time (sec): 194.36 - samples/sec: 4871.71 - lr: 0.100000
2023-04-06 01:27:07,547 epoch 17 - iter 1488/3720 - loss 0.03782393 - time (sec): 259.79 - samples/sec: 4864.06 - lr: 0.100000
2023-04-06 01:28:11,650 epoch 17 - iter 1860/3720 - loss 0.03782723 - time (sec): 323.89 - samples/sec: 4874.57 - lr: 0.100000
2023-04-06 01:29:15,043 epoch 17 - iter 2232/3720 - loss 0.03774844 - time (sec): 387.29 - samples/sec: 4893.80 - lr: 0.100000
2023-04-06 01:30:19,164 epoch 17 - iter 2604/3720 - loss 0.03766911 - time (sec): 451.41 - samples/sec: 4902.54 - lr: 0.100000
2023-04-06 01:31:22,179 epoch 17 - iter 2976/3720 - loss 0.03783737 - time (sec): 514.42 - samples/sec: 4911.87 - lr: 0.100000
2023-04-06 01:32:24,708 epoch 17 - iter 3348/3720 - loss 0.03791902 - time (sec): 576.95 - samples/sec: 4920.71 - lr: 0.100000
2023-04-06 01:33:29,291 epoch 17 - iter 3720/3720 - loss 0.03798204 - time (sec): 641.53 - samples/sec: 4912.77 - lr: 0.100000
2023-04-06 01:33:29,291 ----------------------------------------------------------------------------------------------------
2023-04-06 01:33:29,291 EPOCH 17 done: loss 0.0380 - lr 0.100000
2023-04-06 01:33:29,291 BAD EPOCHS (no improvement): 0
2023-04-06 01:33:29,295 ----------------------------------------------------------------------------------------------------
2023-04-06 01:34:33,127 epoch 18 - iter 372/3720 - loss 0.03717692 - time (sec): 63.83 - samples/sec: 4943.40 - lr: 0.100000
2023-04-06 01:35:36,821 epoch 18 - iter 744/3720 - loss 0.03806882 - time (sec): 127.53 - samples/sec: 4946.33 - lr: 0.100000
2023-04-06 01:36:40,369 epoch 18 - iter 1116/3720 - loss 0.03788397 - time (sec): 191.07 - samples/sec: 4950.24 - lr: 0.100000
2023-04-06 01:37:45,086 epoch 18 - iter 1488/3720 - loss 0.03811433 - time (sec): 255.79 - samples/sec: 4941.13 - lr: 0.100000
2023-04-06 01:38:50,149 epoch 18 - iter 1860/3720 - loss 0.03789941 - time (sec): 320.85 - samples/sec: 4923.11 - lr: 0.100000
2023-04-06 01:39:53,735 epoch 18 - iter 2232/3720 - loss 0.03799931 - time (sec): 384.44 - samples/sec: 4923.81 - lr: 0.100000
2023-04-06 01:40:57,148 epoch 18 - iter 2604/3720 - loss 0.03787706 - time (sec): 447.85 - samples/sec: 4928.18 - lr: 0.100000
2023-04-06 01:42:01,906 epoch 18 - iter 2976/3720 - loss 0.03800637 - time (sec): 512.61 - samples/sec: 4925.40 - lr: 0.100000
2023-04-06 01:43:05,441 epoch 18 - iter 3348/3720 - loss 0.03806865 - time (sec): 576.15 - samples/sec: 4928.27 - lr: 0.100000
2023-04-06 01:44:09,375 epoch 18 - iter 3720/3720 - loss 0.03801343 - time (sec): 640.08 - samples/sec: 4923.93 - lr: 0.100000
2023-04-06 01:44:09,376 ----------------------------------------------------------------------------------------------------
2023-04-06 01:44:09,376 EPOCH 18 done: loss 0.0380 - lr 0.100000
2023-04-06 01:44:09,376 BAD EPOCHS (no improvement): 1
2023-04-06 01:44:09,379 ----------------------------------------------------------------------------------------------------
2023-04-06 01:45:13,657 epoch 19 - iter 372/3720 - loss 0.03779553 - time (sec): 64.28 - samples/sec: 4898.67 - lr: 0.100000
2023-04-06 01:46:16,968 epoch 19 - iter 744/3720 - loss 0.03734848 - time (sec): 127.59 - samples/sec: 4932.26 - lr: 0.100000
2023-04-06 01:47:20,913 epoch 19 - iter 1116/3720 - loss 0.03732153 - time (sec): 191.53 - samples/sec: 4933.50 - lr: 0.100000
2023-04-06 01:48:24,318 epoch 19 - iter 1488/3720 - loss 0.03741383 - time (sec): 254.94 - samples/sec: 4928.21 - lr: 0.100000
2023-04-06 01:49:28,405 epoch 19 - iter 1860/3720 - loss 0.03746305 - time (sec): 319.03 - samples/sec: 4928.28 - lr: 0.100000
2023-04-06 01:50:33,078 epoch 19 - iter 2232/3720 - loss 0.03737705 - time (sec): 383.70 - samples/sec: 4924.75 - lr: 0.100000
2023-04-06 01:51:36,689 epoch 19 - iter 2604/3720 - loss 0.03751033 - time (sec): 447.31 - samples/sec: 4933.01 - lr: 0.100000
2023-04-06 01:52:39,422 epoch 19 - iter 2976/3720 - loss 0.03767277 - time (sec): 510.04 - samples/sec: 4945.22 - lr: 0.100000
2023-04-06 01:53:41,842 epoch 19 - iter 3348/3720 - loss 0.03771265 - time (sec): 572.46 - samples/sec: 4956.63 - lr: 0.100000
2023-04-06 01:54:44,403 epoch 19 - iter 3720/3720 - loss 0.03780869 - time (sec): 635.02 - samples/sec: 4963.14 - lr: 0.100000
2023-04-06 01:54:44,404 ----------------------------------------------------------------------------------------------------
2023-04-06 01:54:44,404 EPOCH 19 done: loss 0.0378 - lr 0.100000
2023-04-06 01:54:44,404 BAD EPOCHS (no improvement): 0
2023-04-06 01:54:44,406 ----------------------------------------------------------------------------------------------------
2023-04-06 01:55:47,875 epoch 20 - iter 372/3720 - loss 0.03785798 - time (sec): 63.47 - samples/sec: 4947.77 - lr: 0.100000
2023-04-06 01:56:51,003 epoch 20 - iter 744/3720 - loss 0.03783281 - time (sec): 126.60 - samples/sec: 4952.04 - lr: 0.100000
2023-04-06 01:57:53,716 epoch 20 - iter 1116/3720 - loss 0.03755687 - time (sec): 189.31 - samples/sec: 4981.33 - lr: 0.100000
2023-04-06 01:58:56,752 epoch 20 - iter 1488/3720 - loss 0.03759708 - time (sec): 252.35 - samples/sec: 4986.73 - lr: 0.100000
2023-04-06 02:00:01,439 epoch 20 - iter 1860/3720 - loss 0.03753506 - time (sec): 317.03 - samples/sec: 4970.13 - lr: 0.100000
2023-04-06 02:01:05,828 epoch 20 - iter 2232/3720 - loss 0.03742842 - time (sec): 381.42 - samples/sec: 4959.00 - lr: 0.100000
2023-04-06 02:02:08,245 epoch 20 - iter 2604/3720 - loss 0.03733267 - time (sec): 443.84 - samples/sec: 4973.81 - lr: 0.100000
2023-04-06 02:03:11,107 epoch 20 - iter 2976/3720 - loss 0.03736629 - time (sec): 506.70 - samples/sec: 4976.22 - lr: 0.100000
2023-04-06 02:04:13,467 epoch 20 - iter 3348/3720 - loss 0.03749929 - time (sec): 569.06 - samples/sec: 4982.46 - lr: 0.100000
2023-04-06 02:05:16,497 epoch 20 - iter 3720/3720 - loss 0.03755917 - time (sec): 632.09 - samples/sec: 4986.17 - lr: 0.100000
2023-04-06 02:05:16,498 ----------------------------------------------------------------------------------------------------
2023-04-06 02:05:16,498 EPOCH 20 done: loss 0.0376 - lr 0.100000
2023-04-06 02:05:16,498 BAD EPOCHS (no improvement): 0
2023-04-06 02:05:16,501 ----------------------------------------------------------------------------------------------------
2023-04-06 02:06:19,401 epoch 21 - iter 372/3720 - loss 0.03719930 - time (sec): 62.90 - samples/sec: 5020.08 - lr: 0.100000
2023-04-06 02:07:21,749 epoch 21 - iter 744/3720 - loss 0.03706269 - time (sec): 125.25 - samples/sec: 5043.64 - lr: 0.100000
2023-04-06 02:08:23,831 epoch 21 - iter 1116/3720 - loss 0.03711430 - time (sec): 187.33 - samples/sec: 5051.85 - lr: 0.100000
2023-04-06 02:09:26,831 epoch 21 - iter 1488/3720 - loss 0.03725920 - time (sec): 250.33 - samples/sec: 5039.63 - lr: 0.100000
2023-04-06 02:10:29,874 epoch 21 - iter 1860/3720 - loss 0.03730278 - time (sec): 313.37 - samples/sec: 5027.99 - lr: 0.100000
2023-04-06 02:11:32,149 epoch 21 - iter 2232/3720 - loss 0.03721740 - time (sec): 375.65 - samples/sec: 5032.88 - lr: 0.100000
2023-04-06 02:12:34,707 epoch 21 - iter 2604/3720 - loss 0.03732940 - time (sec): 438.21 - samples/sec: 5035.33 - lr: 0.100000
2023-04-06 02:13:37,977 epoch 21 - iter 2976/3720 - loss 0.03738477 - time (sec): 501.48 - samples/sec: 5033.70 - lr: 0.100000
2023-04-06 02:14:41,524 epoch 21 - iter 3348/3720 - loss 0.03738079 - time (sec): 565.02 - samples/sec: 5023.15 - lr: 0.100000
2023-04-06 02:15:44,618 epoch 21 - iter 3720/3720 - loss 0.03743202 - time (sec): 628.12 - samples/sec: 5017.71 - lr: 0.100000
2023-04-06 02:15:44,618 ----------------------------------------------------------------------------------------------------
2023-04-06 02:15:44,618 EPOCH 21 done: loss 0.0374 - lr 0.100000
2023-04-06 02:15:44,618 BAD EPOCHS (no improvement): 0
2023-04-06 02:15:44,621 ----------------------------------------------------------------------------------------------------
2023-04-06 02:16:47,511 epoch 22 - iter 372/3720 - loss 0.03606527 - time (sec): 62.89 - samples/sec: 4997.69 - lr: 0.100000
2023-04-06 02:17:49,824 epoch 22 - iter 744/3720 - loss 0.03654979 - time (sec): 125.20 - samples/sec: 5013.71 - lr: 0.100000
2023-04-06 02:18:53,656 epoch 22 - iter 1116/3720 - loss 0.03663016 - time (sec): 189.03 - samples/sec: 4987.13 - lr: 0.100000
2023-04-06 02:19:57,426 epoch 22 - iter 1488/3720 - loss 0.03682940 - time (sec): 252.80 - samples/sec: 4978.60 - lr: 0.100000
2023-04-06 02:21:00,182 epoch 22 - iter 1860/3720 - loss 0.03683460 - time (sec): 315.56 - samples/sec: 4992.24 - lr: 0.100000
2023-04-06 02:22:04,607 epoch 22 - iter 2232/3720 - loss 0.03700043 - time (sec): 379.99 - samples/sec: 4981.35 - lr: 0.100000
2023-04-06 02:23:08,231 epoch 22 - iter 2604/3720 - loss 0.03707551 - time (sec): 443.61 - samples/sec: 4973.72 - lr: 0.100000
2023-04-06 02:24:11,920 epoch 22 - iter 2976/3720 - loss 0.03710649 - time (sec): 507.30 - samples/sec: 4968.95 - lr: 0.100000
2023-04-06 02:25:15,697 epoch 22 - iter 3348/3720 - loss 0.03725444 - time (sec): 571.07 - samples/sec: 4965.36 - lr: 0.100000
2023-04-06 02:26:19,110 epoch 22 - iter 3720/3720 - loss 0.03728125 - time (sec): 634.49 - samples/sec: 4967.33 - lr: 0.100000
2023-04-06 02:26:19,110 ----------------------------------------------------------------------------------------------------
2023-04-06 02:26:19,110 EPOCH 22 done: loss 0.0373 - lr 0.100000
2023-04-06 02:26:19,110 BAD EPOCHS (no improvement): 0
2023-04-06 02:26:19,113 ----------------------------------------------------------------------------------------------------
2023-04-06 02:27:22,264 epoch 23 - iter 372/3720 - loss 0.03708948 - time (sec): 63.15 - samples/sec: 4991.98 - lr: 0.100000
2023-04-06 02:28:24,744 epoch 23 - iter 744/3720 - loss 0.03701560 - time (sec): 125.63 - samples/sec: 5019.13 - lr: 0.100000
2023-04-06 02:29:27,416 epoch 23 - iter 1116/3720 - loss 0.03711337 - time (sec): 188.30 - samples/sec: 5006.78 - lr: 0.100000
2023-04-06 02:30:30,607 epoch 23 - iter 1488/3720 - loss 0.03713993 - time (sec): 251.49 - samples/sec: 4995.53 - lr: 0.100000
2023-04-06 02:31:31,911 epoch 23 - iter 1860/3720 - loss 0.03718061 - time (sec): 312.80 - samples/sec: 5019.09 - lr: 0.100000
2023-04-06 02:32:36,140 epoch 23 - iter 2232/3720 - loss 0.03707327 - time (sec): 377.03 - samples/sec: 5010.33 - lr: 0.100000
2023-04-06 02:33:39,471 epoch 23 - iter 2604/3720 - loss 0.03709395 - time (sec): 440.36 - samples/sec: 5009.17 - lr: 0.100000
2023-04-06 02:34:43,117 epoch 23 - iter 2976/3720 - loss 0.03709204 - time (sec): 504.00 - samples/sec: 5002.85 - lr: 0.100000
2023-04-06 02:35:45,820 epoch 23 - iter 3348/3720 - loss 0.03713828 - time (sec): 566.71 - samples/sec: 5004.31 - lr: 0.100000
2023-04-06 02:36:48,575 epoch 23 - iter 3720/3720 - loss 0.03713727 - time (sec): 629.46 - samples/sec: 5006.99 - lr: 0.100000
2023-04-06 02:36:48,576 ----------------------------------------------------------------------------------------------------
2023-04-06 02:36:48,576 EPOCH 23 done: loss 0.0371 - lr 0.100000
2023-04-06 02:36:48,576 BAD EPOCHS (no improvement): 0
2023-04-06 02:36:48,579 ----------------------------------------------------------------------------------------------------
2023-04-06 02:37:51,265 epoch 24 - iter 372/3720 - loss 0.03669695 - time (sec): 62.69 - samples/sec: 5040.67 - lr: 0.100000
2023-04-06 02:38:53,921 epoch 24 - iter 744/3720 - loss 0.03662117 - time (sec): 125.34 - samples/sec: 5035.38 - lr: 0.100000
2023-04-06 02:39:57,107 epoch 24 - iter 1116/3720 - loss 0.03658693 - time (sec): 188.53 - samples/sec: 5012.98 - lr: 0.100000
2023-04-06 02:40:59,524 epoch 24 - iter 1488/3720 - loss 0.03647600 - time (sec): 250.94 - samples/sec: 5008.04 - lr: 0.100000
2023-04-06 02:42:02,608 epoch 24 - iter 1860/3720 - loss 0.03640431 - time (sec): 314.03 - samples/sec: 5009.09 - lr: 0.100000
2023-04-06 02:43:05,716 epoch 24 - iter 2232/3720 - loss 0.03666376 - time (sec): 377.14 - samples/sec: 5004.81 - lr: 0.100000
2023-04-06 02:44:09,570 epoch 24 - iter 2604/3720 - loss 0.03676543 - time (sec): 440.99 - samples/sec: 5000.25 - lr: 0.100000
2023-04-06 02:45:13,636 epoch 24 - iter 2976/3720 - loss 0.03679277 - time (sec): 505.06 - samples/sec: 4994.02 - lr: 0.100000
2023-04-06 02:46:16,146 epoch 24 - iter 3348/3720 - loss 0.03692790 - time (sec): 567.57 - samples/sec: 5000.04 - lr: 0.100000
2023-04-06 02:47:18,346 epoch 24 - iter 3720/3720 - loss 0.03693011 - time (sec): 629.77 - samples/sec: 5004.57 - lr: 0.100000
2023-04-06 02:47:18,346 ----------------------------------------------------------------------------------------------------
2023-04-06 02:47:18,346 EPOCH 24 done: loss 0.0369 - lr 0.100000
2023-04-06 02:47:18,346 BAD EPOCHS (no improvement): 0
2023-04-06 02:47:18,350 ----------------------------------------------------------------------------------------------------
2023-04-06 02:48:22,056 epoch 25 - iter 372/3720 - loss 0.03576623 - time (sec): 63.71 - samples/sec: 4970.15 - lr: 0.100000
2023-04-06 02:49:26,057 epoch 25 - iter 744/3720 - loss 0.03617317 - time (sec): 127.71 - samples/sec: 4948.50 - lr: 0.100000
2023-04-06 02:50:28,898 epoch 25 - iter 1116/3720 - loss 0.03635723 - time (sec): 190.55 - samples/sec: 4975.37 - lr: 0.100000
2023-04-06 02:51:32,665 epoch 25 - iter 1488/3720 - loss 0.03677751 - time (sec): 254.32 - samples/sec: 4972.70 - lr: 0.100000
2023-04-06 02:52:35,999 epoch 25 - iter 1860/3720 - loss 0.03665019 - time (sec): 317.65 - samples/sec: 4971.71 - lr: 0.100000
2023-04-06 02:53:38,392 epoch 25 - iter 2232/3720 - loss 0.03671172 - time (sec): 380.04 - samples/sec: 4980.30 - lr: 0.100000
2023-04-06 02:54:41,734 epoch 25 - iter 2604/3720 - loss 0.03675279 - time (sec): 443.38 - samples/sec: 4982.06 - lr: 0.100000
2023-04-06 02:55:45,716 epoch 25 - iter 2976/3720 - loss 0.03678682 - time (sec): 507.37 - samples/sec: 4976.74 - lr: 0.100000
2023-04-06 02:56:48,279 epoch 25 - iter 3348/3720 - loss 0.03686545 - time (sec): 569.93 - samples/sec: 4980.01 - lr: 0.100000
2023-04-06 02:57:50,665 epoch 25 - iter 3720/3720 - loss 0.03688441 - time (sec): 632.31 - samples/sec: 4984.40 - lr: 0.100000
2023-04-06 02:57:50,665 ----------------------------------------------------------------------------------------------------
2023-04-06 02:57:50,666 EPOCH 25 done: loss 0.0369 - lr 0.100000
2023-04-06 02:57:50,666 BAD EPOCHS (no improvement): 0
2023-04-06 02:57:50,669 ----------------------------------------------------------------------------------------------------
2023-04-06 02:58:52,529 epoch 26 - iter 372/3720 - loss 0.03575018 - time (sec): 61.86 - samples/sec: 5080.71 - lr: 0.100000
2023-04-06 02:59:54,967 epoch 26 - iter 744/3720 - loss 0.03589772 - time (sec): 124.30 - samples/sec: 5054.73 - lr: 0.100000
2023-04-06 03:00:57,363 epoch 26 - iter 1116/3720 - loss 0.03617339 - time (sec): 186.69 - samples/sec: 5062.93 - lr: 0.100000
2023-04-06 03:02:00,007 epoch 26 - iter 1488/3720 - loss 0.03617761 - time (sec): 249.34 - samples/sec: 5061.16 - lr: 0.100000
2023-04-06 03:03:03,023 epoch 26 - iter 1860/3720 - loss 0.03626125 - time (sec): 312.35 - samples/sec: 5041.33 - lr: 0.100000
2023-04-06 03:04:06,741 epoch 26 - iter 2232/3720 - loss 0.03657375 - time (sec): 376.07 - samples/sec: 5021.13 - lr: 0.100000
2023-04-06 03:05:09,733 epoch 26 - iter 2604/3720 - loss 0.03664309 - time (sec): 439.06 - samples/sec: 5016.93 - lr: 0.100000
2023-04-06 03:06:12,591 epoch 26 - iter 2976/3720 - loss 0.03668327 - time (sec): 501.92 - samples/sec: 5022.02 - lr: 0.100000
2023-04-06 03:07:14,762 epoch 26 - iter 3348/3720 - loss 0.03666879 - time (sec): 564.09 - samples/sec: 5025.53 - lr: 0.100000
2023-04-06 03:08:19,046 epoch 26 - iter 3720/3720 - loss 0.03682200 - time (sec): 628.38 - samples/sec: 5015.64 - lr: 0.100000
2023-04-06 03:08:19,046 ----------------------------------------------------------------------------------------------------
2023-04-06 03:08:19,046 EPOCH 26 done: loss 0.0368 - lr 0.100000
2023-04-06 03:08:19,046 BAD EPOCHS (no improvement): 0
2023-04-06 03:08:19,049 ----------------------------------------------------------------------------------------------------
2023-04-06 03:09:22,636 epoch 27 - iter 372/3720 - loss 0.03591577 - time (sec): 63.59 - samples/sec: 4994.14 - lr: 0.100000
2023-04-06 03:10:25,528 epoch 27 - iter 744/3720 - loss 0.03638401 - time (sec): 126.48 - samples/sec: 5006.03 - lr: 0.100000
2023-04-06 03:11:27,613 epoch 27 - iter 1116/3720 - loss 0.03619447 - time (sec): 188.56 - samples/sec: 5019.48 - lr: 0.100000
2023-04-06 03:12:30,143 epoch 27 - iter 1488/3720 - loss 0.03647944 - time (sec): 251.09 - samples/sec: 5027.02 - lr: 0.100000
2023-04-06 03:13:32,505 epoch 27 - iter 1860/3720 - loss 0.03656158 - time (sec): 313.46 - samples/sec: 5030.90 - lr: 0.100000
2023-04-06 03:14:36,285 epoch 27 - iter 2232/3720 - loss 0.03654736 - time (sec): 377.24 - samples/sec: 5014.81 - lr: 0.100000
2023-04-06 03:15:39,734 epoch 27 - iter 2604/3720 - loss 0.03661380 - time (sec): 440.68 - samples/sec: 5008.20 - lr: 0.100000
2023-04-06 03:16:44,611 epoch 27 - iter 2976/3720 - loss 0.03667561 - time (sec): 505.56 - samples/sec: 4991.99 - lr: 0.100000
2023-04-06 03:17:47,683 epoch 27 - iter 3348/3720 - loss 0.03672948 - time (sec): 568.63 - samples/sec: 4992.51 - lr: 0.100000
2023-04-06 03:18:49,995 epoch 27 - iter 3720/3720 - loss 0.03675441 - time (sec): 630.95 - samples/sec: 4995.22 - lr: 0.100000
2023-04-06 03:18:49,995 ----------------------------------------------------------------------------------------------------
2023-04-06 03:18:49,995 EPOCH 27 done: loss 0.0368 - lr 0.100000
2023-04-06 03:18:49,995 BAD EPOCHS (no improvement): 0
2023-04-06 03:18:49,998 ----------------------------------------------------------------------------------------------------
2023-04-06 03:19:53,813 epoch 28 - iter 372/3720 - loss 0.03629755 - time (sec): 63.82 - samples/sec: 4956.29 - lr: 0.100000
2023-04-06 03:20:56,823 epoch 28 - iter 744/3720 - loss 0.03569959 - time (sec): 126.82 - samples/sec: 4979.95 - lr: 0.100000
2023-04-06 03:21:59,289 epoch 28 - iter 1116/3720 - loss 0.03576099 - time (sec): 189.29 - samples/sec: 4991.02 - lr: 0.100000
2023-04-06 03:23:02,352 epoch 28 - iter 1488/3720 - loss 0.03573978 - time (sec): 252.35 - samples/sec: 4987.32 - lr: 0.100000
2023-04-06 03:24:06,746 epoch 28 - iter 1860/3720 - loss 0.03585206 - time (sec): 316.75 - samples/sec: 4969.49 - lr: 0.100000
2023-04-06 03:25:09,793 epoch 28 - iter 2232/3720 - loss 0.03610828 - time (sec): 379.79 - samples/sec: 4975.70 - lr: 0.100000
2023-04-06 03:26:12,683 epoch 28 - iter 2604/3720 - loss 0.03606630 - time (sec): 442.69 - samples/sec: 4979.70 - lr: 0.100000
2023-04-06 03:27:15,150 epoch 28 - iter 2976/3720 - loss 0.03629244 - time (sec): 505.15 - samples/sec: 4987.77 - lr: 0.100000
2023-04-06 03:28:17,959 epoch 28 - iter 3348/3720 - loss 0.03635761 - time (sec): 567.96 - samples/sec: 4990.24 - lr: 0.100000
2023-04-06 03:29:22,429 epoch 28 - iter 3720/3720 - loss 0.03647368 - time (sec): 632.43 - samples/sec: 4983.49 - lr: 0.100000
2023-04-06 03:29:22,429 ----------------------------------------------------------------------------------------------------
2023-04-06 03:29:22,429 EPOCH 28 done: loss 0.0365 - lr 0.100000
2023-04-06 03:29:22,429 BAD EPOCHS (no improvement): 0
2023-04-06 03:29:22,432 ----------------------------------------------------------------------------------------------------
2023-04-06 03:30:26,512 epoch 29 - iter 372/3720 - loss 0.03580840 - time (sec): 64.08 - samples/sec: 4913.90 - lr: 0.100000
2023-04-06 03:31:29,880 epoch 29 - iter 744/3720 - loss 0.03558584 - time (sec): 127.45 - samples/sec: 4946.21 - lr: 0.100000
2023-04-06 03:32:33,002 epoch 29 - iter 1116/3720 - loss 0.03595706 - time (sec): 190.57 - samples/sec: 4968.49 - lr: 0.100000
2023-04-06 03:33:35,317 epoch 29 - iter 1488/3720 - loss 0.03607640 - time (sec): 252.89 - samples/sec: 4996.88 - lr: 0.100000
2023-04-06 03:34:37,513 epoch 29 - iter 1860/3720 - loss 0.03631701 - time (sec): 315.08 - samples/sec: 5010.68 - lr: 0.100000
2023-04-06 03:35:40,490 epoch 29 - iter 2232/3720 - loss 0.03634836 - time (sec): 378.06 - samples/sec: 5019.47 - lr: 0.100000
2023-04-06 03:36:42,442 epoch 29 - iter 2604/3720 - loss 0.03643398 - time (sec): 440.01 - samples/sec: 5023.07 - lr: 0.100000
2023-04-06 03:37:44,939 epoch 29 - iter 2976/3720 - loss 0.03647132 - time (sec): 502.51 - samples/sec: 5025.27 - lr: 0.100000
2023-04-06 03:38:48,775 epoch 29 - iter 3348/3720 - loss 0.03648527 - time (sec): 566.34 - samples/sec: 5013.01 - lr: 0.100000
2023-04-06 03:39:51,756 epoch 29 - iter 3720/3720 - loss 0.03656561 - time (sec): 629.32 - samples/sec: 5008.09 - lr: 0.100000
2023-04-06 03:39:51,757 ----------------------------------------------------------------------------------------------------
2023-04-06 03:39:51,757 EPOCH 29 done: loss 0.0366 - lr 0.100000
2023-04-06 03:39:51,757 BAD EPOCHS (no improvement): 1
2023-04-06 03:39:51,759 ----------------------------------------------------------------------------------------------------
2023-04-06 03:40:54,782 epoch 30 - iter 372/3720 - loss 0.03622456 - time (sec): 63.02 - samples/sec: 5011.96 - lr: 0.100000
2023-04-06 03:41:58,477 epoch 30 - iter 744/3720 - loss 0.03596512 - time (sec): 126.72 - samples/sec: 4993.34 - lr: 0.100000
2023-04-06 03:43:00,990 epoch 30 - iter 1116/3720 - loss 0.03581400 - time (sec): 189.23 - samples/sec: 4991.87 - lr: 0.100000
2023-04-06 03:44:03,955 epoch 30 - iter 1488/3720 - loss 0.03595400 - time (sec): 252.20 - samples/sec: 4997.03 - lr: 0.100000
2023-04-06 03:45:05,948 epoch 30 - iter 1860/3720 - loss 0.03586498 - time (sec): 314.19 - samples/sec: 5012.81 - lr: 0.100000
2023-04-06 03:46:08,866 epoch 30 - iter 2232/3720 - loss 0.03587263 - time (sec): 377.11 - samples/sec: 5013.25 - lr: 0.100000
2023-04-06 03:47:12,458 epoch 30 - iter 2604/3720 - loss 0.03601574 - time (sec): 440.70 - samples/sec: 5004.52 - lr: 0.100000
2023-04-06 03:48:15,915 epoch 30 - iter 2976/3720 - loss 0.03621407 - time (sec): 504.16 - samples/sec: 5002.41 - lr: 0.100000
2023-04-06 03:49:19,611 epoch 30 - iter 3348/3720 - loss 0.03629892 - time (sec): 567.85 - samples/sec: 4995.75 - lr: 0.100000
2023-04-06 03:50:23,857 epoch 30 - iter 3720/3720 - loss 0.03638563 - time (sec): 632.10 - samples/sec: 4986.12 - lr: 0.100000
2023-04-06 03:50:23,857 ----------------------------------------------------------------------------------------------------
2023-04-06 03:50:23,857 EPOCH 30 done: loss 0.0364 - lr 0.100000
2023-04-06 03:50:23,857 BAD EPOCHS (no improvement): 0
2023-04-06 03:50:23,861 ----------------------------------------------------------------------------------------------------
2023-04-06 03:51:27,597 epoch 31 - iter 372/3720 - loss 0.03589866 - time (sec): 63.74 - samples/sec: 4935.55 - lr: 0.100000
2023-04-06 03:52:30,594 epoch 31 - iter 744/3720 - loss 0.03591985 - time (sec): 126.73 - samples/sec: 4980.31 - lr: 0.100000
2023-04-06 03:53:34,478 epoch 31 - iter 1116/3720 - loss 0.03608791 - time (sec): 190.62 - samples/sec: 4969.68 - lr: 0.100000
2023-04-06 03:54:37,550 epoch 31 - iter 1488/3720 - loss 0.03605652 - time (sec): 253.69 - samples/sec: 4976.81 - lr: 0.100000
2023-04-06 03:55:40,506 epoch 31 - iter 1860/3720 - loss 0.03627127 - time (sec): 316.65 - samples/sec: 4975.40 - lr: 0.100000
2023-04-06 03:56:45,133 epoch 31 - iter 2232/3720 - loss 0.03623942 - time (sec): 381.27 - samples/sec: 4961.76 - lr: 0.100000
2023-04-06 03:57:48,120 epoch 31 - iter 2604/3720 - loss 0.03643650 - time (sec): 444.26 - samples/sec: 4962.42 - lr: 0.100000
2023-04-06 03:58:51,606 epoch 31 - iter 2976/3720 - loss 0.03644714 - time (sec): 507.74 - samples/sec: 4962.08 - lr: 0.100000
2023-04-06 03:59:55,671 epoch 31 - iter 3348/3720 - loss 0.03651080 - time (sec): 571.81 - samples/sec: 4959.66 - lr: 0.100000
2023-04-06 04:00:59,712 epoch 31 - iter 3720/3720 - loss 0.03653519 - time (sec): 635.85 - samples/sec: 4956.68 - lr: 0.100000
2023-04-06 04:00:59,713 ----------------------------------------------------------------------------------------------------
2023-04-06 04:00:59,713 EPOCH 31 done: loss 0.0365 - lr 0.100000
2023-04-06 04:00:59,713 BAD EPOCHS (no improvement): 1
2023-04-06 04:00:59,716 ----------------------------------------------------------------------------------------------------
2023-04-06 04:02:03,637 epoch 32 - iter 372/3720 - loss 0.03422510 - time (sec): 63.92 - samples/sec: 4951.84 - lr: 0.100000
2023-04-06 04:03:05,658 epoch 32 - iter 744/3720 - loss 0.03547466 - time (sec): 125.94 - samples/sec: 5019.51 - lr: 0.100000
2023-04-06 04:04:08,997 epoch 32 - iter 1116/3720 - loss 0.03580223 - time (sec): 189.28 - samples/sec: 4997.24 - lr: 0.100000
2023-04-06 04:05:13,508 epoch 32 - iter 1488/3720 - loss 0.03599954 - time (sec): 253.79 - samples/sec: 4977.40 - lr: 0.100000
2023-04-06 04:06:16,227 epoch 32 - iter 1860/3720 - loss 0.03621192 - time (sec): 316.51 - samples/sec: 4983.23 - lr: 0.100000
2023-04-06 04:07:18,991 epoch 32 - iter 2232/3720 - loss 0.03622746 - time (sec): 379.28 - samples/sec: 4987.48 - lr: 0.100000
2023-04-06 04:08:22,113 epoch 32 - iter 2604/3720 - loss 0.03620039 - time (sec): 442.40 - samples/sec: 4988.51 - lr: 0.100000
2023-04-06 04:09:25,150 epoch 32 - iter 2976/3720 - loss 0.03629937 - time (sec): 505.43 - samples/sec: 4990.05 - lr: 0.100000
2023-04-06 04:10:28,052 epoch 32 - iter 3348/3720 - loss 0.03632866 - time (sec): 568.34 - samples/sec: 4988.58 - lr: 0.100000
2023-04-06 04:11:32,068 epoch 32 - iter 3720/3720 - loss 0.03640496 - time (sec): 632.35 - samples/sec: 4984.11 - lr: 0.100000
2023-04-06 04:11:32,068 ----------------------------------------------------------------------------------------------------
2023-04-06 04:11:32,069 EPOCH 32 done: loss 0.0364 - lr 0.100000
2023-04-06 04:11:32,069 BAD EPOCHS (no improvement): 2
2023-04-06 04:11:32,072 ----------------------------------------------------------------------------------------------------
2023-04-06 04:12:34,566 epoch 33 - iter 372/3720 - loss 0.03615619 - time (sec): 62.49 - samples/sec: 5045.81 - lr: 0.100000
2023-04-06 04:13:39,034 epoch 33 - iter 744/3720 - loss 0.03617709 - time (sec): 126.96 - samples/sec: 4972.09 - lr: 0.100000
2023-04-06 04:14:42,381 epoch 33 - iter 1116/3720 - loss 0.03629354 - time (sec): 190.31 - samples/sec: 4975.96 - lr: 0.100000
2023-04-06 04:15:45,322 epoch 33 - iter 1488/3720 - loss 0.03613818 - time (sec): 253.25 - samples/sec: 4977.67 - lr: 0.100000
2023-04-06 04:16:47,594 epoch 33 - iter 1860/3720 - loss 0.03617935 - time (sec): 315.52 - samples/sec: 4996.51 - lr: 0.100000
2023-04-06 04:17:51,516 epoch 33 - iter 2232/3720 - loss 0.03633921 - time (sec): 379.44 - samples/sec: 4979.41 - lr: 0.100000
2023-04-06 04:18:55,881 epoch 33 - iter 2604/3720 - loss 0.03629361 - time (sec): 443.81 - samples/sec: 4968.91 - lr: 0.100000
2023-04-06 04:19:59,049 epoch 33 - iter 2976/3720 - loss 0.03635164 - time (sec): 506.98 - samples/sec: 4971.74 - lr: 0.100000
2023-04-06 04:21:03,289 epoch 33 - iter 3348/3720 - loss 0.03624671 - time (sec): 571.22 - samples/sec: 4965.33 - lr: 0.100000
2023-04-06 04:22:06,801 epoch 33 - iter 3720/3720 - loss 0.03622238 - time (sec): 634.73 - samples/sec: 4965.45 - lr: 0.100000
2023-04-06 04:22:06,801 ----------------------------------------------------------------------------------------------------
2023-04-06 04:22:06,802 EPOCH 33 done: loss 0.0362 - lr 0.100000
2023-04-06 04:22:06,802 BAD EPOCHS (no improvement): 0
2023-04-06 04:22:06,805 ----------------------------------------------------------------------------------------------------
2023-04-06 04:23:10,915 epoch 34 - iter 372/3720 - loss 0.03661503 - time (sec): 64.11 - samples/sec: 4942.61 - lr: 0.100000
2023-04-06 04:24:13,440 epoch 34 - iter 744/3720 - loss 0.03657963 - time (sec): 126.64 - samples/sec: 4990.14 - lr: 0.100000
2023-04-06 04:25:16,466 epoch 34 - iter 1116/3720 - loss 0.03647274 - time (sec): 189.66 - samples/sec: 4981.26 - lr: 0.100000
2023-04-06 04:26:17,810 epoch 34 - iter 1488/3720 - loss 0.03660441 - time (sec): 251.01 - samples/sec: 5009.35 - lr: 0.100000
2023-04-06 04:27:21,461 epoch 34 - iter 1860/3720 - loss 0.03655032 - time (sec): 314.66 - samples/sec: 5000.17 - lr: 0.100000
2023-04-06 04:28:25,783 epoch 34 - iter 2232/3720 - loss 0.03654797 - time (sec): 378.98 - samples/sec: 4988.08 - lr: 0.100000
2023-04-06 04:29:30,360 epoch 34 - iter 2604/3720 - loss 0.03664691 - time (sec): 443.55 - samples/sec: 4975.96 - lr: 0.100000
2023-04-06 04:30:33,471 epoch 34 - iter 2976/3720 - loss 0.03655137 - time (sec): 506.67 - samples/sec: 4977.32 - lr: 0.100000
2023-04-06 04:31:35,798 epoch 34 - iter 3348/3720 - loss 0.03650225 - time (sec): 568.99 - samples/sec: 4981.06 - lr: 0.100000
2023-04-06 04:32:38,474 epoch 34 - iter 3720/3720 - loss 0.03654818 - time (sec): 631.67 - samples/sec: 4989.50 - lr: 0.100000
2023-04-06 04:32:38,474 ----------------------------------------------------------------------------------------------------
2023-04-06 04:32:38,474 EPOCH 34 done: loss 0.0365 - lr 0.100000
2023-04-06 04:32:38,474 BAD EPOCHS (no improvement): 1
2023-04-06 04:32:38,477 ----------------------------------------------------------------------------------------------------
2023-04-06 04:33:40,730 epoch 35 - iter 372/3720 - loss 0.03554340 - time (sec): 62.25 - samples/sec: 5068.43 - lr: 0.100000
2023-04-06 04:34:44,385 epoch 35 - iter 744/3720 - loss 0.03577875 - time (sec): 125.91 - samples/sec: 5018.61 - lr: 0.100000
2023-04-06 04:35:47,589 epoch 35 - iter 1116/3720 - loss 0.03598251 - time (sec): 189.11 - samples/sec: 5003.48 - lr: 0.100000
2023-04-06 04:36:51,082 epoch 35 - iter 1488/3720 - loss 0.03586585 - time (sec): 252.60 - samples/sec: 4989.90 - lr: 0.100000
2023-04-06 04:37:53,233 epoch 35 - iter 1860/3720 - loss 0.03602181 - time (sec): 314.76 - samples/sec: 5006.38 - lr: 0.100000
2023-04-06 04:38:56,781 epoch 35 - iter 2232/3720 - loss 0.03594249 - time (sec): 378.30 - samples/sec: 4997.93 - lr: 0.100000
2023-04-06 04:40:01,588 epoch 35 - iter 2604/3720 - loss 0.03596076 - time (sec): 443.11 - samples/sec: 4982.08 - lr: 0.100000
2023-04-06 04:41:05,095 epoch 35 - iter 2976/3720 - loss 0.03609682 - time (sec): 506.62 - samples/sec: 4975.33 - lr: 0.100000
2023-04-06 04:42:08,145 epoch 35 - iter 3348/3720 - loss 0.03609989 - time (sec): 569.67 - samples/sec: 4978.05 - lr: 0.100000
2023-04-06 04:43:10,774 epoch 35 - iter 3720/3720 - loss 0.03625924 - time (sec): 632.30 - samples/sec: 4984.55 - lr: 0.100000
2023-04-06 04:43:10,775 ----------------------------------------------------------------------------------------------------
2023-04-06 04:43:10,775 EPOCH 35 done: loss 0.0363 - lr 0.100000
2023-04-06 04:43:10,775 BAD EPOCHS (no improvement): 2
2023-04-06 04:43:10,777 ----------------------------------------------------------------------------------------------------
2023-04-06 04:44:13,548 epoch 36 - iter 372/3720 - loss 0.03547997 - time (sec): 62.77 - samples/sec: 5023.36 - lr: 0.100000
2023-04-06 04:45:17,165 epoch 36 - iter 744/3720 - loss 0.03556230 - time (sec): 126.39 - samples/sec: 4987.40 - lr: 0.100000
2023-04-06 04:46:19,825 epoch 36 - iter 1116/3720 - loss 0.03557631 - time (sec): 189.05 - samples/sec: 5008.21 - lr: 0.100000
2023-04-06 04:47:21,810 epoch 36 - iter 1488/3720 - loss 0.03593285 - time (sec): 251.03 - samples/sec: 5024.33 - lr: 0.100000
2023-04-06 04:48:24,759 epoch 36 - iter 1860/3720 - loss 0.03604706 - time (sec): 313.98 - samples/sec: 5021.56 - lr: 0.100000
2023-04-06 04:49:28,723 epoch 36 - iter 2232/3720 - loss 0.03585887 - time (sec): 377.95 - samples/sec: 5012.08 - lr: 0.100000
2023-04-06 04:50:31,251 epoch 36 - iter 2604/3720 - loss 0.03595801 - time (sec): 440.47 - samples/sec: 5015.64 - lr: 0.100000
2023-04-06 04:51:33,011 epoch 36 - iter 2976/3720 - loss 0.03602714 - time (sec): 502.23 - samples/sec: 5023.84 - lr: 0.100000
2023-04-06 04:52:36,148 epoch 36 - iter 3348/3720 - loss 0.03612962 - time (sec): 565.37 - samples/sec: 5022.99 - lr: 0.100000
2023-04-06 04:53:39,499 epoch 36 - iter 3720/3720 - loss 0.03613866 - time (sec): 628.72 - samples/sec: 5012.89 - lr: 0.100000
2023-04-06 04:53:39,499 ----------------------------------------------------------------------------------------------------
2023-04-06 04:53:39,499 EPOCH 36 done: loss 0.0361 - lr 0.100000
2023-04-06 04:53:39,499 BAD EPOCHS (no improvement): 0
2023-04-06 04:53:39,506 ----------------------------------------------------------------------------------------------------
2023-04-06 04:54:43,630 epoch 37 - iter 372/3720 - loss 0.03588182 - time (sec): 64.12 - samples/sec: 4935.21 - lr: 0.100000
2023-04-06 04:55:46,708 epoch 37 - iter 744/3720 - loss 0.03648884 - time (sec): 127.20 - samples/sec: 4972.26 - lr: 0.100000
2023-04-06 04:56:49,734 epoch 37 - iter 1116/3720 - loss 0.03638007 - time (sec): 190.23 - samples/sec: 4976.04 - lr: 0.100000
2023-04-06 04:57:53,201 epoch 37 - iter 1488/3720 - loss 0.03629532 - time (sec): 253.69 - samples/sec: 4976.37 - lr: 0.100000
2023-04-06 04:58:56,851 epoch 37 - iter 1860/3720 - loss 0.03628672 - time (sec): 317.34 - samples/sec: 4974.04 - lr: 0.100000
2023-04-06 05:00:00,942 epoch 37 - iter 2232/3720 - loss 0.03624483 - time (sec): 381.44 - samples/sec: 4967.68 - lr: 0.100000
2023-04-06 05:01:02,835 epoch 37 - iter 2604/3720 - loss 0.03630287 - time (sec): 443.33 - samples/sec: 4979.32 - lr: 0.100000
2023-04-06 05:02:05,858 epoch 37 - iter 2976/3720 - loss 0.03618145 - time (sec): 506.35 - samples/sec: 4982.41 - lr: 0.100000
2023-04-06 05:03:08,739 epoch 37 - iter 3348/3720 - loss 0.03627277 - time (sec): 569.23 - samples/sec: 4983.45 - lr: 0.100000
2023-04-06 05:04:12,014 epoch 37 - iter 3720/3720 - loss 0.03621617 - time (sec): 632.51 - samples/sec: 4982.88 - lr: 0.100000
2023-04-06 05:04:12,015 ----------------------------------------------------------------------------------------------------
2023-04-06 05:04:12,015 EPOCH 37 done: loss 0.0362 - lr 0.100000
2023-04-06 05:04:12,015 BAD EPOCHS (no improvement): 1
2023-04-06 05:04:12,022 ----------------------------------------------------------------------------------------------------
2023-04-06 05:05:15,649 epoch 38 - iter 372/3720 - loss 0.03581758 - time (sec): 63.63 - samples/sec: 4951.67 - lr: 0.100000
2023-04-06 05:06:18,889 epoch 38 - iter 744/3720 - loss 0.03530465 - time (sec): 126.87 - samples/sec: 4966.09 - lr: 0.100000
2023-04-06 05:07:22,777 epoch 38 - iter 1116/3720 - loss 0.03588757 - time (sec): 190.75 - samples/sec: 4956.01 - lr: 0.100000
2023-04-06 05:08:25,250 epoch 38 - iter 1488/3720 - loss 0.03595468 - time (sec): 253.23 - samples/sec: 4976.78 - lr: 0.100000
2023-04-06 05:09:30,429 epoch 38 - iter 1860/3720 - loss 0.03604632 - time (sec): 318.41 - samples/sec: 4957.88 - lr: 0.100000
2023-04-06 05:10:33,993 epoch 38 - iter 2232/3720 - loss 0.03612242 - time (sec): 381.97 - samples/sec: 4957.49 - lr: 0.100000
2023-04-06 05:11:36,193 epoch 38 - iter 2604/3720 - loss 0.03620231 - time (sec): 444.17 - samples/sec: 4971.41 - lr: 0.100000
2023-04-06 05:12:38,101 epoch 38 - iter 2976/3720 - loss 0.03629202 - time (sec): 506.08 - samples/sec: 4986.44 - lr: 0.100000
2023-04-06 05:13:40,885 epoch 38 - iter 3348/3720 - loss 0.03627942 - time (sec): 568.86 - samples/sec: 4990.85 - lr: 0.100000
2023-04-06 05:14:42,225 epoch 38 - iter 3720/3720 - loss 0.03628093 - time (sec): 630.20 - samples/sec: 5001.11 - lr: 0.100000
2023-04-06 05:14:42,225 ----------------------------------------------------------------------------------------------------
2023-04-06 05:14:42,225 EPOCH 38 done: loss 0.0363 - lr 0.100000
2023-04-06 05:14:42,225 BAD EPOCHS (no improvement): 2
2023-04-06 05:14:42,232 ----------------------------------------------------------------------------------------------------
2023-04-06 05:15:45,821 epoch 39 - iter 372/3720 - loss 0.03618891 - time (sec): 63.59 - samples/sec: 4972.35 - lr: 0.100000
2023-04-06 05:16:48,199 epoch 39 - iter 744/3720 - loss 0.03589596 - time (sec): 125.97 - samples/sec: 5003.42 - lr: 0.100000
2023-04-06 05:17:52,287 epoch 39 - iter 1116/3720 - loss 0.03577781 - time (sec): 190.06 - samples/sec: 4987.77 - lr: 0.100000
2023-04-06 05:18:56,385 epoch 39 - iter 1488/3720 - loss 0.03605000 - time (sec): 254.15 - samples/sec: 4963.34 - lr: 0.100000
2023-04-06 05:19:59,898 epoch 39 - iter 1860/3720 - loss 0.03619938 - time (sec): 317.67 - samples/sec: 4963.72 - lr: 0.100000
2023-04-06 05:21:02,956 epoch 39 - iter 2232/3720 - loss 0.03616054 - time (sec): 380.72 - samples/sec: 4972.71 - lr: 0.100000
2023-04-06 05:22:05,380 epoch 39 - iter 2604/3720 - loss 0.03631325 - time (sec): 443.15 - samples/sec: 4981.75 - lr: 0.100000
2023-04-06 05:23:09,479 epoch 39 - iter 2976/3720 - loss 0.03629390 - time (sec): 507.25 - samples/sec: 4974.63 - lr: 0.100000
2023-04-06 05:24:13,514 epoch 39 - iter 3348/3720 - loss 0.03627535 - time (sec): 571.28 - samples/sec: 4971.41 - lr: 0.100000
2023-04-06 05:25:15,345 epoch 39 - iter 3720/3720 - loss 0.03633764 - time (sec): 633.11 - samples/sec: 4978.12 - lr: 0.100000
2023-04-06 05:25:15,345 ----------------------------------------------------------------------------------------------------
2023-04-06 05:25:15,345 EPOCH 39 done: loss 0.0363 - lr 0.100000
2023-04-06 05:25:15,345 BAD EPOCHS (no improvement): 3
2023-04-06 05:25:15,352 ----------------------------------------------------------------------------------------------------
2023-04-06 05:26:18,619 epoch 40 - iter 372/3720 - loss 0.03507705 - time (sec): 63.27 - samples/sec: 4954.03 - lr: 0.100000
2023-04-06 05:27:22,411 epoch 40 - iter 744/3720 - loss 0.03569404 - time (sec): 127.06 - samples/sec: 4954.66 - lr: 0.100000
2023-04-06 05:28:25,575 epoch 40 - iter 1116/3720 - loss 0.03568844 - time (sec): 190.22 - samples/sec: 4965.57 - lr: 0.100000
2023-04-06 05:29:28,086 epoch 40 - iter 1488/3720 - loss 0.03588809 - time (sec): 252.73 - samples/sec: 4973.71 - lr: 0.100000
2023-04-06 05:30:31,282 epoch 40 - iter 1860/3720 - loss 0.03564235 - time (sec): 315.93 - samples/sec: 4980.29 - lr: 0.100000
2023-04-06 05:31:33,392 epoch 40 - iter 2232/3720 - loss 0.03576474 - time (sec): 378.04 - samples/sec: 4997.67 - lr: 0.100000
2023-04-06 05:32:35,991 epoch 40 - iter 2604/3720 - loss 0.03580354 - time (sec): 440.64 - samples/sec: 5006.35 - lr: 0.100000
2023-04-06 05:33:37,763 epoch 40 - iter 2976/3720 - loss 0.03594209 - time (sec): 502.41 - samples/sec: 5014.35 - lr: 0.100000
2023-04-06 05:34:40,953 epoch 40 - iter 3348/3720 - loss 0.03600140 - time (sec): 565.60 - samples/sec: 5014.46 - lr: 0.100000
2023-04-06 05:35:45,443 epoch 40 - iter 3720/3720 - loss 0.03603039 - time (sec): 630.09 - samples/sec: 5002.00 - lr: 0.100000
2023-04-06 05:35:45,443 ----------------------------------------------------------------------------------------------------
2023-04-06 05:35:45,443 EPOCH 40 done: loss 0.0360 - lr 0.100000
2023-04-06 05:35:45,443 BAD EPOCHS (no improvement): 0
2023-04-06 05:35:45,450 ----------------------------------------------------------------------------------------------------
2023-04-06 05:36:47,700 epoch 41 - iter 372/3720 - loss 0.03518475 - time (sec): 62.25 - samples/sec: 5053.76 - lr: 0.100000
2023-04-06 05:37:51,556 epoch 41 - iter 744/3720 - loss 0.03588227 - time (sec): 126.11 - samples/sec: 4992.75 - lr: 0.100000
2023-04-06 05:38:55,132 epoch 41 - iter 1116/3720 - loss 0.03591225 - time (sec): 189.68 - samples/sec: 4978.53 - lr: 0.100000
2023-04-06 05:39:57,749 epoch 41 - iter 1488/3720 - loss 0.03580877 - time (sec): 252.30 - samples/sec: 4992.88 - lr: 0.100000
2023-04-06 05:41:01,212 epoch 41 - iter 1860/3720 - loss 0.03578007 - time (sec): 315.76 - samples/sec: 4998.42 - lr: 0.100000
2023-04-06 05:42:03,400 epoch 41 - iter 2232/3720 - loss 0.03604466 - time (sec): 377.95 - samples/sec: 5002.20 - lr: 0.100000
2023-04-06 05:43:07,825 epoch 41 - iter 2604/3720 - loss 0.03592712 - time (sec): 442.38 - samples/sec: 4987.97 - lr: 0.100000
2023-04-06 05:44:10,741 epoch 41 - iter 2976/3720 - loss 0.03602655 - time (sec): 505.29 - samples/sec: 4987.93 - lr: 0.100000
2023-04-06 05:45:14,930 epoch 41 - iter 3348/3720 - loss 0.03605697 - time (sec): 569.48 - samples/sec: 4982.72 - lr: 0.100000
2023-04-06 05:46:18,175 epoch 41 - iter 3720/3720 - loss 0.03612727 - time (sec): 632.73 - samples/sec: 4981.17 - lr: 0.100000
2023-04-06 05:46:18,175 ----------------------------------------------------------------------------------------------------
2023-04-06 05:46:18,176 EPOCH 41 done: loss 0.0361 - lr 0.100000
2023-04-06 05:46:18,176 BAD EPOCHS (no improvement): 1
2023-04-06 05:46:18,179 ----------------------------------------------------------------------------------------------------
2023-04-06 05:47:21,888 epoch 42 - iter 372/3720 - loss 0.03501414 - time (sec): 63.71 - samples/sec: 4942.95 - lr: 0.100000
2023-04-06 05:48:25,521 epoch 42 - iter 744/3720 - loss 0.03538114 - time (sec): 127.34 - samples/sec: 4951.75 - lr: 0.100000
2023-04-06 05:49:28,818 epoch 42 - iter 1116/3720 - loss 0.03564249 - time (sec): 190.64 - samples/sec: 4964.53 - lr: 0.100000
2023-04-06 05:50:31,722 epoch 42 - iter 1488/3720 - loss 0.03576905 - time (sec): 253.54 - samples/sec: 4973.25 - lr: 0.100000
2023-04-06 05:51:35,060 epoch 42 - iter 1860/3720 - loss 0.03575105 - time (sec): 316.88 - samples/sec: 4973.78 - lr: 0.100000
2023-04-06 05:52:37,857 epoch 42 - iter 2232/3720 - loss 0.03583953 - time (sec): 379.68 - samples/sec: 4976.50 - lr: 0.100000
2023-04-06 05:53:41,974 epoch 42 - iter 2604/3720 - loss 0.03599862 - time (sec): 443.80 - samples/sec: 4967.95 - lr: 0.100000
2023-04-06 05:54:44,560 epoch 42 - iter 2976/3720 - loss 0.03603327 - time (sec): 506.38 - samples/sec: 4976.54 - lr: 0.100000
2023-04-06 05:55:48,409 epoch 42 - iter 3348/3720 - loss 0.03596564 - time (sec): 570.23 - samples/sec: 4975.43 - lr: 0.100000
2023-04-06 05:56:52,312 epoch 42 - iter 3720/3720 - loss 0.03602839 - time (sec): 634.13 - samples/sec: 4970.11 - lr: 0.100000
2023-04-06 05:56:52,313 ----------------------------------------------------------------------------------------------------
2023-04-06 05:56:52,313 EPOCH 42 done: loss 0.0360 - lr 0.100000
2023-04-06 05:56:52,313 BAD EPOCHS (no improvement): 0
2023-04-06 05:56:52,316 ----------------------------------------------------------------------------------------------------
2023-04-06 05:57:55,724 epoch 43 - iter 372/3720 - loss 0.03555848 - time (sec): 63.41 - samples/sec: 5039.09 - lr: 0.100000
2023-04-06 05:58:58,889 epoch 43 - iter 744/3720 - loss 0.03560942 - time (sec): 126.57 - samples/sec: 5011.99 - lr: 0.100000
2023-04-06 06:00:02,000 epoch 43 - iter 1116/3720 - loss 0.03581274 - time (sec): 189.68 - samples/sec: 5002.83 - lr: 0.100000
2023-04-06 06:01:04,999 epoch 43 - iter 1488/3720 - loss 0.03581777 - time (sec): 252.68 - samples/sec: 4999.78 - lr: 0.100000
2023-04-06 06:02:09,423 epoch 43 - iter 1860/3720 - loss 0.03587032 - time (sec): 317.11 - samples/sec: 4983.13 - lr: 0.100000
2023-04-06 06:03:13,413 epoch 43 - iter 2232/3720 - loss 0.03602507 - time (sec): 381.10 - samples/sec: 4968.34 - lr: 0.100000
2023-04-06 06:04:16,662 epoch 43 - iter 2604/3720 - loss 0.03608633 - time (sec): 444.35 - samples/sec: 4969.57 - lr: 0.100000
2023-04-06 06:05:20,951 epoch 43 - iter 2976/3720 - loss 0.03607723 - time (sec): 508.63 - samples/sec: 4960.06 - lr: 0.100000
2023-04-06 06:06:24,950 epoch 43 - iter 3348/3720 - loss 0.03619460 - time (sec): 572.63 - samples/sec: 4954.04 - lr: 0.100000
2023-04-06 06:07:28,532 epoch 43 - iter 3720/3720 - loss 0.03611081 - time (sec): 636.22 - samples/sec: 4953.84 - lr: 0.100000
2023-04-06 06:07:28,533 ----------------------------------------------------------------------------------------------------
2023-04-06 06:07:28,533 EPOCH 43 done: loss 0.0361 - lr 0.100000
2023-04-06 06:07:28,533 BAD EPOCHS (no improvement): 1
2023-04-06 06:07:28,539 ----------------------------------------------------------------------------------------------------
2023-04-06 06:08:31,206 epoch 44 - iter 372/3720 - loss 0.03589108 - time (sec): 62.67 - samples/sec: 5020.94 - lr: 0.100000
2023-04-06 06:09:34,097 epoch 44 - iter 744/3720 - loss 0.03586768 - time (sec): 125.56 - samples/sec: 5028.57 - lr: 0.100000
2023-04-06 06:10:37,457 epoch 44 - iter 1116/3720 - loss 0.03612357 - time (sec): 188.92 - samples/sec: 5009.90 - lr: 0.100000
2023-04-06 06:11:40,707 epoch 44 - iter 1488/3720 - loss 0.03610499 - time (sec): 252.17 - samples/sec: 5001.98 - lr: 0.100000
2023-04-06 06:12:43,165 epoch 44 - iter 1860/3720 - loss 0.03594915 - time (sec): 314.63 - samples/sec: 5011.35 - lr: 0.100000
2023-04-06 06:13:45,140 epoch 44 - iter 2232/3720 - loss 0.03609889 - time (sec): 376.60 - samples/sec: 5020.84 - lr: 0.100000
2023-04-06 06:14:48,197 epoch 44 - iter 2604/3720 - loss 0.03615081 - time (sec): 439.66 - samples/sec: 5015.06 - lr: 0.100000
2023-04-06 06:15:51,641 epoch 44 - iter 2976/3720 - loss 0.03602430 - time (sec): 503.10 - samples/sec: 5007.21 - lr: 0.100000
2023-04-06 06:16:56,633 epoch 44 - iter 3348/3720 - loss 0.03602352 - time (sec): 568.09 - samples/sec: 4990.70 - lr: 0.100000
2023-04-06 06:18:01,163 epoch 44 - iter 3720/3720 - loss 0.03604926 - time (sec): 632.62 - samples/sec: 4981.97 - lr: 0.100000
2023-04-06 06:18:01,164 ----------------------------------------------------------------------------------------------------
2023-04-06 06:18:01,164 EPOCH 44 done: loss 0.0360 - lr 0.100000
2023-04-06 06:18:01,164 BAD EPOCHS (no improvement): 2
2023-04-06 06:18:01,170 ----------------------------------------------------------------------------------------------------
2023-04-06 06:19:05,090 epoch 45 - iter 372/3720 - loss 0.03615835 - time (sec): 63.92 - samples/sec: 4940.37 - lr: 0.100000
2023-04-06 06:20:08,176 epoch 45 - iter 744/3720 - loss 0.03552771 - time (sec): 127.01 - samples/sec: 4975.27 - lr: 0.100000
2023-04-06 06:21:12,225 epoch 45 - iter 1116/3720 - loss 0.03555194 - time (sec): 191.06 - samples/sec: 4949.78 - lr: 0.100000
2023-04-06 06:22:13,586 epoch 45 - iter 1488/3720 - loss 0.03556429 - time (sec): 252.42 - samples/sec: 4983.47 - lr: 0.100000
2023-04-06 06:23:16,041 epoch 45 - iter 1860/3720 - loss 0.03557876 - time (sec): 314.87 - samples/sec: 4997.62 - lr: 0.100000
2023-04-06 06:24:18,043 epoch 45 - iter 2232/3720 - loss 0.03578220 - time (sec): 376.87 - samples/sec: 5008.33 - lr: 0.100000
2023-04-06 06:25:21,526 epoch 45 - iter 2604/3720 - loss 0.03584423 - time (sec): 440.36 - samples/sec: 5003.44 - lr: 0.100000
2023-04-06 06:26:25,628 epoch 45 - iter 2976/3720 - loss 0.03575174 - time (sec): 504.46 - samples/sec: 4997.42 - lr: 0.100000
2023-04-06 06:27:29,722 epoch 45 - iter 3348/3720 - loss 0.03583934 - time (sec): 568.55 - samples/sec: 4990.60 - lr: 0.100000
2023-04-06 06:28:33,643 epoch 45 - iter 3720/3720 - loss 0.03588281 - time (sec): 632.47 - samples/sec: 4983.16 - lr: 0.100000
2023-04-06 06:28:33,643 ----------------------------------------------------------------------------------------------------
2023-04-06 06:28:33,643 EPOCH 45 done: loss 0.0359 - lr 0.100000
2023-04-06 06:28:33,643 BAD EPOCHS (no improvement): 0
2023-04-06 06:28:33,648 ----------------------------------------------------------------------------------------------------
2023-04-06 06:29:36,550 epoch 46 - iter 372/3720 - loss 0.03630327 - time (sec): 62.90 - samples/sec: 4974.87 - lr: 0.100000
2023-04-06 06:30:40,187 epoch 46 - iter 744/3720 - loss 0.03559881 - time (sec): 126.54 - samples/sec: 4984.12 - lr: 0.100000
2023-04-06 06:31:44,689 epoch 46 - iter 1116/3720 - loss 0.03595745 - time (sec): 191.04 - samples/sec: 4963.69 - lr: 0.100000
2023-04-06 06:32:48,098 epoch 46 - iter 1488/3720 - loss 0.03574153 - time (sec): 254.45 - samples/sec: 4961.50 - lr: 0.100000
2023-04-06 06:33:51,369 epoch 46 - iter 1860/3720 - loss 0.03583519 - time (sec): 317.72 - samples/sec: 4969.38 - lr: 0.100000
2023-04-06 06:34:53,314 epoch 46 - iter 2232/3720 - loss 0.03585869 - time (sec): 379.67 - samples/sec: 4982.85 - lr: 0.100000
2023-04-06 06:35:55,870 epoch 46 - iter 2604/3720 - loss 0.03613151 - time (sec): 442.22 - samples/sec: 4991.54 - lr: 0.100000
2023-04-06 06:36:57,688 epoch 46 - iter 2976/3720 - loss 0.03613827 - time (sec): 504.04 - samples/sec: 5000.94 - lr: 0.100000
2023-04-06 06:38:01,506 epoch 46 - iter 3348/3720 - loss 0.03615893 - time (sec): 567.86 - samples/sec: 4994.24 - lr: 0.100000
2023-04-06 06:39:06,195 epoch 46 - iter 3720/3720 - loss 0.03607414 - time (sec): 632.55 - samples/sec: 4982.58 - lr: 0.100000
2023-04-06 06:39:06,195 ----------------------------------------------------------------------------------------------------
2023-04-06 06:39:06,195 EPOCH 46 done: loss 0.0361 - lr 0.100000
2023-04-06 06:39:06,196 BAD EPOCHS (no improvement): 1
2023-04-06 06:39:06,199 ----------------------------------------------------------------------------------------------------
2023-04-06 06:40:10,485 epoch 47 - iter 372/3720 - loss 0.03617819 - time (sec): 64.29 - samples/sec: 4897.36 - lr: 0.100000
2023-04-06 06:41:12,875 epoch 47 - iter 744/3720 - loss 0.03616579 - time (sec): 126.68 - samples/sec: 4974.47 - lr: 0.100000
2023-04-06 06:42:15,327 epoch 47 - iter 1116/3720 - loss 0.03586013 - time (sec): 189.13 - samples/sec: 4991.27 - lr: 0.100000
2023-04-06 06:43:19,618 epoch 47 - iter 1488/3720 - loss 0.03567191 - time (sec): 253.42 - samples/sec: 4975.30 - lr: 0.100000
2023-04-06 06:44:22,746 epoch 47 - iter 1860/3720 - loss 0.03560712 - time (sec): 316.55 - samples/sec: 4979.93 - lr: 0.100000
2023-04-06 06:45:24,829 epoch 47 - iter 2232/3720 - loss 0.03572832 - time (sec): 378.63 - samples/sec: 4993.69 - lr: 0.100000
2023-04-06 06:46:28,144 epoch 47 - iter 2604/3720 - loss 0.03585095 - time (sec): 441.95 - samples/sec: 4988.66 - lr: 0.100000
2023-04-06 06:47:32,397 epoch 47 - iter 2976/3720 - loss 0.03587577 - time (sec): 506.20 - samples/sec: 4978.63 - lr: 0.100000
2023-04-06 06:48:37,163 epoch 47 - iter 3348/3720 - loss 0.03588128 - time (sec): 570.96 - samples/sec: 4968.15 - lr: 0.100000
2023-04-06 06:49:40,834 epoch 47 - iter 3720/3720 - loss 0.03594796 - time (sec): 634.64 - samples/sec: 4966.18 - lr: 0.100000
2023-04-06 06:49:40,834 ----------------------------------------------------------------------------------------------------
2023-04-06 06:49:40,834 EPOCH 47 done: loss 0.0359 - lr 0.100000
2023-04-06 06:49:40,834 BAD EPOCHS (no improvement): 2
2023-04-06 06:49:40,841 ----------------------------------------------------------------------------------------------------
2023-04-06 06:50:44,233 epoch 48 - iter 372/3720 - loss 0.03554580 - time (sec): 63.39 - samples/sec: 4933.11 - lr: 0.100000
2023-04-06 06:51:47,269 epoch 48 - iter 744/3720 - loss 0.03555960 - time (sec): 126.43 - samples/sec: 4996.62 - lr: 0.100000
2023-04-06 06:52:49,964 epoch 48 - iter 1116/3720 - loss 0.03584024 - time (sec): 189.12 - samples/sec: 4999.42 - lr: 0.100000
2023-04-06 06:53:53,479 epoch 48 - iter 1488/3720 - loss 0.03588851 - time (sec): 252.64 - samples/sec: 4988.01 - lr: 0.100000
2023-04-06 06:54:56,070 epoch 48 - iter 1860/3720 - loss 0.03575096 - time (sec): 315.23 - samples/sec: 4997.40 - lr: 0.100000
2023-04-06 06:55:58,504 epoch 48 - iter 2232/3720 - loss 0.03572593 - time (sec): 377.66 - samples/sec: 5006.22 - lr: 0.100000
2023-04-06 06:57:01,545 epoch 48 - iter 2604/3720 - loss 0.03571595 - time (sec): 440.70 - samples/sec: 5007.16 - lr: 0.100000
2023-04-06 06:58:05,462 epoch 48 - iter 2976/3720 - loss 0.03582259 - time (sec): 504.62 - samples/sec: 4997.59 - lr: 0.100000
2023-04-06 06:59:10,299 epoch 48 - iter 3348/3720 - loss 0.03577259 - time (sec): 569.46 - samples/sec: 4985.30 - lr: 0.100000
2023-04-06 07:00:13,789 epoch 48 - iter 3720/3720 - loss 0.03577572 - time (sec): 632.95 - samples/sec: 4979.42 - lr: 0.100000
2023-04-06 07:00:13,789 ----------------------------------------------------------------------------------------------------
2023-04-06 07:00:13,789 EPOCH 48 done: loss 0.0358 - lr 0.100000
2023-04-06 07:00:13,789 BAD EPOCHS (no improvement): 0
2023-04-06 07:00:13,796 ----------------------------------------------------------------------------------------------------
2023-04-06 07:01:17,203 epoch 49 - iter 372/3720 - loss 0.03554267 - time (sec): 63.41 - samples/sec: 4962.01 - lr: 0.100000
2023-04-06 07:02:21,293 epoch 49 - iter 744/3720 - loss 0.03562803 - time (sec): 127.50 - samples/sec: 4939.84 - lr: 0.100000
2023-04-06 07:03:25,257 epoch 49 - iter 1116/3720 - loss 0.03544781 - time (sec): 191.46 - samples/sec: 4933.92 - lr: 0.100000
2023-04-06 07:04:30,417 epoch 49 - iter 1488/3720 - loss 0.03547625 - time (sec): 256.62 - samples/sec: 4925.15 - lr: 0.100000
2023-04-06 07:05:33,971 epoch 49 - iter 1860/3720 - loss 0.03561882 - time (sec): 320.17 - samples/sec: 4931.75 - lr: 0.100000
2023-04-06 07:06:38,334 epoch 49 - iter 2232/3720 - loss 0.03562794 - time (sec): 384.54 - samples/sec: 4924.24 - lr: 0.100000
2023-04-06 07:07:41,637 epoch 49 - iter 2604/3720 - loss 0.03558202 - time (sec): 447.84 - samples/sec: 4925.04 - lr: 0.100000
2023-04-06 07:08:46,559 epoch 49 - iter 2976/3720 - loss 0.03573090 - time (sec): 512.76 - samples/sec: 4922.24 - lr: 0.100000
2023-04-06 07:09:49,950 epoch 49 - iter 3348/3720 - loss 0.03567211 - time (sec): 576.15 - samples/sec: 4926.28 - lr: 0.100000
2023-04-06 07:10:53,102 epoch 49 - iter 3720/3720 - loss 0.03593851 - time (sec): 639.31 - samples/sec: 4929.90 - lr: 0.100000
2023-04-06 07:10:53,102 ----------------------------------------------------------------------------------------------------
2023-04-06 07:10:53,103 EPOCH 49 done: loss 0.0359 - lr 0.100000
2023-04-06 07:10:53,103 BAD EPOCHS (no improvement): 1
2023-04-06 07:10:53,106 ----------------------------------------------------------------------------------------------------
2023-04-06 07:11:57,194 epoch 50 - iter 372/3720 - loss 0.03592104 - time (sec): 64.09 - samples/sec: 4918.69 - lr: 0.100000
2023-04-06 07:13:01,310 epoch 50 - iter 744/3720 - loss 0.03541568 - time (sec): 128.20 - samples/sec: 4905.25 - lr: 0.100000
2023-04-06 07:14:05,523 epoch 50 - iter 1116/3720 - loss 0.03570905 - time (sec): 192.42 - samples/sec: 4907.23 - lr: 0.100000
2023-04-06 07:15:09,667 epoch 50 - iter 1488/3720 - loss 0.03571792 - time (sec): 256.56 - samples/sec: 4905.33 - lr: 0.100000
2023-04-06 07:16:13,919 epoch 50 - iter 1860/3720 - loss 0.03584356 - time (sec): 320.81 - samples/sec: 4905.96 - lr: 0.100000
2023-04-06 07:17:17,370 epoch 50 - iter 2232/3720 - loss 0.03600621 - time (sec): 384.26 - samples/sec: 4919.86 - lr: 0.100000
2023-04-06 07:18:21,980 epoch 50 - iter 2604/3720 - loss 0.03591886 - time (sec): 448.87 - samples/sec: 4915.19 - lr: 0.100000
2023-04-06 07:19:24,867 epoch 50 - iter 2976/3720 - loss 0.03588801 - time (sec): 511.76 - samples/sec: 4924.26 - lr: 0.100000
2023-04-06 07:20:27,929 epoch 50 - iter 3348/3720 - loss 0.03591384 - time (sec): 574.82 - samples/sec: 4931.97 - lr: 0.100000
2023-04-06 07:21:31,603 epoch 50 - iter 3720/3720 - loss 0.03593144 - time (sec): 638.50 - samples/sec: 4936.14 - lr: 0.100000
2023-04-06 07:21:31,604 ----------------------------------------------------------------------------------------------------
2023-04-06 07:21:31,604 EPOCH 50 done: loss 0.0359 - lr 0.100000
2023-04-06 07:21:31,604 BAD EPOCHS (no improvement): 2
2023-04-06 07:21:31,607 ----------------------------------------------------------------------------------------------------
2023-04-06 07:22:35,309 epoch 51 - iter 372/3720 - loss 0.03611297 - time (sec): 63.70 - samples/sec: 4940.79 - lr: 0.100000
2023-04-06 07:23:39,113 epoch 51 - iter 744/3720 - loss 0.03558047 - time (sec): 127.51 - samples/sec: 4945.55 - lr: 0.100000
2023-04-06 07:24:42,508 epoch 51 - iter 1116/3720 - loss 0.03554481 - time (sec): 190.90 - samples/sec: 4959.02 - lr: 0.100000
2023-04-06 07:25:45,777 epoch 51 - iter 1488/3720 - loss 0.03589186 - time (sec): 254.17 - samples/sec: 4954.67 - lr: 0.100000
2023-04-06 07:26:48,204 epoch 51 - iter 1860/3720 - loss 0.03598627 - time (sec): 316.60 - samples/sec: 4969.12 - lr: 0.100000
2023-04-06 07:27:52,011 epoch 51 - iter 2232/3720 - loss 0.03592064 - time (sec): 380.40 - samples/sec: 4964.39 - lr: 0.100000
2023-04-06 07:28:56,520 epoch 51 - iter 2604/3720 - loss 0.03586564 - time (sec): 444.91 - samples/sec: 4955.78 - lr: 0.100000
2023-04-06 07:30:01,001 epoch 51 - iter 2976/3720 - loss 0.03586686 - time (sec): 509.39 - samples/sec: 4945.34 - lr: 0.100000
2023-04-06 07:31:05,031 epoch 51 - iter 3348/3720 - loss 0.03590195 - time (sec): 573.42 - samples/sec: 4948.82 - lr: 0.100000
2023-04-06 07:32:08,022 epoch 51 - iter 3720/3720 - loss 0.03604278 - time (sec): 636.42 - samples/sec: 4952.29 - lr: 0.100000
2023-04-06 07:32:08,023 ----------------------------------------------------------------------------------------------------
2023-04-06 07:32:08,023 EPOCH 51 done: loss 0.0360 - lr 0.100000
2023-04-06 07:32:08,023 BAD EPOCHS (no improvement): 3
2023-04-06 07:32:08,026 ----------------------------------------------------------------------------------------------------
2023-04-06 07:33:12,509 epoch 52 - iter 372/3720 - loss 0.03469880 - time (sec): 64.48 - samples/sec: 4932.91 - lr: 0.100000
2023-04-06 07:34:16,091 epoch 52 - iter 744/3720 - loss 0.03495443 - time (sec): 128.06 - samples/sec: 4943.58 - lr: 0.100000
2023-04-06 07:35:19,627 epoch 52 - iter 1116/3720 - loss 0.03503903 - time (sec): 191.60 - samples/sec: 4943.40 - lr: 0.100000
2023-04-06 07:36:22,813 epoch 52 - iter 1488/3720 - loss 0.03533135 - time (sec): 254.79 - samples/sec: 4952.14 - lr: 0.100000
2023-04-06 07:37:26,237 epoch 52 - iter 1860/3720 - loss 0.03548857 - time (sec): 318.21 - samples/sec: 4955.48 - lr: 0.100000
2023-04-06 07:38:29,896 epoch 52 - iter 2232/3720 - loss 0.03561160 - time (sec): 381.87 - samples/sec: 4953.14 - lr: 0.100000
2023-04-06 07:39:34,068 epoch 52 - iter 2604/3720 - loss 0.03563686 - time (sec): 446.04 - samples/sec: 4945.79 - lr: 0.100000
2023-04-06 07:40:36,586 epoch 52 - iter 2976/3720 - loss 0.03573569 - time (sec): 508.56 - samples/sec: 4960.11 - lr: 0.100000
2023-04-06 07:41:39,467 epoch 52 - iter 3348/3720 - loss 0.03569256 - time (sec): 571.44 - samples/sec: 4966.97 - lr: 0.100000
2023-04-06 07:42:41,221 epoch 52 - iter 3720/3720 - loss 0.03576243 - time (sec): 633.20 - samples/sec: 4977.47 - lr: 0.100000
2023-04-06 07:42:41,222 ----------------------------------------------------------------------------------------------------
2023-04-06 07:42:41,222 EPOCH 52 done: loss 0.0358 - lr 0.100000
2023-04-06 07:42:41,222 BAD EPOCHS (no improvement): 0
2023-04-06 07:42:41,225 ----------------------------------------------------------------------------------------------------
2023-04-06 07:43:44,567 epoch 53 - iter 372/3720 - loss 0.03521698 - time (sec): 63.34 - samples/sec: 5001.75 - lr: 0.100000
2023-04-06 07:44:47,871 epoch 53 - iter 744/3720 - loss 0.03535323 - time (sec): 126.65 - samples/sec: 4997.61 - lr: 0.100000
2023-04-06 07:45:50,809 epoch 53 - iter 1116/3720 - loss 0.03581383 - time (sec): 189.58 - samples/sec: 4994.56 - lr: 0.100000
2023-04-06 07:46:54,008 epoch 53 - iter 1488/3720 - loss 0.03585699 - time (sec): 252.78 - samples/sec: 4993.35 - lr: 0.100000
2023-04-06 07:47:57,315 epoch 53 - iter 1860/3720 - loss 0.03546709 - time (sec): 316.09 - samples/sec: 4991.47 - lr: 0.100000
2023-04-06 07:48:59,339 epoch 53 - iter 2232/3720 - loss 0.03559660 - time (sec): 378.11 - samples/sec: 5004.44 - lr: 0.100000
2023-04-06 07:50:03,686 epoch 53 - iter 2604/3720 - loss 0.03560430 - time (sec): 442.46 - samples/sec: 4989.10 - lr: 0.100000
2023-04-06 07:51:06,555 epoch 53 - iter 2976/3720 - loss 0.03555950 - time (sec): 505.33 - samples/sec: 4991.29 - lr: 0.100000
2023-04-06 07:52:09,988 epoch 53 - iter 3348/3720 - loss 0.03568930 - time (sec): 568.76 - samples/sec: 4988.07 - lr: 0.100000
2023-04-06 07:53:13,140 epoch 53 - iter 3720/3720 - loss 0.03572553 - time (sec): 631.92 - samples/sec: 4987.55 - lr: 0.100000
2023-04-06 07:53:13,140 ----------------------------------------------------------------------------------------------------
2023-04-06 07:53:13,140 EPOCH 53 done: loss 0.0357 - lr 0.100000
2023-04-06 07:53:13,140 BAD EPOCHS (no improvement): 0
2023-04-06 07:53:13,143 ----------------------------------------------------------------------------------------------------
2023-04-06 07:54:17,019 epoch 54 - iter 372/3720 - loss 0.03507077 - time (sec): 63.88 - samples/sec: 4987.30 - lr: 0.100000
2023-04-06 07:55:20,628 epoch 54 - iter 744/3720 - loss 0.03503947 - time (sec): 127.48 - samples/sec: 4965.15 - lr: 0.100000
2023-04-06 07:56:23,731 epoch 54 - iter 1116/3720 - loss 0.03511696 - time (sec): 190.59 - samples/sec: 4967.04 - lr: 0.100000
2023-04-06 07:57:27,250 epoch 54 - iter 1488/3720 - loss 0.03537984 - time (sec): 254.11 - samples/sec: 4964.39 - lr: 0.100000
2023-04-06 07:58:30,162 epoch 54 - iter 1860/3720 - loss 0.03556180 - time (sec): 317.02 - samples/sec: 4972.03 - lr: 0.100000
2023-04-06 07:59:33,372 epoch 54 - iter 2232/3720 - loss 0.03557394 - time (sec): 380.23 - samples/sec: 4973.89 - lr: 0.100000
2023-04-06 08:00:36,612 epoch 54 - iter 2604/3720 - loss 0.03562157 - time (sec): 443.47 - samples/sec: 4973.17 - lr: 0.100000
2023-04-06 08:01:39,834 epoch 54 - iter 2976/3720 - loss 0.03558331 - time (sec): 506.69 - samples/sec: 4975.68 - lr: 0.100000
2023-04-06 08:02:43,251 epoch 54 - iter 3348/3720 - loss 0.03570854 - time (sec): 570.11 - samples/sec: 4975.42 - lr: 0.100000
2023-04-06 08:03:47,595 epoch 54 - iter 3720/3720 - loss 0.03576077 - time (sec): 634.45 - samples/sec: 4967.62 - lr: 0.100000
2023-04-06 08:03:47,595 ----------------------------------------------------------------------------------------------------
2023-04-06 08:03:47,595 EPOCH 54 done: loss 0.0358 - lr 0.100000
2023-04-06 08:03:47,595 BAD EPOCHS (no improvement): 1
2023-04-06 08:03:47,599 ----------------------------------------------------------------------------------------------------
2023-04-06 08:04:50,499 epoch 55 - iter 372/3720 - loss 0.03556611 - time (sec): 62.90 - samples/sec: 4994.47 - lr: 0.100000
2023-04-06 08:05:55,305 epoch 55 - iter 744/3720 - loss 0.03582456 - time (sec): 127.71 - samples/sec: 4965.91 - lr: 0.100000
2023-04-06 08:06:59,007 epoch 55 - iter 1116/3720 - loss 0.03548060 - time (sec): 191.41 - samples/sec: 4957.25 - lr: 0.100000
2023-04-06 08:08:01,743 epoch 55 - iter 1488/3720 - loss 0.03558508 - time (sec): 254.14 - samples/sec: 4963.79 - lr: 0.100000
2023-04-06 08:09:05,576 epoch 55 - iter 1860/3720 - loss 0.03554312 - time (sec): 317.98 - samples/sec: 4962.03 - lr: 0.100000
2023-04-06 08:10:08,908 epoch 55 - iter 2232/3720 - loss 0.03566213 - time (sec): 381.31 - samples/sec: 4960.97 - lr: 0.100000
2023-04-06 08:11:12,955 epoch 55 - iter 2604/3720 - loss 0.03578027 - time (sec): 445.36 - samples/sec: 4957.57 - lr: 0.100000
2023-04-06 08:12:16,798 epoch 55 - iter 2976/3720 - loss 0.03576108 - time (sec): 509.20 - samples/sec: 4956.91 - lr: 0.100000
2023-04-06 08:13:19,582 epoch 55 - iter 3348/3720 - loss 0.03580154 - time (sec): 571.98 - samples/sec: 4963.63 - lr: 0.100000
2023-04-06 08:14:22,205 epoch 55 - iter 3720/3720 - loss 0.03582617 - time (sec): 634.61 - samples/sec: 4966.41 - lr: 0.100000
2023-04-06 08:14:22,205 ----------------------------------------------------------------------------------------------------
2023-04-06 08:14:22,205 EPOCH 55 done: loss 0.0358 - lr 0.100000
2023-04-06 08:14:22,205 BAD EPOCHS (no improvement): 2
2023-04-06 08:14:22,208 ----------------------------------------------------------------------------------------------------
2023-04-06 08:15:24,411 epoch 56 - iter 372/3720 - loss 0.03443706 - time (sec): 62.20 - samples/sec: 5036.04 - lr: 0.100000
2023-04-06 08:16:26,636 epoch 56 - iter 744/3720 - loss 0.03516020 - time (sec): 124.43 - samples/sec: 5058.82 - lr: 0.100000
2023-04-06 08:17:30,105 epoch 56 - iter 1116/3720 - loss 0.03535747 - time (sec): 187.90 - samples/sec: 5032.65 - lr: 0.100000
2023-04-06 08:18:34,643 epoch 56 - iter 1488/3720 - loss 0.03554717 - time (sec): 252.43 - samples/sec: 4997.00 - lr: 0.100000
2023-04-06 08:19:38,306 epoch 56 - iter 1860/3720 - loss 0.03543249 - time (sec): 316.10 - samples/sec: 4988.13 - lr: 0.100000
2023-04-06 08:20:41,104 epoch 56 - iter 2232/3720 - loss 0.03532255 - time (sec): 378.90 - samples/sec: 4989.87 - lr: 0.100000
2023-04-06 08:21:44,361 epoch 56 - iter 2604/3720 - loss 0.03531136 - time (sec): 442.15 - samples/sec: 4991.51 - lr: 0.100000
2023-04-06 08:22:47,628 epoch 56 - iter 2976/3720 - loss 0.03552810 - time (sec): 505.42 - samples/sec: 4987.43 - lr: 0.100000
2023-04-06 08:23:50,303 epoch 56 - iter 3348/3720 - loss 0.03562046 - time (sec): 568.09 - samples/sec: 4993.31 - lr: 0.100000
2023-04-06 08:24:54,359 epoch 56 - iter 3720/3720 - loss 0.03579107 - time (sec): 632.15 - samples/sec: 4985.70 - lr: 0.100000
2023-04-06 08:24:54,359 ----------------------------------------------------------------------------------------------------
2023-04-06 08:24:54,359 EPOCH 56 done: loss 0.0358 - lr 0.100000
2023-04-06 08:24:54,360 BAD EPOCHS (no improvement): 3
2023-04-06 08:24:54,365 ----------------------------------------------------------------------------------------------------
2023-04-06 08:25:58,233 epoch 57 - iter 372/3720 - loss 0.03522513 - time (sec): 63.87 - samples/sec: 4903.71 - lr: 0.100000
2023-04-06 08:27:02,754 epoch 57 - iter 744/3720 - loss 0.03543756 - time (sec): 128.39 - samples/sec: 4900.92 - lr: 0.100000
2023-04-06 08:28:06,253 epoch 57 - iter 1116/3720 - loss 0.03571462 - time (sec): 191.89 - samples/sec: 4914.12 - lr: 0.100000
2023-04-06 08:29:10,700 epoch 57 - iter 1488/3720 - loss 0.03566904 - time (sec): 256.33 - samples/sec: 4913.95 - lr: 0.100000
2023-04-06 08:30:14,899 epoch 57 - iter 1860/3720 - loss 0.03573546 - time (sec): 320.53 - samples/sec: 4908.38 - lr: 0.100000
2023-04-06 08:31:19,151 epoch 57 - iter 2232/3720 - loss 0.03548699 - time (sec): 384.79 - samples/sec: 4914.70 - lr: 0.100000
2023-04-06 08:32:23,916 epoch 57 - iter 2604/3720 - loss 0.03546777 - time (sec): 449.55 - samples/sec: 4909.01 - lr: 0.100000
2023-04-06 08:33:27,136 epoch 57 - iter 2976/3720 - loss 0.03550752 - time (sec): 512.77 - samples/sec: 4915.96 - lr: 0.100000
2023-04-06 08:34:29,964 epoch 57 - iter 3348/3720 - loss 0.03561924 - time (sec): 575.60 - samples/sec: 4927.39 - lr: 0.100000
2023-04-06 08:35:32,600 epoch 57 - iter 3720/3720 - loss 0.03564062 - time (sec): 638.23 - samples/sec: 4938.17 - lr: 0.100000
2023-04-06 08:35:32,600 ----------------------------------------------------------------------------------------------------
2023-04-06 08:35:32,600 EPOCH 57 done: loss 0.0356 - lr 0.100000
2023-04-06 08:35:32,600 BAD EPOCHS (no improvement): 0
2023-04-06 08:35:32,603 ----------------------------------------------------------------------------------------------------
2023-04-06 08:36:36,646 epoch 58 - iter 372/3720 - loss 0.03509077 - time (sec): 64.04 - samples/sec: 4929.77 - lr: 0.100000
2023-04-06 08:37:39,835 epoch 58 - iter 744/3720 - loss 0.03566188 - time (sec): 127.23 - samples/sec: 4947.50 - lr: 0.100000
2023-04-06 08:38:43,340 epoch 58 - iter 1116/3720 - loss 0.03548289 - time (sec): 190.74 - samples/sec: 4958.98 - lr: 0.100000
2023-04-06 08:39:47,722 epoch 58 - iter 1488/3720 - loss 0.03546883 - time (sec): 255.12 - samples/sec: 4945.43 - lr: 0.100000
2023-04-06 08:40:51,456 epoch 58 - iter 1860/3720 - loss 0.03546374 - time (sec): 318.85 - samples/sec: 4948.67 - lr: 0.100000
2023-04-06 08:41:55,028 epoch 58 - iter 2232/3720 - loss 0.03558942 - time (sec): 382.42 - samples/sec: 4948.86 - lr: 0.100000
2023-04-06 08:42:59,081 epoch 58 - iter 2604/3720 - loss 0.03561752 - time (sec): 446.48 - samples/sec: 4949.39 - lr: 0.100000
2023-04-06 08:44:01,965 epoch 58 - iter 2976/3720 - loss 0.03565205 - time (sec): 509.36 - samples/sec: 4956.64 - lr: 0.100000
2023-04-06 08:45:04,932 epoch 58 - iter 3348/3720 - loss 0.03555554 - time (sec): 572.33 - samples/sec: 4955.43 - lr: 0.100000
2023-04-06 08:46:08,504 epoch 58 - iter 3720/3720 - loss 0.03565709 - time (sec): 635.90 - samples/sec: 4956.29 - lr: 0.100000
2023-04-06 08:46:08,505 ----------------------------------------------------------------------------------------------------
2023-04-06 08:46:08,505 EPOCH 58 done: loss 0.0357 - lr 0.100000
2023-04-06 08:46:08,505 BAD EPOCHS (no improvement): 1
2023-04-06 08:46:08,507 ----------------------------------------------------------------------------------------------------
2023-04-06 08:47:10,580 epoch 59 - iter 372/3720 - loss 0.03472784 - time (sec): 62.07 - samples/sec: 5052.42 - lr: 0.100000
2023-04-06 08:48:14,755 epoch 59 - iter 744/3720 - loss 0.03524033 - time (sec): 126.25 - samples/sec: 4975.16 - lr: 0.100000
2023-04-06 08:49:18,120 epoch 59 - iter 1116/3720 - loss 0.03561557 - time (sec): 189.61 - samples/sec: 4966.12 - lr: 0.100000
2023-04-06 08:50:20,947 epoch 59 - iter 1488/3720 - loss 0.03555656 - time (sec): 252.44 - samples/sec: 4975.37 - lr: 0.100000
2023-04-06 08:51:25,738 epoch 59 - iter 1860/3720 - loss 0.03548465 - time (sec): 317.23 - samples/sec: 4959.79 - lr: 0.100000
2023-04-06 08:52:30,049 epoch 59 - iter 2232/3720 - loss 0.03548318 - time (sec): 381.54 - samples/sec: 4950.57 - lr: 0.100000
2023-04-06 08:53:34,190 epoch 59 - iter 2604/3720 - loss 0.03569103 - time (sec): 445.68 - samples/sec: 4948.31 - lr: 0.100000
2023-04-06 08:54:37,949 epoch 59 - iter 2976/3720 - loss 0.03575223 - time (sec): 509.44 - samples/sec: 4951.91 - lr: 0.100000
2023-04-06 08:55:41,386 epoch 59 - iter 3348/3720 - loss 0.03581366 - time (sec): 572.88 - samples/sec: 4952.70 - lr: 0.100000
2023-04-06 08:56:45,986 epoch 59 - iter 3720/3720 - loss 0.03587479 - time (sec): 637.48 - samples/sec: 4944.03 - lr: 0.100000
2023-04-06 08:56:45,987 ----------------------------------------------------------------------------------------------------
2023-04-06 08:56:45,987 EPOCH 59 done: loss 0.0359 - lr 0.100000
2023-04-06 08:56:45,987 BAD EPOCHS (no improvement): 2
2023-04-06 08:56:45,989 ----------------------------------------------------------------------------------------------------
2023-04-06 08:57:48,730 epoch 60 - iter 372/3720 - loss 0.03499679 - time (sec): 62.74 - samples/sec: 5000.59 - lr: 0.100000
2023-04-06 08:58:52,636 epoch 60 - iter 744/3720 - loss 0.03512788 - time (sec): 126.65 - samples/sec: 4981.76 - lr: 0.100000
2023-04-06 08:59:56,347 epoch 60 - iter 1116/3720 - loss 0.03529976 - time (sec): 190.36 - samples/sec: 4968.31 - lr: 0.100000
2023-04-06 09:00:59,103 epoch 60 - iter 1488/3720 - loss 0.03560345 - time (sec): 253.11 - samples/sec: 4971.13 - lr: 0.100000
2023-04-06 09:02:01,837 epoch 60 - iter 1860/3720 - loss 0.03565454 - time (sec): 315.85 - samples/sec: 4981.76 - lr: 0.100000
2023-04-06 09:03:04,547 epoch 60 - iter 2232/3720 - loss 0.03556429 - time (sec): 378.56 - samples/sec: 4994.56 - lr: 0.100000
2023-04-06 09:04:07,043 epoch 60 - iter 2604/3720 - loss 0.03559014 - time (sec): 441.05 - samples/sec: 5001.07 - lr: 0.100000
2023-04-06 09:05:09,498 epoch 60 - iter 2976/3720 - loss 0.03561401 - time (sec): 503.51 - samples/sec: 5006.08 - lr: 0.100000
2023-04-06 09:06:12,947 epoch 60 - iter 3348/3720 - loss 0.03585984 - time (sec): 566.96 - samples/sec: 5004.68 - lr: 0.100000
2023-04-06 09:07:15,771 epoch 60 - iter 3720/3720 - loss 0.03574776 - time (sec): 629.78 - samples/sec: 5004.45 - lr: 0.100000
2023-04-06 09:07:15,771 ----------------------------------------------------------------------------------------------------
2023-04-06 09:07:15,771 EPOCH 60 done: loss 0.0357 - lr 0.100000
2023-04-06 09:07:15,771 BAD EPOCHS (no improvement): 3
2023-04-06 09:07:15,774 ----------------------------------------------------------------------------------------------------
2023-04-06 09:08:19,663 epoch 61 - iter 372/3720 - loss 0.03401490 - time (sec): 63.89 - samples/sec: 4961.70 - lr: 0.100000
2023-04-06 09:09:23,952 epoch 61 - iter 744/3720 - loss 0.03514973 - time (sec): 128.18 - samples/sec: 4943.16 - lr: 0.100000
2023-04-06 09:10:28,106 epoch 61 - iter 1116/3720 - loss 0.03528165 - time (sec): 192.33 - samples/sec: 4931.53 - lr: 0.100000
2023-04-06 09:11:31,644 epoch 61 - iter 1488/3720 - loss 0.03550286 - time (sec): 255.87 - samples/sec: 4940.67 - lr: 0.100000
2023-04-06 09:12:36,190 epoch 61 - iter 1860/3720 - loss 0.03553016 - time (sec): 320.42 - samples/sec: 4925.97 - lr: 0.100000
2023-04-06 09:13:40,262 epoch 61 - iter 2232/3720 - loss 0.03560856 - time (sec): 384.49 - samples/sec: 4923.30 - lr: 0.100000
2023-04-06 09:14:41,795 epoch 61 - iter 2604/3720 - loss 0.03554887 - time (sec): 446.02 - samples/sec: 4945.73 - lr: 0.100000
2023-04-06 09:15:45,261 epoch 61 - iter 2976/3720 - loss 0.03569112 - time (sec): 509.49 - samples/sec: 4945.71 - lr: 0.100000
2023-04-06 09:16:49,312 epoch 61 - iter 3348/3720 - loss 0.03566798 - time (sec): 573.54 - samples/sec: 4945.05 - lr: 0.100000
2023-04-06 09:17:53,366 epoch 61 - iter 3720/3720 - loss 0.03574582 - time (sec): 637.59 - samples/sec: 4943.15 - lr: 0.100000
2023-04-06 09:17:53,366 ----------------------------------------------------------------------------------------------------
2023-04-06 09:17:53,367 EPOCH 61 done: loss 0.0357 - lr 0.100000
2023-04-06 09:17:53,367 Epoch 61: reducing learning rate of group 0 to 5.0000e-02.
2023-04-06 09:17:53,367 BAD EPOCHS (no improvement): 4
2023-04-06 09:17:53,370 ----------------------------------------------------------------------------------------------------
2023-04-06 09:18:57,578 epoch 62 - iter 372/3720 - loss 0.03497989 - time (sec): 64.21 - samples/sec: 4906.84 - lr: 0.050000
2023-04-06 09:20:00,256 epoch 62 - iter 744/3720 - loss 0.03456611 - time (sec): 126.89 - samples/sec: 4962.01 - lr: 0.050000
2023-04-06 09:21:04,208 epoch 62 - iter 1116/3720 - loss 0.03419942 - time (sec): 190.84 - samples/sec: 4956.90 - lr: 0.050000
2023-04-06 09:22:08,550 epoch 62 - iter 1488/3720 - loss 0.03429432 - time (sec): 255.18 - samples/sec: 4947.79 - lr: 0.050000
2023-04-06 09:23:12,615 epoch 62 - iter 1860/3720 - loss 0.03407697 - time (sec): 319.24 - samples/sec: 4942.36 - lr: 0.050000
2023-04-06 09:24:16,266 epoch 62 - iter 2232/3720 - loss 0.03392393 - time (sec): 382.90 - samples/sec: 4943.70 - lr: 0.050000
2023-04-06 09:25:19,468 epoch 62 - iter 2604/3720 - loss 0.03382584 - time (sec): 446.10 - samples/sec: 4945.27 - lr: 0.050000
2023-04-06 09:26:23,186 epoch 62 - iter 2976/3720 - loss 0.03374750 - time (sec): 509.82 - samples/sec: 4948.55 - lr: 0.050000
2023-04-06 09:27:26,185 epoch 62 - iter 3348/3720 - loss 0.03366302 - time (sec): 572.82 - samples/sec: 4952.68 - lr: 0.050000
2023-04-06 09:28:30,000 epoch 62 - iter 3720/3720 - loss 0.03361755 - time (sec): 636.63 - samples/sec: 4950.62 - lr: 0.050000
2023-04-06 09:28:30,000 ----------------------------------------------------------------------------------------------------
2023-04-06 09:28:30,000 EPOCH 62 done: loss 0.0336 - lr 0.050000
2023-04-06 09:28:30,000 BAD EPOCHS (no improvement): 0
2023-04-06 09:28:30,003 ----------------------------------------------------------------------------------------------------
2023-04-06 09:29:33,070 epoch 63 - iter 372/3720 - loss 0.03225239 - time (sec): 63.07 - samples/sec: 4993.12 - lr: 0.050000
2023-04-06 09:30:36,787 epoch 63 - iter 744/3720 - loss 0.03250590 - time (sec): 126.78 - samples/sec: 4976.52 - lr: 0.050000
2023-04-06 09:31:39,314 epoch 63 - iter 1116/3720 - loss 0.03295037 - time (sec): 189.31 - samples/sec: 4989.73 - lr: 0.050000
2023-04-06 09:32:43,392 epoch 63 - iter 1488/3720 - loss 0.03312255 - time (sec): 253.39 - samples/sec: 4966.39 - lr: 0.050000
2023-04-06 09:33:45,454 epoch 63 - iter 1860/3720 - loss 0.03310235 - time (sec): 315.45 - samples/sec: 4983.17 - lr: 0.050000
2023-04-06 09:34:49,567 epoch 63 - iter 2232/3720 - loss 0.03292022 - time (sec): 379.56 - samples/sec: 4972.51 - lr: 0.050000
2023-04-06 09:35:51,932 epoch 63 - iter 2604/3720 - loss 0.03302965 - time (sec): 441.93 - samples/sec: 4979.96 - lr: 0.050000
2023-04-06 09:36:56,519 epoch 63 - iter 2976/3720 - loss 0.03279925 - time (sec): 506.52 - samples/sec: 4971.89 - lr: 0.050000
2023-04-06 09:38:00,666 epoch 63 - iter 3348/3720 - loss 0.03265644 - time (sec): 570.66 - samples/sec: 4967.93 - lr: 0.050000
2023-04-06 09:39:04,150 epoch 63 - iter 3720/3720 - loss 0.03264588 - time (sec): 634.15 - samples/sec: 4970.00 - lr: 0.050000
2023-04-06 09:39:04,150 ----------------------------------------------------------------------------------------------------
2023-04-06 09:39:04,150 EPOCH 63 done: loss 0.0326 - lr 0.050000
2023-04-06 09:39:04,150 BAD EPOCHS (no improvement): 0
2023-04-06 09:39:04,154 ----------------------------------------------------------------------------------------------------
2023-04-06 09:40:07,911 epoch 64 - iter 372/3720 - loss 0.03273274 - time (sec): 63.76 - samples/sec: 4956.71 - lr: 0.050000
2023-04-06 09:41:10,461 epoch 64 - iter 744/3720 - loss 0.03207882 - time (sec): 126.31 - samples/sec: 4991.07 - lr: 0.050000
2023-04-06 09:42:14,418 epoch 64 - iter 1116/3720 - loss 0.03240069 - time (sec): 190.26 - samples/sec: 4966.45 - lr: 0.050000
2023-04-06 09:43:17,666 epoch 64 - iter 1488/3720 - loss 0.03244033 - time (sec): 253.51 - samples/sec: 4966.85 - lr: 0.050000
2023-04-06 09:44:20,296 epoch 64 - iter 1860/3720 - loss 0.03236251 - time (sec): 316.14 - samples/sec: 4974.35 - lr: 0.050000
2023-04-06 09:45:25,031 epoch 64 - iter 2232/3720 - loss 0.03243121 - time (sec): 380.88 - samples/sec: 4963.60 - lr: 0.050000
2023-04-06 09:46:28,885 epoch 64 - iter 2604/3720 - loss 0.03231516 - time (sec): 444.73 - samples/sec: 4960.12 - lr: 0.050000
2023-04-06 09:47:32,235 epoch 64 - iter 2976/3720 - loss 0.03241879 - time (sec): 508.08 - samples/sec: 4964.27 - lr: 0.050000
2023-04-06 09:48:36,050 epoch 64 - iter 3348/3720 - loss 0.03239816 - time (sec): 571.90 - samples/sec: 4961.55 - lr: 0.050000
2023-04-06 09:49:39,630 epoch 64 - iter 3720/3720 - loss 0.03239631 - time (sec): 635.48 - samples/sec: 4959.61 - lr: 0.050000
2023-04-06 09:49:39,630 ----------------------------------------------------------------------------------------------------
2023-04-06 09:49:39,630 EPOCH 64 done: loss 0.0324 - lr 0.050000
2023-04-06 09:49:39,630 BAD EPOCHS (no improvement): 0
2023-04-06 09:49:39,633 ----------------------------------------------------------------------------------------------------
2023-04-06 09:50:43,116 epoch 65 - iter 372/3720 - loss 0.03289786 - time (sec): 63.48 - samples/sec: 4957.24 - lr: 0.050000
2023-04-06 09:51:46,500 epoch 65 - iter 744/3720 - loss 0.03221444 - time (sec): 126.87 - samples/sec: 4953.63 - lr: 0.050000
2023-04-06 09:52:50,737 epoch 65 - iter 1116/3720 - loss 0.03234440 - time (sec): 191.10 - samples/sec: 4937.75 - lr: 0.050000
2023-04-06 09:53:53,592 epoch 65 - iter 1488/3720 - loss 0.03225068 - time (sec): 253.96 - samples/sec: 4947.51 - lr: 0.050000
2023-04-06 09:54:58,174 epoch 65 - iter 1860/3720 - loss 0.03220238 - time (sec): 318.54 - samples/sec: 4937.14 - lr: 0.050000
2023-04-06 09:56:02,536 epoch 65 - iter 2232/3720 - loss 0.03211585 - time (sec): 382.90 - samples/sec: 4930.14 - lr: 0.050000
2023-04-06 09:57:06,298 epoch 65 - iter 2604/3720 - loss 0.03212059 - time (sec): 446.67 - samples/sec: 4929.46 - lr: 0.050000
2023-04-06 09:58:10,404 epoch 65 - iter 2976/3720 - loss 0.03209122 - time (sec): 510.77 - samples/sec: 4930.02 - lr: 0.050000
2023-04-06 09:59:13,868 epoch 65 - iter 3348/3720 - loss 0.03201975 - time (sec): 574.24 - samples/sec: 4933.73 - lr: 0.050000
2023-04-06 10:00:17,763 epoch 65 - iter 3720/3720 - loss 0.03191218 - time (sec): 638.13 - samples/sec: 4938.98 - lr: 0.050000
2023-04-06 10:00:17,763 ----------------------------------------------------------------------------------------------------
2023-04-06 10:00:17,763 EPOCH 65 done: loss 0.0319 - lr 0.050000
2023-04-06 10:00:17,763 BAD EPOCHS (no improvement): 0
2023-04-06 10:00:17,766 ----------------------------------------------------------------------------------------------------
2023-04-06 10:01:20,365 epoch 66 - iter 372/3720 - loss 0.03180213 - time (sec): 62.60 - samples/sec: 4986.99 - lr: 0.050000
2023-04-06 10:02:24,776 epoch 66 - iter 744/3720 - loss 0.03214387 - time (sec): 127.01 - samples/sec: 4974.10 - lr: 0.050000
2023-04-06 10:03:26,943 epoch 66 - iter 1116/3720 - loss 0.03205532 - time (sec): 189.18 - samples/sec: 5006.82 - lr: 0.050000
2023-04-06 10:04:30,066 epoch 66 - iter 1488/3720 - loss 0.03196585 - time (sec): 252.30 - samples/sec: 5001.92 - lr: 0.050000
2023-04-06 10:05:33,207 epoch 66 - iter 1860/3720 - loss 0.03187393 - time (sec): 315.44 - samples/sec: 4998.37 - lr: 0.050000
2023-04-06 10:06:36,121 epoch 66 - iter 2232/3720 - loss 0.03190732 - time (sec): 378.36 - samples/sec: 5000.72 - lr: 0.050000
2023-04-06 10:07:39,422 epoch 66 - iter 2604/3720 - loss 0.03194692 - time (sec): 441.66 - samples/sec: 5000.81 - lr: 0.050000
2023-04-06 10:08:42,308 epoch 66 - iter 2976/3720 - loss 0.03192703 - time (sec): 504.54 - samples/sec: 4998.84 - lr: 0.050000
2023-04-06 10:09:45,643 epoch 66 - iter 3348/3720 - loss 0.03190535 - time (sec): 567.88 - samples/sec: 4995.70 - lr: 0.050000
2023-04-06 10:10:48,607 epoch 66 - iter 3720/3720 - loss 0.03205002 - time (sec): 630.84 - samples/sec: 4996.05 - lr: 0.050000
2023-04-06 10:10:48,608 ----------------------------------------------------------------------------------------------------
2023-04-06 10:10:48,608 EPOCH 66 done: loss 0.0321 - lr 0.050000
2023-04-06 10:10:48,608 BAD EPOCHS (no improvement): 1
2023-04-06 10:10:48,611 ----------------------------------------------------------------------------------------------------
2023-04-06 10:11:51,553 epoch 67 - iter 372/3720 - loss 0.03197003 - time (sec): 62.94 - samples/sec: 5006.72 - lr: 0.050000
2023-04-06 10:12:55,384 epoch 67 - iter 744/3720 - loss 0.03167444 - time (sec): 126.77 - samples/sec: 4979.36 - lr: 0.050000
2023-04-06 10:13:59,031 epoch 67 - iter 1116/3720 - loss 0.03179666 - time (sec): 190.42 - samples/sec: 4965.30 - lr: 0.050000
2023-04-06 10:15:02,813 epoch 67 - iter 1488/3720 - loss 0.03166737 - time (sec): 254.20 - samples/sec: 4961.16 - lr: 0.050000
2023-04-06 10:16:06,300 epoch 67 - iter 1860/3720 - loss 0.03170742 - time (sec): 317.69 - samples/sec: 4969.39 - lr: 0.050000
2023-04-06 10:17:09,727 epoch 67 - iter 2232/3720 - loss 0.03181418 - time (sec): 381.12 - samples/sec: 4966.40 - lr: 0.050000
2023-04-06 10:18:14,122 epoch 67 - iter 2604/3720 - loss 0.03163692 - time (sec): 445.51 - samples/sec: 4958.38 - lr: 0.050000
2023-04-06 10:19:17,530 epoch 67 - iter 2976/3720 - loss 0.03167661 - time (sec): 508.92 - samples/sec: 4955.14 - lr: 0.050000
2023-04-06 10:20:20,171 epoch 67 - iter 3348/3720 - loss 0.03166178 - time (sec): 571.56 - samples/sec: 4963.69 - lr: 0.050000
2023-04-06 10:21:22,597 epoch 67 - iter 3720/3720 - loss 0.03171928 - time (sec): 633.99 - samples/sec: 4971.27 - lr: 0.050000
2023-04-06 10:21:22,597 ----------------------------------------------------------------------------------------------------
2023-04-06 10:21:22,597 EPOCH 67 done: loss 0.0317 - lr 0.050000
2023-04-06 10:21:22,597 BAD EPOCHS (no improvement): 0
2023-04-06 10:21:22,600 ----------------------------------------------------------------------------------------------------
2023-04-06 10:22:26,310 epoch 68 - iter 372/3720 - loss 0.03161777 - time (sec): 63.71 - samples/sec: 4940.03 - lr: 0.050000
2023-04-06 10:23:30,377 epoch 68 - iter 744/3720 - loss 0.03148526 - time (sec): 127.78 - samples/sec: 4928.76 - lr: 0.050000
2023-04-06 10:24:33,195 epoch 68 - iter 1116/3720 - loss 0.03186291 - time (sec): 190.59 - samples/sec: 4956.30 - lr: 0.050000
2023-04-06 10:25:36,342 epoch 68 - iter 1488/3720 - loss 0.03175793 - time (sec): 253.74 - samples/sec: 4966.81 - lr: 0.050000
2023-04-06 10:26:41,348 epoch 68 - iter 1860/3720 - loss 0.03156923 - time (sec): 318.75 - samples/sec: 4945.81 - lr: 0.050000
2023-04-06 10:27:45,571 epoch 68 - iter 2232/3720 - loss 0.03161897 - time (sec): 382.97 - samples/sec: 4940.10 - lr: 0.050000
2023-04-06 10:28:49,262 epoch 68 - iter 2604/3720 - loss 0.03165787 - time (sec): 446.66 - samples/sec: 4942.58 - lr: 0.050000
2023-04-06 10:29:53,924 epoch 68 - iter 2976/3720 - loss 0.03164836 - time (sec): 511.32 - samples/sec: 4934.19 - lr: 0.050000
2023-04-06 10:30:57,633 epoch 68 - iter 3348/3720 - loss 0.03158081 - time (sec): 575.03 - samples/sec: 4935.44 - lr: 0.050000
2023-04-06 10:32:00,786 epoch 68 - iter 3720/3720 - loss 0.03160046 - time (sec): 638.19 - samples/sec: 4938.55 - lr: 0.050000
2023-04-06 10:32:00,786 ----------------------------------------------------------------------------------------------------
2023-04-06 10:32:00,786 EPOCH 68 done: loss 0.0316 - lr 0.050000
2023-04-06 10:32:00,786 BAD EPOCHS (no improvement): 0
2023-04-06 10:32:00,789 ----------------------------------------------------------------------------------------------------
2023-04-06 10:33:04,937 epoch 69 - iter 372/3720 - loss 0.03135092 - time (sec): 64.15 - samples/sec: 4939.21 - lr: 0.050000
2023-04-06 10:34:07,030 epoch 69 - iter 744/3720 - loss 0.03094274 - time (sec): 126.24 - samples/sec: 4971.53 - lr: 0.050000
2023-04-06 10:35:11,016 epoch 69 - iter 1116/3720 - loss 0.03116561 - time (sec): 190.23 - samples/sec: 4959.15 - lr: 0.050000
2023-04-06 10:36:15,342 epoch 69 - iter 1488/3720 - loss 0.03126021 - time (sec): 254.55 - samples/sec: 4945.28 - lr: 0.050000
2023-04-06 10:37:18,675 epoch 69 - iter 1860/3720 - loss 0.03134390 - time (sec): 317.89 - samples/sec: 4949.61 - lr: 0.050000
2023-04-06 10:38:22,341 epoch 69 - iter 2232/3720 - loss 0.03134850 - time (sec): 381.55 - samples/sec: 4949.88 - lr: 0.050000
2023-04-06 10:39:25,106 epoch 69 - iter 2604/3720 - loss 0.03142628 - time (sec): 444.32 - samples/sec: 4958.11 - lr: 0.050000
2023-04-06 10:40:28,241 epoch 69 - iter 2976/3720 - loss 0.03136588 - time (sec): 507.45 - samples/sec: 4960.83 - lr: 0.050000
2023-04-06 10:41:32,561 epoch 69 - iter 3348/3720 - loss 0.03130388 - time (sec): 571.77 - samples/sec: 4957.56 - lr: 0.050000
2023-04-06 10:42:36,606 epoch 69 - iter 3720/3720 - loss 0.03130843 - time (sec): 635.82 - samples/sec: 4956.95 - lr: 0.050000
2023-04-06 10:42:36,607 ----------------------------------------------------------------------------------------------------
2023-04-06 10:42:36,607 EPOCH 69 done: loss 0.0313 - lr 0.050000
2023-04-06 10:42:36,607 BAD EPOCHS (no improvement): 0
2023-04-06 10:42:36,609 ----------------------------------------------------------------------------------------------------
2023-04-06 10:43:40,981 epoch 70 - iter 372/3720 - loss 0.03116412 - time (sec): 64.37 - samples/sec: 4901.70 - lr: 0.050000
2023-04-06 10:44:44,319 epoch 70 - iter 744/3720 - loss 0.03118096 - time (sec): 127.71 - samples/sec: 4940.06 - lr: 0.050000
2023-04-06 10:45:48,485 epoch 70 - iter 1116/3720 - loss 0.03090943 - time (sec): 191.88 - samples/sec: 4929.28 - lr: 0.050000
2023-04-06 10:46:52,519 epoch 70 - iter 1488/3720 - loss 0.03093912 - time (sec): 255.91 - samples/sec: 4927.35 - lr: 0.050000
2023-04-06 10:47:55,740 epoch 70 - iter 1860/3720 - loss 0.03096841 - time (sec): 319.13 - samples/sec: 4924.15 - lr: 0.050000
2023-04-06 10:48:59,477 epoch 70 - iter 2232/3720 - loss 0.03105662 - time (sec): 382.87 - samples/sec: 4924.60 - lr: 0.050000
2023-04-06 10:50:02,531 epoch 70 - iter 2604/3720 - loss 0.03118693 - time (sec): 445.92 - samples/sec: 4940.22 - lr: 0.050000
2023-04-06 10:51:06,113 epoch 70 - iter 2976/3720 - loss 0.03112537 - time (sec): 509.50 - samples/sec: 4946.77 - lr: 0.050000
2023-04-06 10:52:09,431 epoch 70 - iter 3348/3720 - loss 0.03118119 - time (sec): 572.82 - samples/sec: 4951.62 - lr: 0.050000
2023-04-06 10:53:14,212 epoch 70 - iter 3720/3720 - loss 0.03123392 - time (sec): 637.60 - samples/sec: 4943.06 - lr: 0.050000
2023-04-06 10:53:14,213 ----------------------------------------------------------------------------------------------------
2023-04-06 10:53:14,213 EPOCH 70 done: loss 0.0312 - lr 0.050000
2023-04-06 10:53:14,213 BAD EPOCHS (no improvement): 0
2023-04-06 10:53:14,216 ----------------------------------------------------------------------------------------------------
2023-04-06 10:54:18,318 epoch 71 - iter 372/3720 - loss 0.03113377 - time (sec): 64.10 - samples/sec: 4917.96 - lr: 0.050000
2023-04-06 10:55:20,822 epoch 71 - iter 744/3720 - loss 0.03099367 - time (sec): 126.61 - samples/sec: 4957.95 - lr: 0.050000
2023-04-06 10:56:24,775 epoch 71 - iter 1116/3720 - loss 0.03123187 - time (sec): 190.56 - samples/sec: 4951.11 - lr: 0.050000
2023-04-06 10:57:28,392 epoch 71 - iter 1488/3720 - loss 0.03104929 - time (sec): 254.18 - samples/sec: 4961.57 - lr: 0.050000
2023-04-06 10:58:31,773 epoch 71 - iter 1860/3720 - loss 0.03090814 - time (sec): 317.56 - samples/sec: 4967.05 - lr: 0.050000
2023-04-06 10:59:33,384 epoch 71 - iter 2232/3720 - loss 0.03101887 - time (sec): 379.17 - samples/sec: 4986.04 - lr: 0.050000
2023-04-06 11:00:38,316 epoch 71 - iter 2604/3720 - loss 0.03107583 - time (sec): 444.10 - samples/sec: 4968.86 - lr: 0.050000
2023-04-06 11:01:41,235 epoch 71 - iter 2976/3720 - loss 0.03110385 - time (sec): 507.02 - samples/sec: 4969.69 - lr: 0.050000
2023-04-06 11:02:44,951 epoch 71 - iter 3348/3720 - loss 0.03109594 - time (sec): 570.74 - samples/sec: 4969.12 - lr: 0.050000
2023-04-06 11:03:48,078 epoch 71 - iter 3720/3720 - loss 0.03099105 - time (sec): 633.86 - samples/sec: 4972.24 - lr: 0.050000
2023-04-06 11:03:48,078 ----------------------------------------------------------------------------------------------------
2023-04-06 11:03:48,078 EPOCH 71 done: loss 0.0310 - lr 0.050000
2023-04-06 11:03:48,078 BAD EPOCHS (no improvement): 0
2023-04-06 11:03:48,082 ----------------------------------------------------------------------------------------------------
2023-04-06 11:04:51,421 epoch 72 - iter 372/3720 - loss 0.03013338 - time (sec): 63.34 - samples/sec: 4976.42 - lr: 0.050000
2023-04-06 11:05:54,665 epoch 72 - iter 744/3720 - loss 0.03050364 - time (sec): 126.58 - samples/sec: 4975.09 - lr: 0.050000
2023-04-06 11:06:56,685 epoch 72 - iter 1116/3720 - loss 0.03046667 - time (sec): 188.60 - samples/sec: 5008.99 - lr: 0.050000
2023-04-06 11:07:59,410 epoch 72 - iter 1488/3720 - loss 0.03057755 - time (sec): 251.33 - samples/sec: 5009.60 - lr: 0.050000
2023-04-06 11:09:00,391 epoch 72 - iter 1860/3720 - loss 0.03058173 - time (sec): 312.31 - samples/sec: 5038.96 - lr: 0.050000
2023-04-06 11:10:04,729 epoch 72 - iter 2232/3720 - loss 0.03055907 - time (sec): 376.65 - samples/sec: 5020.39 - lr: 0.050000
2023-04-06 11:11:08,809 epoch 72 - iter 2604/3720 - loss 0.03064763 - time (sec): 440.73 - samples/sec: 5003.90 - lr: 0.050000
2023-04-06 11:12:12,462 epoch 72 - iter 2976/3720 - loss 0.03068538 - time (sec): 504.38 - samples/sec: 4997.32 - lr: 0.050000
2023-04-06 11:13:16,757 epoch 72 - iter 3348/3720 - loss 0.03079806 - time (sec): 568.68 - samples/sec: 4985.31 - lr: 0.050000
2023-04-06 11:14:20,977 epoch 72 - iter 3720/3720 - loss 0.03080137 - time (sec): 632.90 - samples/sec: 4979.83 - lr: 0.050000
2023-04-06 11:14:20,978 ----------------------------------------------------------------------------------------------------
2023-04-06 11:14:20,978 EPOCH 72 done: loss 0.0308 - lr 0.050000
2023-04-06 11:14:20,978 BAD EPOCHS (no improvement): 0
2023-04-06 11:14:20,982 ----------------------------------------------------------------------------------------------------
2023-04-06 11:15:25,313 epoch 73 - iter 372/3720 - loss 0.03065767 - time (sec): 64.33 - samples/sec: 4924.92 - lr: 0.050000
2023-04-06 11:16:29,963 epoch 73 - iter 744/3720 - loss 0.03055942 - time (sec): 128.98 - samples/sec: 4906.21 - lr: 0.050000
2023-04-06 11:17:33,123 epoch 73 - iter 1116/3720 - loss 0.03072176 - time (sec): 192.14 - samples/sec: 4923.46 - lr: 0.050000
2023-04-06 11:18:36,847 epoch 73 - iter 1488/3720 - loss 0.03048773 - time (sec): 255.87 - samples/sec: 4933.71 - lr: 0.050000
2023-04-06 11:19:39,856 epoch 73 - iter 1860/3720 - loss 0.03055045 - time (sec): 318.87 - samples/sec: 4944.90 - lr: 0.050000
2023-04-06 11:20:44,634 epoch 73 - iter 2232/3720 - loss 0.03045798 - time (sec): 383.65 - samples/sec: 4936.56 - lr: 0.050000
2023-04-06 11:21:46,138 epoch 73 - iter 2604/3720 - loss 0.03051092 - time (sec): 445.16 - samples/sec: 4958.72 - lr: 0.050000
2023-04-06 11:22:49,617 epoch 73 - iter 2976/3720 - loss 0.03058982 - time (sec): 508.63 - samples/sec: 4959.01 - lr: 0.050000
2023-04-06 11:23:53,461 epoch 73 - iter 3348/3720 - loss 0.03067650 - time (sec): 572.48 - samples/sec: 4955.73 - lr: 0.050000
2023-04-06 11:24:57,226 epoch 73 - iter 3720/3720 - loss 0.03066288 - time (sec): 636.24 - samples/sec: 4953.62 - lr: 0.050000
2023-04-06 11:24:57,226 ----------------------------------------------------------------------------------------------------
2023-04-06 11:24:57,226 EPOCH 73 done: loss 0.0307 - lr 0.050000
2023-04-06 11:24:57,226 BAD EPOCHS (no improvement): 0
2023-04-06 11:24:57,230 ----------------------------------------------------------------------------------------------------
2023-04-06 11:25:59,629 epoch 74 - iter 372/3720 - loss 0.03113534 - time (sec): 62.40 - samples/sec: 5008.36 - lr: 0.050000
2023-04-06 11:27:03,370 epoch 74 - iter 744/3720 - loss 0.03054291 - time (sec): 126.14 - samples/sec: 4994.88 - lr: 0.050000
2023-04-06 11:28:07,338 epoch 74 - iter 1116/3720 - loss 0.03034973 - time (sec): 190.11 - samples/sec: 4982.07 - lr: 0.050000
2023-04-06 11:29:10,508 epoch 74 - iter 1488/3720 - loss 0.03051028 - time (sec): 253.28 - samples/sec: 4981.10 - lr: 0.050000
2023-04-06 11:30:14,732 epoch 74 - iter 1860/3720 - loss 0.03053525 - time (sec): 317.50 - samples/sec: 4962.14 - lr: 0.050000
2023-04-06 11:31:18,305 epoch 74 - iter 2232/3720 - loss 0.03054710 - time (sec): 381.08 - samples/sec: 4954.37 - lr: 0.050000
2023-04-06 11:32:22,559 epoch 74 - iter 2604/3720 - loss 0.03053230 - time (sec): 445.33 - samples/sec: 4952.66 - lr: 0.050000
2023-04-06 11:33:25,938 epoch 74 - iter 2976/3720 - loss 0.03060414 - time (sec): 508.71 - samples/sec: 4952.37 - lr: 0.050000
2023-04-06 11:34:28,978 epoch 74 - iter 3348/3720 - loss 0.03073909 - time (sec): 571.75 - samples/sec: 4956.25 - lr: 0.050000
2023-04-06 11:35:33,510 epoch 74 - iter 3720/3720 - loss 0.03073551 - time (sec): 636.28 - samples/sec: 4953.34 - lr: 0.050000
2023-04-06 11:35:33,510 ----------------------------------------------------------------------------------------------------
2023-04-06 11:35:33,510 EPOCH 74 done: loss 0.0307 - lr 0.050000
2023-04-06 11:35:33,510 BAD EPOCHS (no improvement): 1
2023-04-06 11:35:33,514 ----------------------------------------------------------------------------------------------------
2023-04-06 11:36:37,175 epoch 75 - iter 372/3720 - loss 0.02962953 - time (sec): 63.66 - samples/sec: 4984.36 - lr: 0.050000
2023-04-06 11:37:41,521 epoch 75 - iter 744/3720 - loss 0.02998085 - time (sec): 128.01 - samples/sec: 4947.37 - lr: 0.050000
2023-04-06 11:38:45,839 epoch 75 - iter 1116/3720 - loss 0.03047575 - time (sec): 192.33 - samples/sec: 4941.67 - lr: 0.050000
2023-04-06 11:39:49,011 epoch 75 - iter 1488/3720 - loss 0.03015728 - time (sec): 255.50 - samples/sec: 4956.57 - lr: 0.050000
2023-04-06 11:40:52,219 epoch 75 - iter 1860/3720 - loss 0.03017682 - time (sec): 318.70 - samples/sec: 4959.20 - lr: 0.050000
2023-04-06 11:41:54,500 epoch 75 - iter 2232/3720 - loss 0.03046283 - time (sec): 380.99 - samples/sec: 4973.68 - lr: 0.050000
2023-04-06 11:42:56,958 epoch 75 - iter 2604/3720 - loss 0.03051771 - time (sec): 443.44 - samples/sec: 4976.67 - lr: 0.050000
2023-04-06 11:44:00,143 epoch 75 - iter 2976/3720 - loss 0.03053466 - time (sec): 506.63 - samples/sec: 4980.69 - lr: 0.050000
2023-04-06 11:45:03,004 epoch 75 - iter 3348/3720 - loss 0.03053358 - time (sec): 569.49 - samples/sec: 4982.58 - lr: 0.050000
2023-04-06 11:46:06,889 epoch 75 - iter 3720/3720 - loss 0.03052326 - time (sec): 633.37 - samples/sec: 4976.06 - lr: 0.050000
2023-04-06 11:46:06,889 ----------------------------------------------------------------------------------------------------
2023-04-06 11:46:06,889 EPOCH 75 done: loss 0.0305 - lr 0.050000
2023-04-06 11:46:06,889 BAD EPOCHS (no improvement): 0
2023-04-06 11:46:06,892 ----------------------------------------------------------------------------------------------------
2023-04-06 11:47:10,468 epoch 76 - iter 372/3720 - loss 0.03043636 - time (sec): 63.58 - samples/sec: 4976.19 - lr: 0.050000
2023-04-06 11:48:14,365 epoch 76 - iter 744/3720 - loss 0.03054553 - time (sec): 127.47 - samples/sec: 4949.69 - lr: 0.050000
2023-04-06 11:49:18,478 epoch 76 - iter 1116/3720 - loss 0.03041469 - time (sec): 191.59 - samples/sec: 4946.42 - lr: 0.050000
2023-04-06 11:50:20,803 epoch 76 - iter 1488/3720 - loss 0.03043815 - time (sec): 253.91 - samples/sec: 4961.30 - lr: 0.050000
2023-04-06 11:51:24,769 epoch 76 - iter 1860/3720 - loss 0.03060184 - time (sec): 317.88 - samples/sec: 4949.65 - lr: 0.050000
2023-04-06 11:52:28,905 epoch 76 - iter 2232/3720 - loss 0.03052971 - time (sec): 382.01 - samples/sec: 4945.22 - lr: 0.050000
2023-04-06 11:53:32,705 epoch 76 - iter 2604/3720 - loss 0.03047486 - time (sec): 445.81 - samples/sec: 4944.69 - lr: 0.050000
2023-04-06 11:54:36,541 epoch 76 - iter 2976/3720 - loss 0.03056677 - time (sec): 509.65 - samples/sec: 4945.04 - lr: 0.050000
2023-04-06 11:55:41,564 epoch 76 - iter 3348/3720 - loss 0.03062374 - time (sec): 574.67 - samples/sec: 4938.42 - lr: 0.050000
2023-04-06 11:56:45,199 epoch 76 - iter 3720/3720 - loss 0.03076034 - time (sec): 638.31 - samples/sec: 4937.61 - lr: 0.050000
2023-04-06 11:56:45,199 ----------------------------------------------------------------------------------------------------
2023-04-06 11:56:45,199 EPOCH 76 done: loss 0.0308 - lr 0.050000
2023-04-06 11:56:45,199 BAD EPOCHS (no improvement): 1
2023-04-06 11:56:45,203 ----------------------------------------------------------------------------------------------------
2023-04-06 11:57:48,768 epoch 77 - iter 372/3720 - loss 0.02994766 - time (sec): 63.57 - samples/sec: 4970.58 - lr: 0.050000
2023-04-06 11:58:52,262 epoch 77 - iter 744/3720 - loss 0.02974964 - time (sec): 127.06 - samples/sec: 4941.68 - lr: 0.050000
2023-04-06 11:59:56,463 epoch 77 - iter 1116/3720 - loss 0.03004014 - time (sec): 191.26 - samples/sec: 4924.04 - lr: 0.050000
2023-04-06 12:01:00,778 epoch 77 - iter 1488/3720 - loss 0.02998906 - time (sec): 255.57 - samples/sec: 4923.95 - lr: 0.050000
2023-04-06 12:02:04,092 epoch 77 - iter 1860/3720 - loss 0.03021740 - time (sec): 318.89 - samples/sec: 4934.46 - lr: 0.050000
2023-04-06 12:03:07,272 epoch 77 - iter 2232/3720 - loss 0.03039703 - time (sec): 382.07 - samples/sec: 4941.79 - lr: 0.050000
2023-04-06 12:04:11,226 epoch 77 - iter 2604/3720 - loss 0.03051115 - time (sec): 446.02 - samples/sec: 4941.60 - lr: 0.050000
2023-04-06 12:05:15,619 epoch 77 - iter 2976/3720 - loss 0.03050436 - time (sec): 510.42 - samples/sec: 4940.33 - lr: 0.050000
2023-04-06 12:06:18,391 epoch 77 - iter 3348/3720 - loss 0.03054092 - time (sec): 573.19 - samples/sec: 4949.83 - lr: 0.050000
2023-04-06 12:07:20,984 epoch 77 - iter 3720/3720 - loss 0.03059546 - time (sec): 635.78 - samples/sec: 4957.23 - lr: 0.050000
2023-04-06 12:07:20,985 ----------------------------------------------------------------------------------------------------
2023-04-06 12:07:20,985 EPOCH 77 done: loss 0.0306 - lr 0.050000
2023-04-06 12:07:20,985 BAD EPOCHS (no improvement): 2
2023-04-06 12:07:20,988 ----------------------------------------------------------------------------------------------------
2023-04-06 12:08:24,842 epoch 78 - iter 372/3720 - loss 0.03085487 - time (sec): 63.85 - samples/sec: 4952.46 - lr: 0.050000
2023-04-06 12:09:28,667 epoch 78 - iter 744/3720 - loss 0.03053833 - time (sec): 127.68 - samples/sec: 4940.54 - lr: 0.050000
2023-04-06 12:10:31,054 epoch 78 - iter 1116/3720 - loss 0.03042577 - time (sec): 190.07 - samples/sec: 4975.15 - lr: 0.050000
2023-04-06 12:11:35,363 epoch 78 - iter 1488/3720 - loss 0.03042125 - time (sec): 254.38 - samples/sec: 4961.07 - lr: 0.050000
2023-04-06 12:12:37,842 epoch 78 - iter 1860/3720 - loss 0.03051081 - time (sec): 316.85 - samples/sec: 4978.06 - lr: 0.050000
2023-04-06 12:13:40,485 epoch 78 - iter 2232/3720 - loss 0.03049864 - time (sec): 379.50 - samples/sec: 4987.83 - lr: 0.050000
2023-04-06 12:14:43,634 epoch 78 - iter 2604/3720 - loss 0.03051224 - time (sec): 442.65 - samples/sec: 4983.42 - lr: 0.050000
2023-04-06 12:15:45,797 epoch 78 - iter 2976/3720 - loss 0.03058829 - time (sec): 504.81 - samples/sec: 4991.41 - lr: 0.050000
2023-04-06 12:16:48,842 epoch 78 - iter 3348/3720 - loss 0.03046737 - time (sec): 567.85 - samples/sec: 4992.96 - lr: 0.050000
2023-04-06 12:17:52,755 epoch 78 - iter 3720/3720 - loss 0.03050550 - time (sec): 631.77 - samples/sec: 4988.72 - lr: 0.050000
2023-04-06 12:17:52,756 ----------------------------------------------------------------------------------------------------
2023-04-06 12:17:52,756 EPOCH 78 done: loss 0.0305 - lr 0.050000
2023-04-06 12:17:52,756 BAD EPOCHS (no improvement): 0
2023-04-06 12:17:52,759 ----------------------------------------------------------------------------------------------------
2023-04-06 12:18:57,352 epoch 79 - iter 372/3720 - loss 0.02989969 - time (sec): 64.59 - samples/sec: 4938.78 - lr: 0.050000
2023-04-06 12:20:00,863 epoch 79 - iter 744/3720 - loss 0.03037559 - time (sec): 128.10 - samples/sec: 4955.85 - lr: 0.050000
2023-04-06 12:21:04,399 epoch 79 - iter 1116/3720 - loss 0.03013356 - time (sec): 191.64 - samples/sec: 4959.13 - lr: 0.050000
2023-04-06 12:22:06,878 epoch 79 - iter 1488/3720 - loss 0.03009245 - time (sec): 254.12 - samples/sec: 4978.84 - lr: 0.050000
2023-04-06 12:23:08,817 epoch 79 - iter 1860/3720 - loss 0.03002431 - time (sec): 316.06 - samples/sec: 4996.97 - lr: 0.050000
2023-04-06 12:24:11,867 epoch 79 - iter 2232/3720 - loss 0.03013416 - time (sec): 379.11 - samples/sec: 4992.95 - lr: 0.050000
2023-04-06 12:25:15,535 epoch 79 - iter 2604/3720 - loss 0.03017423 - time (sec): 442.78 - samples/sec: 4986.62 - lr: 0.050000
2023-04-06 12:26:19,252 epoch 79 - iter 2976/3720 - loss 0.03027454 - time (sec): 506.49 - samples/sec: 4980.98 - lr: 0.050000
2023-04-06 12:27:22,216 epoch 79 - iter 3348/3720 - loss 0.03035162 - time (sec): 569.46 - samples/sec: 4983.40 - lr: 0.050000
2023-04-06 12:28:25,719 epoch 79 - iter 3720/3720 - loss 0.03040132 - time (sec): 632.96 - samples/sec: 4979.32 - lr: 0.050000
2023-04-06 12:28:25,720 ----------------------------------------------------------------------------------------------------
2023-04-06 12:28:25,720 EPOCH 79 done: loss 0.0304 - lr 0.050000
2023-04-06 12:28:25,720 BAD EPOCHS (no improvement): 0
2023-04-06 12:28:25,723 ----------------------------------------------------------------------------------------------------
2023-04-06 12:29:29,741 epoch 80 - iter 372/3720 - loss 0.02967585 - time (sec): 64.02 - samples/sec: 4965.95 - lr: 0.050000
2023-04-06 12:30:32,266 epoch 80 - iter 744/3720 - loss 0.02980327 - time (sec): 126.54 - samples/sec: 4992.19 - lr: 0.050000
2023-04-06 12:31:35,831 epoch 80 - iter 1116/3720 - loss 0.02997426 - time (sec): 190.11 - samples/sec: 4983.89 - lr: 0.050000
2023-04-06 12:32:39,623 epoch 80 - iter 1488/3720 - loss 0.03019761 - time (sec): 253.90 - samples/sec: 4973.92 - lr: 0.050000
2023-04-06 12:33:42,193 epoch 80 - iter 1860/3720 - loss 0.03017472 - time (sec): 316.47 - samples/sec: 4988.34 - lr: 0.050000
2023-04-06 12:34:45,880 epoch 80 - iter 2232/3720 - loss 0.03006042 - time (sec): 380.16 - samples/sec: 4982.57 - lr: 0.050000
2023-04-06 12:35:50,163 epoch 80 - iter 2604/3720 - loss 0.03008109 - time (sec): 444.44 - samples/sec: 4974.44 - lr: 0.050000
2023-04-06 12:36:53,578 epoch 80 - iter 2976/3720 - loss 0.03009367 - time (sec): 507.85 - samples/sec: 4971.11 - lr: 0.050000
2023-04-06 12:37:56,594 epoch 80 - iter 3348/3720 - loss 0.03025149 - time (sec): 570.87 - samples/sec: 4973.91 - lr: 0.050000
2023-04-06 12:38:58,959 epoch 80 - iter 3720/3720 - loss 0.03029803 - time (sec): 633.24 - samples/sec: 4977.15 - lr: 0.050000
2023-04-06 12:38:58,959 ----------------------------------------------------------------------------------------------------
2023-04-06 12:38:58,959 EPOCH 80 done: loss 0.0303 - lr 0.050000
2023-04-06 12:38:58,959 BAD EPOCHS (no improvement): 0
2023-04-06 12:38:58,962 ----------------------------------------------------------------------------------------------------
2023-04-06 12:40:01,930 epoch 81 - iter 372/3720 - loss 0.02975027 - time (sec): 62.97 - samples/sec: 4999.89 - lr: 0.050000
2023-04-06 12:41:04,302 epoch 81 - iter 744/3720 - loss 0.03051976 - time (sec): 125.34 - samples/sec: 5016.68 - lr: 0.050000
2023-04-06 12:42:07,250 epoch 81 - iter 1116/3720 - loss 0.03076626 - time (sec): 188.29 - samples/sec: 5016.89 - lr: 0.050000
2023-04-06 12:43:10,955 epoch 81 - iter 1488/3720 - loss 0.03052334 - time (sec): 251.99 - samples/sec: 5003.81 - lr: 0.050000
2023-04-06 12:44:15,673 epoch 81 - iter 1860/3720 - loss 0.03042447 - time (sec): 316.71 - samples/sec: 4987.43 - lr: 0.050000
2023-04-06 12:45:19,950 epoch 81 - iter 2232/3720 - loss 0.03036129 - time (sec): 380.99 - samples/sec: 4971.63 - lr: 0.050000
2023-04-06 12:46:23,964 epoch 81 - iter 2604/3720 - loss 0.03039412 - time (sec): 445.00 - samples/sec: 4966.96 - lr: 0.050000
2023-04-06 12:47:26,745 epoch 81 - iter 2976/3720 - loss 0.03030867 - time (sec): 507.78 - samples/sec: 4968.93 - lr: 0.050000
2023-04-06 12:48:30,741 epoch 81 - iter 3348/3720 - loss 0.03025832 - time (sec): 571.78 - samples/sec: 4962.95 - lr: 0.050000
2023-04-06 12:49:34,322 epoch 81 - iter 3720/3720 - loss 0.03027550 - time (sec): 635.36 - samples/sec: 4960.51 - lr: 0.050000
2023-04-06 12:49:34,322 ----------------------------------------------------------------------------------------------------
2023-04-06 12:49:34,322 EPOCH 81 done: loss 0.0303 - lr 0.050000
2023-04-06 12:49:34,322 BAD EPOCHS (no improvement): 0
2023-04-06 12:49:34,325 ----------------------------------------------------------------------------------------------------
2023-04-06 12:50:37,764 epoch 82 - iter 372/3720 - loss 0.02977248 - time (sec): 63.44 - samples/sec: 4980.45 - lr: 0.050000
2023-04-06 12:51:40,812 epoch 82 - iter 744/3720 - loss 0.02977672 - time (sec): 126.49 - samples/sec: 4972.33 - lr: 0.050000
2023-04-06 12:52:43,768 epoch 82 - iter 1116/3720 - loss 0.02981920 - time (sec): 189.44 - samples/sec: 4986.93 - lr: 0.050000
2023-04-06 12:53:47,696 epoch 82 - iter 1488/3720 - loss 0.02974237 - time (sec): 253.37 - samples/sec: 4987.24 - lr: 0.050000
2023-04-06 12:54:51,401 epoch 82 - iter 1860/3720 - loss 0.02983702 - time (sec): 317.08 - samples/sec: 4971.99 - lr: 0.050000
2023-04-06 12:55:55,682 epoch 82 - iter 2232/3720 - loss 0.02997370 - time (sec): 381.36 - samples/sec: 4963.87 - lr: 0.050000
2023-04-06 12:56:58,728 epoch 82 - iter 2604/3720 - loss 0.03010291 - time (sec): 444.40 - samples/sec: 4967.19 - lr: 0.050000
2023-04-06 12:58:01,531 epoch 82 - iter 2976/3720 - loss 0.03005695 - time (sec): 507.21 - samples/sec: 4974.33 - lr: 0.050000
2023-04-06 12:59:05,659 epoch 82 - iter 3348/3720 - loss 0.03006183 - time (sec): 571.33 - samples/sec: 4965.83 - lr: 0.050000
2023-04-06 13:00:08,499 epoch 82 - iter 3720/3720 - loss 0.03017237 - time (sec): 634.17 - samples/sec: 4969.80 - lr: 0.050000
2023-04-06 13:00:08,499 ----------------------------------------------------------------------------------------------------
2023-04-06 13:00:08,499 EPOCH 82 done: loss 0.0302 - lr 0.050000
2023-04-06 13:00:08,499 BAD EPOCHS (no improvement): 0
2023-04-06 13:00:08,502 ----------------------------------------------------------------------------------------------------
2023-04-06 13:01:12,221 epoch 83 - iter 372/3720 - loss 0.03043908 - time (sec): 63.72 - samples/sec: 4951.87 - lr: 0.050000
2023-04-06 13:02:16,000 epoch 83 - iter 744/3720 - loss 0.03032200 - time (sec): 127.50 - samples/sec: 4941.02 - lr: 0.050000
2023-04-06 13:03:20,241 epoch 83 - iter 1116/3720 - loss 0.03014522 - time (sec): 191.74 - samples/sec: 4933.39 - lr: 0.050000
2023-04-06 13:04:24,273 epoch 83 - iter 1488/3720 - loss 0.03036548 - time (sec): 255.77 - samples/sec: 4933.74 - lr: 0.050000
2023-04-06 13:05:27,755 epoch 83 - iter 1860/3720 - loss 0.03029812 - time (sec): 319.25 - samples/sec: 4932.26 - lr: 0.050000
2023-04-06 13:06:31,221 epoch 83 - iter 2232/3720 - loss 0.03030823 - time (sec): 382.72 - samples/sec: 4940.43 - lr: 0.050000
2023-04-06 13:07:33,936 epoch 83 - iter 2604/3720 - loss 0.03029005 - time (sec): 445.43 - samples/sec: 4954.28 - lr: 0.050000
2023-04-06 13:08:38,059 epoch 83 - iter 2976/3720 - loss 0.03027530 - time (sec): 509.56 - samples/sec: 4948.93 - lr: 0.050000
2023-04-06 13:09:41,850 epoch 83 - iter 3348/3720 - loss 0.03022734 - time (sec): 573.35 - samples/sec: 4947.83 - lr: 0.050000
2023-04-06 13:10:45,251 epoch 83 - iter 3720/3720 - loss 0.03015810 - time (sec): 636.75 - samples/sec: 4949.70 - lr: 0.050000
2023-04-06 13:10:45,251 ----------------------------------------------------------------------------------------------------
2023-04-06 13:10:45,251 EPOCH 83 done: loss 0.0302 - lr 0.050000
2023-04-06 13:10:45,251 BAD EPOCHS (no improvement): 0
2023-04-06 13:10:45,254 ----------------------------------------------------------------------------------------------------
2023-04-06 13:11:48,427 epoch 84 - iter 372/3720 - loss 0.02908372 - time (sec): 63.17 - samples/sec: 4960.66 - lr: 0.050000
2023-04-06 13:12:52,193 epoch 84 - iter 744/3720 - loss 0.02956119 - time (sec): 126.94 - samples/sec: 4942.69 - lr: 0.050000
2023-04-06 13:13:56,791 epoch 84 - iter 1116/3720 - loss 0.02940072 - time (sec): 191.54 - samples/sec: 4934.39 - lr: 0.050000
2023-04-06 13:14:59,569 epoch 84 - iter 1488/3720 - loss 0.02954165 - time (sec): 254.31 - samples/sec: 4952.96 - lr: 0.050000
2023-04-06 13:16:01,106 epoch 84 - iter 1860/3720 - loss 0.02981238 - time (sec): 315.85 - samples/sec: 4978.97 - lr: 0.050000
2023-04-06 13:17:04,334 epoch 84 - iter 2232/3720 - loss 0.02981780 - time (sec): 379.08 - samples/sec: 4984.31 - lr: 0.050000
2023-04-06 13:18:08,919 epoch 84 - iter 2604/3720 - loss 0.02981041 - time (sec): 443.67 - samples/sec: 4975.37 - lr: 0.050000
2023-04-06 13:19:12,215 epoch 84 - iter 2976/3720 - loss 0.02984617 - time (sec): 506.96 - samples/sec: 4976.52 - lr: 0.050000
2023-04-06 13:20:14,762 epoch 84 - iter 3348/3720 - loss 0.02999180 - time (sec): 569.51 - samples/sec: 4981.50 - lr: 0.050000
2023-04-06 13:21:18,532 epoch 84 - iter 3720/3720 - loss 0.03004575 - time (sec): 633.28 - samples/sec: 4976.83 - lr: 0.050000
2023-04-06 13:21:18,532 ----------------------------------------------------------------------------------------------------
2023-04-06 13:21:18,532 EPOCH 84 done: loss 0.0300 - lr 0.050000
2023-04-06 13:21:18,532 BAD EPOCHS (no improvement): 0
2023-04-06 13:21:18,535 ----------------------------------------------------------------------------------------------------
2023-04-06 13:22:22,664 epoch 85 - iter 372/3720 - loss 0.02902546 - time (sec): 64.13 - samples/sec: 4936.46 - lr: 0.050000
2023-04-06 13:23:26,348 epoch 85 - iter 744/3720 - loss 0.02949886 - time (sec): 127.81 - samples/sec: 4943.93 - lr: 0.050000
2023-04-06 13:24:31,398 epoch 85 - iter 1116/3720 - loss 0.02952757 - time (sec): 192.86 - samples/sec: 4916.08 - lr: 0.050000
2023-04-06 13:25:35,221 epoch 85 - iter 1488/3720 - loss 0.02998131 - time (sec): 256.69 - samples/sec: 4911.89 - lr: 0.050000
2023-04-06 13:26:39,389 epoch 85 - iter 1860/3720 - loss 0.02996414 - time (sec): 320.85 - samples/sec: 4908.02 - lr: 0.050000
2023-04-06 13:27:43,083 epoch 85 - iter 2232/3720 - loss 0.03001694 - time (sec): 384.55 - samples/sec: 4915.96 - lr: 0.050000
2023-04-06 13:28:47,258 epoch 85 - iter 2604/3720 - loss 0.02993371 - time (sec): 448.72 - samples/sec: 4918.72 - lr: 0.050000
2023-04-06 13:29:51,792 epoch 85 - iter 2976/3720 - loss 0.02987828 - time (sec): 513.26 - samples/sec: 4912.85 - lr: 0.050000
2023-04-06 13:30:55,747 epoch 85 - iter 3348/3720 - loss 0.02983418 - time (sec): 577.21 - samples/sec: 4912.32 - lr: 0.050000
2023-04-06 13:31:59,736 epoch 85 - iter 3720/3720 - loss 0.02986079 - time (sec): 641.20 - samples/sec: 4915.33 - lr: 0.050000
2023-04-06 13:31:59,737 ----------------------------------------------------------------------------------------------------
2023-04-06 13:31:59,737 EPOCH 85 done: loss 0.0299 - lr 0.050000
2023-04-06 13:31:59,737 BAD EPOCHS (no improvement): 0
2023-04-06 13:31:59,740 ----------------------------------------------------------------------------------------------------
2023-04-06 13:33:03,757 epoch 86 - iter 372/3720 - loss 0.02923677 - time (sec): 64.02 - samples/sec: 4934.86 - lr: 0.050000
2023-04-06 13:34:06,935 epoch 86 - iter 744/3720 - loss 0.02950851 - time (sec): 127.20 - samples/sec: 4968.11 - lr: 0.050000
2023-04-06 13:35:09,973 epoch 86 - iter 1116/3720 - loss 0.02994130 - time (sec): 190.23 - samples/sec: 4981.61 - lr: 0.050000
2023-04-06 13:36:12,083 epoch 86 - iter 1488/3720 - loss 0.03002814 - time (sec): 252.34 - samples/sec: 5004.61 - lr: 0.050000
2023-04-06 13:37:14,901 epoch 86 - iter 1860/3720 - loss 0.02999422 - time (sec): 315.16 - samples/sec: 5009.75 - lr: 0.050000
2023-04-06 13:38:17,771 epoch 86 - iter 2232/3720 - loss 0.02999098 - time (sec): 378.03 - samples/sec: 5010.54 - lr: 0.050000
2023-04-06 13:39:21,085 epoch 86 - iter 2604/3720 - loss 0.02992430 - time (sec): 441.34 - samples/sec: 5001.78 - lr: 0.050000
2023-04-06 13:40:25,116 epoch 86 - iter 2976/3720 - loss 0.02990109 - time (sec): 505.38 - samples/sec: 4993.15 - lr: 0.050000
2023-04-06 13:41:27,680 epoch 86 - iter 3348/3720 - loss 0.02995638 - time (sec): 567.94 - samples/sec: 4993.61 - lr: 0.050000
2023-04-06 13:42:30,937 epoch 86 - iter 3720/3720 - loss 0.02997383 - time (sec): 631.20 - samples/sec: 4993.23 - lr: 0.050000
2023-04-06 13:42:30,937 ----------------------------------------------------------------------------------------------------
2023-04-06 13:42:30,937 EPOCH 86 done: loss 0.0300 - lr 0.050000
2023-04-06 13:42:30,937 BAD EPOCHS (no improvement): 1
2023-04-06 13:42:30,940 ----------------------------------------------------------------------------------------------------
2023-04-06 13:43:34,861 epoch 87 - iter 372/3720 - loss 0.02955950 - time (sec): 63.92 - samples/sec: 4928.74 - lr: 0.050000
2023-04-06 13:44:39,653 epoch 87 - iter 744/3720 - loss 0.02946349 - time (sec): 128.71 - samples/sec: 4932.74 - lr: 0.050000
2023-04-06 13:45:43,726 epoch 87 - iter 1116/3720 - loss 0.02977305 - time (sec): 192.79 - samples/sec: 4932.02 - lr: 0.050000
2023-04-06 13:46:47,650 epoch 87 - iter 1488/3720 - loss 0.02979529 - time (sec): 256.71 - samples/sec: 4923.84 - lr: 0.050000
2023-04-06 13:47:51,153 epoch 87 - iter 1860/3720 - loss 0.02988215 - time (sec): 320.21 - samples/sec: 4927.51 - lr: 0.050000
2023-04-06 13:48:55,054 epoch 87 - iter 2232/3720 - loss 0.02994461 - time (sec): 384.11 - samples/sec: 4926.85 - lr: 0.050000
2023-04-06 13:50:00,227 epoch 87 - iter 2604/3720 - loss 0.02983321 - time (sec): 449.29 - samples/sec: 4915.60 - lr: 0.050000
2023-04-06 13:51:03,744 epoch 87 - iter 2976/3720 - loss 0.02987213 - time (sec): 512.80 - samples/sec: 4916.68 - lr: 0.050000
2023-04-06 13:52:07,890 epoch 87 - iter 3348/3720 - loss 0.02979281 - time (sec): 576.95 - samples/sec: 4916.04 - lr: 0.050000
2023-04-06 13:53:12,302 epoch 87 - iter 3720/3720 - loss 0.02979169 - time (sec): 641.36 - samples/sec: 4914.10 - lr: 0.050000
2023-04-06 13:53:12,302 ----------------------------------------------------------------------------------------------------
2023-04-06 13:53:12,302 EPOCH 87 done: loss 0.0298 - lr 0.050000
2023-04-06 13:53:12,302 BAD EPOCHS (no improvement): 0
2023-04-06 13:53:12,305 ----------------------------------------------------------------------------------------------------
2023-04-06 13:54:16,223 epoch 88 - iter 372/3720 - loss 0.02878423 - time (sec): 63.92 - samples/sec: 4925.16 - lr: 0.050000
2023-04-06 13:55:19,407 epoch 88 - iter 744/3720 - loss 0.02900817 - time (sec): 127.10 - samples/sec: 4931.56 - lr: 0.050000
2023-04-06 13:56:22,766 epoch 88 - iter 1116/3720 - loss 0.02934331 - time (sec): 190.46 - samples/sec: 4943.01 - lr: 0.050000
2023-04-06 13:57:25,382 epoch 88 - iter 1488/3720 - loss 0.02947376 - time (sec): 253.08 - samples/sec: 4964.91 - lr: 0.050000
2023-04-06 13:58:29,104 epoch 88 - iter 1860/3720 - loss 0.02954726 - time (sec): 316.80 - samples/sec: 4968.14 - lr: 0.050000
2023-04-06 13:59:31,746 epoch 88 - iter 2232/3720 - loss 0.02971221 - time (sec): 379.44 - samples/sec: 4981.73 - lr: 0.050000
2023-04-06 14:00:35,645 epoch 88 - iter 2604/3720 - loss 0.02971017 - time (sec): 443.34 - samples/sec: 4973.40 - lr: 0.050000
2023-04-06 14:01:39,863 epoch 88 - iter 2976/3720 - loss 0.02969125 - time (sec): 507.56 - samples/sec: 4969.18 - lr: 0.050000
2023-04-06 14:02:43,882 epoch 88 - iter 3348/3720 - loss 0.02979260 - time (sec): 571.58 - samples/sec: 4965.68 - lr: 0.050000
2023-04-06 14:03:46,446 epoch 88 - iter 3720/3720 - loss 0.02979054 - time (sec): 634.14 - samples/sec: 4970.05 - lr: 0.050000
2023-04-06 14:03:46,447 ----------------------------------------------------------------------------------------------------
2023-04-06 14:03:46,447 EPOCH 88 done: loss 0.0298 - lr 0.050000
2023-04-06 14:03:46,447 BAD EPOCHS (no improvement): 0
2023-04-06 14:03:46,451 ----------------------------------------------------------------------------------------------------
2023-04-06 14:04:49,702 epoch 89 - iter 372/3720 - loss 0.02946297 - time (sec): 63.25 - samples/sec: 4971.61 - lr: 0.050000
2023-04-06 14:05:54,264 epoch 89 - iter 744/3720 - loss 0.03008024 - time (sec): 127.81 - samples/sec: 4928.31 - lr: 0.050000
2023-04-06 14:06:59,024 epoch 89 - iter 1116/3720 - loss 0.02983486 - time (sec): 192.57 - samples/sec: 4923.71 - lr: 0.050000
2023-04-06 14:08:03,409 epoch 89 - iter 1488/3720 - loss 0.02967858 - time (sec): 256.96 - samples/sec: 4917.98 - lr: 0.050000
2023-04-06 14:09:06,664 epoch 89 - iter 1860/3720 - loss 0.02955772 - time (sec): 320.21 - samples/sec: 4928.70 - lr: 0.050000
2023-04-06 14:10:09,000 epoch 89 - iter 2232/3720 - loss 0.02962135 - time (sec): 382.55 - samples/sec: 4948.27 - lr: 0.050000
2023-04-06 14:11:11,921 epoch 89 - iter 2604/3720 - loss 0.02959664 - time (sec): 445.47 - samples/sec: 4951.07 - lr: 0.050000
2023-04-06 14:12:16,345 epoch 89 - iter 2976/3720 - loss 0.02970072 - time (sec): 509.89 - samples/sec: 4944.29 - lr: 0.050000
2023-04-06 14:13:21,194 epoch 89 - iter 3348/3720 - loss 0.02971199 - time (sec): 574.74 - samples/sec: 4938.18 - lr: 0.050000
2023-04-06 14:14:25,101 epoch 89 - iter 3720/3720 - loss 0.02974746 - time (sec): 638.65 - samples/sec: 4934.96 - lr: 0.050000
2023-04-06 14:14:25,101 ----------------------------------------------------------------------------------------------------
2023-04-06 14:14:25,101 EPOCH 89 done: loss 0.0297 - lr 0.050000
2023-04-06 14:14:25,101 BAD EPOCHS (no improvement): 0
2023-04-06 14:14:25,104 ----------------------------------------------------------------------------------------------------
2023-04-06 14:15:29,312 epoch 90 - iter 372/3720 - loss 0.02967709 - time (sec): 64.21 - samples/sec: 4935.27 - lr: 0.050000
2023-04-06 14:16:33,686 epoch 90 - iter 744/3720 - loss 0.02956470 - time (sec): 128.58 - samples/sec: 4924.76 - lr: 0.050000
2023-04-06 14:17:37,387 epoch 90 - iter 1116/3720 - loss 0.02956805 - time (sec): 192.28 - samples/sec: 4929.26 - lr: 0.050000
2023-04-06 14:18:40,952 epoch 90 - iter 1488/3720 - loss 0.02963412 - time (sec): 255.85 - samples/sec: 4934.68 - lr: 0.050000
2023-04-06 14:19:45,152 epoch 90 - iter 1860/3720 - loss 0.02984104 - time (sec): 320.05 - samples/sec: 4927.35 - lr: 0.050000
2023-04-06 14:20:49,368 epoch 90 - iter 2232/3720 - loss 0.02978023 - time (sec): 384.26 - samples/sec: 4924.98 - lr: 0.050000
2023-04-06 14:21:53,220 epoch 90 - iter 2604/3720 - loss 0.02980348 - time (sec): 448.12 - samples/sec: 4929.15 - lr: 0.050000
2023-04-06 14:22:57,003 epoch 90 - iter 2976/3720 - loss 0.02980065 - time (sec): 511.90 - samples/sec: 4926.88 - lr: 0.050000
2023-04-06 14:24:00,692 epoch 90 - iter 3348/3720 - loss 0.02970333 - time (sec): 575.59 - samples/sec: 4929.03 - lr: 0.050000
2023-04-06 14:25:05,523 epoch 90 - iter 3720/3720 - loss 0.02968234 - time (sec): 640.42 - samples/sec: 4921.34 - lr: 0.050000
2023-04-06 14:25:05,523 ----------------------------------------------------------------------------------------------------
2023-04-06 14:25:05,523 EPOCH 90 done: loss 0.0297 - lr 0.050000
2023-04-06 14:25:05,523 BAD EPOCHS (no improvement): 0
2023-04-06 14:25:05,527 ----------------------------------------------------------------------------------------------------
2023-04-06 14:26:09,538 epoch 91 - iter 372/3720 - loss 0.02855779 - time (sec): 64.01 - samples/sec: 4950.08 - lr: 0.050000
2023-04-06 14:27:12,559 epoch 91 - iter 744/3720 - loss 0.02911246 - time (sec): 127.03 - samples/sec: 4970.09 - lr: 0.050000
2023-04-06 14:28:15,052 epoch 91 - iter 1116/3720 - loss 0.02935122 - time (sec): 189.52 - samples/sec: 4994.77 - lr: 0.050000
2023-04-06 14:29:18,974 epoch 91 - iter 1488/3720 - loss 0.02941200 - time (sec): 253.45 - samples/sec: 4983.38 - lr: 0.050000
2023-04-06 14:30:23,126 epoch 91 - iter 1860/3720 - loss 0.02947597 - time (sec): 317.60 - samples/sec: 4966.61 - lr: 0.050000
2023-04-06 14:31:25,350 epoch 91 - iter 2232/3720 - loss 0.02953269 - time (sec): 379.82 - samples/sec: 4975.14 - lr: 0.050000
2023-04-06 14:32:28,560 epoch 91 - iter 2604/3720 - loss 0.02945525 - time (sec): 443.03 - samples/sec: 4977.74 - lr: 0.050000
2023-04-06 14:33:32,497 epoch 91 - iter 2976/3720 - loss 0.02944464 - time (sec): 506.97 - samples/sec: 4974.91 - lr: 0.050000
2023-04-06 14:34:35,915 epoch 91 - iter 3348/3720 - loss 0.02957418 - time (sec): 570.39 - samples/sec: 4974.90 - lr: 0.050000
2023-04-06 14:35:39,277 epoch 91 - iter 3720/3720 - loss 0.02955067 - time (sec): 633.75 - samples/sec: 4973.12 - lr: 0.050000
2023-04-06 14:35:39,278 ----------------------------------------------------------------------------------------------------
2023-04-06 14:35:39,278 EPOCH 91 done: loss 0.0296 - lr 0.050000
2023-04-06 14:35:39,278 BAD EPOCHS (no improvement): 0
2023-04-06 14:35:39,281 ----------------------------------------------------------------------------------------------------
2023-04-06 14:36:42,207 epoch 92 - iter 372/3720 - loss 0.03000054 - time (sec): 62.93 - samples/sec: 5009.41 - lr: 0.050000
2023-04-06 14:37:46,192 epoch 92 - iter 744/3720 - loss 0.03046749 - time (sec): 126.91 - samples/sec: 4947.57 - lr: 0.050000
2023-04-06 14:38:50,601 epoch 92 - iter 1116/3720 - loss 0.03014072 - time (sec): 191.32 - samples/sec: 4938.82 - lr: 0.050000
2023-04-06 14:39:54,783 epoch 92 - iter 1488/3720 - loss 0.03012977 - time (sec): 255.50 - samples/sec: 4934.32 - lr: 0.050000
2023-04-06 14:40:58,658 epoch 92 - iter 1860/3720 - loss 0.03005794 - time (sec): 319.38 - samples/sec: 4933.31 - lr: 0.050000
2023-04-06 14:42:01,560 epoch 92 - iter 2232/3720 - loss 0.03000857 - time (sec): 382.28 - samples/sec: 4937.17 - lr: 0.050000
2023-04-06 14:43:05,835 epoch 92 - iter 2604/3720 - loss 0.02985341 - time (sec): 446.55 - samples/sec: 4938.38 - lr: 0.050000
2023-04-06 14:44:11,044 epoch 92 - iter 2976/3720 - loss 0.02979932 - time (sec): 511.76 - samples/sec: 4931.23 - lr: 0.050000
2023-04-06 14:45:14,862 epoch 92 - iter 3348/3720 - loss 0.02979041 - time (sec): 575.58 - samples/sec: 4931.62 - lr: 0.050000
2023-04-06 14:46:18,753 epoch 92 - iter 3720/3720 - loss 0.02961658 - time (sec): 639.47 - samples/sec: 4928.61 - lr: 0.050000
2023-04-06 14:46:18,754 ----------------------------------------------------------------------------------------------------
2023-04-06 14:46:18,754 EPOCH 92 done: loss 0.0296 - lr 0.050000
2023-04-06 14:46:18,754 BAD EPOCHS (no improvement): 1
2023-04-06 14:46:18,757 ----------------------------------------------------------------------------------------------------
2023-04-06 14:47:22,362 epoch 93 - iter 372/3720 - loss 0.02949013 - time (sec): 63.61 - samples/sec: 4934.02 - lr: 0.050000
2023-04-06 14:48:27,004 epoch 93 - iter 744/3720 - loss 0.02954477 - time (sec): 128.25 - samples/sec: 4908.90 - lr: 0.050000
2023-04-06 14:49:31,369 epoch 93 - iter 1116/3720 - loss 0.02954577 - time (sec): 192.61 - samples/sec: 4900.16 - lr: 0.050000
2023-04-06 14:50:32,931 epoch 93 - iter 1488/3720 - loss 0.02958463 - time (sec): 254.17 - samples/sec: 4945.72 - lr: 0.050000
2023-04-06 14:51:35,509 epoch 93 - iter 1860/3720 - loss 0.02962799 - time (sec): 316.75 - samples/sec: 4967.73 - lr: 0.050000
2023-04-06 14:52:38,620 epoch 93 - iter 2232/3720 - loss 0.02969660 - time (sec): 379.86 - samples/sec: 4974.26 - lr: 0.050000
2023-04-06 14:53:41,162 epoch 93 - iter 2604/3720 - loss 0.02972034 - time (sec): 442.41 - samples/sec: 4985.21 - lr: 0.050000
2023-04-06 14:54:44,347 epoch 93 - iter 2976/3720 - loss 0.02969130 - time (sec): 505.59 - samples/sec: 4988.07 - lr: 0.050000
2023-04-06 14:55:47,388 epoch 93 - iter 3348/3720 - loss 0.02971463 - time (sec): 568.63 - samples/sec: 4989.28 - lr: 0.050000
2023-04-06 14:56:49,677 epoch 93 - iter 3720/3720 - loss 0.02968347 - time (sec): 630.92 - samples/sec: 4995.42 - lr: 0.050000
2023-04-06 14:56:49,678 ----------------------------------------------------------------------------------------------------
2023-04-06 14:56:49,678 EPOCH 93 done: loss 0.0297 - lr 0.050000
2023-04-06 14:56:49,678 BAD EPOCHS (no improvement): 2
2023-04-06 14:56:49,681 ----------------------------------------------------------------------------------------------------
2023-04-06 14:57:51,883 epoch 94 - iter 372/3720 - loss 0.02924225 - time (sec): 62.20 - samples/sec: 5024.92 - lr: 0.050000
2023-04-06 14:58:55,243 epoch 94 - iter 744/3720 - loss 0.02917742 - time (sec): 125.56 - samples/sec: 5018.20 - lr: 0.050000
2023-04-06 14:59:58,387 epoch 94 - iter 1116/3720 - loss 0.02940144 - time (sec): 188.71 - samples/sec: 5015.68 - lr: 0.050000
2023-04-06 15:01:02,165 epoch 94 - iter 1488/3720 - loss 0.02928628 - time (sec): 252.48 - samples/sec: 4995.52 - lr: 0.050000
2023-04-06 15:02:06,720 epoch 94 - iter 1860/3720 - loss 0.02954478 - time (sec): 317.04 - samples/sec: 4973.97 - lr: 0.050000
2023-04-06 15:03:11,366 epoch 94 - iter 2232/3720 - loss 0.02944295 - time (sec): 381.68 - samples/sec: 4957.13 - lr: 0.050000
2023-04-06 15:04:14,372 epoch 94 - iter 2604/3720 - loss 0.02932206 - time (sec): 444.69 - samples/sec: 4961.61 - lr: 0.050000
2023-04-06 15:05:17,643 epoch 94 - iter 2976/3720 - loss 0.02934208 - time (sec): 507.96 - samples/sec: 4962.62 - lr: 0.050000
2023-04-06 15:06:20,427 epoch 94 - iter 3348/3720 - loss 0.02945435 - time (sec): 570.75 - samples/sec: 4965.75 - lr: 0.050000
2023-04-06 15:07:24,884 epoch 94 - iter 3720/3720 - loss 0.02951776 - time (sec): 635.20 - samples/sec: 4961.74 - lr: 0.050000
2023-04-06 15:07:24,884 ----------------------------------------------------------------------------------------------------
2023-04-06 15:07:24,884 EPOCH 94 done: loss 0.0295 - lr 0.050000
2023-04-06 15:07:24,884 BAD EPOCHS (no improvement): 0
2023-04-06 15:07:24,888 ----------------------------------------------------------------------------------------------------
2023-04-06 15:08:27,171 epoch 95 - iter 372/3720 - loss 0.02936498 - time (sec): 62.28 - samples/sec: 5039.08 - lr: 0.050000
2023-04-06 15:09:30,536 epoch 95 - iter 744/3720 - loss 0.02887280 - time (sec): 125.65 - samples/sec: 5008.77 - lr: 0.050000
2023-04-06 15:10:33,863 epoch 95 - iter 1116/3720 - loss 0.02894426 - time (sec): 188.97 - samples/sec: 5005.54 - lr: 0.050000
2023-04-06 15:11:38,073 epoch 95 - iter 1488/3720 - loss 0.02927178 - time (sec): 253.19 - samples/sec: 4972.78 - lr: 0.050000
2023-04-06 15:12:42,544 epoch 95 - iter 1860/3720 - loss 0.02919982 - time (sec): 317.66 - samples/sec: 4950.72 - lr: 0.050000
2023-04-06 15:13:47,007 epoch 95 - iter 2232/3720 - loss 0.02912063 - time (sec): 382.12 - samples/sec: 4946.09 - lr: 0.050000
2023-04-06 15:14:50,719 epoch 95 - iter 2604/3720 - loss 0.02914057 - time (sec): 445.83 - samples/sec: 4950.06 - lr: 0.050000
2023-04-06 15:15:54,115 epoch 95 - iter 2976/3720 - loss 0.02914541 - time (sec): 509.23 - samples/sec: 4951.91 - lr: 0.050000
2023-04-06 15:16:57,568 epoch 95 - iter 3348/3720 - loss 0.02916951 - time (sec): 572.68 - samples/sec: 4954.18 - lr: 0.050000
2023-04-06 15:18:01,117 epoch 95 - iter 3720/3720 - loss 0.02926558 - time (sec): 636.23 - samples/sec: 4953.74 - lr: 0.050000
2023-04-06 15:18:01,117 ----------------------------------------------------------------------------------------------------
2023-04-06 15:18:01,117 EPOCH 95 done: loss 0.0293 - lr 0.050000
2023-04-06 15:18:01,117 BAD EPOCHS (no improvement): 0
2023-04-06 15:18:01,124 ----------------------------------------------------------------------------------------------------
2023-04-06 15:19:05,315 epoch 96 - iter 372/3720 - loss 0.02967119 - time (sec): 64.19 - samples/sec: 4927.13 - lr: 0.050000
2023-04-06 15:20:09,775 epoch 96 - iter 744/3720 - loss 0.02946338 - time (sec): 128.65 - samples/sec: 4914.75 - lr: 0.050000
2023-04-06 15:21:12,527 epoch 96 - iter 1116/3720 - loss 0.02952667 - time (sec): 191.40 - samples/sec: 4943.59 - lr: 0.050000
2023-04-06 15:22:15,935 epoch 96 - iter 1488/3720 - loss 0.02925585 - time (sec): 254.81 - samples/sec: 4957.78 - lr: 0.050000
2023-04-06 15:23:19,925 epoch 96 - iter 1860/3720 - loss 0.02927701 - time (sec): 318.80 - samples/sec: 4946.03 - lr: 0.050000
2023-04-06 15:24:23,982 epoch 96 - iter 2232/3720 - loss 0.02911695 - time (sec): 382.86 - samples/sec: 4940.62 - lr: 0.050000
2023-04-06 15:25:27,385 epoch 96 - iter 2604/3720 - loss 0.02923169 - time (sec): 446.26 - samples/sec: 4939.68 - lr: 0.050000
2023-04-06 15:26:31,221 epoch 96 - iter 2976/3720 - loss 0.02936122 - time (sec): 510.10 - samples/sec: 4939.62 - lr: 0.050000
2023-04-06 15:27:34,688 epoch 96 - iter 3348/3720 - loss 0.02930831 - time (sec): 573.56 - samples/sec: 4941.02 - lr: 0.050000
2023-04-06 15:28:38,474 epoch 96 - iter 3720/3720 - loss 0.02929948 - time (sec): 637.35 - samples/sec: 4945.03 - lr: 0.050000
2023-04-06 15:28:38,474 ----------------------------------------------------------------------------------------------------
2023-04-06 15:28:38,474 EPOCH 96 done: loss 0.0293 - lr 0.050000
2023-04-06 15:28:38,474 BAD EPOCHS (no improvement): 1
2023-04-06 15:28:38,477 ----------------------------------------------------------------------------------------------------
2023-04-06 15:29:41,896 epoch 97 - iter 372/3720 - loss 0.02846987 - time (sec): 63.42 - samples/sec: 4964.38 - lr: 0.050000
2023-04-06 15:30:45,520 epoch 97 - iter 744/3720 - loss 0.02902019 - time (sec): 127.04 - samples/sec: 4956.62 - lr: 0.050000
2023-04-06 15:31:48,172 epoch 97 - iter 1116/3720 - loss 0.02921452 - time (sec): 189.69 - samples/sec: 4971.69 - lr: 0.050000
2023-04-06 15:32:51,629 epoch 97 - iter 1488/3720 - loss 0.02924840 - time (sec): 253.15 - samples/sec: 4974.23 - lr: 0.050000
2023-04-06 15:33:53,890 epoch 97 - iter 1860/3720 - loss 0.02927003 - time (sec): 315.41 - samples/sec: 4985.37 - lr: 0.050000
2023-04-06 15:34:57,913 epoch 97 - iter 2232/3720 - loss 0.02925115 - time (sec): 379.44 - samples/sec: 4981.77 - lr: 0.050000
2023-04-06 15:36:01,517 epoch 97 - iter 2604/3720 - loss 0.02926837 - time (sec): 443.04 - samples/sec: 4979.62 - lr: 0.050000
2023-04-06 15:37:05,820 epoch 97 - iter 2976/3720 - loss 0.02929307 - time (sec): 507.34 - samples/sec: 4967.57 - lr: 0.050000
2023-04-06 15:38:09,978 epoch 97 - iter 3348/3720 - loss 0.02928203 - time (sec): 571.50 - samples/sec: 4964.43 - lr: 0.050000
2023-04-06 15:39:13,882 epoch 97 - iter 3720/3720 - loss 0.02929182 - time (sec): 635.41 - samples/sec: 4960.16 - lr: 0.050000
2023-04-06 15:39:13,883 ----------------------------------------------------------------------------------------------------
2023-04-06 15:39:13,883 EPOCH 97 done: loss 0.0293 - lr 0.050000
2023-04-06 15:39:13,883 BAD EPOCHS (no improvement): 2
2023-04-06 15:39:13,886 ----------------------------------------------------------------------------------------------------
2023-04-06 15:40:18,041 epoch 98 - iter 372/3720 - loss 0.02915454 - time (sec): 64.16 - samples/sec: 4892.46 - lr: 0.050000
2023-04-06 15:41:21,546 epoch 98 - iter 744/3720 - loss 0.02918277 - time (sec): 127.66 - samples/sec: 4924.41 - lr: 0.050000
2023-04-06 15:42:25,001 epoch 98 - iter 1116/3720 - loss 0.02909367 - time (sec): 191.11 - samples/sec: 4932.07 - lr: 0.050000
2023-04-06 15:43:27,908 epoch 98 - iter 1488/3720 - loss 0.02927396 - time (sec): 254.02 - samples/sec: 4949.97 - lr: 0.050000
2023-04-06 15:44:31,820 epoch 98 - iter 1860/3720 - loss 0.02931938 - time (sec): 317.93 - samples/sec: 4954.28 - lr: 0.050000
2023-04-06 15:45:35,712 epoch 98 - iter 2232/3720 - loss 0.02927944 - time (sec): 381.83 - samples/sec: 4948.43 - lr: 0.050000
2023-04-06 15:46:39,824 epoch 98 - iter 2604/3720 - loss 0.02925248 - time (sec): 445.94 - samples/sec: 4939.69 - lr: 0.050000
2023-04-06 15:47:43,650 epoch 98 - iter 2976/3720 - loss 0.02905909 - time (sec): 509.76 - samples/sec: 4942.76 - lr: 0.050000
2023-04-06 15:48:47,900 epoch 98 - iter 3348/3720 - loss 0.02912807 - time (sec): 574.01 - samples/sec: 4940.40 - lr: 0.050000
2023-04-06 15:49:52,162 epoch 98 - iter 3720/3720 - loss 0.02928766 - time (sec): 638.28 - samples/sec: 4937.85 - lr: 0.050000
2023-04-06 15:49:52,163 ----------------------------------------------------------------------------------------------------
2023-04-06 15:49:52,163 EPOCH 98 done: loss 0.0293 - lr 0.050000
2023-04-06 15:49:52,163 BAD EPOCHS (no improvement): 3
2023-04-06 15:49:52,167 ----------------------------------------------------------------------------------------------------
2023-04-06 15:50:57,014 epoch 99 - iter 372/3720 - loss 0.02846481 - time (sec): 64.85 - samples/sec: 4883.70 - lr: 0.050000
2023-04-06 15:52:01,104 epoch 99 - iter 744/3720 - loss 0.02860222 - time (sec): 128.94 - samples/sec: 4905.84 - lr: 0.050000
2023-04-06 15:53:04,755 epoch 99 - iter 1116/3720 - loss 0.02893728 - time (sec): 192.59 - samples/sec: 4921.85 - lr: 0.050000
2023-04-06 15:54:08,776 epoch 99 - iter 1488/3720 - loss 0.02911058 - time (sec): 256.61 - samples/sec: 4929.96 - lr: 0.050000
2023-04-06 15:55:13,006 epoch 99 - iter 1860/3720 - loss 0.02900851 - time (sec): 320.84 - samples/sec: 4925.42 - lr: 0.050000
2023-04-06 15:56:15,908 epoch 99 - iter 2232/3720 - loss 0.02918873 - time (sec): 383.74 - samples/sec: 4933.64 - lr: 0.050000
2023-04-06 15:57:19,760 epoch 99 - iter 2604/3720 - loss 0.02922697 - time (sec): 447.59 - samples/sec: 4935.96 - lr: 0.050000
2023-04-06 15:58:23,155 epoch 99 - iter 2976/3720 - loss 0.02929975 - time (sec): 510.99 - samples/sec: 4937.80 - lr: 0.050000
2023-04-06 15:59:27,525 epoch 99 - iter 3348/3720 - loss 0.02925750 - time (sec): 575.36 - samples/sec: 4931.22 - lr: 0.050000
2023-04-06 16:00:31,230 epoch 99 - iter 3720/3720 - loss 0.02926185 - time (sec): 639.06 - samples/sec: 4931.77 - lr: 0.050000
2023-04-06 16:00:31,230 ----------------------------------------------------------------------------------------------------
2023-04-06 16:00:31,230 EPOCH 99 done: loss 0.0293 - lr 0.050000
2023-04-06 16:00:31,231 BAD EPOCHS (no improvement): 0
2023-04-06 16:00:31,234 ----------------------------------------------------------------------------------------------------
2023-04-06 16:01:35,409 epoch 100 - iter 372/3720 - loss 0.02926741 - time (sec): 64.17 - samples/sec: 4891.41 - lr: 0.050000
2023-04-06 16:02:38,622 epoch 100 - iter 744/3720 - loss 0.02886811 - time (sec): 127.39 - samples/sec: 4938.32 - lr: 0.050000
2023-04-06 16:03:42,243 epoch 100 - iter 1116/3720 - loss 0.02903872 - time (sec): 191.01 - samples/sec: 4928.43 - lr: 0.050000
2023-04-06 16:04:47,341 epoch 100 - iter 1488/3720 - loss 0.02897199 - time (sec): 256.11 - samples/sec: 4910.73 - lr: 0.050000
2023-04-06 16:05:52,173 epoch 100 - iter 1860/3720 - loss 0.02898875 - time (sec): 320.94 - samples/sec: 4911.30 - lr: 0.050000
2023-04-06 16:06:57,196 epoch 100 - iter 2232/3720 - loss 0.02908536 - time (sec): 385.96 - samples/sec: 4902.11 - lr: 0.050000
2023-04-06 16:08:01,001 epoch 100 - iter 2604/3720 - loss 0.02905924 - time (sec): 449.77 - samples/sec: 4908.45 - lr: 0.050000
2023-04-06 16:09:04,671 epoch 100 - iter 2976/3720 - loss 0.02908040 - time (sec): 513.44 - samples/sec: 4909.42 - lr: 0.050000
2023-04-06 16:10:08,680 epoch 100 - iter 3348/3720 - loss 0.02912874 - time (sec): 577.45 - samples/sec: 4909.75 - lr: 0.050000
2023-04-06 16:11:13,261 epoch 100 - iter 3720/3720 - loss 0.02917659 - time (sec): 642.03 - samples/sec: 4909.01 - lr: 0.050000
2023-04-06 16:11:13,261 ----------------------------------------------------------------------------------------------------
2023-04-06 16:11:13,261 EPOCH 100 done: loss 0.0292 - lr 0.050000
2023-04-06 16:11:13,261 BAD EPOCHS (no improvement): 0
2023-04-06 16:11:13,264 ----------------------------------------------------------------------------------------------------
2023-04-06 16:12:16,908 epoch 101 - iter 372/3720 - loss 0.02868550 - time (sec): 63.64 - samples/sec: 4939.62 - lr: 0.050000
2023-04-06 16:13:19,781 epoch 101 - iter 744/3720 - loss 0.02869803 - time (sec): 126.52 - samples/sec: 4977.98 - lr: 0.050000
2023-04-06 16:14:23,407 epoch 101 - iter 1116/3720 - loss 0.02879305 - time (sec): 190.14 - samples/sec: 4968.58 - lr: 0.050000
2023-04-06 16:15:26,705 epoch 101 - iter 1488/3720 - loss 0.02877063 - time (sec): 253.44 - samples/sec: 4962.77 - lr: 0.050000
2023-04-06 16:16:31,576 epoch 101 - iter 1860/3720 - loss 0.02879079 - time (sec): 318.31 - samples/sec: 4948.32 - lr: 0.050000
2023-04-06 16:17:34,756 epoch 101 - iter 2232/3720 - loss 0.02890731 - time (sec): 381.49 - samples/sec: 4952.85 - lr: 0.050000
2023-04-06 16:18:38,963 epoch 101 - iter 2604/3720 - loss 0.02894713 - time (sec): 445.70 - samples/sec: 4949.01 - lr: 0.050000
2023-04-06 16:19:42,548 epoch 101 - iter 2976/3720 - loss 0.02882801 - time (sec): 509.28 - samples/sec: 4951.83 - lr: 0.050000
2023-04-06 16:20:45,613 epoch 101 - iter 3348/3720 - loss 0.02892971 - time (sec): 572.35 - samples/sec: 4956.44 - lr: 0.050000
2023-04-06 16:21:48,377 epoch 101 - iter 3720/3720 - loss 0.02896954 - time (sec): 635.11 - samples/sec: 4962.44 - lr: 0.050000
2023-04-06 16:21:48,378 ----------------------------------------------------------------------------------------------------
2023-04-06 16:21:48,378 EPOCH 101 done: loss 0.0290 - lr 0.050000
2023-04-06 16:21:48,378 BAD EPOCHS (no improvement): 0
2023-04-06 16:21:48,381 ----------------------------------------------------------------------------------------------------
2023-04-06 16:22:50,215 epoch 102 - iter 372/3720 - loss 0.02881203 - time (sec): 61.83 - samples/sec: 5072.57 - lr: 0.050000
2023-04-06 16:23:52,864 epoch 102 - iter 744/3720 - loss 0.02944004 - time (sec): 124.48 - samples/sec: 5049.06 - lr: 0.050000
2023-04-06 16:24:55,607 epoch 102 - iter 1116/3720 - loss 0.02928611 - time (sec): 187.23 - samples/sec: 5042.76 - lr: 0.050000
2023-04-06 16:25:58,472 epoch 102 - iter 1488/3720 - loss 0.02928647 - time (sec): 250.09 - samples/sec: 5047.97 - lr: 0.050000
2023-04-06 16:27:00,669 epoch 102 - iter 1860/3720 - loss 0.02933067 - time (sec): 312.29 - samples/sec: 5047.54 - lr: 0.050000
2023-04-06 16:28:03,582 epoch 102 - iter 2232/3720 - loss 0.02924074 - time (sec): 375.20 - samples/sec: 5040.48 - lr: 0.050000
2023-04-06 16:29:06,368 epoch 102 - iter 2604/3720 - loss 0.02923811 - time (sec): 437.99 - samples/sec: 5040.36 - lr: 0.050000
2023-04-06 16:30:09,324 epoch 102 - iter 2976/3720 - loss 0.02921632 - time (sec): 500.94 - samples/sec: 5031.45 - lr: 0.050000
2023-04-06 16:31:12,993 epoch 102 - iter 3348/3720 - loss 0.02929971 - time (sec): 564.61 - samples/sec: 5020.45 - lr: 0.050000
2023-04-06 16:32:17,716 epoch 102 - iter 3720/3720 - loss 0.02929986 - time (sec): 629.34 - samples/sec: 5008.00 - lr: 0.050000
2023-04-06 16:32:17,717 ----------------------------------------------------------------------------------------------------
2023-04-06 16:32:17,717 EPOCH 102 done: loss 0.0293 - lr 0.050000
2023-04-06 16:32:17,717 BAD EPOCHS (no improvement): 1
2023-04-06 16:32:17,720 ----------------------------------------------------------------------------------------------------
2023-04-06 16:33:21,203 epoch 103 - iter 372/3720 - loss 0.02976637 - time (sec): 63.48 - samples/sec: 4940.11 - lr: 0.050000
2023-04-06 16:34:26,238 epoch 103 - iter 744/3720 - loss 0.02902686 - time (sec): 128.52 - samples/sec: 4916.39 - lr: 0.050000
2023-04-06 16:35:30,502 epoch 103 - iter 1116/3720 - loss 0.02884190 - time (sec): 192.78 - samples/sec: 4923.55 - lr: 0.050000
2023-04-06 16:36:34,348 epoch 103 - iter 1488/3720 - loss 0.02859953 - time (sec): 256.63 - samples/sec: 4923.70 - lr: 0.050000
2023-04-06 16:37:39,158 epoch 103 - iter 1860/3720 - loss 0.02871555 - time (sec): 321.44 - samples/sec: 4916.14 - lr: 0.050000
2023-04-06 16:38:43,747 epoch 103 - iter 2232/3720 - loss 0.02871976 - time (sec): 386.03 - samples/sec: 4915.45 - lr: 0.050000
2023-04-06 16:39:46,948 epoch 103 - iter 2604/3720 - loss 0.02895213 - time (sec): 449.23 - samples/sec: 4918.80 - lr: 0.050000
2023-04-06 16:40:51,473 epoch 103 - iter 2976/3720 - loss 0.02903915 - time (sec): 513.75 - samples/sec: 4910.60 - lr: 0.050000
2023-04-06 16:41:56,334 epoch 103 - iter 3348/3720 - loss 0.02908869 - time (sec): 578.61 - samples/sec: 4905.48 - lr: 0.050000
2023-04-06 16:42:59,655 epoch 103 - iter 3720/3720 - loss 0.02912004 - time (sec): 641.93 - samples/sec: 4909.71 - lr: 0.050000
2023-04-06 16:42:59,655 ----------------------------------------------------------------------------------------------------
2023-04-06 16:42:59,655 EPOCH 103 done: loss 0.0291 - lr 0.050000
2023-04-06 16:42:59,655 BAD EPOCHS (no improvement): 2
2023-04-06 16:42:59,659 ----------------------------------------------------------------------------------------------------
2023-04-06 16:44:03,164 epoch 104 - iter 372/3720 - loss 0.02794056 - time (sec): 63.51 - samples/sec: 4940.25 - lr: 0.050000
2023-04-06 16:45:07,071 epoch 104 - iter 744/3720 - loss 0.02836150 - time (sec): 127.41 - samples/sec: 4938.79 - lr: 0.050000
2023-04-06 16:46:11,702 epoch 104 - iter 1116/3720 - loss 0.02825259 - time (sec): 192.04 - samples/sec: 4924.08 - lr: 0.050000
2023-04-06 16:47:16,459 epoch 104 - iter 1488/3720 - loss 0.02828719 - time (sec): 256.80 - samples/sec: 4919.51 - lr: 0.050000
2023-04-06 16:48:20,799 epoch 104 - iter 1860/3720 - loss 0.02850305 - time (sec): 321.14 - samples/sec: 4920.31 - lr: 0.050000
2023-04-06 16:49:23,570 epoch 104 - iter 2232/3720 - loss 0.02876834 - time (sec): 383.91 - samples/sec: 4932.68 - lr: 0.050000
2023-04-06 16:50:26,233 epoch 104 - iter 2604/3720 - loss 0.02885298 - time (sec): 446.57 - samples/sec: 4941.14 - lr: 0.050000
2023-04-06 16:51:28,962 epoch 104 - iter 2976/3720 - loss 0.02894524 - time (sec): 509.30 - samples/sec: 4952.59 - lr: 0.050000
2023-04-06 16:52:31,609 epoch 104 - iter 3348/3720 - loss 0.02896460 - time (sec): 571.95 - samples/sec: 4960.15 - lr: 0.050000
2023-04-06 16:53:34,519 epoch 104 - iter 3720/3720 - loss 0.02897805 - time (sec): 634.86 - samples/sec: 4964.42 - lr: 0.050000
2023-04-06 16:53:34,519 ----------------------------------------------------------------------------------------------------
2023-04-06 16:53:34,519 EPOCH 104 done: loss 0.0290 - lr 0.050000
2023-04-06 16:53:34,519 BAD EPOCHS (no improvement): 3
2023-04-06 16:53:34,522 ----------------------------------------------------------------------------------------------------
2023-04-06 16:54:37,908 epoch 105 - iter 372/3720 - loss 0.02848474 - time (sec): 63.39 - samples/sec: 5004.69 - lr: 0.050000
2023-04-06 16:55:41,910 epoch 105 - iter 744/3720 - loss 0.02891930 - time (sec): 127.39 - samples/sec: 4959.30 - lr: 0.050000
2023-04-06 16:56:45,614 epoch 105 - iter 1116/3720 - loss 0.02886648 - time (sec): 191.09 - samples/sec: 4952.09 - lr: 0.050000
2023-04-06 16:57:49,847 epoch 105 - iter 1488/3720 - loss 0.02888613 - time (sec): 255.32 - samples/sec: 4943.68 - lr: 0.050000
2023-04-06 16:58:53,006 epoch 105 - iter 1860/3720 - loss 0.02877241 - time (sec): 318.48 - samples/sec: 4952.26 - lr: 0.050000
2023-04-06 16:59:55,569 epoch 105 - iter 2232/3720 - loss 0.02866175 - time (sec): 381.05 - samples/sec: 4960.30 - lr: 0.050000
2023-04-06 17:01:00,023 epoch 105 - iter 2604/3720 - loss 0.02864414 - time (sec): 445.50 - samples/sec: 4957.09 - lr: 0.050000
2023-04-06 17:02:03,781 epoch 105 - iter 2976/3720 - loss 0.02863500 - time (sec): 509.26 - samples/sec: 4953.44 - lr: 0.050000
2023-04-06 17:03:08,218 epoch 105 - iter 3348/3720 - loss 0.02883981 - time (sec): 573.70 - samples/sec: 4948.87 - lr: 0.050000
2023-04-06 17:04:11,931 epoch 105 - iter 3720/3720 - loss 0.02885578 - time (sec): 637.41 - samples/sec: 4944.57 - lr: 0.050000
2023-04-06 17:04:11,931 ----------------------------------------------------------------------------------------------------
2023-04-06 17:04:11,931 EPOCH 105 done: loss 0.0289 - lr 0.050000
2023-04-06 17:04:11,931 BAD EPOCHS (no improvement): 0
2023-04-06 17:04:11,934 ----------------------------------------------------------------------------------------------------
2023-04-06 17:05:16,709 epoch 106 - iter 372/3720 - loss 0.02956074 - time (sec): 64.77 - samples/sec: 4910.20 - lr: 0.050000
2023-04-06 17:06:20,970 epoch 106 - iter 744/3720 - loss 0.02964768 - time (sec): 129.04 - samples/sec: 4902.56 - lr: 0.050000
2023-04-06 17:07:24,282 epoch 106 - iter 1116/3720 - loss 0.02933871 - time (sec): 192.35 - samples/sec: 4921.90 - lr: 0.050000
2023-04-06 17:08:27,831 epoch 106 - iter 1488/3720 - loss 0.02917584 - time (sec): 255.90 - samples/sec: 4936.52 - lr: 0.050000
2023-04-06 17:09:31,094 epoch 106 - iter 1860/3720 - loss 0.02921854 - time (sec): 319.16 - samples/sec: 4944.81 - lr: 0.050000
2023-04-06 17:10:35,149 epoch 106 - iter 2232/3720 - loss 0.02913248 - time (sec): 383.21 - samples/sec: 4944.33 - lr: 0.050000
2023-04-06 17:11:38,271 epoch 106 - iter 2604/3720 - loss 0.02899441 - time (sec): 446.34 - samples/sec: 4948.91 - lr: 0.050000
2023-04-06 17:12:41,927 epoch 106 - iter 2976/3720 - loss 0.02900669 - time (sec): 509.99 - samples/sec: 4948.51 - lr: 0.050000
2023-04-06 17:13:45,552 epoch 106 - iter 3348/3720 - loss 0.02900425 - time (sec): 573.62 - samples/sec: 4948.92 - lr: 0.050000
2023-04-06 17:14:49,765 epoch 106 - iter 3720/3720 - loss 0.02890924 - time (sec): 637.83 - samples/sec: 4941.30 - lr: 0.050000
2023-04-06 17:14:49,765 ----------------------------------------------------------------------------------------------------
2023-04-06 17:14:49,765 EPOCH 106 done: loss 0.0289 - lr 0.050000
2023-04-06 17:14:49,765 BAD EPOCHS (no improvement): 1
2023-04-06 17:14:49,768 ----------------------------------------------------------------------------------------------------
2023-04-06 17:15:53,842 epoch 107 - iter 372/3720 - loss 0.02800491 - time (sec): 64.07 - samples/sec: 4922.91 - lr: 0.050000
2023-04-06 17:16:57,412 epoch 107 - iter 744/3720 - loss 0.02823637 - time (sec): 127.64 - samples/sec: 4953.94 - lr: 0.050000
2023-04-06 17:18:01,167 epoch 107 - iter 1116/3720 - loss 0.02861287 - time (sec): 191.40 - samples/sec: 4941.07 - lr: 0.050000
2023-04-06 17:19:04,321 epoch 107 - iter 1488/3720 - loss 0.02881320 - time (sec): 254.55 - samples/sec: 4950.35 - lr: 0.050000
2023-04-06 17:20:07,174 epoch 107 - iter 1860/3720 - loss 0.02872406 - time (sec): 317.41 - samples/sec: 4970.37 - lr: 0.050000
2023-04-06 17:21:08,908 epoch 107 - iter 2232/3720 - loss 0.02882268 - time (sec): 379.14 - samples/sec: 4983.42 - lr: 0.050000
2023-04-06 17:22:12,336 epoch 107 - iter 2604/3720 - loss 0.02883334 - time (sec): 442.57 - samples/sec: 4990.61 - lr: 0.050000
2023-04-06 17:23:14,935 epoch 107 - iter 2976/3720 - loss 0.02889807 - time (sec): 505.17 - samples/sec: 4997.02 - lr: 0.050000
2023-04-06 17:24:18,198 epoch 107 - iter 3348/3720 - loss 0.02887444 - time (sec): 568.43 - samples/sec: 4996.30 - lr: 0.050000
2023-04-06 17:25:20,301 epoch 107 - iter 3720/3720 - loss 0.02885701 - time (sec): 630.53 - samples/sec: 4998.50 - lr: 0.050000
2023-04-06 17:25:20,301 ----------------------------------------------------------------------------------------------------
2023-04-06 17:25:20,301 EPOCH 107 done: loss 0.0289 - lr 0.050000
2023-04-06 17:25:20,301 BAD EPOCHS (no improvement): 2
2023-04-06 17:25:20,304 ----------------------------------------------------------------------------------------------------
2023-04-06 17:26:24,279 epoch 108 - iter 372/3720 - loss 0.02834232 - time (sec): 63.97 - samples/sec: 4943.86 - lr: 0.050000
2023-04-06 17:27:28,148 epoch 108 - iter 744/3720 - loss 0.02839086 - time (sec): 127.84 - samples/sec: 4948.53 - lr: 0.050000
2023-04-06 17:28:30,736 epoch 108 - iter 1116/3720 - loss 0.02874352 - time (sec): 190.43 - samples/sec: 4975.68 - lr: 0.050000
2023-04-06 17:29:33,820 epoch 108 - iter 1488/3720 - loss 0.02865885 - time (sec): 253.52 - samples/sec: 4973.58 - lr: 0.050000
2023-04-06 17:30:37,432 epoch 108 - iter 1860/3720 - loss 0.02861584 - time (sec): 317.13 - samples/sec: 4974.93 - lr: 0.050000
2023-04-06 17:31:40,947 epoch 108 - iter 2232/3720 - loss 0.02873260 - time (sec): 380.64 - samples/sec: 4970.76 - lr: 0.050000
2023-04-06 17:32:45,352 epoch 108 - iter 2604/3720 - loss 0.02887913 - time (sec): 445.05 - samples/sec: 4959.96 - lr: 0.050000
2023-04-06 17:33:48,857 epoch 108 - iter 2976/3720 - loss 0.02884463 - time (sec): 508.55 - samples/sec: 4962.67 - lr: 0.050000
2023-04-06 17:34:52,541 epoch 108 - iter 3348/3720 - loss 0.02882512 - time (sec): 572.24 - samples/sec: 4960.26 - lr: 0.050000
2023-04-06 17:35:56,493 epoch 108 - iter 3720/3720 - loss 0.02876946 - time (sec): 636.19 - samples/sec: 4954.05 - lr: 0.050000
2023-04-06 17:35:56,493 ----------------------------------------------------------------------------------------------------
2023-04-06 17:35:56,493 EPOCH 108 done: loss 0.0288 - lr 0.050000
2023-04-06 17:35:56,493 BAD EPOCHS (no improvement): 0
2023-04-06 17:35:56,497 ----------------------------------------------------------------------------------------------------
2023-04-06 17:37:00,572 epoch 109 - iter 372/3720 - loss 0.02841398 - time (sec): 64.08 - samples/sec: 4929.69 - lr: 0.050000
2023-04-06 17:38:04,019 epoch 109 - iter 744/3720 - loss 0.02850382 - time (sec): 127.52 - samples/sec: 4930.02 - lr: 0.050000
2023-04-06 17:39:07,991 epoch 109 - iter 1116/3720 - loss 0.02852396 - time (sec): 191.49 - samples/sec: 4924.26 - lr: 0.050000
2023-04-06 17:40:11,467 epoch 109 - iter 1488/3720 - loss 0.02871293 - time (sec): 254.97 - samples/sec: 4932.80 - lr: 0.050000
2023-04-06 17:41:15,598 epoch 109 - iter 1860/3720 - loss 0.02880669 - time (sec): 319.10 - samples/sec: 4924.26 - lr: 0.050000
2023-04-06 17:42:19,664 epoch 109 - iter 2232/3720 - loss 0.02881068 - time (sec): 383.17 - samples/sec: 4923.89 - lr: 0.050000
2023-04-06 17:43:24,228 epoch 109 - iter 2604/3720 - loss 0.02874266 - time (sec): 447.73 - samples/sec: 4923.06 - lr: 0.050000
2023-04-06 17:44:28,013 epoch 109 - iter 2976/3720 - loss 0.02866993 - time (sec): 511.52 - samples/sec: 4926.17 - lr: 0.050000
2023-04-06 17:45:32,387 epoch 109 - iter 3348/3720 - loss 0.02868024 - time (sec): 575.89 - samples/sec: 4924.73 - lr: 0.050000
2023-04-06 17:46:35,281 epoch 109 - iter 3720/3720 - loss 0.02876157 - time (sec): 638.78 - samples/sec: 4933.93 - lr: 0.050000
2023-04-06 17:46:35,281 ----------------------------------------------------------------------------------------------------
2023-04-06 17:46:35,281 EPOCH 109 done: loss 0.0288 - lr 0.050000
2023-04-06 17:46:35,281 BAD EPOCHS (no improvement): 0
2023-04-06 17:46:35,284 ----------------------------------------------------------------------------------------------------
2023-04-06 17:47:38,550 epoch 110 - iter 372/3720 - loss 0.02880920 - time (sec): 63.27 - samples/sec: 4986.03 - lr: 0.050000
2023-04-06 17:48:41,087 epoch 110 - iter 744/3720 - loss 0.02851942 - time (sec): 125.80 - samples/sec: 5023.90 - lr: 0.050000
2023-04-06 17:49:43,649 epoch 110 - iter 1116/3720 - loss 0.02866770 - time (sec): 188.37 - samples/sec: 5031.15 - lr: 0.050000
2023-04-06 17:50:47,594 epoch 110 - iter 1488/3720 - loss 0.02865069 - time (sec): 252.31 - samples/sec: 5007.43 - lr: 0.050000
2023-04-06 17:51:51,188 epoch 110 - iter 1860/3720 - loss 0.02875658 - time (sec): 315.90 - samples/sec: 4995.56 - lr: 0.050000
2023-04-06 17:52:54,789 epoch 110 - iter 2232/3720 - loss 0.02890267 - time (sec): 379.51 - samples/sec: 4987.81 - lr: 0.050000
2023-04-06 17:53:58,537 epoch 110 - iter 2604/3720 - loss 0.02886673 - time (sec): 443.25 - samples/sec: 4982.66 - lr: 0.050000
2023-04-06 17:55:01,522 epoch 110 - iter 2976/3720 - loss 0.02893921 - time (sec): 506.24 - samples/sec: 4979.73 - lr: 0.050000
2023-04-06 17:56:04,715 epoch 110 - iter 3348/3720 - loss 0.02887197 - time (sec): 569.43 - samples/sec: 4978.46 - lr: 0.050000
2023-04-06 17:57:08,428 epoch 110 - iter 3720/3720 - loss 0.02879770 - time (sec): 633.14 - samples/sec: 4977.88 - lr: 0.050000
2023-04-06 17:57:08,429 ----------------------------------------------------------------------------------------------------
2023-04-06 17:57:08,429 EPOCH 110 done: loss 0.0288 - lr 0.050000
2023-04-06 17:57:08,429 BAD EPOCHS (no improvement): 1
2023-04-06 17:57:08,432 ----------------------------------------------------------------------------------------------------
2023-04-06 17:58:12,516 epoch 111 - iter 372/3720 - loss 0.02936568 - time (sec): 64.08 - samples/sec: 4901.69 - lr: 0.050000
2023-04-06 17:59:16,303 epoch 111 - iter 744/3720 - loss 0.02922799 - time (sec): 127.87 - samples/sec: 4923.93 - lr: 0.050000
2023-04-06 18:00:19,984 epoch 111 - iter 1116/3720 - loss 0.02903980 - time (sec): 191.55 - samples/sec: 4933.88 - lr: 0.050000
2023-04-06 18:01:23,853 epoch 111 - iter 1488/3720 - loss 0.02886305 - time (sec): 255.42 - samples/sec: 4937.56 - lr: 0.050000
2023-04-06 18:02:26,911 epoch 111 - iter 1860/3720 - loss 0.02869059 - time (sec): 318.48 - samples/sec: 4949.74 - lr: 0.050000
2023-04-06 18:03:30,898 epoch 111 - iter 2232/3720 - loss 0.02861122 - time (sec): 382.47 - samples/sec: 4949.67 - lr: 0.050000
2023-04-06 18:04:34,183 epoch 111 - iter 2604/3720 - loss 0.02857775 - time (sec): 445.75 - samples/sec: 4957.08 - lr: 0.050000
2023-04-06 18:05:37,359 epoch 111 - iter 2976/3720 - loss 0.02851697 - time (sec): 508.93 - samples/sec: 4959.08 - lr: 0.050000
2023-04-06 18:06:40,417 epoch 111 - iter 3348/3720 - loss 0.02854522 - time (sec): 571.99 - samples/sec: 4962.24 - lr: 0.050000
2023-04-06 18:07:43,144 epoch 111 - iter 3720/3720 - loss 0.02853782 - time (sec): 634.71 - samples/sec: 4965.58 - lr: 0.050000
2023-04-06 18:07:43,144 ----------------------------------------------------------------------------------------------------
2023-04-06 18:07:43,144 EPOCH 111 done: loss 0.0285 - lr 0.050000
2023-04-06 18:07:43,144 BAD EPOCHS (no improvement): 0
2023-04-06 18:07:43,148 ----------------------------------------------------------------------------------------------------
2023-04-06 18:08:46,599 epoch 112 - iter 372/3720 - loss 0.02813126 - time (sec): 63.45 - samples/sec: 4954.53 - lr: 0.050000
2023-04-06 18:09:50,216 epoch 112 - iter 744/3720 - loss 0.02845559 - time (sec): 127.07 - samples/sec: 4945.79 - lr: 0.050000
2023-04-06 18:10:53,807 epoch 112 - iter 1116/3720 - loss 0.02883468 - time (sec): 190.66 - samples/sec: 4958.64 - lr: 0.050000
2023-04-06 18:11:56,054 epoch 112 - iter 1488/3720 - loss 0.02885031 - time (sec): 252.91 - samples/sec: 4981.53 - lr: 0.050000
2023-04-06 18:12:59,597 epoch 112 - iter 1860/3720 - loss 0.02889557 - time (sec): 316.45 - samples/sec: 4978.80 - lr: 0.050000
2023-04-06 18:14:02,666 epoch 112 - iter 2232/3720 - loss 0.02903618 - time (sec): 379.52 - samples/sec: 4981.14 - lr: 0.050000
2023-04-06 18:15:06,074 epoch 112 - iter 2604/3720 - loss 0.02892502 - time (sec): 442.93 - samples/sec: 4979.15 - lr: 0.050000
2023-04-06 18:16:10,092 epoch 112 - iter 2976/3720 - loss 0.02892389 - time (sec): 506.94 - samples/sec: 4973.78 - lr: 0.050000
2023-04-06 18:17:12,681 epoch 112 - iter 3348/3720 - loss 0.02894698 - time (sec): 569.53 - samples/sec: 4977.99 - lr: 0.050000
2023-04-06 18:18:15,880 epoch 112 - iter 3720/3720 - loss 0.02887480 - time (sec): 632.73 - samples/sec: 4981.11 - lr: 0.050000
2023-04-06 18:18:15,880 ----------------------------------------------------------------------------------------------------
2023-04-06 18:18:15,880 EPOCH 112 done: loss 0.0289 - lr 0.050000
2023-04-06 18:18:15,881 BAD EPOCHS (no improvement): 1
2023-04-06 18:18:15,884 ----------------------------------------------------------------------------------------------------
2023-04-06 18:19:18,187 epoch 113 - iter 372/3720 - loss 0.02880723 - time (sec): 62.30 - samples/sec: 5060.44 - lr: 0.050000
2023-04-06 18:20:20,784 epoch 113 - iter 744/3720 - loss 0.02876553 - time (sec): 124.90 - samples/sec: 5052.58 - lr: 0.050000
2023-04-06 18:21:23,750 epoch 113 - iter 1116/3720 - loss 0.02897645 - time (sec): 187.87 - samples/sec: 5034.04 - lr: 0.050000
2023-04-06 18:22:27,003 epoch 113 - iter 1488/3720 - loss 0.02887117 - time (sec): 251.12 - samples/sec: 5011.98 - lr: 0.050000
2023-04-06 18:23:31,298 epoch 113 - iter 1860/3720 - loss 0.02877149 - time (sec): 315.41 - samples/sec: 4997.95 - lr: 0.050000
2023-04-06 18:24:35,282 epoch 113 - iter 2232/3720 - loss 0.02885075 - time (sec): 379.40 - samples/sec: 4984.55 - lr: 0.050000
2023-04-06 18:25:39,092 epoch 113 - iter 2604/3720 - loss 0.02885398 - time (sec): 443.21 - samples/sec: 4976.09 - lr: 0.050000
2023-04-06 18:26:43,751 epoch 113 - iter 2976/3720 - loss 0.02884556 - time (sec): 507.87 - samples/sec: 4962.23 - lr: 0.050000
2023-04-06 18:27:47,317 epoch 113 - iter 3348/3720 - loss 0.02883566 - time (sec): 571.43 - samples/sec: 4964.05 - lr: 0.050000
2023-04-06 18:28:51,157 epoch 113 - iter 3720/3720 - loss 0.02883942 - time (sec): 635.27 - samples/sec: 4961.20 - lr: 0.050000
2023-04-06 18:28:51,157 ----------------------------------------------------------------------------------------------------
2023-04-06 18:28:51,157 EPOCH 113 done: loss 0.0288 - lr 0.050000
2023-04-06 18:28:51,157 BAD EPOCHS (no improvement): 2
2023-04-06 18:28:51,160 ----------------------------------------------------------------------------------------------------
2023-04-06 18:29:54,191 epoch 114 - iter 372/3720 - loss 0.02825922 - time (sec): 63.03 - samples/sec: 4952.07 - lr: 0.050000
2023-04-06 18:30:57,550 epoch 114 - iter 744/3720 - loss 0.02827768 - time (sec): 126.39 - samples/sec: 4955.43 - lr: 0.050000
2023-04-06 18:32:02,440 epoch 114 - iter 1116/3720 - loss 0.02832827 - time (sec): 191.28 - samples/sec: 4940.26 - lr: 0.050000
2023-04-06 18:33:07,304 epoch 114 - iter 1488/3720 - loss 0.02853831 - time (sec): 256.14 - samples/sec: 4926.31 - lr: 0.050000
2023-04-06 18:34:11,156 epoch 114 - iter 1860/3720 - loss 0.02855166 - time (sec): 320.00 - samples/sec: 4933.45 - lr: 0.050000
2023-04-06 18:35:13,342 epoch 114 - iter 2232/3720 - loss 0.02859716 - time (sec): 382.18 - samples/sec: 4954.20 - lr: 0.050000
2023-04-06 18:36:16,232 epoch 114 - iter 2604/3720 - loss 0.02862590 - time (sec): 445.07 - samples/sec: 4962.55 - lr: 0.050000
2023-04-06 18:37:19,883 epoch 114 - iter 2976/3720 - loss 0.02860909 - time (sec): 508.72 - samples/sec: 4962.97 - lr: 0.050000
2023-04-06 18:38:23,770 epoch 114 - iter 3348/3720 - loss 0.02871705 - time (sec): 572.61 - samples/sec: 4956.31 - lr: 0.050000
2023-04-06 18:39:26,389 epoch 114 - iter 3720/3720 - loss 0.02871430 - time (sec): 635.23 - samples/sec: 4961.54 - lr: 0.050000
2023-04-06 18:39:26,389 ----------------------------------------------------------------------------------------------------
2023-04-06 18:39:26,389 EPOCH 114 done: loss 0.0287 - lr 0.050000
2023-04-06 18:39:26,389 BAD EPOCHS (no improvement): 3
2023-04-06 18:39:26,392 ----------------------------------------------------------------------------------------------------
2023-04-06 18:40:29,860 epoch 115 - iter 372/3720 - loss 0.02911821 - time (sec): 63.47 - samples/sec: 4927.30 - lr: 0.050000
2023-04-06 18:41:33,450 epoch 115 - iter 744/3720 - loss 0.02882231 - time (sec): 127.06 - samples/sec: 4941.74 - lr: 0.050000
2023-04-06 18:42:37,355 epoch 115 - iter 1116/3720 - loss 0.02867095 - time (sec): 190.96 - samples/sec: 4947.90 - lr: 0.050000
2023-04-06 18:43:41,391 epoch 115 - iter 1488/3720 - loss 0.02861315 - time (sec): 255.00 - samples/sec: 4942.20 - lr: 0.050000
2023-04-06 18:44:44,772 epoch 115 - iter 1860/3720 - loss 0.02848779 - time (sec): 318.38 - samples/sec: 4944.27 - lr: 0.050000
2023-04-06 18:45:49,487 epoch 115 - iter 2232/3720 - loss 0.02855081 - time (sec): 383.09 - samples/sec: 4935.84 - lr: 0.050000
2023-04-06 18:46:53,533 epoch 115 - iter 2604/3720 - loss 0.02856081 - time (sec): 447.14 - samples/sec: 4935.65 - lr: 0.050000
2023-04-06 18:47:56,484 epoch 115 - iter 2976/3720 - loss 0.02863237 - time (sec): 510.09 - samples/sec: 4944.03 - lr: 0.050000
2023-04-06 18:49:00,712 epoch 115 - iter 3348/3720 - loss 0.02869631 - time (sec): 574.32 - samples/sec: 4939.07 - lr: 0.050000
2023-04-06 18:50:04,554 epoch 115 - iter 3720/3720 - loss 0.02869026 - time (sec): 638.16 - samples/sec: 4938.74 - lr: 0.050000
2023-04-06 18:50:04,554 ----------------------------------------------------------------------------------------------------
2023-04-06 18:50:04,554 EPOCH 115 done: loss 0.0287 - lr 0.050000
2023-04-06 18:50:04,554 Epoch 115: reducing learning rate of group 0 to 2.5000e-02.
2023-04-06 18:50:04,554 BAD EPOCHS (no improvement): 4
2023-04-06 18:50:04,557 ----------------------------------------------------------------------------------------------------
2023-04-06 18:51:07,957 epoch 116 - iter 372/3720 - loss 0.02821545 - time (sec): 63.40 - samples/sec: 4960.45 - lr: 0.025000
2023-04-06 18:52:12,265 epoch 116 - iter 744/3720 - loss 0.02796080 - time (sec): 127.71 - samples/sec: 4930.76 - lr: 0.025000
2023-04-06 18:53:16,249 epoch 116 - iter 1116/3720 - loss 0.02784636 - time (sec): 191.69 - samples/sec: 4923.98 - lr: 0.025000
2023-04-06 18:54:20,407 epoch 116 - iter 1488/3720 - loss 0.02793204 - time (sec): 255.85 - samples/sec: 4934.67 - lr: 0.025000
2023-04-06 18:55:24,127 epoch 116 - iter 1860/3720 - loss 0.02790313 - time (sec): 319.57 - samples/sec: 4934.35 - lr: 0.025000
2023-04-06 18:56:26,817 epoch 116 - iter 2232/3720 - loss 0.02773606 - time (sec): 382.26 - samples/sec: 4950.67 - lr: 0.025000
2023-04-06 18:57:29,417 epoch 116 - iter 2604/3720 - loss 0.02764451 - time (sec): 444.86 - samples/sec: 4963.13 - lr: 0.025000
2023-04-06 18:58:31,527 epoch 116 - iter 2976/3720 - loss 0.02765926 - time (sec): 506.97 - samples/sec: 4974.90 - lr: 0.025000
2023-04-06 18:59:34,328 epoch 116 - iter 3348/3720 - loss 0.02768845 - time (sec): 569.77 - samples/sec: 4980.06 - lr: 0.025000
2023-04-06 19:00:36,591 epoch 116 - iter 3720/3720 - loss 0.02772749 - time (sec): 632.03 - samples/sec: 4986.62 - lr: 0.025000
2023-04-06 19:00:36,592 ----------------------------------------------------------------------------------------------------
2023-04-06 19:00:36,592 EPOCH 116 done: loss 0.0277 - lr 0.025000
2023-04-06 19:00:36,592 BAD EPOCHS (no improvement): 0
2023-04-06 19:00:36,595 ----------------------------------------------------------------------------------------------------
2023-04-06 19:01:38,677 epoch 117 - iter 372/3720 - loss 0.02675697 - time (sec): 62.08 - samples/sec: 5051.78 - lr: 0.025000
2023-04-06 19:02:41,867 epoch 117 - iter 744/3720 - loss 0.02663327 - time (sec): 125.27 - samples/sec: 5015.39 - lr: 0.025000
2023-04-06 19:03:45,084 epoch 117 - iter 1116/3720 - loss 0.02706868 - time (sec): 188.49 - samples/sec: 5008.26 - lr: 0.025000
2023-04-06 19:04:48,764 epoch 117 - iter 1488/3720 - loss 0.02706415 - time (sec): 252.17 - samples/sec: 5003.54 - lr: 0.025000
2023-04-06 19:05:50,662 epoch 117 - iter 1860/3720 - loss 0.02719994 - time (sec): 314.07 - samples/sec: 5013.20 - lr: 0.025000
2023-04-06 19:06:53,396 epoch 117 - iter 2232/3720 - loss 0.02713182 - time (sec): 376.80 - samples/sec: 5012.63 - lr: 0.025000
2023-04-06 19:07:57,352 epoch 117 - iter 2604/3720 - loss 0.02707887 - time (sec): 440.76 - samples/sec: 5006.54 - lr: 0.025000
2023-04-06 19:08:59,908 epoch 117 - iter 2976/3720 - loss 0.02710851 - time (sec): 503.31 - samples/sec: 5005.35 - lr: 0.025000
2023-04-06 19:10:02,971 epoch 117 - iter 3348/3720 - loss 0.02721646 - time (sec): 566.38 - samples/sec: 5004.31 - lr: 0.025000
2023-04-06 19:11:07,259 epoch 117 - iter 3720/3720 - loss 0.02713624 - time (sec): 630.66 - samples/sec: 4997.45 - lr: 0.025000
2023-04-06 19:11:07,259 ----------------------------------------------------------------------------------------------------
2023-04-06 19:11:07,259 EPOCH 117 done: loss 0.0271 - lr 0.025000
2023-04-06 19:11:07,259 BAD EPOCHS (no improvement): 0
2023-04-06 19:11:07,263 ----------------------------------------------------------------------------------------------------
2023-04-06 19:12:10,966 epoch 118 - iter 372/3720 - loss 0.02669724 - time (sec): 63.70 - samples/sec: 4937.62 - lr: 0.025000
2023-04-06 19:13:14,278 epoch 118 - iter 744/3720 - loss 0.02721880 - time (sec): 127.02 - samples/sec: 4945.51 - lr: 0.025000
2023-04-06 19:14:19,061 epoch 118 - iter 1116/3720 - loss 0.02706087 - time (sec): 191.80 - samples/sec: 4920.94 - lr: 0.025000
2023-04-06 19:15:21,193 epoch 118 - iter 1488/3720 - loss 0.02716530 - time (sec): 253.93 - samples/sec: 4945.84 - lr: 0.025000
2023-04-06 19:16:25,556 epoch 118 - iter 1860/3720 - loss 0.02716783 - time (sec): 318.29 - samples/sec: 4945.42 - lr: 0.025000
2023-04-06 19:17:29,322 epoch 118 - iter 2232/3720 - loss 0.02706879 - time (sec): 382.06 - samples/sec: 4947.43 - lr: 0.025000
2023-04-06 19:18:34,509 epoch 118 - iter 2604/3720 - loss 0.02708168 - time (sec): 447.25 - samples/sec: 4934.76 - lr: 0.025000
2023-04-06 19:19:38,213 epoch 118 - iter 2976/3720 - loss 0.02703912 - time (sec): 510.95 - samples/sec: 4937.91 - lr: 0.025000
2023-04-06 19:20:41,819 epoch 118 - iter 3348/3720 - loss 0.02707908 - time (sec): 574.56 - samples/sec: 4938.99 - lr: 0.025000
2023-04-06 19:21:45,197 epoch 118 - iter 3720/3720 - loss 0.02702579 - time (sec): 637.93 - samples/sec: 4940.50 - lr: 0.025000
2023-04-06 19:21:45,197 ----------------------------------------------------------------------------------------------------
2023-04-06 19:21:45,197 EPOCH 118 done: loss 0.0270 - lr 0.025000
2023-04-06 19:21:45,197 BAD EPOCHS (no improvement): 0
2023-04-06 19:21:45,200 ----------------------------------------------------------------------------------------------------
2023-04-06 19:22:48,238 epoch 119 - iter 372/3720 - loss 0.02718127 - time (sec): 63.04 - samples/sec: 5003.14 - lr: 0.025000
2023-04-06 19:23:50,912 epoch 119 - iter 744/3720 - loss 0.02726518 - time (sec): 125.71 - samples/sec: 4994.01 - lr: 0.025000
2023-04-06 19:24:54,097 epoch 119 - iter 1116/3720 - loss 0.02701256 - time (sec): 188.90 - samples/sec: 4997.19 - lr: 0.025000
2023-04-06 19:25:56,821 epoch 119 - iter 1488/3720 - loss 0.02697041 - time (sec): 251.62 - samples/sec: 4998.53 - lr: 0.025000
2023-04-06 19:26:58,741 epoch 119 - iter 1860/3720 - loss 0.02697449 - time (sec): 313.54 - samples/sec: 5009.34 - lr: 0.025000
2023-04-06 19:28:02,318 epoch 119 - iter 2232/3720 - loss 0.02707016 - time (sec): 377.12 - samples/sec: 5001.18 - lr: 0.025000
2023-04-06 19:29:05,727 epoch 119 - iter 2604/3720 - loss 0.02701127 - time (sec): 440.53 - samples/sec: 4996.73 - lr: 0.025000
2023-04-06 19:30:10,360 epoch 119 - iter 2976/3720 - loss 0.02697199 - time (sec): 505.16 - samples/sec: 4983.04 - lr: 0.025000
2023-04-06 19:31:14,922 epoch 119 - iter 3348/3720 - loss 0.02700429 - time (sec): 569.72 - samples/sec: 4975.14 - lr: 0.025000
2023-04-06 19:32:19,639 epoch 119 - iter 3720/3720 - loss 0.02696087 - time (sec): 634.44 - samples/sec: 4967.72 - lr: 0.025000
2023-04-06 19:32:19,639 ----------------------------------------------------------------------------------------------------
2023-04-06 19:32:19,639 EPOCH 119 done: loss 0.0270 - lr 0.025000
2023-04-06 19:32:19,639 BAD EPOCHS (no improvement): 0
2023-04-06 19:32:19,642 ----------------------------------------------------------------------------------------------------
2023-04-06 19:33:23,815 epoch 120 - iter 372/3720 - loss 0.02680386 - time (sec): 64.17 - samples/sec: 4922.23 - lr: 0.025000
2023-04-06 19:34:27,141 epoch 120 - iter 744/3720 - loss 0.02695026 - time (sec): 127.50 - samples/sec: 4940.91 - lr: 0.025000
2023-04-06 19:35:29,995 epoch 120 - iter 1116/3720 - loss 0.02687993 - time (sec): 190.35 - samples/sec: 4961.80 - lr: 0.025000
2023-04-06 19:36:34,191 epoch 120 - iter 1488/3720 - loss 0.02712338 - time (sec): 254.55 - samples/sec: 4956.47 - lr: 0.025000
2023-04-06 19:37:38,294 epoch 120 - iter 1860/3720 - loss 0.02702479 - time (sec): 318.65 - samples/sec: 4944.56 - lr: 0.025000
2023-04-06 19:38:41,655 epoch 120 - iter 2232/3720 - loss 0.02690621 - time (sec): 382.01 - samples/sec: 4949.55 - lr: 0.025000
2023-04-06 19:39:45,334 epoch 120 - iter 2604/3720 - loss 0.02694464 - time (sec): 445.69 - samples/sec: 4947.86 - lr: 0.025000
2023-04-06 19:40:48,440 epoch 120 - iter 2976/3720 - loss 0.02695253 - time (sec): 508.80 - samples/sec: 4949.23 - lr: 0.025000
2023-04-06 19:41:52,934 epoch 120 - iter 3348/3720 - loss 0.02695106 - time (sec): 573.29 - samples/sec: 4946.11 - lr: 0.025000
2023-04-06 19:42:56,744 epoch 120 - iter 3720/3720 - loss 0.02691680 - time (sec): 637.10 - samples/sec: 4946.96 - lr: 0.025000
2023-04-06 19:42:56,744 ----------------------------------------------------------------------------------------------------
2023-04-06 19:42:56,744 EPOCH 120 done: loss 0.0269 - lr 0.025000
2023-04-06 19:42:56,744 BAD EPOCHS (no improvement): 0
2023-04-06 19:42:56,747 ----------------------------------------------------------------------------------------------------
2023-04-06 19:44:00,181 epoch 121 - iter 372/3720 - loss 0.02731040 - time (sec): 63.43 - samples/sec: 4975.67 - lr: 0.025000
2023-04-06 19:45:03,502 epoch 121 - iter 744/3720 - loss 0.02731590 - time (sec): 126.76 - samples/sec: 4993.30 - lr: 0.025000
2023-04-06 19:46:08,061 epoch 121 - iter 1116/3720 - loss 0.02689033 - time (sec): 191.31 - samples/sec: 4955.16 - lr: 0.025000
2023-04-06 19:47:12,064 epoch 121 - iter 1488/3720 - loss 0.02672188 - time (sec): 255.32 - samples/sec: 4944.92 - lr: 0.025000
2023-04-06 19:48:15,567 epoch 121 - iter 1860/3720 - loss 0.02666091 - time (sec): 318.82 - samples/sec: 4945.28 - lr: 0.025000
2023-04-06 19:49:19,854 epoch 121 - iter 2232/3720 - loss 0.02663124 - time (sec): 383.11 - samples/sec: 4937.49 - lr: 0.025000
2023-04-06 19:50:23,225 epoch 121 - iter 2604/3720 - loss 0.02671111 - time (sec): 446.48 - samples/sec: 4940.58 - lr: 0.025000
2023-04-06 19:51:26,327 epoch 121 - iter 2976/3720 - loss 0.02669765 - time (sec): 509.58 - samples/sec: 4947.35 - lr: 0.025000
2023-04-06 19:52:29,926 epoch 121 - iter 3348/3720 - loss 0.02670859 - time (sec): 573.18 - samples/sec: 4948.39 - lr: 0.025000
2023-04-06 19:53:33,415 epoch 121 - iter 3720/3720 - loss 0.02670706 - time (sec): 636.67 - samples/sec: 4950.33 - lr: 0.025000
2023-04-06 19:53:33,415 ----------------------------------------------------------------------------------------------------
2023-04-06 19:53:33,415 EPOCH 121 done: loss 0.0267 - lr 0.025000
2023-04-06 19:53:33,415 BAD EPOCHS (no improvement): 0
2023-04-06 19:53:33,419 ----------------------------------------------------------------------------------------------------
2023-04-06 19:54:37,754 epoch 122 - iter 372/3720 - loss 0.02598740 - time (sec): 64.34 - samples/sec: 4905.80 - lr: 0.025000
2023-04-06 19:55:42,681 epoch 122 - iter 744/3720 - loss 0.02656373 - time (sec): 129.26 - samples/sec: 4897.77 - lr: 0.025000
2023-04-06 19:56:45,612 epoch 122 - iter 1116/3720 - loss 0.02656848 - time (sec): 192.19 - samples/sec: 4931.86 - lr: 0.025000
2023-04-06 19:57:48,664 epoch 122 - iter 1488/3720 - loss 0.02669494 - time (sec): 255.25 - samples/sec: 4950.96 - lr: 0.025000
2023-04-06 19:58:52,574 epoch 122 - iter 1860/3720 - loss 0.02665255 - time (sec): 319.16 - samples/sec: 4939.30 - lr: 0.025000
2023-04-06 19:59:55,997 epoch 122 - iter 2232/3720 - loss 0.02664812 - time (sec): 382.58 - samples/sec: 4940.88 - lr: 0.025000
2023-04-06 20:00:59,794 epoch 122 - iter 2604/3720 - loss 0.02669736 - time (sec): 446.38 - samples/sec: 4942.45 - lr: 0.025000
2023-04-06 20:02:03,761 epoch 122 - iter 2976/3720 - loss 0.02676921 - time (sec): 510.34 - samples/sec: 4943.11 - lr: 0.025000
2023-04-06 20:03:07,085 epoch 122 - iter 3348/3720 - loss 0.02673274 - time (sec): 573.67 - samples/sec: 4945.82 - lr: 0.025000
2023-04-06 20:04:10,240 epoch 122 - iter 3720/3720 - loss 0.02670126 - time (sec): 636.82 - samples/sec: 4949.13 - lr: 0.025000
2023-04-06 20:04:10,240 ----------------------------------------------------------------------------------------------------
2023-04-06 20:04:10,240 EPOCH 122 done: loss 0.0267 - lr 0.025000
2023-04-06 20:04:10,240 BAD EPOCHS (no improvement): 0
2023-04-06 20:04:10,245 ----------------------------------------------------------------------------------------------------
2023-04-06 20:05:12,932 epoch 123 - iter 372/3720 - loss 0.02653286 - time (sec): 62.69 - samples/sec: 5036.53 - lr: 0.025000
2023-04-06 20:06:15,120 epoch 123 - iter 744/3720 - loss 0.02689535 - time (sec): 124.88 - samples/sec: 5041.14 - lr: 0.025000
2023-04-06 20:07:18,745 epoch 123 - iter 1116/3720 - loss 0.02664607 - time (sec): 188.50 - samples/sec: 5013.66 - lr: 0.025000
2023-04-06 20:08:22,419 epoch 123 - iter 1488/3720 - loss 0.02660883 - time (sec): 252.17 - samples/sec: 5008.11 - lr: 0.025000
2023-04-06 20:09:25,423 epoch 123 - iter 1860/3720 - loss 0.02656737 - time (sec): 315.18 - samples/sec: 5001.03 - lr: 0.025000
2023-04-06 20:10:29,678 epoch 123 - iter 2232/3720 - loss 0.02659796 - time (sec): 379.43 - samples/sec: 4988.49 - lr: 0.025000
2023-04-06 20:11:32,277 epoch 123 - iter 2604/3720 - loss 0.02662924 - time (sec): 442.03 - samples/sec: 4996.37 - lr: 0.025000
2023-04-06 20:12:35,036 epoch 123 - iter 2976/3720 - loss 0.02662029 - time (sec): 504.79 - samples/sec: 4998.47 - lr: 0.025000
2023-04-06 20:13:38,977 epoch 123 - iter 3348/3720 - loss 0.02656031 - time (sec): 568.73 - samples/sec: 4991.21 - lr: 0.025000
2023-04-06 20:14:42,417 epoch 123 - iter 3720/3720 - loss 0.02662917 - time (sec): 632.17 - samples/sec: 4985.53 - lr: 0.025000
2023-04-06 20:14:42,417 ----------------------------------------------------------------------------------------------------
2023-04-06 20:14:42,417 EPOCH 123 done: loss 0.0266 - lr 0.025000
2023-04-06 20:14:42,417 BAD EPOCHS (no improvement): 0
2023-04-06 20:14:42,420 ----------------------------------------------------------------------------------------------------
2023-04-06 20:15:45,805 epoch 124 - iter 372/3720 - loss 0.02646290 - time (sec): 63.38 - samples/sec: 4951.31 - lr: 0.025000
2023-04-06 20:16:49,805 epoch 124 - iter 744/3720 - loss 0.02640494 - time (sec): 127.38 - samples/sec: 4934.70 - lr: 0.025000
2023-04-06 20:17:53,126 epoch 124 - iter 1116/3720 - loss 0.02628842 - time (sec): 190.71 - samples/sec: 4954.98 - lr: 0.025000
2023-04-06 20:18:55,135 epoch 124 - iter 1488/3720 - loss 0.02636300 - time (sec): 252.71 - samples/sec: 4989.83 - lr: 0.025000
2023-04-06 20:19:56,619 epoch 124 - iter 1860/3720 - loss 0.02630184 - time (sec): 314.20 - samples/sec: 5008.32 - lr: 0.025000
2023-04-06 20:20:59,211 epoch 124 - iter 2232/3720 - loss 0.02626940 - time (sec): 376.79 - samples/sec: 5016.94 - lr: 0.025000
2023-04-06 20:22:01,999 epoch 124 - iter 2604/3720 - loss 0.02638424 - time (sec): 439.58 - samples/sec: 5015.96 - lr: 0.025000
2023-04-06 20:23:04,829 epoch 124 - iter 2976/3720 - loss 0.02649910 - time (sec): 502.41 - samples/sec: 5018.66 - lr: 0.025000
2023-04-06 20:24:07,211 epoch 124 - iter 3348/3720 - loss 0.02653918 - time (sec): 564.79 - samples/sec: 5023.01 - lr: 0.025000
2023-04-06 20:25:11,504 epoch 124 - iter 3720/3720 - loss 0.02663640 - time (sec): 629.08 - samples/sec: 5010.00 - lr: 0.025000
2023-04-06 20:25:11,505 ----------------------------------------------------------------------------------------------------
2023-04-06 20:25:11,505 EPOCH 124 done: loss 0.0266 - lr 0.025000
2023-04-06 20:25:11,505 BAD EPOCHS (no improvement): 1
2023-04-06 20:25:11,508 ----------------------------------------------------------------------------------------------------
2023-04-06 20:26:15,531 epoch 125 - iter 372/3720 - loss 0.02534478 - time (sec): 64.02 - samples/sec: 4932.33 - lr: 0.025000
2023-04-06 20:27:19,395 epoch 125 - iter 744/3720 - loss 0.02581998 - time (sec): 127.89 - samples/sec: 4926.68 - lr: 0.025000
2023-04-06 20:28:23,262 epoch 125 - iter 1116/3720 - loss 0.02624280 - time (sec): 191.75 - samples/sec: 4932.38 - lr: 0.025000
2023-04-06 20:29:26,429 epoch 125 - iter 1488/3720 - loss 0.02628706 - time (sec): 254.92 - samples/sec: 4947.10 - lr: 0.025000
2023-04-06 20:30:30,237 epoch 125 - iter 1860/3720 - loss 0.02624753 - time (sec): 318.73 - samples/sec: 4945.07 - lr: 0.025000
2023-04-06 20:31:34,859 epoch 125 - iter 2232/3720 - loss 0.02628655 - time (sec): 383.35 - samples/sec: 4939.29 - lr: 0.025000
2023-04-06 20:32:38,569 epoch 125 - iter 2604/3720 - loss 0.02633956 - time (sec): 447.06 - samples/sec: 4933.77 - lr: 0.025000
2023-04-06 20:33:42,777 epoch 125 - iter 2976/3720 - loss 0.02634226 - time (sec): 511.27 - samples/sec: 4930.17 - lr: 0.025000
2023-04-06 20:34:46,447 epoch 125 - iter 3348/3720 - loss 0.02642968 - time (sec): 574.94 - samples/sec: 4932.09 - lr: 0.025000
2023-04-06 20:35:51,005 epoch 125 - iter 3720/3720 - loss 0.02641503 - time (sec): 639.50 - samples/sec: 4928.42 - lr: 0.025000
2023-04-06 20:35:51,005 ----------------------------------------------------------------------------------------------------
2023-04-06 20:35:51,005 EPOCH 125 done: loss 0.0264 - lr 0.025000
2023-04-06 20:35:51,005 BAD EPOCHS (no improvement): 0
2023-04-06 20:35:51,009 ----------------------------------------------------------------------------------------------------
2023-04-06 20:36:55,001 epoch 126 - iter 372/3720 - loss 0.02531438 - time (sec): 63.99 - samples/sec: 4911.63 - lr: 0.025000
2023-04-06 20:37:58,546 epoch 126 - iter 744/3720 - loss 0.02554386 - time (sec): 127.54 - samples/sec: 4928.08 - lr: 0.025000
2023-04-06 20:39:02,530 epoch 126 - iter 1116/3720 - loss 0.02568486 - time (sec): 191.52 - samples/sec: 4926.48 - lr: 0.025000
2023-04-06 20:40:06,252 epoch 126 - iter 1488/3720 - loss 0.02567360 - time (sec): 255.24 - samples/sec: 4930.82 - lr: 0.025000
2023-04-06 20:41:08,786 epoch 126 - iter 1860/3720 - loss 0.02587454 - time (sec): 317.78 - samples/sec: 4952.65 - lr: 0.025000
2023-04-06 20:42:11,901 epoch 126 - iter 2232/3720 - loss 0.02600226 - time (sec): 380.89 - samples/sec: 4964.09 - lr: 0.025000
2023-04-06 20:43:14,515 epoch 126 - iter 2604/3720 - loss 0.02608638 - time (sec): 443.51 - samples/sec: 4975.03 - lr: 0.025000
2023-04-06 20:44:16,686 epoch 126 - iter 2976/3720 - loss 0.02612202 - time (sec): 505.68 - samples/sec: 4983.84 - lr: 0.025000
2023-04-06 20:45:19,444 epoch 126 - iter 3348/3720 - loss 0.02619542 - time (sec): 568.44 - samples/sec: 4990.47 - lr: 0.025000
2023-04-06 20:46:23,292 epoch 126 - iter 3720/3720 - loss 0.02624532 - time (sec): 632.28 - samples/sec: 4984.65 - lr: 0.025000
2023-04-06 20:46:23,292 ----------------------------------------------------------------------------------------------------
2023-04-06 20:46:23,292 EPOCH 126 done: loss 0.0262 - lr 0.025000
2023-04-06 20:46:23,292 BAD EPOCHS (no improvement): 0
2023-04-06 20:46:23,295 ----------------------------------------------------------------------------------------------------
2023-04-06 20:47:27,596 epoch 127 - iter 372/3720 - loss 0.02579680 - time (sec): 64.30 - samples/sec: 4904.80 - lr: 0.025000
2023-04-06 20:48:31,740 epoch 127 - iter 744/3720 - loss 0.02596528 - time (sec): 128.44 - samples/sec: 4907.61 - lr: 0.025000
2023-04-06 20:49:36,560 epoch 127 - iter 1116/3720 - loss 0.02608173 - time (sec): 193.26 - samples/sec: 4904.64 - lr: 0.025000
2023-04-06 20:50:40,080 epoch 127 - iter 1488/3720 - loss 0.02601741 - time (sec): 256.78 - samples/sec: 4922.38 - lr: 0.025000
2023-04-06 20:51:42,737 epoch 127 - iter 1860/3720 - loss 0.02604764 - time (sec): 319.44 - samples/sec: 4942.99 - lr: 0.025000
2023-04-06 20:52:45,980 epoch 127 - iter 2232/3720 - loss 0.02608747 - time (sec): 382.68 - samples/sec: 4948.80 - lr: 0.025000
2023-04-06 20:53:49,492 epoch 127 - iter 2604/3720 - loss 0.02615918 - time (sec): 446.20 - samples/sec: 4952.40 - lr: 0.025000
2023-04-06 20:54:53,390 epoch 127 - iter 2976/3720 - loss 0.02624227 - time (sec): 510.09 - samples/sec: 4945.33 - lr: 0.025000
2023-04-06 20:55:58,069 epoch 127 - iter 3348/3720 - loss 0.02620766 - time (sec): 574.77 - samples/sec: 4939.41 - lr: 0.025000
2023-04-06 20:57:00,176 epoch 127 - iter 3720/3720 - loss 0.02619732 - time (sec): 636.88 - samples/sec: 4948.67 - lr: 0.025000
2023-04-06 20:57:00,177 ----------------------------------------------------------------------------------------------------
2023-04-06 20:57:00,177 EPOCH 127 done: loss 0.0262 - lr 0.025000
2023-04-06 20:57:00,177 BAD EPOCHS (no improvement): 0
2023-04-06 20:57:00,180 ----------------------------------------------------------------------------------------------------
2023-04-06 20:58:03,014 epoch 128 - iter 372/3720 - loss 0.02628506 - time (sec): 62.83 - samples/sec: 4991.67 - lr: 0.025000
2023-04-06 20:59:05,789 epoch 128 - iter 744/3720 - loss 0.02647100 - time (sec): 125.61 - samples/sec: 4990.64 - lr: 0.025000
2023-04-06 21:00:11,004 epoch 128 - iter 1116/3720 - loss 0.02659542 - time (sec): 190.82 - samples/sec: 4946.11 - lr: 0.025000
2023-04-06 21:01:16,208 epoch 128 - iter 1488/3720 - loss 0.02650373 - time (sec): 256.03 - samples/sec: 4932.21 - lr: 0.025000
2023-04-06 21:02:19,235 epoch 128 - iter 1860/3720 - loss 0.02648294 - time (sec): 319.06 - samples/sec: 4937.35 - lr: 0.025000
2023-04-06 21:03:23,840 epoch 128 - iter 2232/3720 - loss 0.02646224 - time (sec): 383.66 - samples/sec: 4930.41 - lr: 0.025000
2023-04-06 21:04:27,528 epoch 128 - iter 2604/3720 - loss 0.02654571 - time (sec): 447.35 - samples/sec: 4931.29 - lr: 0.025000
2023-04-06 21:05:31,354 epoch 128 - iter 2976/3720 - loss 0.02643899 - time (sec): 511.17 - samples/sec: 4932.34 - lr: 0.025000
2023-04-06 21:06:35,909 epoch 128 - iter 3348/3720 - loss 0.02642192 - time (sec): 575.73 - samples/sec: 4929.00 - lr: 0.025000
2023-04-06 21:07:39,186 epoch 128 - iter 3720/3720 - loss 0.02639418 - time (sec): 639.01 - samples/sec: 4932.21 - lr: 0.025000
2023-04-06 21:07:39,186 ----------------------------------------------------------------------------------------------------
2023-04-06 21:07:39,186 EPOCH 128 done: loss 0.0264 - lr 0.025000
2023-04-06 21:07:39,186 BAD EPOCHS (no improvement): 1
2023-04-06 21:07:39,189 ----------------------------------------------------------------------------------------------------
2023-04-06 21:08:42,876 epoch 129 - iter 372/3720 - loss 0.02643114 - time (sec): 63.69 - samples/sec: 4904.80 - lr: 0.025000
2023-04-06 21:09:47,140 epoch 129 - iter 744/3720 - loss 0.02633930 - time (sec): 127.95 - samples/sec: 4917.39 - lr: 0.025000
2023-04-06 21:10:50,200 epoch 129 - iter 1116/3720 - loss 0.02634387 - time (sec): 191.01 - samples/sec: 4942.11 - lr: 0.025000
2023-04-06 21:11:53,293 epoch 129 - iter 1488/3720 - loss 0.02641411 - time (sec): 254.10 - samples/sec: 4952.45 - lr: 0.025000
2023-04-06 21:12:55,714 epoch 129 - iter 1860/3720 - loss 0.02637481 - time (sec): 316.53 - samples/sec: 4975.52 - lr: 0.025000
2023-04-06 21:14:00,163 epoch 129 - iter 2232/3720 - loss 0.02623052 - time (sec): 380.97 - samples/sec: 4973.52 - lr: 0.025000
2023-04-06 21:15:03,340 epoch 129 - iter 2604/3720 - loss 0.02630420 - time (sec): 444.15 - samples/sec: 4974.64 - lr: 0.025000
2023-04-06 21:16:06,768 epoch 129 - iter 2976/3720 - loss 0.02640706 - time (sec): 507.58 - samples/sec: 4970.72 - lr: 0.025000
2023-04-06 21:17:10,516 epoch 129 - iter 3348/3720 - loss 0.02638523 - time (sec): 571.33 - samples/sec: 4966.94 - lr: 0.025000
2023-04-06 21:18:15,192 epoch 129 - iter 3720/3720 - loss 0.02632172 - time (sec): 636.00 - samples/sec: 4955.50 - lr: 0.025000
2023-04-06 21:18:15,192 ----------------------------------------------------------------------------------------------------
2023-04-06 21:18:15,192 EPOCH 129 done: loss 0.0263 - lr 0.025000
2023-04-06 21:18:15,192 BAD EPOCHS (no improvement): 2
2023-04-06 21:18:15,199 ----------------------------------------------------------------------------------------------------
2023-04-06 21:19:18,502 epoch 130 - iter 372/3720 - loss 0.02577645 - time (sec): 63.30 - samples/sec: 4955.91 - lr: 0.025000
2023-04-06 21:20:21,849 epoch 130 - iter 744/3720 - loss 0.02608502 - time (sec): 126.65 - samples/sec: 4970.02 - lr: 0.025000
2023-04-06 21:21:25,546 epoch 130 - iter 1116/3720 - loss 0.02598848 - time (sec): 190.35 - samples/sec: 4965.13 - lr: 0.025000
2023-04-06 21:22:27,917 epoch 130 - iter 1488/3720 - loss 0.02586645 - time (sec): 252.72 - samples/sec: 4985.07 - lr: 0.025000
2023-04-06 21:23:31,506 epoch 130 - iter 1860/3720 - loss 0.02590813 - time (sec): 316.31 - samples/sec: 4981.76 - lr: 0.025000
2023-04-06 21:24:34,761 epoch 130 - iter 2232/3720 - loss 0.02608655 - time (sec): 379.56 - samples/sec: 4981.43 - lr: 0.025000
2023-04-06 21:25:37,983 epoch 130 - iter 2604/3720 - loss 0.02619504 - time (sec): 442.78 - samples/sec: 4980.55 - lr: 0.025000
2023-04-06 21:26:40,873 epoch 130 - iter 2976/3720 - loss 0.02619835 - time (sec): 505.67 - samples/sec: 4984.34 - lr: 0.025000
2023-04-06 21:27:42,738 epoch 130 - iter 3348/3720 - loss 0.02622439 - time (sec): 567.54 - samples/sec: 4997.30 - lr: 0.025000
2023-04-06 21:28:45,512 epoch 130 - iter 3720/3720 - loss 0.02614202 - time (sec): 630.31 - samples/sec: 5000.24 - lr: 0.025000
2023-04-06 21:28:45,512 ----------------------------------------------------------------------------------------------------
2023-04-06 21:28:45,512 EPOCH 130 done: loss 0.0261 - lr 0.025000
2023-04-06 21:28:45,512 BAD EPOCHS (no improvement): 0
2023-04-06 21:28:45,518 ----------------------------------------------------------------------------------------------------
2023-04-06 21:29:48,226 epoch 131 - iter 372/3720 - loss 0.02583416 - time (sec): 62.71 - samples/sec: 5029.38 - lr: 0.025000
2023-04-06 21:30:50,711 epoch 131 - iter 744/3720 - loss 0.02579379 - time (sec): 125.19 - samples/sec: 5035.47 - lr: 0.025000
2023-04-06 21:31:53,354 epoch 131 - iter 1116/3720 - loss 0.02578966 - time (sec): 187.84 - samples/sec: 5043.32 - lr: 0.025000
2023-04-06 21:32:56,132 epoch 131 - iter 1488/3720 - loss 0.02576554 - time (sec): 250.61 - samples/sec: 5044.14 - lr: 0.025000
2023-04-06 21:33:58,300 epoch 131 - iter 1860/3720 - loss 0.02566748 - time (sec): 312.78 - samples/sec: 5045.38 - lr: 0.025000
2023-04-06 21:35:02,035 epoch 131 - iter 2232/3720 - loss 0.02583357 - time (sec): 376.52 - samples/sec: 5028.45 - lr: 0.025000
2023-04-06 21:36:04,422 epoch 131 - iter 2604/3720 - loss 0.02604849 - time (sec): 438.90 - samples/sec: 5026.52 - lr: 0.025000
2023-04-06 21:37:08,289 epoch 131 - iter 2976/3720 - loss 0.02610628 - time (sec): 502.77 - samples/sec: 5013.34 - lr: 0.025000
2023-04-06 21:38:11,661 epoch 131 - iter 3348/3720 - loss 0.02608858 - time (sec): 566.14 - samples/sec: 5005.48 - lr: 0.025000
2023-04-06 21:39:16,384 epoch 131 - iter 3720/3720 - loss 0.02614340 - time (sec): 630.87 - samples/sec: 4995.85 - lr: 0.025000
2023-04-06 21:39:16,384 ----------------------------------------------------------------------------------------------------
2023-04-06 21:39:16,384 EPOCH 131 done: loss 0.0261 - lr 0.025000
2023-04-06 21:39:16,384 BAD EPOCHS (no improvement): 1
2023-04-06 21:39:16,388 ----------------------------------------------------------------------------------------------------
2023-04-06 21:40:19,287 epoch 132 - iter 372/3720 - loss 0.02608236 - time (sec): 62.90 - samples/sec: 4969.94 - lr: 0.025000
2023-04-06 21:41:23,718 epoch 132 - iter 744/3720 - loss 0.02601802 - time (sec): 127.33 - samples/sec: 4945.46 - lr: 0.025000
2023-04-06 21:42:27,863 epoch 132 - iter 1116/3720 - loss 0.02631864 - time (sec): 191.48 - samples/sec: 4937.69 - lr: 0.025000
2023-04-06 21:43:32,268 epoch 132 - iter 1488/3720 - loss 0.02620156 - time (sec): 255.88 - samples/sec: 4933.57 - lr: 0.025000
2023-04-06 21:44:37,090 epoch 132 - iter 1860/3720 - loss 0.02619527 - time (sec): 320.70 - samples/sec: 4920.44 - lr: 0.025000
2023-04-06 21:45:40,680 epoch 132 - iter 2232/3720 - loss 0.02618574 - time (sec): 384.29 - samples/sec: 4921.06 - lr: 0.025000
2023-04-06 21:46:44,513 epoch 132 - iter 2604/3720 - loss 0.02604139 - time (sec): 448.12 - samples/sec: 4917.79 - lr: 0.025000
2023-04-06 21:47:49,229 epoch 132 - iter 2976/3720 - loss 0.02614251 - time (sec): 512.84 - samples/sec: 4913.54 - lr: 0.025000
2023-04-06 21:48:51,742 epoch 132 - iter 3348/3720 - loss 0.02614245 - time (sec): 575.35 - samples/sec: 4925.30 - lr: 0.025000
2023-04-06 21:49:55,623 epoch 132 - iter 3720/3720 - loss 0.02612145 - time (sec): 639.24 - samples/sec: 4930.44 - lr: 0.025000
2023-04-06 21:49:55,623 ----------------------------------------------------------------------------------------------------
2023-04-06 21:49:55,623 EPOCH 132 done: loss 0.0261 - lr 0.025000
2023-04-06 21:49:55,623 BAD EPOCHS (no improvement): 0
2023-04-06 21:49:55,627 ----------------------------------------------------------------------------------------------------
2023-04-06 21:50:58,443 epoch 133 - iter 372/3720 - loss 0.02553210 - time (sec): 62.82 - samples/sec: 5011.88 - lr: 0.025000
2023-04-06 21:52:02,486 epoch 133 - iter 744/3720 - loss 0.02560492 - time (sec): 126.86 - samples/sec: 4979.06 - lr: 0.025000
2023-04-06 21:53:04,156 epoch 133 - iter 1116/3720 - loss 0.02608778 - time (sec): 188.53 - samples/sec: 5017.29 - lr: 0.025000
2023-04-06 21:54:07,561 epoch 133 - iter 1488/3720 - loss 0.02616909 - time (sec): 251.93 - samples/sec: 5004.50 - lr: 0.025000
2023-04-06 21:55:12,056 epoch 133 - iter 1860/3720 - loss 0.02626976 - time (sec): 316.43 - samples/sec: 4980.32 - lr: 0.025000
2023-04-06 21:56:15,780 epoch 133 - iter 2232/3720 - loss 0.02615927 - time (sec): 380.15 - samples/sec: 4974.76 - lr: 0.025000
2023-04-06 21:57:19,597 epoch 133 - iter 2604/3720 - loss 0.02605486 - time (sec): 443.97 - samples/sec: 4969.11 - lr: 0.025000
2023-04-06 21:58:23,279 epoch 133 - iter 2976/3720 - loss 0.02617783 - time (sec): 507.65 - samples/sec: 4967.51 - lr: 0.025000
2023-04-06 21:59:27,050 epoch 133 - iter 3348/3720 - loss 0.02620108 - time (sec): 571.42 - samples/sec: 4963.89 - lr: 0.025000
2023-04-06 22:00:30,946 epoch 133 - iter 3720/3720 - loss 0.02611399 - time (sec): 635.32 - samples/sec: 4960.83 - lr: 0.025000
2023-04-06 22:00:30,946 ----------------------------------------------------------------------------------------------------
2023-04-06 22:00:30,946 EPOCH 133 done: loss 0.0261 - lr 0.025000
2023-04-06 22:00:30,946 BAD EPOCHS (no improvement): 0
2023-04-06 22:00:30,950 ----------------------------------------------------------------------------------------------------
2023-04-06 22:01:35,438 epoch 134 - iter 372/3720 - loss 0.02583291 - time (sec): 64.49 - samples/sec: 4920.43 - lr: 0.025000
2023-04-06 22:02:40,061 epoch 134 - iter 744/3720 - loss 0.02600139 - time (sec): 129.11 - samples/sec: 4909.15 - lr: 0.025000
2023-04-06 22:03:44,224 epoch 134 - iter 1116/3720 - loss 0.02617786 - time (sec): 193.27 - samples/sec: 4915.91 - lr: 0.025000
2023-04-06 22:04:48,602 epoch 134 - iter 1488/3720 - loss 0.02603403 - time (sec): 257.65 - samples/sec: 4911.66 - lr: 0.025000
2023-04-06 22:05:52,644 epoch 134 - iter 1860/3720 - loss 0.02626694 - time (sec): 321.69 - samples/sec: 4921.62 - lr: 0.025000
2023-04-06 22:06:54,744 epoch 134 - iter 2232/3720 - loss 0.02613598 - time (sec): 383.79 - samples/sec: 4937.22 - lr: 0.025000
2023-04-06 22:07:57,460 epoch 134 - iter 2604/3720 - loss 0.02610478 - time (sec): 446.51 - samples/sec: 4951.08 - lr: 0.025000
2023-04-06 22:08:59,147 epoch 134 - iter 2976/3720 - loss 0.02603591 - time (sec): 508.20 - samples/sec: 4967.64 - lr: 0.025000
2023-04-06 22:10:01,299 epoch 134 - iter 3348/3720 - loss 0.02596891 - time (sec): 570.35 - samples/sec: 4975.07 - lr: 0.025000
2023-04-06 22:11:05,089 epoch 134 - iter 3720/3720 - loss 0.02599401 - time (sec): 634.14 - samples/sec: 4970.06 - lr: 0.025000
2023-04-06 22:11:05,089 ----------------------------------------------------------------------------------------------------
2023-04-06 22:11:05,089 EPOCH 134 done: loss 0.0260 - lr 0.025000
2023-04-06 22:11:05,089 BAD EPOCHS (no improvement): 0
2023-04-06 22:11:05,119 ----------------------------------------------------------------------------------------------------
2023-04-06 22:12:09,174 epoch 135 - iter 372/3720 - loss 0.02566066 - time (sec): 64.05 - samples/sec: 4883.81 - lr: 0.025000
2023-04-06 22:13:13,125 epoch 135 - iter 744/3720 - loss 0.02587147 - time (sec): 128.01 - samples/sec: 4891.86 - lr: 0.025000
2023-04-06 22:14:16,754 epoch 135 - iter 1116/3720 - loss 0.02587474 - time (sec): 191.63 - samples/sec: 4907.99 - lr: 0.025000
2023-04-06 22:15:21,193 epoch 135 - iter 1488/3720 - loss 0.02595556 - time (sec): 256.07 - samples/sec: 4911.45 - lr: 0.025000
2023-04-06 22:16:24,594 epoch 135 - iter 1860/3720 - loss 0.02582688 - time (sec): 319.47 - samples/sec: 4928.54 - lr: 0.025000
2023-04-06 22:17:28,680 epoch 135 - iter 2232/3720 - loss 0.02583154 - time (sec): 383.56 - samples/sec: 4931.94 - lr: 0.025000
2023-04-06 22:18:33,286 epoch 135 - iter 2604/3720 - loss 0.02586650 - time (sec): 448.17 - samples/sec: 4928.42 - lr: 0.025000
2023-04-06 22:19:36,631 epoch 135 - iter 2976/3720 - loss 0.02592220 - time (sec): 511.51 - samples/sec: 4930.47 - lr: 0.025000
2023-04-06 22:20:40,501 epoch 135 - iter 3348/3720 - loss 0.02587278 - time (sec): 575.38 - samples/sec: 4930.62 - lr: 0.025000
2023-04-06 22:21:44,435 epoch 135 - iter 3720/3720 - loss 0.02590732 - time (sec): 639.32 - samples/sec: 4929.82 - lr: 0.025000
2023-04-06 22:21:44,435 ----------------------------------------------------------------------------------------------------
2023-04-06 22:21:44,435 EPOCH 135 done: loss 0.0259 - lr 0.025000
2023-04-06 22:21:44,435 BAD EPOCHS (no improvement): 0
2023-04-06 22:21:44,439 ----------------------------------------------------------------------------------------------------
2023-04-06 22:22:48,281 epoch 136 - iter 372/3720 - loss 0.02723063 - time (sec): 63.84 - samples/sec: 4950.72 - lr: 0.025000
2023-04-06 22:23:52,174 epoch 136 - iter 744/3720 - loss 0.02611622 - time (sec): 127.74 - samples/sec: 4951.16 - lr: 0.025000
2023-04-06 22:24:56,334 epoch 136 - iter 1116/3720 - loss 0.02612083 - time (sec): 191.90 - samples/sec: 4946.21 - lr: 0.025000
2023-04-06 22:25:59,241 epoch 136 - iter 1488/3720 - loss 0.02621783 - time (sec): 254.80 - samples/sec: 4957.76 - lr: 0.025000
2023-04-06 22:27:02,838 epoch 136 - iter 1860/3720 - loss 0.02605925 - time (sec): 318.40 - samples/sec: 4953.83 - lr: 0.025000
2023-04-06 22:28:06,527 epoch 136 - iter 2232/3720 - loss 0.02603578 - time (sec): 382.09 - samples/sec: 4955.32 - lr: 0.025000
2023-04-06 22:29:09,400 epoch 136 - iter 2604/3720 - loss 0.02598420 - time (sec): 444.96 - samples/sec: 4960.21 - lr: 0.025000
2023-04-06 22:30:12,756 epoch 136 - iter 2976/3720 - loss 0.02593483 - time (sec): 508.32 - samples/sec: 4960.75 - lr: 0.025000
2023-04-06 22:31:16,277 epoch 136 - iter 3348/3720 - loss 0.02593681 - time (sec): 571.84 - samples/sec: 4960.80 - lr: 0.025000
2023-04-06 22:32:19,806 epoch 136 - iter 3720/3720 - loss 0.02599687 - time (sec): 635.37 - samples/sec: 4960.46 - lr: 0.025000
2023-04-06 22:32:19,806 ----------------------------------------------------------------------------------------------------
2023-04-06 22:32:19,806 EPOCH 136 done: loss 0.0260 - lr 0.025000
2023-04-06 22:32:19,806 BAD EPOCHS (no improvement): 1
2023-04-06 22:32:19,809 ----------------------------------------------------------------------------------------------------
2023-04-06 22:33:23,643 epoch 137 - iter 372/3720 - loss 0.02613913 - time (sec): 63.83 - samples/sec: 4924.51 - lr: 0.025000
2023-04-06 22:34:29,340 epoch 137 - iter 744/3720 - loss 0.02598281 - time (sec): 129.53 - samples/sec: 4881.99 - lr: 0.025000
2023-04-06 22:35:33,482 epoch 137 - iter 1116/3720 - loss 0.02592690 - time (sec): 193.67 - samples/sec: 4886.97 - lr: 0.025000
2023-04-06 22:36:36,862 epoch 137 - iter 1488/3720 - loss 0.02599989 - time (sec): 257.05 - samples/sec: 4909.45 - lr: 0.025000
2023-04-06 22:37:40,900 epoch 137 - iter 1860/3720 - loss 0.02584012 - time (sec): 321.09 - samples/sec: 4915.39 - lr: 0.025000
2023-04-06 22:38:43,964 epoch 137 - iter 2232/3720 - loss 0.02583791 - time (sec): 384.15 - samples/sec: 4928.64 - lr: 0.025000
2023-04-06 22:39:47,727 epoch 137 - iter 2604/3720 - loss 0.02584277 - time (sec): 447.92 - samples/sec: 4929.96 - lr: 0.025000
2023-04-06 22:40:51,228 epoch 137 - iter 2976/3720 - loss 0.02588664 - time (sec): 511.42 - samples/sec: 4931.63 - lr: 0.025000
2023-04-06 22:41:55,001 epoch 137 - iter 3348/3720 - loss 0.02585971 - time (sec): 575.19 - samples/sec: 4932.24 - lr: 0.025000
2023-04-06 22:42:58,501 epoch 137 - iter 3720/3720 - loss 0.02583166 - time (sec): 638.69 - samples/sec: 4934.64 - lr: 0.025000
2023-04-06 22:42:58,501 ----------------------------------------------------------------------------------------------------
2023-04-06 22:42:58,501 EPOCH 137 done: loss 0.0258 - lr 0.025000
2023-04-06 22:42:58,501 BAD EPOCHS (no improvement): 0
2023-04-06 22:42:58,504 ----------------------------------------------------------------------------------------------------
2023-04-06 22:44:02,262 epoch 138 - iter 372/3720 - loss 0.02638899 - time (sec): 63.76 - samples/sec: 4950.99 - lr: 0.025000
2023-04-06 22:45:06,774 epoch 138 - iter 744/3720 - loss 0.02577679 - time (sec): 128.27 - samples/sec: 4924.26 - lr: 0.025000
2023-04-06 22:46:11,548 epoch 138 - iter 1116/3720 - loss 0.02563858 - time (sec): 193.04 - samples/sec: 4906.53 - lr: 0.025000
2023-04-06 22:47:17,012 epoch 138 - iter 1488/3720 - loss 0.02551228 - time (sec): 258.51 - samples/sec: 4890.39 - lr: 0.025000
2023-04-06 22:48:21,572 epoch 138 - iter 1860/3720 - loss 0.02556028 - time (sec): 323.07 - samples/sec: 4886.61 - lr: 0.025000
2023-04-06 22:49:24,678 epoch 138 - iter 2232/3720 - loss 0.02568383 - time (sec): 386.17 - samples/sec: 4907.24 - lr: 0.025000
2023-04-06 22:50:27,797 epoch 138 - iter 2604/3720 - loss 0.02579370 - time (sec): 449.29 - samples/sec: 4917.43 - lr: 0.025000
2023-04-06 22:51:30,979 epoch 138 - iter 2976/3720 - loss 0.02573020 - time (sec): 512.47 - samples/sec: 4923.16 - lr: 0.025000
2023-04-06 22:52:33,807 epoch 138 - iter 3348/3720 - loss 0.02588175 - time (sec): 575.30 - samples/sec: 4934.35 - lr: 0.025000
2023-04-06 22:53:36,022 epoch 138 - iter 3720/3720 - loss 0.02588153 - time (sec): 637.52 - samples/sec: 4943.73 - lr: 0.025000
2023-04-06 22:53:36,022 ----------------------------------------------------------------------------------------------------
2023-04-06 22:53:36,022 EPOCH 138 done: loss 0.0259 - lr 0.025000
2023-04-06 22:53:36,022 BAD EPOCHS (no improvement): 1
2023-04-06 22:53:36,026 ----------------------------------------------------------------------------------------------------
2023-04-06 22:54:38,481 epoch 139 - iter 372/3720 - loss 0.02552485 - time (sec): 62.45 - samples/sec: 5022.53 - lr: 0.025000
2023-04-06 22:55:41,481 epoch 139 - iter 744/3720 - loss 0.02553804 - time (sec): 125.46 - samples/sec: 5017.18 - lr: 0.025000
2023-04-06 22:56:44,429 epoch 139 - iter 1116/3720 - loss 0.02573355 - time (sec): 188.40 - samples/sec: 5018.26 - lr: 0.025000
2023-04-06 22:57:46,288 epoch 139 - iter 1488/3720 - loss 0.02583800 - time (sec): 250.26 - samples/sec: 5034.55 - lr: 0.025000
2023-04-06 22:58:50,182 epoch 139 - iter 1860/3720 - loss 0.02573281 - time (sec): 314.16 - samples/sec: 5007.64 - lr: 0.025000
2023-04-06 22:59:54,958 epoch 139 - iter 2232/3720 - loss 0.02593784 - time (sec): 378.93 - samples/sec: 4982.69 - lr: 0.025000
2023-04-06 23:00:59,157 epoch 139 - iter 2604/3720 - loss 0.02592564 - time (sec): 443.13 - samples/sec: 4976.41 - lr: 0.025000
2023-04-06 23:02:03,185 epoch 139 - iter 2976/3720 - loss 0.02590531 - time (sec): 507.16 - samples/sec: 4971.44 - lr: 0.025000
2023-04-06 23:03:07,721 epoch 139 - iter 3348/3720 - loss 0.02590306 - time (sec): 571.69 - samples/sec: 4965.32 - lr: 0.025000
2023-04-06 23:04:11,651 epoch 139 - iter 3720/3720 - loss 0.02592914 - time (sec): 635.63 - samples/sec: 4958.44 - lr: 0.025000
2023-04-06 23:04:11,652 ----------------------------------------------------------------------------------------------------
2023-04-06 23:04:11,652 EPOCH 139 done: loss 0.0259 - lr 0.025000
2023-04-06 23:04:11,652 BAD EPOCHS (no improvement): 2
2023-04-06 23:04:11,655 ----------------------------------------------------------------------------------------------------
2023-04-06 23:05:15,980 epoch 140 - iter 372/3720 - loss 0.02533624 - time (sec): 64.32 - samples/sec: 4923.12 - lr: 0.025000
2023-04-06 23:06:19,205 epoch 140 - iter 744/3720 - loss 0.02598887 - time (sec): 127.55 - samples/sec: 4940.86 - lr: 0.025000
2023-04-06 23:07:23,079 epoch 140 - iter 1116/3720 - loss 0.02585332 - time (sec): 191.42 - samples/sec: 4950.32 - lr: 0.025000
2023-04-06 23:08:26,418 epoch 140 - iter 1488/3720 - loss 0.02574556 - time (sec): 254.76 - samples/sec: 4964.48 - lr: 0.025000
2023-04-06 23:09:30,357 epoch 140 - iter 1860/3720 - loss 0.02578041 - time (sec): 318.70 - samples/sec: 4957.90 - lr: 0.025000
2023-04-06 23:10:33,812 epoch 140 - iter 2232/3720 - loss 0.02575237 - time (sec): 382.16 - samples/sec: 4957.11 - lr: 0.025000
2023-04-06 23:11:37,488 epoch 140 - iter 2604/3720 - loss 0.02572028 - time (sec): 445.83 - samples/sec: 4954.54 - lr: 0.025000
2023-04-06 23:12:40,708 epoch 140 - iter 2976/3720 - loss 0.02579612 - time (sec): 509.05 - samples/sec: 4956.48 - lr: 0.025000
2023-04-06 23:13:44,271 epoch 140 - iter 3348/3720 - loss 0.02577860 - time (sec): 572.62 - samples/sec: 4955.38 - lr: 0.025000
2023-04-06 23:14:47,348 epoch 140 - iter 3720/3720 - loss 0.02574111 - time (sec): 635.69 - samples/sec: 4957.92 - lr: 0.025000
2023-04-06 23:14:47,348 ----------------------------------------------------------------------------------------------------
2023-04-06 23:14:47,348 EPOCH 140 done: loss 0.0257 - lr 0.025000
2023-04-06 23:14:47,348 BAD EPOCHS (no improvement): 0
2023-04-06 23:14:47,352 ----------------------------------------------------------------------------------------------------
2023-04-06 23:15:51,919 epoch 141 - iter 372/3720 - loss 0.02442453 - time (sec): 64.57 - samples/sec: 4882.76 - lr: 0.025000
2023-04-06 23:16:55,411 epoch 141 - iter 744/3720 - loss 0.02513508 - time (sec): 128.06 - samples/sec: 4921.75 - lr: 0.025000
2023-04-06 23:17:59,499 epoch 141 - iter 1116/3720 - loss 0.02526491 - time (sec): 192.15 - samples/sec: 4924.39 - lr: 0.025000
2023-04-06 23:19:03,941 epoch 141 - iter 1488/3720 - loss 0.02534890 - time (sec): 256.59 - samples/sec: 4927.69 - lr: 0.025000
2023-04-06 23:20:07,287 epoch 141 - iter 1860/3720 - loss 0.02549156 - time (sec): 319.94 - samples/sec: 4932.14 - lr: 0.025000
2023-04-06 23:21:11,308 epoch 141 - iter 2232/3720 - loss 0.02555004 - time (sec): 383.96 - samples/sec: 4926.40 - lr: 0.025000
2023-04-06 23:22:14,759 epoch 141 - iter 2604/3720 - loss 0.02561058 - time (sec): 447.41 - samples/sec: 4932.97 - lr: 0.025000
2023-04-06 23:23:16,727 epoch 141 - iter 2976/3720 - loss 0.02566289 - time (sec): 509.38 - samples/sec: 4946.11 - lr: 0.025000
2023-04-06 23:24:20,412 epoch 141 - iter 3348/3720 - loss 0.02566191 - time (sec): 573.06 - samples/sec: 4949.10 - lr: 0.025000
2023-04-06 23:25:24,711 epoch 141 - iter 3720/3720 - loss 0.02572472 - time (sec): 637.36 - samples/sec: 4944.95 - lr: 0.025000
2023-04-06 23:25:24,711 ----------------------------------------------------------------------------------------------------
2023-04-06 23:25:24,711 EPOCH 141 done: loss 0.0257 - lr 0.025000
2023-04-06 23:25:24,712 BAD EPOCHS (no improvement): 0
2023-04-06 23:25:24,715 ----------------------------------------------------------------------------------------------------
2023-04-06 23:26:28,871 epoch 142 - iter 372/3720 - loss 0.02454281 - time (sec): 64.16 - samples/sec: 4919.81 - lr: 0.025000
2023-04-06 23:27:33,203 epoch 142 - iter 744/3720 - loss 0.02554346 - time (sec): 128.49 - samples/sec: 4916.30 - lr: 0.025000
2023-04-06 23:28:38,013 epoch 142 - iter 1116/3720 - loss 0.02583124 - time (sec): 193.30 - samples/sec: 4895.76 - lr: 0.025000
2023-04-06 23:29:42,387 epoch 142 - iter 1488/3720 - loss 0.02585711 - time (sec): 257.67 - samples/sec: 4901.13 - lr: 0.025000
2023-04-06 23:30:45,098 epoch 142 - iter 1860/3720 - loss 0.02574555 - time (sec): 320.38 - samples/sec: 4915.59 - lr: 0.025000
2023-04-06 23:31:49,118 epoch 142 - iter 2232/3720 - loss 0.02584393 - time (sec): 384.40 - samples/sec: 4915.32 - lr: 0.025000
2023-04-06 23:32:52,542 epoch 142 - iter 2604/3720 - loss 0.02579463 - time (sec): 447.83 - samples/sec: 4920.97 - lr: 0.025000
2023-04-06 23:33:55,836 epoch 142 - iter 2976/3720 - loss 0.02586720 - time (sec): 511.12 - samples/sec: 4928.60 - lr: 0.025000
2023-04-06 23:35:00,469 epoch 142 - iter 3348/3720 - loss 0.02582206 - time (sec): 575.75 - samples/sec: 4926.09 - lr: 0.025000
2023-04-06 23:36:04,441 epoch 142 - iter 3720/3720 - loss 0.02575175 - time (sec): 639.73 - samples/sec: 4926.66 - lr: 0.025000
2023-04-06 23:36:04,441 ----------------------------------------------------------------------------------------------------
2023-04-06 23:36:04,441 EPOCH 142 done: loss 0.0258 - lr 0.025000
2023-04-06 23:36:04,441 BAD EPOCHS (no improvement): 1
2023-04-06 23:36:04,445 ----------------------------------------------------------------------------------------------------
2023-04-06 23:37:08,262 epoch 143 - iter 372/3720 - loss 0.02531212 - time (sec): 63.82 - samples/sec: 4952.41 - lr: 0.025000
2023-04-06 23:38:12,616 epoch 143 - iter 744/3720 - loss 0.02551014 - time (sec): 128.17 - samples/sec: 4931.40 - lr: 0.025000
2023-04-06 23:39:16,962 epoch 143 - iter 1116/3720 - loss 0.02552322 - time (sec): 192.52 - samples/sec: 4926.55 - lr: 0.025000
2023-04-06 23:40:20,555 epoch 143 - iter 1488/3720 - loss 0.02556579 - time (sec): 256.11 - samples/sec: 4932.25 - lr: 0.025000
2023-04-06 23:41:24,062 epoch 143 - iter 1860/3720 - loss 0.02581501 - time (sec): 319.62 - samples/sec: 4934.83 - lr: 0.025000
2023-04-06 23:42:27,208 epoch 143 - iter 2232/3720 - loss 0.02578200 - time (sec): 382.76 - samples/sec: 4945.20 - lr: 0.025000
2023-04-06 23:43:31,595 epoch 143 - iter 2604/3720 - loss 0.02572099 - time (sec): 447.15 - samples/sec: 4935.20 - lr: 0.025000
2023-04-06 23:44:35,952 epoch 143 - iter 2976/3720 - loss 0.02558884 - time (sec): 511.51 - samples/sec: 4927.93 - lr: 0.025000
2023-04-06 23:45:40,135 epoch 143 - iter 3348/3720 - loss 0.02559362 - time (sec): 575.69 - samples/sec: 4929.64 - lr: 0.025000
2023-04-06 23:46:43,409 epoch 143 - iter 3720/3720 - loss 0.02562851 - time (sec): 638.96 - samples/sec: 4932.53 - lr: 0.025000
2023-04-06 23:46:43,410 ----------------------------------------------------------------------------------------------------
2023-04-06 23:46:43,410 EPOCH 143 done: loss 0.0256 - lr 0.025000
2023-04-06 23:46:43,410 BAD EPOCHS (no improvement): 0
2023-04-06 23:46:43,413 ----------------------------------------------------------------------------------------------------
2023-04-06 23:47:47,065 epoch 144 - iter 372/3720 - loss 0.02503982 - time (sec): 63.65 - samples/sec: 4937.81 - lr: 0.025000
2023-04-06 23:48:50,955 epoch 144 - iter 744/3720 - loss 0.02565553 - time (sec): 127.54 - samples/sec: 4928.72 - lr: 0.025000
2023-04-06 23:49:55,285 epoch 144 - iter 1116/3720 - loss 0.02550931 - time (sec): 191.87 - samples/sec: 4924.62 - lr: 0.025000
2023-04-06 23:50:59,428 epoch 144 - iter 1488/3720 - loss 0.02547788 - time (sec): 256.02 - samples/sec: 4923.84 - lr: 0.025000
2023-04-06 23:52:02,385 epoch 144 - iter 1860/3720 - loss 0.02544773 - time (sec): 318.97 - samples/sec: 4939.46 - lr: 0.025000
2023-04-06 23:53:05,940 epoch 144 - iter 2232/3720 - loss 0.02567112 - time (sec): 382.53 - samples/sec: 4940.52 - lr: 0.025000
2023-04-06 23:54:10,215 epoch 144 - iter 2604/3720 - loss 0.02565937 - time (sec): 446.80 - samples/sec: 4937.55 - lr: 0.025000
2023-04-06 23:55:14,241 epoch 144 - iter 2976/3720 - loss 0.02566799 - time (sec): 510.83 - samples/sec: 4931.94 - lr: 0.025000
2023-04-06 23:56:18,506 epoch 144 - iter 3348/3720 - loss 0.02562112 - time (sec): 575.09 - samples/sec: 4931.61 - lr: 0.025000
2023-04-06 23:57:21,784 epoch 144 - iter 3720/3720 - loss 0.02559035 - time (sec): 638.37 - samples/sec: 4937.12 - lr: 0.025000
2023-04-06 23:57:21,785 ----------------------------------------------------------------------------------------------------
2023-04-06 23:57:21,785 EPOCH 144 done: loss 0.0256 - lr 0.025000
2023-04-06 23:57:21,785 BAD EPOCHS (no improvement): 0
2023-04-06 23:57:21,788 ----------------------------------------------------------------------------------------------------
2023-04-06 23:58:24,501 epoch 145 - iter 372/3720 - loss 0.02545164 - time (sec): 62.71 - samples/sec: 5042.47 - lr: 0.025000
2023-04-06 23:59:28,252 epoch 145 - iter 744/3720 - loss 0.02574462 - time (sec): 126.46 - samples/sec: 5001.06 - lr: 0.025000
2023-04-07 00:00:32,273 epoch 145 - iter 1116/3720 - loss 0.02563708 - time (sec): 190.48 - samples/sec: 4972.09 - lr: 0.025000
2023-04-07 00:01:34,593 epoch 145 - iter 1488/3720 - loss 0.02546344 - time (sec): 252.80 - samples/sec: 4993.69 - lr: 0.025000
2023-04-07 00:02:36,830 epoch 145 - iter 1860/3720 - loss 0.02561387 - time (sec): 315.04 - samples/sec: 4997.70 - lr: 0.025000
2023-04-07 00:03:39,276 epoch 145 - iter 2232/3720 - loss 0.02561802 - time (sec): 377.49 - samples/sec: 5002.92 - lr: 0.025000
2023-04-07 00:04:43,702 epoch 145 - iter 2604/3720 - loss 0.02560137 - time (sec): 441.91 - samples/sec: 4985.64 - lr: 0.025000
2023-04-07 00:05:47,108 epoch 145 - iter 2976/3720 - loss 0.02570301 - time (sec): 505.32 - samples/sec: 4982.07 - lr: 0.025000
2023-04-07 00:06:51,444 epoch 145 - iter 3348/3720 - loss 0.02567101 - time (sec): 569.66 - samples/sec: 4971.75 - lr: 0.025000
2023-04-07 00:07:56,444 epoch 145 - iter 3720/3720 - loss 0.02563443 - time (sec): 634.66 - samples/sec: 4966.02 - lr: 0.025000
2023-04-07 00:07:56,444 ----------------------------------------------------------------------------------------------------
2023-04-07 00:07:56,444 EPOCH 145 done: loss 0.0256 - lr 0.025000
2023-04-07 00:07:56,445 BAD EPOCHS (no improvement): 1
2023-04-07 00:07:56,448 ----------------------------------------------------------------------------------------------------
2023-04-07 00:08:59,126 epoch 146 - iter 372/3720 - loss 0.02589090 - time (sec): 62.68 - samples/sec: 5008.35 - lr: 0.025000
2023-04-07 00:10:03,439 epoch 146 - iter 744/3720 - loss 0.02526705 - time (sec): 126.99 - samples/sec: 4952.15 - lr: 0.025000
2023-04-07 00:11:07,046 epoch 146 - iter 1116/3720 - loss 0.02550601 - time (sec): 190.60 - samples/sec: 4958.67 - lr: 0.025000
2023-04-07 00:12:10,548 epoch 146 - iter 1488/3720 - loss 0.02536717 - time (sec): 254.10 - samples/sec: 4959.25 - lr: 0.025000
2023-04-07 00:13:14,382 epoch 146 - iter 1860/3720 - loss 0.02534244 - time (sec): 317.93 - samples/sec: 4958.88 - lr: 0.025000
2023-04-07 00:14:17,039 epoch 146 - iter 2232/3720 - loss 0.02528017 - time (sec): 380.59 - samples/sec: 4968.09 - lr: 0.025000
2023-04-07 00:15:19,654 epoch 146 - iter 2604/3720 - loss 0.02524634 - time (sec): 443.21 - samples/sec: 4976.96 - lr: 0.025000
2023-04-07 00:16:23,204 epoch 146 - iter 2976/3720 - loss 0.02522499 - time (sec): 506.76 - samples/sec: 4970.64 - lr: 0.025000
2023-04-07 00:17:28,245 epoch 146 - iter 3348/3720 - loss 0.02534385 - time (sec): 571.80 - samples/sec: 4961.30 - lr: 0.025000
2023-04-07 00:18:32,331 epoch 146 - iter 3720/3720 - loss 0.02535678 - time (sec): 635.88 - samples/sec: 4956.43 - lr: 0.025000
2023-04-07 00:18:32,331 ----------------------------------------------------------------------------------------------------
2023-04-07 00:18:32,331 EPOCH 146 done: loss 0.0254 - lr 0.025000
2023-04-07 00:18:32,332 BAD EPOCHS (no improvement): 0
2023-04-07 00:18:32,365 ----------------------------------------------------------------------------------------------------
2023-04-07 00:19:37,026 epoch 147 - iter 372/3720 - loss 0.02507179 - time (sec): 64.66 - samples/sec: 4889.70 - lr: 0.025000
2023-04-07 00:20:41,357 epoch 147 - iter 744/3720 - loss 0.02501378 - time (sec): 128.99 - samples/sec: 4900.85 - lr: 0.025000
2023-04-07 00:21:45,141 epoch 147 - iter 1116/3720 - loss 0.02522842 - time (sec): 192.78 - samples/sec: 4910.37 - lr: 0.025000
2023-04-07 00:22:49,600 epoch 147 - iter 1488/3720 - loss 0.02517993 - time (sec): 257.23 - samples/sec: 4916.48 - lr: 0.025000
2023-04-07 00:23:53,037 epoch 147 - iter 1860/3720 - loss 0.02536139 - time (sec): 320.67 - samples/sec: 4925.55 - lr: 0.025000
2023-04-07 00:24:57,619 epoch 147 - iter 2232/3720 - loss 0.02547593 - time (sec): 385.25 - samples/sec: 4920.70 - lr: 0.025000
2023-04-07 00:26:01,113 epoch 147 - iter 2604/3720 - loss 0.02549843 - time (sec): 448.75 - samples/sec: 4925.88 - lr: 0.025000
2023-04-07 00:27:04,506 epoch 147 - iter 2976/3720 - loss 0.02552815 - time (sec): 512.14 - samples/sec: 4930.06 - lr: 0.025000
2023-04-07 00:28:08,452 epoch 147 - iter 3348/3720 - loss 0.02553373 - time (sec): 576.09 - samples/sec: 4927.51 - lr: 0.025000
2023-04-07 00:29:11,291 epoch 147 - iter 3720/3720 - loss 0.02552517 - time (sec): 638.93 - samples/sec: 4932.83 - lr: 0.025000
2023-04-07 00:29:11,291 ----------------------------------------------------------------------------------------------------
2023-04-07 00:29:11,291 EPOCH 147 done: loss 0.0255 - lr 0.025000
2023-04-07 00:29:11,291 BAD EPOCHS (no improvement): 1
2023-04-07 00:29:11,295 ----------------------------------------------------------------------------------------------------
2023-04-07 00:30:15,880 epoch 148 - iter 372/3720 - loss 0.02500274 - time (sec): 64.58 - samples/sec: 4882.34 - lr: 0.025000
2023-04-07 00:31:20,464 epoch 148 - iter 744/3720 - loss 0.02497199 - time (sec): 129.17 - samples/sec: 4898.07 - lr: 0.025000
2023-04-07 00:32:23,169 epoch 148 - iter 1116/3720 - loss 0.02508581 - time (sec): 191.87 - samples/sec: 4931.86 - lr: 0.025000
2023-04-07 00:33:27,782 epoch 148 - iter 1488/3720 - loss 0.02529478 - time (sec): 256.49 - samples/sec: 4920.74 - lr: 0.025000
2023-04-07 00:34:31,503 epoch 148 - iter 1860/3720 - loss 0.02533457 - time (sec): 320.21 - samples/sec: 4925.54 - lr: 0.025000
2023-04-07 00:35:35,200 epoch 148 - iter 2232/3720 - loss 0.02534177 - time (sec): 383.90 - samples/sec: 4930.17 - lr: 0.025000
2023-04-07 00:36:37,690 epoch 148 - iter 2604/3720 - loss 0.02534918 - time (sec): 446.40 - samples/sec: 4940.98 - lr: 0.025000
2023-04-07 00:37:41,976 epoch 148 - iter 2976/3720 - loss 0.02543365 - time (sec): 510.68 - samples/sec: 4937.32 - lr: 0.025000
2023-04-07 00:38:45,417 epoch 148 - iter 3348/3720 - loss 0.02546538 - time (sec): 574.12 - samples/sec: 4939.68 - lr: 0.025000
2023-04-07 00:39:49,356 epoch 148 - iter 3720/3720 - loss 0.02551808 - time (sec): 638.06 - samples/sec: 4939.51 - lr: 0.025000
2023-04-07 00:39:49,356 ----------------------------------------------------------------------------------------------------
2023-04-07 00:39:49,356 EPOCH 148 done: loss 0.0255 - lr 0.025000
2023-04-07 00:39:49,356 BAD EPOCHS (no improvement): 2
2023-04-07 00:39:49,360 ----------------------------------------------------------------------------------------------------
2023-04-07 00:40:53,940 epoch 149 - iter 372/3720 - loss 0.02527195 - time (sec): 64.58 - samples/sec: 4907.18 - lr: 0.025000
2023-04-07 00:41:58,077 epoch 149 - iter 744/3720 - loss 0.02541228 - time (sec): 128.72 - samples/sec: 4931.90 - lr: 0.025000
2023-04-07 00:43:01,475 epoch 149 - iter 1116/3720 - loss 0.02531123 - time (sec): 192.12 - samples/sec: 4926.60 - lr: 0.025000
2023-04-07 00:44:05,865 epoch 149 - iter 1488/3720 - loss 0.02574382 - time (sec): 256.50 - samples/sec: 4921.44 - lr: 0.025000
2023-04-07 00:45:08,754 epoch 149 - iter 1860/3720 - loss 0.02557559 - time (sec): 319.39 - samples/sec: 4937.89 - lr: 0.025000
2023-04-07 00:46:12,748 epoch 149 - iter 2232/3720 - loss 0.02538772 - time (sec): 383.39 - samples/sec: 4934.65 - lr: 0.025000
2023-04-07 00:47:17,990 epoch 149 - iter 2604/3720 - loss 0.02539580 - time (sec): 448.63 - samples/sec: 4924.78 - lr: 0.025000
2023-04-07 00:48:21,744 epoch 149 - iter 2976/3720 - loss 0.02543328 - time (sec): 512.38 - samples/sec: 4928.43 - lr: 0.025000
2023-04-07 00:49:24,803 epoch 149 - iter 3348/3720 - loss 0.02537053 - time (sec): 575.44 - samples/sec: 4934.66 - lr: 0.025000
2023-04-07 00:50:28,000 epoch 149 - iter 3720/3720 - loss 0.02537367 - time (sec): 638.64 - samples/sec: 4935.04 - lr: 0.025000
2023-04-07 00:50:28,000 ----------------------------------------------------------------------------------------------------
2023-04-07 00:50:28,001 EPOCH 149 done: loss 0.0254 - lr 0.025000
2023-04-07 00:50:28,001 BAD EPOCHS (no improvement): 3
2023-04-07 00:50:28,004 ----------------------------------------------------------------------------------------------------
2023-04-07 00:51:31,132 epoch 150 - iter 372/3720 - loss 0.02509917 - time (sec): 63.13 - samples/sec: 4999.77 - lr: 0.025000
2023-04-07 00:52:33,491 epoch 150 - iter 744/3720 - loss 0.02528357 - time (sec): 125.49 - samples/sec: 5040.43 - lr: 0.025000
2023-04-07 00:53:37,008 epoch 150 - iter 1116/3720 - loss 0.02521386 - time (sec): 189.00 - samples/sec: 5023.44 - lr: 0.025000
2023-04-07 00:54:39,974 epoch 150 - iter 1488/3720 - loss 0.02525906 - time (sec): 251.97 - samples/sec: 5026.77 - lr: 0.025000
2023-04-07 00:55:42,900 epoch 150 - iter 1860/3720 - loss 0.02537075 - time (sec): 314.90 - samples/sec: 5022.86 - lr: 0.025000
2023-04-07 00:56:44,701 epoch 150 - iter 2232/3720 - loss 0.02534848 - time (sec): 376.70 - samples/sec: 5032.75 - lr: 0.025000
2023-04-07 00:57:48,820 epoch 150 - iter 2604/3720 - loss 0.02528790 - time (sec): 440.82 - samples/sec: 5021.57 - lr: 0.025000
2023-04-07 00:58:52,332 epoch 150 - iter 2976/3720 - loss 0.02534070 - time (sec): 504.33 - samples/sec: 5010.36 - lr: 0.025000
2023-04-07 00:59:55,540 epoch 150 - iter 3348/3720 - loss 0.02529852 - time (sec): 567.54 - samples/sec: 5002.01 - lr: 0.025000
2023-04-07 01:00:59,069 epoch 150 - iter 3720/3720 - loss 0.02530371 - time (sec): 631.06 - samples/sec: 4994.28 - lr: 0.025000
2023-04-07 01:00:59,070 ----------------------------------------------------------------------------------------------------
2023-04-07 01:00:59,070 EPOCH 150 done: loss 0.0253 - lr 0.025000
2023-04-07 01:00:59,070 BAD EPOCHS (no improvement): 0
2023-04-07 01:01:06,016 ----------------------------------------------------------------------------------------------------
2023-04-07 01:01:06,017 Testing using last state of model ...
2023-04-07 01:02:10,849 Evaluating as a multi-label problem: False
2023-04-07 01:02:11,002 0.9066 0.9053 0.9059 0.8494
2023-04-07 01:02:11,002
Results:
- F-score (micro) 0.9059
- F-score (macro) 0.8941
- Accuracy 0.8494
By class:
precision recall f1-score support
LOC 0.8913 0.9020 0.8967 11495
PER 0.9592 0.9659 0.9625 7444
MISC 0.8615 0.8310 0.8460 3946
ORG 0.8869 0.8559 0.8712 2429
micro avg 0.9066 0.9053 0.9059 25314
macro avg 0.8998 0.8887 0.8941 25314
weighted avg 0.9062 0.9053 0.9057 25314
2023-04-07 01:02:11,002 ----------------------------------------------------------------------------------------------------