BirdAST / BirdAST_Baseline_GroupKFold_training.log
shiyi-li's picture
Upload 9 files
cd32054 verified
################################### TRAIN_CONFIG ###################################
dataset_dir: ./Audio_XenoCanto
labels_list: ./xeno_labels.csv
model_name: BirdAST_Baseline_GroupKFold
backbone_name: MIT/ast-finetuned-audioset-10-10-0.4593
n_classes: 728
audio_sr: 16000
segment_length: 10
fft_window: 0.025
hop_window_length: 0.01
n_mels: 128
low_cut: 1000
high_cut: 8000
top_db: 100
batch_size: 16
num_workers: 0
n_splits: 5
log_dir: ./training_logs
max_lr: 1e-05
epochs: 10
weight_decay: 0.01
lr_final_div: 1000
amp: True
grad_accum_steps: 1
max_grad_norm: 10000000.0
print_epoch_freq: 1
print_freq: 500
random_seed: 2046
copy: <classmethod(<function Config.copy at 0x7b4f57baf1c0>)>
################################################################################
Failed to detect the name of this notebook, you can set it manually with the WANDB_NOTEBOOK_NAME environment variable to enable code saving.
Epoch 1 [0/559] | Train Loss: 0.3797 Grad: 132458.4531 LR: 4.0008e-07 | Elapse: 5.22s
Epoch 1 [500/559] | Train Loss: 0.1767 Grad: 17217.5918 LR: 9.7549e-06 | Elapse: 632.27s
Epoch 1 [558/559] | Train Loss: 0.1659 Grad: 38565.3086 LR: 1.0000e-05 | Elapse: 704.92s
Epoch 1 [0/140] | Valid Loss: 0.0956 | Elapse: 1.77s
Epoch 1 [139/140] | Valid Loss: 0.1626 | Elapse: 179.13s
Epoch 1 - Train Loss: 0.1659 - Valid Loss: 0.5170 - Elapsed Time: 902.38s
- Epoch 1: Best model found with loss = 0.5170.
Epoch 2 [0/559] | Train Loss: 0.3837 Grad: 82366.4531 LR: 1.0000e-05 | Elapse: 1.39s
Epoch 2 [500/559] | Train Loss: 0.1670 Grad: 26346.2246 LR: 9.7564e-06 | Elapse: 647.59s
Epoch 2 [558/559] | Train Loss: 0.1563 Grad: 53784.7227 LR: 9.6974e-06 | Elapse: 716.52s
Epoch 2 [0/140] | Valid Loss: 0.0949 | Elapse: 1.36s
Epoch 2 [139/140] | Valid Loss: 0.1759 | Elapse: 176.02s
Epoch 2 - Train Loss: 0.1563 - Valid Loss: 0.5562 - Elapsed Time: 910.59s
- Epoch 2: Best model found with loss = 0.5562.
Epoch 3 [0/559] | Train Loss: 0.3296 Grad: 136677.4531 LR: 9.6963e-06 | Elapse: 1.60s
Epoch 3 [500/559] | Train Loss: 0.1347 Grad: 29127.7148 LR: 8.9422e-06 | Elapse: 630.69s
Epoch 3 [558/559] | Train Loss: 0.1259 Grad: 57361.0430 LR: 8.8283e-06 | Elapse: 700.52s
Epoch 3 [0/140] | Valid Loss: 0.0909 | Elapse: 1.56s
Epoch 3 [139/140] | Valid Loss: 0.1843 | Elapse: 176.22s
Epoch 3 - Train Loss: 0.1259 - Valid Loss: 0.6019 - Elapsed Time: 894.87s
- Epoch 3: Best model found with loss = 0.6019.
Epoch 4 [0/559] | Train Loss: 0.2495 Grad: 174822.3438 LR: 8.8263e-06 | Elapse: 1.03s
Epoch 4 [500/559] | Train Loss: 0.0971 Grad: 30384.9941 LR: 7.6526e-06 | Elapse: 616.92s
Epoch 4 [558/559] | Train Loss: 0.0909 Grad: 54755.8555 LR: 7.4974e-06 | Elapse: 686.08s
Epoch 4 [0/140] | Valid Loss: 0.0883 | Elapse: 0.96s
Epoch 4 [139/140] | Valid Loss: 0.1906 | Elapse: 170.98s
Epoch 4 - Train Loss: 0.0909 - Valid Loss: 0.6292 - Elapsed Time: 875.26s
- Epoch 4: Best model found with loss = 0.6292.
Epoch 5 [0/559] | Train Loss: 0.1445 Grad: 179717.0781 LR: 7.4947e-06 | Elapse: 1.67s
Epoch 5 [500/559] | Train Loss: 0.0679 Grad: 31367.4883 LR: 6.0431e-06 | Elapse: 636.79s
Epoch 5 [558/559] | Train Loss: 0.0638 Grad: 46204.8477 LR: 5.8653e-06 | Elapse: 710.08s
Epoch 5 [0/140] | Valid Loss: 0.0862 | Elapse: 1.37s
Epoch 5 [139/140] | Valid Loss: 0.1974 | Elapse: 172.42s
Epoch 5 - Train Loss: 0.0638 - Valid Loss: 0.6417 - Elapsed Time: 900.70s
- Epoch 5: Best model found with loss = 0.6417.
Epoch 6 [0/559] | Train Loss: 0.0752 Grad: 150651.5312 LR: 5.8623e-06 | Elapse: 1.26s
Epoch 6 [500/559] | Train Loss: 0.0498 Grad: 30212.4238 LR: 4.3078e-06 | Elapse: 625.35s
Epoch 6 [558/559] | Train Loss: 0.0471 Grad: 45234.8984 LR: 4.1289e-06 | Elapse: 698.58s
Epoch 6 [0/140] | Valid Loss: 0.0843 | Elapse: 1.56s
Epoch 6 [139/140] | Valid Loss: 0.2014 | Elapse: 168.62s
Epoch 6 - Train Loss: 0.0471 - Valid Loss: 0.6506 - Elapsed Time: 885.11s
- Epoch 6: Best model found with loss = 0.6506.
Epoch 7 [0/559] | Train Loss: 0.0401 Grad: 110378.2734 LR: 4.1258e-06 | Elapse: 1.55s
Epoch 7 [500/559] | Train Loss: 0.0401 Grad: 29949.4160 LR: 2.6560e-06 | Elapse: 747.46s
Epoch 7 [558/559] | Train Loss: 0.0381 Grad: 47635.7148 LR: 2.4976e-06 | Elapse: 850.90s
Epoch 7 [0/140] | Valid Loss: 0.0835 | Elapse: 1.84s
Epoch 7 [139/140] | Valid Loss: 0.2044 | Elapse: 247.83s
Epoch 7 - Train Loss: 0.0381 - Valid Loss: 0.6516 - Elapsed Time: 1122.23s
- Epoch 7: Best model found with loss = 0.6516.
Epoch 8 [0/559] | Train Loss: 0.0310 Grad: 93998.0625 LR: 2.4949e-06 | Elapse: 2.01s
Epoch 8 [500/559] | Train Loss: 0.0364 Grad: 34944.8828 LR: 1.2869e-06 | Elapse: 898.62s
Epoch 8 [558/559] | Train Loss: 0.0347 Grad: 48920.4258 LR: 1.1681e-06 | Elapse: 1001.16s
Epoch 8 [0/140] | Valid Loss: 0.0855 | Elapse: 1.69s
Epoch 8 [139/140] | Valid Loss: 0.2072 | Elapse: 250.89s
Epoch 8 - Train Loss: 0.0347 - Valid Loss: 0.6495 - Elapsed Time: 1275.92s
Epoch 9 [0/559] | Train Loss: 0.0334 Grad: 111821.3047 LR: 1.1661e-06 | Elapse: 1.79s
Epoch 9 [500/559] | Train Loss: 0.0380 Grad: 48075.5664 LR: 3.6575e-07 | Elapse: 896.79s
Epoch 9 [558/559] | Train Loss: 0.0362 Grad: 48004.2852 LR: 3.0086e-07 | Elapse: 999.50s
Epoch 9 [0/140] | Valid Loss: 0.0802 | Elapse: 1.68s
Epoch 9 [139/140] | Valid Loss: 0.2040 | Elapse: 247.83s
Epoch 9 - Train Loss: 0.0362 - Valid Loss: 0.6773 - Elapsed Time: 1272.24s
- Epoch 9: Best model found with loss = 0.6773.
Epoch 10 [0/559] | Train Loss: 0.0419 Grad: 138725.0625 LR: 2.9979e-07 | Elapse: 1.85s
Epoch 10 [500/559] | Train Loss: 0.0442 Grad: 51908.7266 LR: 3.5668e-09 | Elapse: 851.64s
Epoch 10 [558/559] | Train Loss: 0.0418 Grad: 36428.0664 LR: 4.0097e-10 | Elapse: 950.39s
Epoch 10 [0/140] | Valid Loss: 0.0763 | Elapse: 1.74s
Epoch 10 [139/140] | Valid Loss: 0.2015 | Elapse: 253.39s
Epoch 10 - Train Loss: 0.0418 - Valid Loss: 0.6896 - Elapsed Time: 1228.92s
- Epoch 10: Best model found with loss = 0.6896.
Fold 0 | Time: 171.93min | Overall Evaluation Loss: 0.6896
Epoch 1 [0/559] | Train Loss: 0.4015 Grad: 130138.6250 LR: 4.0008e-07 | Elapse: 1.81s
Epoch 1 [500/559] | Train Loss: 0.1759 Grad: 863.0330 LR: 9.7549e-06 | Elapse: 869.10s
Epoch 1 [558/559] | Train Loss: 0.1663 Grad: 33445.6641 LR: 1.0000e-05 | Elapse: 956.09s
Epoch 1 [0/140] | Valid Loss: 0.2185 | Elapse: 1.43s
Epoch 1 [139/140] | Valid Loss: 0.1571 | Elapse: 206.29s
Epoch 1 - Train Loss: 0.1663 - Valid Loss: 0.5072 - Elapsed Time: 1181.07s
- Epoch 1: Best model found with loss = 0.5072.
Epoch 2 [0/559] | Train Loss: 0.3793 Grad: 81459.7891 LR: 1.0000e-05 | Elapse: 1.45s
Epoch 2 [500/559] | Train Loss: 0.1659 Grad: 1246.6095 LR: 9.7564e-06 | Elapse: 724.53s
Epoch 2 [558/559] | Train Loss: 0.1560 Grad: 45349.8438 LR: 9.6974e-06 | Elapse: 796.86s
Epoch 2 [0/140] | Valid Loss: 0.2406 | Elapse: 1.39s
Epoch 2 [139/140] | Valid Loss: 0.1642 | Elapse: 172.62s
Epoch 2 - Train Loss: 0.1560 - Valid Loss: 0.5597 - Elapsed Time: 988.42s
- Epoch 2: Best model found with loss = 0.5597.
Epoch 3 [0/559] | Train Loss: 0.3372 Grad: 126511.1250 LR: 9.6963e-06 | Elapse: 1.61s
Epoch 3 [500/559] | Train Loss: 0.1332 Grad: 1709.5671 LR: 8.9422e-06 | Elapse: 626.50s
Epoch 3 [558/559] | Train Loss: 0.1245 Grad: 48516.1641 LR: 8.8283e-06 | Elapse: 698.54s
Epoch 3 [0/140] | Valid Loss: 0.2499 | Elapse: 1.15s
Epoch 3 [139/140] | Valid Loss: 0.1690 | Elapse: 175.40s
Epoch 3 - Train Loss: 0.1245 - Valid Loss: 0.5997 - Elapsed Time: 892.90s
- Epoch 3: Best model found with loss = 0.5997.
Epoch 4 [0/559] | Train Loss: 0.2329 Grad: 165485.4688 LR: 8.8263e-06 | Elapse: 1.42s
Epoch 4 [500/559] | Train Loss: 0.0928 Grad: 2085.9751 LR: 7.6526e-06 | Elapse: 617.80s
Epoch 4 [558/559] | Train Loss: 0.0867 Grad: 45565.9609 LR: 7.4974e-06 | Elapse: 690.54s
Epoch 4 [0/140] | Valid Loss: 0.2734 | Elapse: 1.55s
Epoch 4 [139/140] | Valid Loss: 0.1746 | Elapse: 167.59s
Epoch 4 - Train Loss: 0.0867 - Valid Loss: 0.6215 - Elapsed Time: 877.07s
- Epoch 4: Best model found with loss = 0.6215.
Epoch 5 [0/559] | Train Loss: 0.1356 Grad: 175726.2500 LR: 7.4947e-06 | Elapse: 1.12s
Epoch 5 [500/559] | Train Loss: 0.0635 Grad: 2302.8323 LR: 6.0431e-06 | Elapse: 619.62s
Epoch 5 [558/559] | Train Loss: 0.0595 Grad: 41125.3477 LR: 5.8653e-06 | Elapse: 690.76s
Epoch 5 [0/140] | Valid Loss: 0.3010 | Elapse: 1.17s
Epoch 5 [139/140] | Valid Loss: 0.1791 | Elapse: 169.10s
Epoch 5 - Train Loss: 0.0595 - Valid Loss: 0.6472 - Elapsed Time: 878.90s
- Epoch 5: Best model found with loss = 0.6472.
Epoch 6 [0/559] | Train Loss: 0.0700 Grad: 136908.1094 LR: 5.8623e-06 | Elapse: 1.13s
Epoch 6 [500/559] | Train Loss: 0.0446 Grad: 2514.1721 LR: 4.3078e-06 | Elapse: 625.46s
Epoch 6 [558/559] | Train Loss: 0.0420 Grad: 37248.8633 LR: 4.1289e-06 | Elapse: 697.66s
Epoch 6 [0/140] | Valid Loss: 0.3092 | Elapse: 1.25s
Epoch 6 [139/140] | Valid Loss: 0.1812 | Elapse: 171.69s
Epoch 6 - Train Loss: 0.0420 - Valid Loss: 0.6583 - Elapsed Time: 888.53s
- Epoch 6: Best model found with loss = 0.6583.
Epoch 7 [0/559] | Train Loss: 0.0358 Grad: 92237.4297 LR: 4.1258e-06 | Elapse: 1.29s
Epoch 7 [500/559] | Train Loss: 0.0349 Grad: 2724.6714 LR: 2.6560e-06 | Elapse: 625.38s
Epoch 7 [558/559] | Train Loss: 0.0330 Grad: 36025.4375 LR: 2.4976e-06 | Elapse: 692.62s
Epoch 7 [0/140] | Valid Loss: 0.3133 | Elapse: 1.35s
Epoch 7 [139/140] | Valid Loss: 0.1820 | Elapse: 169.90s
Epoch 7 - Train Loss: 0.0330 - Valid Loss: 0.6669 - Elapsed Time: 881.25s
- Epoch 7: Best model found with loss = 0.6669.
Epoch 8 [0/559] | Train Loss: 0.0239 Grad: 68634.6406 LR: 2.4949e-06 | Elapse: 0.94s
Epoch 8 [500/559] | Train Loss: 0.0310 Grad: 2664.3289 LR: 1.2869e-06 | Elapse: 623.73s
Epoch 8 [558/559] | Train Loss: 0.0293 Grad: 35448.2188 LR: 1.1681e-06 | Elapse: 693.87s
Epoch 8 [0/140] | Valid Loss: 0.3229 | Elapse: 1.65s
Epoch 8 [139/140] | Valid Loss: 0.1835 | Elapse: 170.20s
Epoch 8 - Train Loss: 0.0293 - Valid Loss: 0.6775 - Elapsed Time: 883.04s
- Epoch 8: Best model found with loss = 0.6775.
Epoch 9 [0/559] | Train Loss: 0.0214 Grad: 67280.5625 LR: 1.1661e-06 | Elapse: 1.50s
Epoch 9 [500/559] | Train Loss: 0.0320 Grad: 2366.7151 LR: 3.6575e-07 | Elapse: 627.09s
Epoch 9 [558/559] | Train Loss: 0.0303 Grad: 33299.4180 LR: 3.0086e-07 | Elapse: 700.43s
Epoch 9 [0/140] | Valid Loss: 0.3247 | Elapse: 1.65s
Epoch 9 [139/140] | Valid Loss: 0.1822 | Elapse: 171.60s
Epoch 9 - Train Loss: 0.0303 - Valid Loss: 0.6887 - Elapsed Time: 891.22s
- Epoch 9: Best model found with loss = 0.6887.
Epoch 10 [0/559] | Train Loss: 0.0396 Grad: 140012.0156 LR: 2.9979e-07 | Elapse: 1.47s
Epoch 10 [500/559] | Train Loss: 0.0399 Grad: 2683.7830 LR: 3.5668e-09 | Elapse: 627.50s
Epoch 10 [558/559] | Train Loss: 0.0374 Grad: 31579.2891 LR: 4.0097e-10 | Elapse: 699.21s
Epoch 10 [0/140] | Valid Loss: 0.3429 | Elapse: 1.75s
Epoch 10 [139/140] | Valid Loss: 0.1868 | Elapse: 170.59s
Epoch 10 - Train Loss: 0.0374 - Valid Loss: 0.6865 - Elapsed Time: 888.77s
Fold 1 | Time: 154.91min | Overall Evaluation Loss: 0.5993
Epoch 1 [0/559] | Train Loss: 0.4080 Grad: 124709.3203 LR: 4.0008e-07 | Elapse: 1.14s
Epoch 1 [500/559] | Train Loss: 0.1747 Grad: 1045.7129 LR: 9.7549e-06 | Elapse: 628.84s
Epoch 1 [558/559] | Train Loss: 0.1648 Grad: 1430.1704 LR: 1.0000e-05 | Elapse: 702.27s
Epoch 1 [0/140] | Valid Loss: 0.0024 | Elapse: 1.05s
Epoch 1 [139/140] | Valid Loss: 0.1646 | Elapse: 172.09s
Epoch 1 - Train Loss: 0.1648 - Valid Loss: 0.5391 - Elapsed Time: 892.89s
- Epoch 1: Best model found with loss = 0.5391.
Epoch 2 [0/559] | Train Loss: 0.3756 Grad: 93382.1719 LR: 1.0000e-05 | Elapse: 1.35s
Epoch 2 [500/559] | Train Loss: 0.1645 Grad: 1447.0669 LR: 9.7564e-06 | Elapse: 626.54s
Epoch 2 [558/559] | Train Loss: 0.1548 Grad: 2336.7964 LR: 9.6974e-06 | Elapse: 695.28s
Epoch 2 [0/140] | Valid Loss: 0.0028 | Elapse: 1.35s
Epoch 2 [139/140] | Valid Loss: 0.1744 | Elapse: 168.90s
Epoch 2 - Train Loss: 0.1548 - Valid Loss: 0.5480 - Elapsed Time: 882.83s
- Epoch 2: Best model found with loss = 0.5480.
Epoch 3 [0/559] | Train Loss: 0.3395 Grad: 155200.7188 LR: 9.6963e-06 | Elapse: 1.21s
Epoch 3 [500/559] | Train Loss: 0.1350 Grad: 1883.7952 LR: 8.9422e-06 | Elapse: 616.61s
Epoch 3 [558/559] | Train Loss: 0.1265 Grad: 3005.8718 LR: 8.8283e-06 | Elapse: 684.27s
Epoch 3 [0/140] | Valid Loss: 0.0033 | Elapse: 1.33s
Epoch 3 [139/140] | Valid Loss: 0.1848 | Elapse: 169.68s
Epoch 3 - Train Loss: 0.1265 - Valid Loss: 0.5793 - Elapsed Time: 872.76s
- Epoch 3: Best model found with loss = 0.5793.
Epoch 4 [0/559] | Train Loss: 0.2507 Grad: 184021.4375 LR: 8.8263e-06 | Elapse: 1.55s
Epoch 4 [500/559] | Train Loss: 0.0979 Grad: 2342.1018 LR: 7.6526e-06 | Elapse: 620.24s
Epoch 4 [558/559] | Train Loss: 0.0919 Grad: 3157.7532 LR: 7.4974e-06 | Elapse: 694.89s
Epoch 4 [0/140] | Valid Loss: 0.0033 | Elapse: 1.65s
Epoch 4 [139/140] | Valid Loss: 0.1921 | Elapse: 171.19s
Epoch 4 - Train Loss: 0.0919 - Valid Loss: 0.5966 - Elapsed Time: 884.77s
- Epoch 4: Best model found with loss = 0.5966.
Epoch 5 [0/559] | Train Loss: 0.1526 Grad: 191586.5938 LR: 7.4947e-06 | Elapse: 1.38s
Epoch 5 [500/559] | Train Loss: 0.0690 Grad: 2454.4775 LR: 6.0431e-06 | Elapse: 619.48s
Epoch 5 [558/559] | Train Loss: 0.0652 Grad: 3468.5071 LR: 5.8653e-06 | Elapse: 691.21s
Epoch 5 [0/140] | Valid Loss: 0.0035 | Elapse: 1.45s
Epoch 5 [139/140] | Valid Loss: 0.1998 | Elapse: 171.49s
Epoch 5 - Train Loss: 0.0652 - Valid Loss: 0.6213 - Elapsed Time: 881.09s
- Epoch 5: Best model found with loss = 0.6213.
Epoch 6 [0/559] | Train Loss: 0.0984 Grad: 176191.0312 LR: 5.8623e-06 | Elapse: 1.48s
Epoch 6 [500/559] | Train Loss: 0.0498 Grad: 2697.2048 LR: 4.3078e-06 | Elapse: 626.78s
Epoch 6 [558/559] | Train Loss: 0.0471 Grad: 3713.1016 LR: 4.1289e-06 | Elapse: 698.05s
Epoch 6 [0/140] | Valid Loss: 0.0033 | Elapse: 1.41s
Epoch 6 [139/140] | Valid Loss: 0.2037 | Elapse: 168.17s
Epoch 6 - Train Loss: 0.0471 - Valid Loss: 0.6357 - Elapsed Time: 885.22s
- Epoch 6: Best model found with loss = 0.6357.
Epoch 7 [0/559] | Train Loss: 0.0513 Grad: 126208.9375 LR: 4.1258e-06 | Elapse: 1.46s
Epoch 7 [500/559] | Train Loss: 0.0387 Grad: 2829.2466 LR: 2.6560e-06 | Elapse: 634.97s
Epoch 7 [558/559] | Train Loss: 0.0369 Grad: 3826.6626 LR: 2.4976e-06 | Elapse: 705.57s
Epoch 7 [0/140] | Valid Loss: 0.0033 | Elapse: 0.98s
Epoch 7 [139/140] | Valid Loss: 0.2071 | Elapse: 172.22s
Epoch 7 - Train Loss: 0.0369 - Valid Loss: 0.6424 - Elapsed Time: 896.53s
- Epoch 7: Best model found with loss = 0.6424.
Epoch 8 [0/559] | Train Loss: 0.0380 Grad: 107768.7891 LR: 2.4949e-06 | Elapse: 1.63s
Epoch 8 [500/559] | Train Loss: 0.0336 Grad: 2959.4180 LR: 1.2869e-06 | Elapse: 626.93s
Epoch 8 [558/559] | Train Loss: 0.0322 Grad: 3683.2998 LR: 1.1681e-06 | Elapse: 702.47s
Epoch 8 [0/140] | Valid Loss: 0.0034 | Elapse: 1.14s
Epoch 8 [139/140] | Valid Loss: 0.2092 | Elapse: 171.83s
Epoch 8 - Train Loss: 0.0322 - Valid Loss: 0.6436 - Elapsed Time: 892.56s
- Epoch 8: Best model found with loss = 0.6436.
Epoch 9 [0/559] | Train Loss: 0.0356 Grad: 110887.7266 LR: 1.1661e-06 | Elapse: 1.26s
Epoch 9 [500/559] | Train Loss: 0.0349 Grad: 2969.2019 LR: 3.6575e-07 | Elapse: 618.59s
Epoch 9 [558/559] | Train Loss: 0.0333 Grad: 3657.1890 LR: 3.0086e-07 | Elapse: 689.52s
Epoch 9 [0/140] | Valid Loss: 0.0034 | Elapse: 0.85s
Epoch 9 [139/140] | Valid Loss: 0.2080 | Elapse: 169.88s
Epoch 9 - Train Loss: 0.0333 - Valid Loss: 0.6454 - Elapsed Time: 877.97s
- Epoch 9: Best model found with loss = 0.6454.
Epoch 10 [0/559] | Train Loss: 0.0413 Grad: 124596.9844 LR: 2.9979e-07 | Elapse: 1.29s
Epoch 10 [500/559] | Train Loss: 0.0474 Grad: 3126.1436 LR: 3.5668e-09 | Elapse: 627.28s
Epoch 10 [558/559] | Train Loss: 0.0448 Grad: 4568.4751 LR: 4.0097e-10 | Elapse: 698.02s
Epoch 10 [0/140] | Valid Loss: 0.0033 | Elapse: 1.65s
Epoch 10 [139/140] | Valid Loss: 0.2082 | Elapse: 171.99s
Epoch 10 - Train Loss: 0.0448 - Valid Loss: 0.6580 - Elapsed Time: 888.30s
- Epoch 10: Best model found with loss = 0.6580.
Fold 2 | Time: 148.58min | Overall Evaluation Loss: 0.5356
Epoch 1 [0/559] | Train Loss: 0.3735 Grad: 136774.1406 LR: 4.0008e-07 | Elapse: 1.12s
Epoch 1 [500/559] | Train Loss: 0.1727 Grad: 19389.6543 LR: 9.7549e-06 | Elapse: 623.82s
Epoch 1 [558/559] | Train Loss: 0.1621 Grad: 33160.3281 LR: 1.0000e-05 | Elapse: 697.46s
Epoch 1 [0/140] | Valid Loss: 0.0017 | Elapse: 1.14s
Epoch 1 [139/140] | Valid Loss: 0.1746 | Elapse: 169.70s
Epoch 1 - Train Loss: 0.1621 - Valid Loss: 0.5274 - Elapsed Time: 887.75s
- Epoch 1: Best model found with loss = 0.5274.
Epoch 2 [0/559] | Train Loss: 0.3857 Grad: 82156.1875 LR: 1.0000e-05 | Elapse: 1.27s
Epoch 2 [500/559] | Train Loss: 0.1630 Grad: 29308.9199 LR: 9.7564e-06 | Elapse: 623.37s
Epoch 2 [558/559] | Train Loss: 0.1524 Grad: 44503.8945 LR: 9.6974e-06 | Elapse: 693.21s
Epoch 2 [0/140] | Valid Loss: 0.0018 | Elapse: 1.15s
Epoch 2 [139/140] | Valid Loss: 0.1843 | Elapse: 176.11s
Epoch 2 - Train Loss: 0.1524 - Valid Loss: 0.5781 - Elapsed Time: 889.88s
- Epoch 2: Best model found with loss = 0.5781.
Epoch 3 [0/559] | Train Loss: 0.3332 Grad: 135450.9531 LR: 9.6963e-06 | Elapse: 1.49s
Epoch 3 [500/559] | Train Loss: 0.1318 Grad: 32993.6094 LR: 8.9422e-06 | Elapse: 622.89s
Epoch 3 [558/559] | Train Loss: 0.1228 Grad: 51153.7461 LR: 8.8283e-06 | Elapse: 691.03s
Epoch 3 [0/140] | Valid Loss: 0.0020 | Elapse: 1.04s
Epoch 3 [139/140] | Valid Loss: 0.1926 | Elapse: 168.78s
Epoch 3 - Train Loss: 0.1228 - Valid Loss: 0.6165 - Elapsed Time: 880.74s
- Epoch 3: Best model found with loss = 0.6165.
Epoch 4 [0/559] | Train Loss: 0.2050 Grad: 158852.4688 LR: 8.8263e-06 | Elapse: 1.24s
Epoch 4 [500/559] | Train Loss: 0.0946 Grad: 32502.8730 LR: 7.6526e-06 | Elapse: 611.44s
Epoch 4 [558/559] | Train Loss: 0.0882 Grad: 52789.3359 LR: 7.4974e-06 | Elapse: 684.08s
Epoch 4 [0/140] | Valid Loss: 0.0021 | Elapse: 1.26s
Epoch 4 [139/140] | Valid Loss: 0.2005 | Elapse: 173.50s
Epoch 4 - Train Loss: 0.0882 - Valid Loss: 0.6403 - Elapsed Time: 878.81s
- Epoch 4: Best model found with loss = 0.6403.
Epoch 5 [0/559] | Train Loss: 0.1045 Grad: 160419.8594 LR: 7.4947e-06 | Elapse: 1.13s
Epoch 5 [500/559] | Train Loss: 0.0674 Grad: 33515.8281 LR: 6.0431e-06 | Elapse: 622.62s
Epoch 5 [558/559] | Train Loss: 0.0630 Grad: 48679.0625 LR: 5.8653e-06 | Elapse: 694.96s
Epoch 5 [0/140] | Valid Loss: 0.0022 | Elapse: 1.26s
Epoch 5 [139/140] | Valid Loss: 0.2054 | Elapse: 174.00s
Epoch 5 - Train Loss: 0.0630 - Valid Loss: 0.6581 - Elapsed Time: 889.52s
- Epoch 5: Best model found with loss = 0.6581.
Epoch 6 [0/559] | Train Loss: 0.0513 Grad: 123881.2109 LR: 5.8623e-06 | Elapse: 1.20s
Epoch 6 [500/559] | Train Loss: 0.0489 Grad: 34166.4883 LR: 4.3078e-06 | Elapse: 619.33s
Epoch 6 [558/559] | Train Loss: 0.0459 Grad: 46318.1602 LR: 4.1289e-06 | Elapse: 692.04s
Epoch 6 [0/140] | Valid Loss: 0.0022 | Elapse: 1.06s
Epoch 6 [139/140] | Valid Loss: 0.2085 | Elapse: 175.60s
Epoch 6 - Train Loss: 0.0459 - Valid Loss: 0.6727 - Elapsed Time: 888.27s
- Epoch 6: Best model found with loss = 0.6727.
Epoch 7 [0/559] | Train Loss: 0.0245 Grad: 69471.7734 LR: 4.1258e-06 | Elapse: 1.23s
Epoch 7 [500/559] | Train Loss: 0.0379 Grad: 33260.8320 LR: 2.6560e-06 | Elapse: 633.33s
Epoch 7 [558/559] | Train Loss: 0.0358 Grad: 43805.9805 LR: 2.4976e-06 | Elapse: 707.50s
Epoch 7 [0/140] | Valid Loss: 0.0023 | Elapse: 1.22s
Epoch 7 [139/140] | Valid Loss: 0.2125 | Elapse: 173.57s
Epoch 7 - Train Loss: 0.0358 - Valid Loss: 0.6797 - Elapsed Time: 901.75s
- Epoch 7: Best model found with loss = 0.6797.
Epoch 8 [0/559] | Train Loss: 0.0170 Grad: 45662.7891 LR: 2.4949e-06 | Elapse: 1.28s
Epoch 8 [500/559] | Train Loss: 0.0332 Grad: 33284.9766 LR: 1.2869e-06 | Elapse: 636.77s
Epoch 8 [558/559] | Train Loss: 0.0315 Grad: 45330.4883 LR: 1.1681e-06 | Elapse: 709.71s
Epoch 8 [0/140] | Valid Loss: 0.0023 | Elapse: 1.45s
Epoch 8 [139/140] | Valid Loss: 0.2158 | Elapse: 172.70s
Epoch 8 - Train Loss: 0.0315 - Valid Loss: 0.6806 - Elapsed Time: 903.01s
- Epoch 8: Best model found with loss = 0.6806.
Epoch 9 [0/559] | Train Loss: 0.0181 Grad: 55811.3711 LR: 1.1661e-06 | Elapse: 1.26s
Epoch 9 [500/559] | Train Loss: 0.0337 Grad: 36090.6758 LR: 3.6575e-07 | Elapse: 622.66s
Epoch 9 [558/559] | Train Loss: 0.0319 Grad: 40806.4766 LR: 3.0086e-07 | Elapse: 695.00s
Epoch 9 [0/140] | Valid Loss: 0.0024 | Elapse: 1.55s
Epoch 9 [139/140] | Valid Loss: 0.2160 | Elapse: 172.99s
Epoch 9 - Train Loss: 0.0319 - Valid Loss: 0.6900 - Elapsed Time: 888.59s
- Epoch 9: Best model found with loss = 0.6900.
Epoch 10 [0/559] | Train Loss: 0.0291 Grad: 108929.7500 LR: 2.9979e-07 | Elapse: 1.67s
Epoch 10 [500/559] | Train Loss: 0.0408 Grad: 33068.8359 LR: 3.5668e-09 | Elapse: 628.66s
Epoch 10 [558/559] | Train Loss: 0.0381 Grad: 40680.0781 LR: 4.0097e-10 | Elapse: 701.00s
Epoch 10 [0/140] | Valid Loss: 0.0026 | Elapse: 1.65s
Epoch 10 [139/140] | Valid Loss: 0.2175 | Elapse: 172.10s
Epoch 10 - Train Loss: 0.0381 - Valid Loss: 0.6948 - Elapsed Time: 893.76s
- Epoch 10: Best model found with loss = 0.6948.
Fold 3 | Time: 149.63min | Overall Evaluation Loss: 0.4956
Epoch 1 [0/559] | Train Loss: 0.0050 Grad: 2809.6936 LR: 4.0008e-07 | Elapse: 1.47s
Epoch 1 [500/559] | Train Loss: 0.1740 Grad: 374.4365 LR: 9.7549e-06 | Elapse: 619.57s
Epoch 1 [558/559] | Train Loss: 0.1637 Grad: 36396.9766 LR: 1.0000e-05 | Elapse: 689.00s
Epoch 1 [0/140] | Valid Loss: 0.4124 | Elapse: 1.45s
Epoch 1 [139/140] | Valid Loss: 0.1685 | Elapse: 171.89s
Epoch 1 - Train Loss: 0.1637 - Valid Loss: 0.5389 - Elapsed Time: 881.37s
- Epoch 1: Best model found with loss = 0.5389.
Epoch 2 [0/559] | Train Loss: 0.0050 Grad: 1995.7759 LR: 1.0000e-05 | Elapse: 1.59s
Epoch 2 [500/559] | Train Loss: 0.1633 Grad: 583.9670 LR: 9.7564e-06 | Elapse: 624.89s
Epoch 2 [558/559] | Train Loss: 0.1530 Grad: 46425.1641 LR: 9.6974e-06 | Elapse: 694.86s
Epoch 2 [0/140] | Valid Loss: 0.4686 | Elapse: 1.01s
Epoch 2 [139/140] | Valid Loss: 0.1789 | Elapse: 167.87s
Epoch 2 - Train Loss: 0.1530 - Valid Loss: 0.5844 - Elapsed Time: 882.92s
- Epoch 2: Best model found with loss = 0.5844.
Epoch 3 [0/559] | Train Loss: 0.0053 Grad: 3130.1858 LR: 9.6963e-06 | Elapse: 1.07s
Epoch 3 [500/559] | Train Loss: 0.1322 Grad: 783.8658 LR: 8.9422e-06 | Elapse: 627.07s
Epoch 3 [558/559] | Train Loss: 0.1232 Grad: 45816.0273 LR: 8.8283e-06 | Elapse: 699.61s
Epoch 3 [0/140] | Valid Loss: 0.4931 | Elapse: 1.25s
Epoch 3 [139/140] | Valid Loss: 0.1861 | Elapse: 167.99s
Epoch 3 - Train Loss: 0.1232 - Valid Loss: 0.6180 - Elapsed Time: 887.79s
- Epoch 3: Best model found with loss = 0.6180.
Epoch 4 [0/559] | Train Loss: 0.0056 Grad: 4049.7507 LR: 8.8263e-06 | Elapse: 1.48s
Epoch 4 [500/559] | Train Loss: 0.0952 Grad: 915.9907 LR: 7.6526e-06 | Elapse: 621.37s
Epoch 4 [558/559] | Train Loss: 0.0887 Grad: 42097.1250 LR: 7.4974e-06 | Elapse: 692.63s
Epoch 4 [0/140] | Valid Loss: 0.4977 | Elapse: 1.44s
Epoch 4 [139/140] | Valid Loss: 0.1917 | Elapse: 166.80s
Epoch 4 - Train Loss: 0.0887 - Valid Loss: 0.6386 - Elapsed Time: 879.67s
- Epoch 4: Best model found with loss = 0.6386.
Epoch 5 [0/559] | Train Loss: 0.0056 Grad: 4627.5327 LR: 7.4947e-06 | Elapse: 1.31s
Epoch 5 [500/559] | Train Loss: 0.0673 Grad: 1042.5446 LR: 6.0431e-06 | Elapse: 623.91s
Epoch 5 [558/559] | Train Loss: 0.0628 Grad: 39756.8047 LR: 5.8653e-06 | Elapse: 695.74s
Epoch 5 [0/140] | Valid Loss: 0.4978 | Elapse: 1.65s
Epoch 5 [139/140] | Valid Loss: 0.1959 | Elapse: 172.59s
Epoch 5 - Train Loss: 0.0628 - Valid Loss: 0.6606 - Elapsed Time: 888.42s
- Epoch 5: Best model found with loss = 0.6606.
Epoch 6 [0/559] | Train Loss: 0.0055 Grad: 4887.3267 LR: 5.8623e-06 | Elapse: 1.38s
Epoch 6 [500/559] | Train Loss: 0.0492 Grad: 1069.9318 LR: 4.3078e-06 | Elapse: 619.50s
Epoch 6 [558/559] | Train Loss: 0.0460 Grad: 38461.5625 LR: 4.1289e-06 | Elapse: 692.72s
Epoch 6 [0/140] | Valid Loss: 0.5020 | Elapse: 1.05s
Epoch 6 [139/140] | Valid Loss: 0.1990 | Elapse: 174.79s
Epoch 6 - Train Loss: 0.0460 - Valid Loss: 0.6746 - Elapsed Time: 887.61s
- Epoch 6: Best model found with loss = 0.6746.
Epoch 7 [0/559] | Train Loss: 0.0054 Grad: 5169.7212 LR: 4.1258e-06 | Elapse: 1.07s
Epoch 7 [500/559] | Train Loss: 0.0381 Grad: 1063.5841 LR: 2.6560e-06 | Elapse: 621.07s
Epoch 7 [558/559] | Train Loss: 0.0359 Grad: 35426.7031 LR: 2.4976e-06 | Elapse: 693.61s
Epoch 7 [0/140] | Valid Loss: 0.5056 | Elapse: 1.28s
Epoch 7 [139/140] | Valid Loss: 0.2010 | Elapse: 169.21s
Epoch 7 - Train Loss: 0.0359 - Valid Loss: 0.6811 - Elapsed Time: 883.41s
- Epoch 7: Best model found with loss = 0.6811.
Epoch 8 [0/559] | Train Loss: 0.0054 Grad: 5201.8013 LR: 2.4949e-06 | Elapse: 1.16s
Epoch 8 [500/559] | Train Loss: 0.0335 Grad: 1033.7025 LR: 1.2869e-06 | Elapse: 621.26s
Epoch 8 [558/559] | Train Loss: 0.0316 Grad: 32125.7207 LR: 1.1681e-06 | Elapse: 691.80s
Epoch 8 [0/140] | Valid Loss: 0.5071 | Elapse: 1.45s
Epoch 8 [139/140] | Valid Loss: 0.2006 | Elapse: 174.00s
Epoch 8 - Train Loss: 0.0316 - Valid Loss: 0.6861 - Elapsed Time: 885.99s
- Epoch 8: Best model found with loss = 0.6861.
Epoch 9 [0/559] | Train Loss: 0.0054 Grad: 5315.4302 LR: 1.1661e-06 | Elapse: 1.47s
Epoch 9 [500/559] | Train Loss: 0.0337 Grad: 1095.0151 LR: 3.6575e-07 | Elapse: 622.67s
Epoch 9 [558/559] | Train Loss: 0.0319 Grad: 27265.7305 LR: 3.0086e-07 | Elapse: 694.01s
Epoch 9 [0/140] | Valid Loss: 0.4932 | Elapse: 1.35s
Epoch 9 [139/140] | Valid Loss: 0.1994 | Elapse: 174.70s
Epoch 9 - Train Loss: 0.0319 - Valid Loss: 0.6887 - Elapsed Time: 888.81s
- Epoch 9: Best model found with loss = 0.6887.
Epoch 10 [0/559] | Train Loss: 0.0052 Grad: 5499.5928 LR: 2.9979e-07 | Elapse: 1.36s
Epoch 10 [500/559] | Train Loss: 0.0392 Grad: 1228.2296 LR: 3.5668e-09 | Elapse: 626.25s
Epoch 10 [558/559] | Train Loss: 0.0367 Grad: 28973.5898 LR: 4.0097e-10 | Elapse: 696.89s
Epoch 10 [0/140] | Valid Loss: 0.5141 | Elapse: 1.16s
Epoch 10 [139/140] | Valid Loss: 0.2049 | Elapse: 174.49s
Epoch 10 - Train Loss: 0.0367 - Valid Loss: 0.6837 - Elapsed Time: 891.96s
Fold 4 | Time: 149.09min | Overall Evaluation Loss: 0.4522