diff --git "a/wandb/run-20220303_033953-1eigbhyo/files/output.log" "b/wandb/run-20220303_033953-1eigbhyo/files/output.log" --- "a/wandb/run-20220303_033953-1eigbhyo/files/output.log" +++ "b/wandb/run-20220303_033953-1eigbhyo/files/output.log" @@ -2303,3 +2303,1529 @@ [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 04:46:11 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb']. This may take a bit of time if the files are large. +[INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:46:45,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 6.2615, 'learning_rate': 0.000998, 'epoch': 0.56} + 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9686, 'learning_rate': 0.001, 'epoch': 0.56} + 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2496, 'learning_rate': 0.0009974489795918369, 'epoch': 0.56} + 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.4162, 'learning_rate': 0.0009948979591836735, 'epoch': 0.57} +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3041, 'learning_rate': 0.00099234693877551, 'epoch': 0.57} + 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▋ | 506/892 [1:07:29<6:32:37, 61.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▋ | 506/892 [1:07:29<6:32:37, 61.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:30,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3472, 'learning_rate': 0.0009872448979591838, 'epoch': 0.57} + 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▊ | 508/892 [1:07:44<3:34:55, 33.58s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▊ | 508/892 [1:07:44<3:34:55, 33.58s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1626, 'learning_rate': 0.0009846938775510204, 'epoch': 0.57} +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:44,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3774, 'learning_rate': 0.0009821428571428572, 'epoch': 0.57} + 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████ | 510/892 [1:07:58<2:08:08, 20.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████ | 510/892 [1:07:58<2:08:08, 20.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:57,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:47:57,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1559, 'learning_rate': 0.0009770408163265307, 'epoch': 0.57} + 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████▏ | 512/892 [1:08:13<1:25:26, 13.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 57%|████████████████████████████████████████████▏ | 512/892 [1:08:13<1:25:26, 13.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0161, 'learning_rate': 0.0009744897959183674, 'epoch': 0.57} +[WARNING|modeling_utils.py:388] 2022-03-03 04:48:13,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.6675, 'learning_rate': 0.0009719387755102041, 'epoch': 0.58} + 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|████████████████████████████████████████████▎ | 514/892 [1:08:26<1:03:57, 10.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|████████████████████████████████████████████▎ | 514/892 [1:08:26<1:03:57, 10.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:48:25,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:48:25,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▌ | 515/892 [1:08:33<57:37, 9.17s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▌ | 515/892 [1:08:33<57:37, 9.17s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2748, 'learning_rate': 0.0009668367346938776, 'epoch': 0.58} +[WARNING|modeling_utils.py:388] 2022-03-03 04:48:34,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.033, 'learning_rate': 0.0009642857142857143, 'epoch': 0.58} + 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▊ | 517/892 [1:08:47<50:00, 8.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▊ | 517/892 [1:08:47<50:00, 8.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2472, 'learning_rate': 0.0009617346938775511, 'epoch': 0.58} +[WARNING|modeling_utils.py:388] 2022-03-03 04:48:48,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1146, 'learning_rate': 0.0009591836734693877, 'epoch': 0.58} + 58%|█████████���███████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████████████████ | 520/892 [1:09:08<45:12, 7.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████████████████ | 520/892 [1:09:08<45:12, 7.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:08,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:08,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 58%|██████████████████████████████████��███████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0594, 'learning_rate': 0.0009489795918367348, 'epoch': 0.59} + 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▎ | 523/892 [1:09:28<42:01, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▎ | 523/892 [1:09:28<42:01, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:26,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:26,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 524/892 [1:09:34<41:13, 6.72s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 524/892 [1:09:34<41:13, 6.72s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1616, 'learning_rate': 0.0009362244897959184, 'epoch': 0.59} +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1956, 'learning_rate': 0.0009336734693877551, 'epoch': 0.59} +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|█���████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████ | 531/892 [1:10:19<37:46, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████ | 531/892 [1:10:19<37:46, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8953, 'learning_rate': 0.000923469387755102, 'epoch': 0.6} +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9381, 'learning_rate': 0.0009209183673469387, 'epoch': 0.6} +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:32,176 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:32,176 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9541, 'learning_rate': 0.0009183673469387756, 'epoch': 0.6} +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:36,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:36,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▍ | 535/892 [1:10:42<35:26, 5.96s/it]g-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▍ | 536/892 [1:10:48<34:45, 5.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▍ | 536/892 [1:10:48<34:45, 5.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:49,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:49,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3974, 'learning_rate': 0.0009107142857142857, 'epoch': 0.6} +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:53,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▋ | 538/892 [1:10:59<33:29, 5.68s/it]g-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▋ | 538/892 [1:10:59<33:29, 5.68s/it]g-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:04,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:07,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:09,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:09,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:11,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:13,408 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:13,408 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:15,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:17,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:17,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:19,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:20,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:20,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:22,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:22,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:25,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:27,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:27,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:28,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:28,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:31,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:31,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:33,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:34,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:34,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:37,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:37,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.8482, 'learning_rate': 0.0008775510204081633, 'epoch': 0.62} +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:41,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:41,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2635, 'learning_rate': 0.000875, 'epoch': 0.62} +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2408, 'learning_rate': 0.0008698979591836736, 'epoch': 0.62} +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████ | 554/892 [1:12:12<35:12, 6.25s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████ | 554/892 [1:12:12<35:12, 6.25s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:52:12,494 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0966, 'learning_rate': 0.0008647959183673469, 'epoch': 0.62} + 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|████████████████████████��████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▏ | 556/892 [1:12:26<37:35, 6.71s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▏ | 556/892 [1:12:26<37:35, 6.71s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:52:26,739 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9667, 'learning_rate': 0.0008596938775510205, 'epoch': 0.62} + 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▍ | 558/892 [1:12:40<38:24, 6.90s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▍ | 558/892 [1:12:40<38:24, 6.90s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2674, 'learning_rate': 0.0008571428571428571, 'epoch': 0.63} +[WARNING|modeling_utils.py:388] 2022-03-03 04:52:40,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0393, 'learning_rate': 0.0008545918367346938, 'epoch': 0.63} + 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1893, 'learning_rate': 0.0008494897959183674, 'epoch': 0.63} + 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0583, 'learning_rate': 0.0008469387755102041, 'epoch': 0.63} +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▉ | 564/892 [1:13:21<37:20, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|█████████████████████████████████████████████████▉ | 564/892 [1:13:21<37:20, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.06, 'learning_rate': 0.0008367346938775511, 'epoch': 0.63} +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▏ | 567/892 [1:13:42<36:40, 6.77s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▎ | 568/892 [1:13:48<36:23, 6.74s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:47,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:47,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9932, 'learning_rate': 0.0008290816326530613, 'epoch': 0.64} + 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1723, 'learning_rate': 0.000826530612244898, 'epoch': 0.64} +[WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9847, 'learning_rate': 0.0008239795918367348, 'epoch': 0.64} +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▋ | 572/892 [1:14:14<35:06, 6.58s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▋ | 573/892 [1:14:21<34:46, 6.54s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▋ | 573/892 [1:14:21<34:46, 6.54s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:21,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:21,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▊ | 574/892 [1:14:27<34:28, 6.51s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 64%|██████████████████████████████████████████████████▊ | 574/892 [1:14:27<34:28, 6.51s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3417, 'learning_rate': 0.0008137755102040817, 'epoch': 0.64} +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███���███████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▍ | 581/892 [1:15:12<32:23, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 65%|███████████████████████████████████████████████████▍ | 581/892 [1:15:12<32:23, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9729, 'learning_rate': 0.0007959183673469387, 'epoch': 0.65} +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.4895, 'learning_rate': 0.0007933673469387756, 'epoch': 0.65} +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:25,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:25,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0296, 'learning_rate': 0.0007908163265306123, 'epoch': 0.65} +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:29,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|███████████████████████████████████████████████████▊ | 585/892 [1:15:35<30:19, 5.93s/it]g-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|███████████████████████████████████████████████████▊ | 585/892 [1:15:35<30:19, 5.93s/it]g-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:41,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:41,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████ | 588/892 [1:15:52<28:38, 5.65s/it]g-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:55:57,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▎ | 591/892 [1:16:07<26:31, 5.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:06,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:06,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▍ | 592/892 [1:16:12<25:28, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 66%|████████████████████████████████████████████████████▌ | 593/892 [1:16:16<24:17, 4.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:14,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:14,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▌ | 594/892 [1:16:20<23:06, 4.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:18,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:18,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▋ | 595/892 [1:16:24<21:33, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▊ | 596/892 [1:16:27<19:54, 4.04s/it]g-point operations will not be computed-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▊ | 596/892 [1:16:27<19:54, 4.04s/it]g-point operations will not be computed-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:25,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:23,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:25,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:23,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▊ | 597/892 [1:16:30<18:15, 3.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:26,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▉ | 598/892 [1:16:33<16:32, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:29,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|████████████████████████████████████████████████████▉ | 598/892 [1:16:33<16:32, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:29,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████ | 599/892 [1:16:35<14:54, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████ | 599/892 [1:16:35<14:54, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:32,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:32,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 600/892 [1:16:38<14:17, 2.94s/it]g-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 600/892 [1:16:38<14:17, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:39,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it]g-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it]g-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:46,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:46,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████��███████████████████████████████████████████▎ | 602/892 [1:16:53<25:20, 5.24s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 67%|█████████████████████████████████████████████████████▎ | 602/892 [1:16:53<25:20, 5.24s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:56:53,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.2736, 'learning_rate': 0.0007423469387755102, 'epoch': 0.68} + 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1739, 'learning_rate': 0.0007397959183673469, 'epoch': 0.68} + 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9385, 'learning_rate': 0.0007346938775510205, 'epoch': 0.68} + 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.867, 'learning_rate': 0.0007321428571428571, 'epoch': 0.68} +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▊ | 608/892 [1:17:36<32:44, 6.92s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▊ | 608/892 [1:17:36<32:44, 6.92s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0564, 'learning_rate': 0.0007295918367346938, 'epoch': 0.68} +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:36,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1008, 'learning_rate': 0.0007270408163265307, 'epoch': 0.68} + 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9015, 'learning_rate': 0.0007219387755102041, 'epoch': 0.68} + 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1184, 'learning_rate': 0.0007193877551020408, 'epoch': 0.69} +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▎ | 613/892 [1:18:10<32:02, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:10,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▍ | 614/892 [1:18:17<31:39, 6.83s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▍ | 614/892 [1:18:17<31:39, 6.83s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0992, 'learning_rate': 0.0007142857142857143, 'epoch': 0.69} +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:17,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1966, 'learning_rate': 0.0007117346938775511, 'epoch': 0.69} + 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9135, 'learning_rate': 0.0007091836734693877, 'epoch': 0.69} +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▋ | 617/892 [1:18:37<30:59, 6.76s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▋ | 617/892 [1:18:37<30:59, 6.76s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1225, 'learning_rate': 0.0007066326530612245, 'epoch': 0.69} +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:37,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▋ | 618/892 [1:18:44<30:40, 6.72s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▋ | 618/892 [1:18:44<30:40, 6.72s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8598, 'learning_rate': 0.0007040816326530613, 'epoch': 0.69} +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:44,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8324, 'learning_rate': 0.000701530612244898, 'epoch': 0.69} + 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.01, 'learning_rate': 0.0006989795918367347, 'epoch': 0.7} +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|██████████████████████████████████████████████████████▉ | 621/892 [1:19:04<29:56, 6.63s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|██████████████████████████████████████████████████████▉ | 621/892 [1:19:04<29:56, 6.63s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:02,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:02,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████ | 622/892 [1:19:10<29:33, 6.57s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████ | 622/892 [1:19:10<29:33, 6.57s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:08,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:08,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.05, 'learning_rate': 0.0006913265306122449, 'epoch': 0.7} + 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0356, 'learning_rate': 0.0006887755102040817, 'epoch': 0.7} +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3868, 'learning_rate': 0.0006862244897959184, 'epoch': 0.7} + 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████��███████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0528, 'learning_rate': 0.0006836734693877551, 'epoch': 0.7} + 70%|███████████████████████████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▌ | 627/892 [1:19:43<28:31, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▌ | 627/892 [1:19:43<28:31, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:41,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:41,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▌ | 628/892 [1:19:49<28:05, 6.39s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 70%|███████████████████████████████████████████████████████▌ | 628/892 [1:19:49<28:05, 6.39s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:47,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:47,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▋ | 629/892 [1:19:55<27:43, 6.33s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▋ | 629/892 [1:19:55<27:43, 6.33s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:53,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:53,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▊ | 630/892 [1:20:01<27:23, 6.27s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▊ | 630/892 [1:20:01<27:23, 6.27s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:59,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 04:59:59,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▉ | 631/892 [1:20:07<27:02, 6.22s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▉ | 631/892 [1:20:07<27:02, 6.22s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:05,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:05,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▉ | 632/892 [1:20:13<26:42, 6.16s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|███████████████████████████████████████████████████████▉ | 632/892 [1:20:13<26:42, 6.16s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:11,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:11,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|████████████████████████████████████████████████████████ | 633/892 [1:20:19<26:23, 6.11s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|████████████████████████████████████████████████████████ | 633/892 [1:20:19<26:23, 6.11s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|████████████████████████████████████████████████████████▏ | 634/892 [1:20:25<26:06, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|████████████████████████████████████████████████████████▏ | 634/892 [1:20:25<26:06, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1228, 'learning_rate': 0.0006607142857142857, 'epoch': 0.71} +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:32,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:32,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:36,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:36,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 71%|████████████████████████████████████████████████████████▍ | 637/892 [1:20:42<24:39, 5.80s/it]g-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 72%|████████████████████████████████████████████████████████▌ | 638/892 [1:20:48<24:11, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 72%|████████████████████████████████████████████████████████▌ | 639/892 [1:20:53<23:38, 5.61s/it]g-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:00:58,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:01,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:03,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:03,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:05,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:07,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:07,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:09,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:11,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:11,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:13,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:15,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:15,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:17,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:17,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:18,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:21,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:21,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:25,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:25,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:27,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:29,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:29,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8832, 'learning_rate': 0.0006224489795918367, 'epoch': 0.73} +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:33,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:33,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:37,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:37,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.3171, 'learning_rate': 0.0006198979591836736, 'epoch': 0.73} +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▋ | 652/892 [1:21:49<21:03, 5.26s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|██████████████████████████████████████��██████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▉ | 654/892 [1:22:03<24:47, 6.25s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|█████████████████████████████████████████████████████████▉ | 654/892 [1:22:03<24:47, 6.25s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:02:04,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9983, 'learning_rate': 0.0006096938775510205, 'epoch': 0.73} + 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████ | 656/892 [1:22:17<26:16, 6.68s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████ | 656/892 [1:22:17<26:16, 6.68s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:02:18,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9124, 'learning_rate': 0.0006045918367346938, 'epoch': 0.74} + 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9621, 'learning_rate': 0.0005994897959183674, 'epoch': 0.74} + 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████���███████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▌ | 661/892 [1:22:52<26:31, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:02:51,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:02:51,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7113, 'learning_rate': 0.0005918367346938776, 'epoch': 0.74} + 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████���███████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▊ | 664/892 [1:23:13<25:56, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 74%|██████████████████████████████████████████████████████████▊ | 664/892 [1:23:13<25:56, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:13,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8416, 'learning_rate': 0.0005841836734693877, 'epoch': 0.75} + 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1135, 'learning_rate': 0.0005816326530612245, 'epoch': 0.75} +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9375, 'learning_rate': 0.0005790816326530613, 'epoch': 0.75} +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▏ | 668/892 [1:23:40<25:06, 6.72s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▏ | 668/892 [1:23:40<25:06, 6.72s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.909, 'learning_rate': 0.000576530612244898, 'epoch': 0.75} +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:40,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1056, 'learning_rate': 0.0005739795918367347, 'epoch': 0.75} + 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0018, 'learning_rate': 0.0005714285714285714, 'epoch': 0.75} +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7696, 'learning_rate': 0.0005688775510204082, 'epoch': 0.75} +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▌ | 672/892 [1:24:06<24:04, 6.57s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:04,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:04,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▌ | 673/892 [1:24:12<23:57, 6.56s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 75%|███████████████████████████████████████████████████████████▌ | 673/892 [1:24:12<23:57, 6.56s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:11,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:11,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▋ | 674/892 [1:24:19<23:42, 6.53s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▋ | 674/892 [1:24:19<23:42, 6.53s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8107, 'learning_rate': 0.0005612244897959184, 'epoch': 0.76} +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9513, 'learning_rate': 0.0005586734693877551, 'epoch': 0.76} +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|█████████��█████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████ | 678/892 [1:24:45<22:51, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1294, 'learning_rate': 0.0005433673469387756, 'epoch': 0.76} + 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████���██████████████████████████▍ | 683/892 [1:25:15<21:18, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|████████████████████████████████████████████████████████████▍ | 683/892 [1:25:15<21:18, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.217, 'learning_rate': 0.0005357142857142857, 'epoch': 0.77} +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:22,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:22,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9345, 'learning_rate': 0.0005331632653061225, 'epoch': 0.77} +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:26,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|████████████████████████████████████████████████████████████▊ | 686/892 [1:25:32<20:03, 5.84s/it]g-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|████████████████████████████████████████████████████████████▊ | 686/892 [1:25:32<20:03, 5.84s/it]g-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:30,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:30,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|████████████████████████████████████████████████████████████▊ | 687/892 [1:25:38<19:40, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|████████████████████████████████████████████████████████████▊ | 687/892 [1:25:38<19:40, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9354, 'learning_rate': 0.0005280612244897959, 'epoch': 0.77} +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9864, 'learning_rate': 0.0005255102040816326, 'epoch': 0.77} +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:42,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████ | 689/892 [1:25:49<18:49, 5.56s/it]g-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████ | 689/892 [1:25:49<18:49, 5.56s/it]g-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:46,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:46,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████ | 690/892 [1:25:54<18:14, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████ | 690/892 [1:25:54<18:14, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:52,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████▏ | 691/892 [1:25:59<17:40, 5.28s/it]g-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 77%|█████████████████████████████████████████████████████████████▏ | 691/892 [1:25:59<17:40, 5.28s/it]g-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:56,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:56,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:05:58,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:01,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:01,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:03,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:05,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:05,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:06,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:08,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:08,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:12,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:13,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:13,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:15,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:15,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:18,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:18,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:19,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:19,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:21,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:22,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:22,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:24,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:24,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:28,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:32,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:32,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7946, 'learning_rate': 0.0004923469387755102, 'epoch': 0.79} +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9163, 'learning_rate': 0.0004846938775510204, 'epoch': 0.79} + 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▍ | 705/892 [1:27:06<20:21, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9996, 'learning_rate': 0.00047704081632653065, 'epoch': 0.79} +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9985, 'learning_rate': 0.0004719387755102041, 'epoch': 0.79} +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.1134, 'learning_rate': 0.00046683673469387755, 'epoch': 0.8} +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▎ | 715/892 [1:28:15<20:14, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:14,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:14,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▍ | 716/892 [1:28:22<19:57, 6.81s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▍ | 716/892 [1:28:22<19:57, 6.81s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8231, 'learning_rate': 0.00045408163265306124, 'epoch': 0.8} +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:22,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.09, 'learning_rate': 0.00045153061224489796, 'epoch': 0.8} + 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 80%|███████████████████████████████████████████████████████████████▌ | 718/892 [1:28:35<19:38, 6.77s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|███████████████████████████████████████████████████████████████▋ | 719/892 [1:28:42<19:23, 6.73s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:40,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:40,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|███████████████████████████████████████████████████████████████▊ | 720/892 [1:28:49<19:10, 6.69s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|███████████████████████████████████████████████████████████████▊ | 720/892 [1:28:49<19:10, 6.69s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.841, 'learning_rate': 0.00044387755102040814, 'epoch': 0.81} +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:49,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7433, 'learning_rate': 0.0004413265306122449, 'epoch': 0.81} + 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7101, 'learning_rate': 0.00043877551020408165, 'epoch': 0.81} +[WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8642, 'learning_rate': 0.00043622448979591837, 'epoch': 0.81} +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████▏ | 725/892 [1:29:22<18:33, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████▏ | 725/892 [1:29:22<18:33, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8054, 'learning_rate': 0.0004311224489795919, 'epoch': 0.81} +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:21,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0236, 'learning_rate': 0.00042857142857142855, 'epoch': 0.81} + 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:29,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:29,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8533, 'learning_rate': 0.00042602040816326533, 'epoch': 0.82} +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:34,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6959, 'learning_rate': 0.00042346938775510206, 'epoch': 0.82} + 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6739, 'learning_rate': 0.0004209183673469388, 'epoch': 0.82} +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0312, 'learning_rate': 0.00041836734693877556, 'epoch': 0.82} +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:00,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:00,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:04,959 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:04,959 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 82%|████████████████████████████████████████████████████████████████▉ | 733/892 [1:30:11<16:11, 6.11s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 82%|█████████████████████████████████████████████████████████████████ | 734/892 [1:30:17<15:47, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 82%|█████████████████████████████████████████████████████████████████ | 734/892 [1:30:17<15:47, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6994, 'learning_rate': 0.0004056122448979592, 'epoch': 0.82} +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:23,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:23,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8075, 'learning_rate': 0.0004030612244897959, 'epoch': 0.83} +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:27,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▎ | 737/892 [1:30:33<14:42, 5.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▎ | 737/892 [1:30:33<14:42, 5.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8765, 'learning_rate': 0.00040051020408163264, 'epoch': 0.83} +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:34,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:34,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.843, 'learning_rate': 0.00039795918367346937, 'epoch': 0.83} +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:38,040 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▍ | 739/892 [1:30:44<13:51, 5.43s/it]g-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▍ | 739/892 [1:30:44<13:51, 5.43s/it]g-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▌ | 740/892 [1:30:49<13:27, 5.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:47,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▋ | 741/892 [1:30:54<12:58, 5.15s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 83%|█████████████████████████████████████████████████████████████████▋ | 741/892 [1:30:54<12:58, 5.15s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:51,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:53,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:53,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:55,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:58,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:10:58,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:00,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:02,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:02,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:04,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:05,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:05,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:07,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:07,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:10,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:12,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:12,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:14,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:14,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:16,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:16,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:20,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:20,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9339, 'learning_rate': 0.00036734693877551024, 'epoch': 0.84} +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:27,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:27,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9065, 'learning_rate': 0.0003647959183673469, 'epoch': 0.84} +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:31,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8276, 'learning_rate': 0.0003622448979591837, 'epoch': 0.84} +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8596, 'learning_rate': 0.0003596938775510204, 'epoch': 0.84} + 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8511, 'learning_rate': 0.00035714285714285714, 'epoch': 0.85} +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8633, 'learning_rate': 0.00035459183673469387, 'epoch': 0.85} + 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:04,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:04,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:07,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:07,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0629, 'learning_rate': 0.0003494897959183674, 'epoch': 0.85} + 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8626, 'learning_rate': 0.0003469387755102041, 'epoch': 0.85} +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 762/892 [1:32:50<14:55, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 85%|███████████████████████████████████████████████████████████████████▍ | 762/892 [1:32:50<14:55, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:51,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:12:51,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▋ | 764/892 [1:33:04<14:37, 6.86s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▊ | 765/892 [1:33:11<14:32, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▊ | 765/892 [1:33:11<14:32, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:11,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0025, 'learning_rate': 0.00032653061224489796, 'epoch': 0.86} + 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6712, 'learning_rate': 0.0003239795918367347, 'epoch': 0.86} +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████ | 768/892 [1:33:31<13:57, 6.76s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████ | 768/892 [1:33:31<13:57, 6.76s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:31,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:31,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7581, 'learning_rate': 0.0003163265306122449, 'epoch': 0.86} +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▎ | 772/892 [1:33:58<13:23, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:56,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:13:56,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9918, 'learning_rate': 0.0003086734693877551, 'epoch': 0.87} + 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5347, 'learning_rate': 0.0003061224489795919, 'epoch': 0.87} +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▊ | 777/892 [1:34:31<12:30, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|█████████████████████████████��██████████████████████████████████████▉ | 778/892 [1:34:37<12:17, 6.47s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|████████████████████████████████████████████████████████████████████▉ | 779/892 [1:34:43<12:03, 6.41s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 87%|█████████████████████████████████████████████████████████████████████ | 780/892 [1:34:49<11:51, 6.35s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:48,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:48,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▏ | 781/892 [1:34:56<11:38, 6.29s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▏ | 781/892 [1:34:56<11:38, 6.29s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:54,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:14:54,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▎ | 782/892 [1:35:02<11:29, 6.27s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▎ | 782/892 [1:35:02<11:29, 6.27s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:00,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:00,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▎ | 783/892 [1:35:08<11:17, 6.22s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▎ | 783/892 [1:35:08<11:17, 6.22s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8777, 'learning_rate': 0.00028316326530612246, 'epoch': 0.88} +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:07,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▍ | 784/892 [1:35:14<11:02, 6.14s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▍ | 784/892 [1:35:14<11:02, 6.14s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▌ | 785/892 [1:35:20<10:45, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▌ | 785/892 [1:35:20<10:45, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6596, 'learning_rate': 0.00027551020408163264, 'epoch': 0.88} +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:26,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:26,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8652, 'learning_rate': 0.00027295918367346936, 'epoch': 0.88} +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:30,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▊ | 788/892 [1:35:36<09:54, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 88%|█████████████████████████████████████████████████████████████████████▊ | 788/892 [1:35:36<09:54, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8039, 'learning_rate': 0.00027040816326530614, 'epoch': 0.88} +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:37,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:37,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7242, 'learning_rate': 0.00026785714285714287, 'epoch': 0.88} +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|█████████████████████████████████████████████████████████████████████▉ | 790/892 [1:35:47<09:16, 5.46s/it]g-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|█████████████████████████████████████████████████████████████████████▉ | 790/892 [1:35:47<09:16, 5.46s/it]g-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:44,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:47,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:47,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▏ | 792/892 [1:35:57<08:34, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▏ | 792/892 [1:35:57<08:34, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:55,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▏ | 793/892 [1:36:01<08:08, 4.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▏ | 793/892 [1:36:01<08:08, 4.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:15:59,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▎ | 794/892 [1:36:05<07:39, 4.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▎ | 794/892 [1:36:05<07:39, 4.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:03,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▍ | 795/892 [1:36:09<07:08, 4.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▍ | 795/892 [1:36:09<07:08, 4.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:07,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:07,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▍ | 796/892 [1:36:12<06:35, 4.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:08,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▍ | 796/892 [1:36:12<06:35, 4.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:08,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▌ | 797/892 [1:36:15<06:00, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▌ | 797/892 [1:36:15<06:00, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 89%|██████████████████████████████████████████████████████████████████████▋ | 798/892 [1:36:18<05:26, 3.47s/it]g-point operations will not be computed-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:15,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:14,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:15,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:14,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:17,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:17,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it]g-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:24,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:24,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|██████████████████████████████████████████████████████████████████████▉ | 801/892 [1:36:31<06:45, 4.46s/it]g-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|██████████████████████████████████████████████████████████████████████▉ | 801/892 [1:36:31<06:45, 4.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:32,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6433, 'learning_rate': 0.00023469387755102041, 'epoch': 0.9} + 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 803/892 [1:36:46<08:45, 5.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████ | 803/892 [1:36:46<08:45, 5.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:16:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▎ | 805/892 [1:37:00<09:32, 6.58s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▎ | 805/892 [1:37:00<09:32, 6.58s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:00,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9597, 'learning_rate': 0.00022448979591836734, 'epoch': 0.9} + 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6865, 'learning_rate': 0.00021938775510204082, 'epoch': 0.91} +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|███████████████████████████████████████████████████████████████████████▋ | 809/892 [1:37:28<09:35, 6.93s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|███████████████████████████████████████████████████████████████████████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|███████████████████████████████████████████████████████████████████████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|███████████████████████████████████████████████████████████████���███████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.8973, 'learning_rate': 0.00021173469387755103, 'epoch': 0.91} +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|███████████████████████████████████████████████████████████████████████▉ | 812/892 [1:37:49<09:11, 6.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:47,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:47,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5545, 'learning_rate': 0.0002066326530612245, 'epoch': 0.91} + 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7104, 'learning_rate': 0.00020408163265306123, 'epoch': 0.91} +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|████████████████████████████████████████████████████████████████████████▏ | 815/892 [1:38:09<08:46, 6.84s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|████████████████████████████████████████████████████████████████████████▎ | 816/892 [1:38:16<08:36, 6.79s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 91%|████████████████████████████████████████████████████████████████████████▎ | 816/892 [1:38:16<08:36, 6.79s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:16,635 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.762, 'learning_rate': 0.00019642857142857144, 'epoch': 0.92} + 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6711, 'learning_rate': 0.00019387755102040816, 'epoch': 0.92} +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7525, 'learning_rate': 0.0001913265306122449, 'epoch': 0.92} +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▌ | 820/892 [1:38:42<07:57, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▌ | 820/892 [1:38:42<07:57, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:43,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▋ | 821/892 [1:38:49<07:50, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▋ | 821/892 [1:38:49<07:50, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.628, 'learning_rate': 0.0001862244897959184, 'epoch': 0.92} +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:49,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.4963, 'learning_rate': 0.00018367346938775512, 'epoch': 0.92} + 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6785, 'learning_rate': 0.00018112244897959185, 'epoch': 0.92} +[WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6588, 'learning_rate': 0.00017857142857142857, 'epoch': 0.92} +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 5.0121, 'learning_rate': 0.00017602040816326532, 'epoch': 0.92} + 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▌ | 830/892 [1:39:46<06:24, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 93%|█████████████████████████████████████████████████████████████████████████▌ | 830/892 [1:39:46<06:24, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.4503, 'learning_rate': 0.00016071428571428573, 'epoch': 0.93} +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5233, 'learning_rate': 0.00015816326530612246, 'epoch': 0.93} +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7851, 'learning_rate': 0.00015561224489795918, 'epoch': 0.93} +[WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:05,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:05,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5912, 'learning_rate': 0.00015306122448979594, 'epoch': 0.93} +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:09,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:09,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|█████████████████████████████████████████████████████████████████████████▉ | 835/892 [1:40:16<05:36, 5.90s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|█████████████████████████████████████████████████████████████████████████▉ | 835/892 [1:40:16<05:36, 5.90s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:15,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:15,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|██████████████████████████████████████████████████████████████████████████ | 836/892 [1:40:21<05:26, 5.82s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:19,656 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:22,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:22,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5167, 'learning_rate': 0.00014540816326530611, 'epoch': 0.94} +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:26,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:26,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|██████████████████████████████████████████████████████████████████████████▏ | 838/892 [1:40:32<05:00, 5.57s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:30,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:32,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:32,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5436, 'learning_rate': 0.0001403061224489796, 'epoch': 0.94} +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:36,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:36,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|██████████████████████████████████████████████████████████████████████████▍ | 840/892 [1:40:42<04:32, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 94%|██████████████████████████████████████████████████████████████████████████▍ | 840/892 [1:40:42<04:32, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:41,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:41,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:44,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:46,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:46,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:48,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:48,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:50,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:52,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:52,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:53,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:57,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:57,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:58,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:20:58,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:00,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:02,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:02,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:04,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:04,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:06,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:06,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:08,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:10,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:10,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7704, 'learning_rate': 0.00011224489795918367, 'epoch': 0.95} +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:18,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:18,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:21,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:25,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:25,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9124, 'learning_rate': 0.00010714285714285714, 'epoch': 0.96} +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6014, 'learning_rate': 0.00010204081632653062, 'epoch': 0.96} +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▋ | 855/892 [1:41:52<04:01, 6.51s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▋ | 855/892 [1:41:52<04:01, 6.51s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:21:52,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5992, 'learning_rate': 9.693877551020408e-05, 'epoch': 0.96} + 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▉ | 857/892 [1:42:06<03:58, 6.81s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:04,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:04,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6767, 'learning_rate': 9.183673469387756e-05, 'epoch': 0.96} + 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.4785, 'learning_rate': 8.928571428571429e-05, 'epoch': 0.96} +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|████████████████████████████████████████████████████████████████████████████▏ | 860/892 [1:42:27<03:39, 6.87s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 96%|████████████████████████████████████████████████████████████████████████████▏ | 860/892 [1:42:27<03:39, 6.87s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6034, 'learning_rate': 8.673469387755102e-05, 'epoch': 0.96} +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:27,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.926, 'learning_rate': 8.418367346938775e-05, 'epoch': 0.97} + 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6059, 'learning_rate': 8.163265306122449e-05, 'epoch': 0.97} +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████��███████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6713, 'learning_rate': 7.908163265306123e-05, 'epoch': 0.97} + 97%|████████████████████████████████████████████████████████████████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▌ | 864/892 [1:42:54<03:10, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▌ | 864/892 [1:42:54<03:10, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9771, 'learning_rate': 7.653061224489797e-05, 'epoch': 0.97} +[WARNING|modeling_utils.py:388] 2022-03-03 05:22:54,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7271, 'learning_rate': 7.39795918367347e-05, 'epoch': 0.97} + 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.7206, 'learning_rate': 7.142857142857142e-05, 'epoch': 0.97} +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▊ | 867/892 [1:43:14<02:47, 6.68s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▊ | 868/892 [1:43:20<02:39, 6.64s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▊ | 868/892 [1:43:20<02:39, 6.64s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:20,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:20,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▉ | 869/892 [1:43:27<02:32, 6.62s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 97%|████████████████████████████████████████████████████████████████████████████▉ | 869/892 [1:43:27<02:32, 6.62s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:27,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:27,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5866, 'learning_rate': 5.8673469387755104e-05, 'epoch': 0.98} +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6059, 'learning_rate': 5.6122448979591836e-05, 'epoch': 0.98} +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.6245, 'learning_rate': 5.357142857142857e-05, 'epoch': 0.98} +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|���████████████████████████████████████████████████████████████████████████████▍ | 875/892 [1:44:06<01:51, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▍ | 875/892 [1:44:06<01:51, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:05,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:05,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▌ | 876/892 [1:44:12<01:43, 6.45s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▌ | 876/892 [1:44:12<01:43, 6.45s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:11,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:11,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▋ | 877/892 [1:44:18<01:34, 6.32s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 98%|█████████████████████████████████████████████████████████████████████████████▊ | 878/892 [1:44:24<01:26, 6.21s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|█████████████████████████████████████████████████████████████████████████████▊ | 879/892 [1:44:29<01:19, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|█████████████████████████████████████████████████████████████████████████████▊ | 879/892 [1:44:29<01:19, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.9238, 'learning_rate': 3.571428571428571e-05, 'epoch': 0.99} +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:34,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████ | 881/892 [1:44:41<01:04, 5.84s/it]g-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████ | 881/892 [1:44:41<01:04, 5.84s/it]g-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.4336, 'learning_rate': 3.316326530612245e-05, 'epoch': 0.99} +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:40,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:40,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████ | 882/892 [1:44:46<00:57, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████ | 882/892 [1:44:46<00:57, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:46,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:46,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.5835, 'learning_rate': 2.8061224489795918e-05, 'epoch': 0.99} +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:50,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:50,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████▎| 884/892 [1:44:56<00:42, 5.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:55,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:55,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████▍| 885/892 [1:45:01<00:35, 5.10s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:57,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:24:59,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:57,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████▍| 886/892 [1:45:05<00:29, 4.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████▍| 886/892 [1:45:05<00:29, 4.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:03,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:03,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed + 99%|██████████████████████████████████████████████████████████████████████████████▌| 887/892 [1:45:09<00:22, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:06,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:06,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▋| 888/892 [1:45:12<00:16, 4.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▋| 889/892 [1:45:15<00:11, 3.93s/it]g-point operations will not be computed-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▋| 889/892 [1:45:15<00:11, 3.93s/it]g-point operations will not be computed-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:13,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:11,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[WARNING|modeling_utils.py:388] 2022-03-03 05:25:13,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:11,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▊| 890/892 [1:45:18<00:07, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:14,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▊| 890/892 [1:45:18<00:07, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:14,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▉| 891/892 [1:45:21<00:03, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +100%|██████████████████████████████████████████████████████████████████████████████▉| 891/892 [1:45:21<00:03, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +{'loss': 4.2487, 'learning_rate': 5.102040816326531e-06, 'epoch': 1.0} +[INFO|trainer.py:2114] 2022-03-03 05:25:18,320 >> Saving model checkpoint to ./=)███| 892/892 [1:45:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[INFO|trainer.py:2114] 2022-03-03 05:25:34,749 >> Saving model checkpoint to ./ ./pytorch_model.bin:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed +[INFO|modeling_utils.py:1081] 2022-03-03 05:25:51,178 >> Model weights saved in ./pytorch_model.bin:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed