0%| | 0/892 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:39:59,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:01,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7965, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:03,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 1/892 [00:08<2:01:09, 8.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:05,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:06,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:08,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0502, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:10,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 2/892 [00:15<1:55:12, 7.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:12,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:14,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:16,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9118, 'learning_rate': 2e-06, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:18,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 3/892 [00:23<1:52:51, 7.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:20,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:21,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:23,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8924, 'learning_rate': 4e-06, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:25,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 4/892 [00:30<1:50:39, 7.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:27,321 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:29,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:30,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.713, 'learning_rate': 6e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:32,632 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 5/892 [00:37<1:49:17, 7.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:34,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:36,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:38,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7862, 'learning_rate': 8e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:39,891 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 6/892 [00:44<1:48:29, 7.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:41,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:43,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:45,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7066, 'learning_rate': 1e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:40:47,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 7/892 [00:52<1:47:41, 7.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:49,006 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:50,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:52,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:54,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5681, 'learning_rate': 1.2e-05, 'epoch': 0.01} 1%|▋ | 8/892 [00:59<1:47:03, 7.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:40:56,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:57,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:40:59,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6685, 'learning_rate': 1.4e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:41:01,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 9/892 [01:06<1:46:12, 7.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:03,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:05,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:06,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:08,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4888, 'learning_rate': 1.6e-05, 'epoch': 0.01} 1%|▉ | 10/892 [01:13<1:45:34, 7.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:10,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:12,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:13,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.566, 'learning_rate': 1.8e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:41:15,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 11/892 [01:20<1:44:32, 7.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:17,298 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:18,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:20,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:22,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 12/892 [01:27<1:43:34, 7.06s/it] 1%|█ | 12/892 [01:27<1:43:34, 7.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:24,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:26,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:27,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4308, 'learning_rate': 2.2e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 03:41:29,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 13/892 [01:34<1:43:33, 7.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:31,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:33,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:34,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:36,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 14/892 [01:41<1:43:21, 7.06s/it] 2%|█▎ | 14/892 [01:41<1:43:21, 7.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:38,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:40,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:41,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3274, 'learning_rate': 2.6e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 03:41:43,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 15/892 [01:48<1:42:49, 7.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:45,302 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:47,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:48,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:50,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 16/892 [01:55<1:41:50, 6.98s/it] 2%|█▍ | 16/892 [01:55<1:41:50, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:52,141 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:53,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:55,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:41:57,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 17/892 [02:02<1:41:07, 6.93s/it] 2%|█▌ | 17/892 [02:02<1:41:07, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:41:58,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:00,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:02,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5591, 'learning_rate': 3.2e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 03:42:03,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 18/892 [02:08<1:40:05, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:05,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:07,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:09,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2522, 'learning_rate': 3.4000000000000007e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 03:42:10,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 19/892 [02:15<1:39:38, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:12,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:14,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:15,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:17,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 20/892 [02:22<1:38:46, 6.80s/it] 2%|█▊ | 20/892 [02:22<1:38:46, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:19,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:20,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:22,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:23,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 21/892 [02:28<1:37:26, 6.71s/it] 2%|█▉ | 21/892 [02:28<1:37:26, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:25,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:27,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:28,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:30,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3206, 'learning_rate': 4e-05, 'epoch': 0.02} 2%|█▉ | 22/892 [02:35<1:36:27, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:32,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:33,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:35,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3652, 'learning_rate': 4.2000000000000004e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-03 03:42:36,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 23/892 [02:41<1:35:29, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:38,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:40,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:41,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:43,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 24/892 [02:48<1:35:02, 6.57s/it] 3%|██▏ | 24/892 [02:48<1:35:02, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:45,049 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:46,604 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:48,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:50,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 25/892 [02:55<1:36:15, 6.66s/it] 3%|██▏ | 25/892 [02:55<1:36:15, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:51,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:53,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:42:51,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:56,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:42:51,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:42:56,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:42:51,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 26/892 [03:01<1:35:06, 6.59s/it] 3%|██▎ | 26/892 [03:01<1:35:06, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:58,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 26/892 [03:01<1:35:06, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:42:58,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-03 03:42:58,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-03 03:42:58,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 27/892 [03:07<1:33:54, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:04,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 27/892 [03:07<1:33:54, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:04,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:07,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:04,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 28/892 [03:14<1:32:58, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:11,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 28/892 [03:14<1:32:58, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:11,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1507, 'learning_rate': 5.2e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-03 03:43:14,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:11,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 29/892 [03:20<1:32:08, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:17,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 29/892 [03:20<1:32:08, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:17,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1699, 'learning_rate': 5.4e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-03 03:43:20,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:17,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 30/892 [03:26<1:30:54, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:23,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 30/892 [03:26<1:30:54, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:23,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3846, 'learning_rate': 5.6e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-03 03:43:26,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:23,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:26,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:23,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▊ | 31/892 [03:32<1:29:47, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:29,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▊ | 31/892 [03:32<1:29:47, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:29,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 32/892 [03:38<1:28:43, 6.19s/it]g-point operations will not be computed-03 03:43:29,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 32/892 [03:38<1:28:43, 6.19s/it]g-point operations will not be computed-03 03:43:29,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 32/892 [03:38<1:28:43, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:35,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 32/892 [03:38<1:28:43, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:35,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 33/892 [03:44<1:27:39, 6.12s/it]g-point operations will not be computed-03 03:43:35,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 33/892 [03:44<1:27:39, 6.12s/it]g-point operations will not be computed-03 03:43:35,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 33/892 [03:44<1:27:39, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:41,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:44,213 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:41,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:44,213 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:41,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 34/892 [03:50<1:25:55, 6.01s/it]g-point operations will not be computed-03 03:43:41,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 34/892 [03:50<1:25:55, 6.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:47,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:49,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:47,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:49,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:47,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 35/892 [03:56<1:24:24, 5.91s/it]g-point operations will not be computed-03 03:43:47,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 35/892 [03:56<1:24:24, 5.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:52,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:55,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:52,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:43:55,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:43:52,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 36/892 [04:01<1:23:11, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:58,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 36/892 [04:01<1:23:11, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:58,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 36/892 [04:01<1:23:11, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:43:58,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 37/892 [04:07<1:21:19, 5.71s/it]g-point operations will not be computed-03 03:43:58,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 37/892 [04:07<1:21:19, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:03,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:06,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:03,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:06,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:03,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 38/892 [04:12<1:19:45, 5.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:09,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 38/892 [04:12<1:19:45, 5.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:09,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 39/892 [04:17<1:18:07, 5.50s/it]g-point operations will not be computed-03 03:44:09,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 39/892 [04:17<1:18:07, 5.50s/it]g-point operations will not be computed-03 03:44:09,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 39/892 [04:17<1:18:07, 5.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:14,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:16,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:14,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:16,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:14,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 40/892 [04:22<1:16:06, 5.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:19,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 40/892 [04:22<1:16:06, 5.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:19,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 41/892 [04:27<1:14:14, 5.23s/it]g-point operations will not be computed-03 03:44:19,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 41/892 [04:27<1:14:14, 5.23s/it]g-point operations will not be computed-03 03:44:19,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 41/892 [04:27<1:14:14, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:24,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 42/892 [04:32<1:11:16, 5.03s/it]g-point operations will not be computed-03 03:44:24,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 42/892 [04:32<1:11:16, 5.03s/it]g-point operations will not be computed-03 03:44:24,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 42/892 [04:32<1:11:16, 5.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:28,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-03 03:44:28,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-03 03:44:28,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 43/892 [04:36<1:08:06, 4.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:32,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 44/892 [04:40<1:04:25, 4.56s/it]g-point operations will not be computed-03 03:44:32,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 44/892 [04:40<1:04:25, 4.56s/it]g-point operations will not be computed-03 03:44:32,913 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 44/892 [04:40<1:04:25, 4.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:36,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 45/892 [04:44<1:00:07, 4.26s/it]g-point operations will not be computed-03 03:44:36,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 45/892 [04:44<1:00:07, 4.26s/it]g-point operations will not be computed-03 03:44:36,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:41,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:40,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:41,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:40,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 46/892 [04:47<55:39, 3.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 47/892 [04:50<51:13, 3.64s/it]g-point operations will not be computed-03 03:44:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 47/892 [04:50<51:13, 3.64s/it]g-point operations will not be computed-03 03:44:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 48/892 [04:53<46:47, 3.33s/it]g-point operations will not be computed-03 03:44:46,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 48/892 [04:53<46:47, 3.33s/it]g-point operations will not be computed-03 03:44:46,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:49,868 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:48,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:49,868 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:48,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:52,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:51,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 50/892 [04:58<41:40, 2.97s/it]g-point operations will not be computed-03 03:44:51,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 50/892 [04:58<41:40, 2.97s/it]g-point operations will not be computed-03 03:44:51,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 50/892 [04:58<41:40, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:55,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 50/892 [04:58<41:40, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:44:55,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:44:59,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:44:55,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 51/892 [05:05<1:01:11, 4.37s/it]g-point operations will not be computed-03 03:44:55,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 51/892 [05:05<1:01:11, 4.37s/it]g-point operations will not be computed-03 03:44:55,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 51/892 [05:05<1:01:11, 4.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:02,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 51/892 [05:05<1:01:11, 4.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:02,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 52/892 [05:13<1:13:38, 5.26s/it]g-point operations will not be computed-03 03:45:02,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 52/892 [05:13<1:13:38, 5.26s/it]g-point operations will not be computed-03 03:45:02,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 52/892 [05:13<1:13:38, 5.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:10,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 52/892 [05:13<1:13:38, 5.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:10,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:13,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:10,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 53/892 [05:20<1:22:24, 5.89s/it]g-point operations will not be computed-03 03:45:10,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 53/892 [05:20<1:22:24, 5.89s/it]g-point operations will not be computed-03 03:45:10,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 53/892 [05:20<1:22:24, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:17,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 53/892 [05:20<1:22:24, 5.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:17,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:20,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:17,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:20,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:17,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 54/892 [05:27<1:27:40, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:24,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 54/892 [05:27<1:27:40, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:24,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:28,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:24,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 55/892 [05:34<1:31:15, 6.54s/it]g-point operations will not be computed-03 03:45:24,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 55/892 [05:34<1:31:15, 6.54s/it]g-point operations will not be computed-03 03:45:24,637 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 55/892 [05:34<1:31:15, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:31,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 55/892 [05:34<1:31:15, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:31,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:35,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:31,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:35,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:31,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 56/892 [05:42<1:33:52, 6.74s/it]g-point operations will not be computed-03 03:45:31,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 56/892 [05:42<1:33:52, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:38,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:42,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:38,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 57/892 [05:49<1:35:24, 6.86s/it]g-point operations will not be computed-03 03:45:38,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 57/892 [05:49<1:35:24, 6.86s/it]g-point operations will not be computed-03 03:45:38,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 57/892 [05:49<1:35:24, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:46,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 57/892 [05:49<1:35:24, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:46,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:49,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:46,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:49,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:46,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 58/892 [05:56<1:36:04, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 58/892 [05:56<1:36:04, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:45:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:45:56,629 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:45:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 59/892 [06:03<1:36:45, 6.97s/it]g-point operations will not be computed-03 03:45:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 59/892 [06:03<1:36:45, 6.97s/it]g-point operations will not be computed-03 03:45:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 59/892 [06:03<1:36:45, 6.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:00,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 59/892 [06:03<1:36:45, 6.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:00,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:03,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:00,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:03,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:00,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 60/892 [06:10<1:36:37, 6.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:07,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 60/892 [06:10<1:36:37, 6.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:07,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:10,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:07,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 61/892 [06:17<1:36:21, 6.96s/it]g-point operations will not be computed-03 03:46:07,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 61/892 [06:17<1:36:21, 6.96s/it]g-point operations will not be computed-03 03:46:07,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 61/892 [06:17<1:36:21, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:14,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 61/892 [06:17<1:36:21, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:14,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 62/892 [06:24<1:36:06, 6.95s/it]g-point operations will not be computed-03 03:46:14,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 62/892 [06:24<1:36:06, 6.95s/it]g-point operations will not be computed-03 03:46:14,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 62/892 [06:24<1:36:06, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:20,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 62/892 [06:24<1:36:06, 6.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:20,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:24,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:20,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 63/892 [06:30<1:35:13, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:27,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 63/892 [06:30<1:35:13, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:27,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2169, 'learning_rate': 0.000122, 'epoch': 0.07} [WARNING|modeling_utils.py:388] 2022-03-03 03:46:31,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:27,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 64/892 [06:37<1:34:54, 6.88s/it]g-point operations will not be computed-03 03:46:27,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 64/892 [06:37<1:34:54, 6.88s/it]g-point operations will not be computed-03 03:46:27,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 64/892 [06:37<1:34:54, 6.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:34,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:37,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:34,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:37,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:34,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 65/892 [06:44<1:34:15, 6.84s/it]g-point operations will not be computed-03 03:46:34,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 65/892 [06:44<1:34:15, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:41,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:44,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:41,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 66/892 [06:51<1:33:17, 6.78s/it]g-point operations will not be computed-03 03:46:41,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 66/892 [06:51<1:33:17, 6.78s/it]g-point operations will not be computed-03 03:46:41,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 66/892 [06:51<1:33:17, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:47,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 66/892 [06:51<1:33:17, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:47,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:51,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:47,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:51,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:47,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 67/892 [06:57<1:32:59, 6.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:54,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 67/892 [06:57<1:32:59, 6.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:46:54,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:57,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:54,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:46:57,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:46:54,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 68/892 [07:04<1:32:36, 6.74s/it]g-point operations will not be computed-03 03:46:54,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 68/892 [07:04<1:32:36, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:01,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:04,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:01,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:04,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:01,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 69/892 [07:11<1:32:11, 6.72s/it]g-point operations will not be computed-03 03:47:01,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 69/892 [07:11<1:32:11, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:08,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 69/892 [07:11<1:32:11, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:08,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:11,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:08,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:11,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:08,087 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 70/892 [07:17<1:31:42, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:14,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 70/892 [07:17<1:31:42, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:14,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:17,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:14,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:17,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:14,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 71/892 [07:24<1:31:02, 6.65s/it]g-point operations will not be computed-03 03:47:14,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 71/892 [07:24<1:31:02, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:21,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:24,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:21,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:24,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:21,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 72/892 [07:31<1:30:37, 6.63s/it]g-point operations will not be computed-03 03:47:21,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 72/892 [07:31<1:30:37, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:27,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:30,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:27,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:30,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:27,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 73/892 [07:37<1:29:47, 6.58s/it]g-point operations will not be computed-03 03:47:27,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 73/892 [07:37<1:29:47, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:34,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:37,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:34,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:37,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:34,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 74/892 [07:43<1:29:13, 6.54s/it]g-point operations will not be computed-03 03:47:34,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 74/892 [07:43<1:29:13, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:40,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 74/892 [07:43<1:29:13, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:40,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:43,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:40,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:43,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:40,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 75/892 [07:50<1:30:15, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:47,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 75/892 [07:50<1:30:15, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:47,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:50,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:47,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:50,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:47,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 76/892 [07:57<1:28:55, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:53,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 76/892 [07:57<1:28:55, 6.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:47:53,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:56,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:53,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:47:56,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:47:53,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 77/892 [08:03<1:28:03, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:00,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 77/892 [08:03<1:28:03, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:00,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:03,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:00,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:03,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:00,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 78/892 [08:09<1:26:44, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:06,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 78/892 [08:09<1:26:44, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:06,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:09,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:06,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:09,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:06,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 79/892 [08:15<1:25:41, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:12,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 79/892 [08:15<1:25:41, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:12,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:15,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:12,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:15,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:12,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 80/892 [08:21<1:24:49, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:18,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 80/892 [08:21<1:24:49, 6.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:18,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:21,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:18,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:21,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:18,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 81/892 [08:27<1:23:41, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 81/892 [08:27<1:23:41, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:28,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:28,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2937, 'learning_rate': 0.00016, 'epoch': 0.09} [WARNING|modeling_utils.py:388] 2022-03-03 03:48:28,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:28,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:28,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:24,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 83/892 [08:39<1:22:00, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 83/892 [08:39<1:22:00, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:40,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:40,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4215, 'learning_rate': 0.000164, 'epoch': 0.09} [WARNING|modeling_utils.py:388] 2022-03-03 03:48:45,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 85/892 [08:51<1:19:33, 5.91s/it]g-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 85/892 [08:51<1:19:33, 5.91s/it]g-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4092, 'learning_rate': 0.00016600000000000002, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 03:48:50,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:50,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 86/892 [08:56<1:18:11, 5.82s/it]g-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:54,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:57,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:48:57,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3505, 'learning_rate': 0.00017, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 03:49:01,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 88/892 [09:07<1:15:23, 5.63s/it]g-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 88/892 [09:07<1:15:23, 5.63s/it]g-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:05,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:05,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:05,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:48:36,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 89/892 [09:13<1:13:41, 5.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 89/892 [09:13<1:13:41, 5.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:13,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:13,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:15,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:18,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:18,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7094, 'learning_rate': 0.000178, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 03:49:21,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:09,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 92/892 [09:27<1:06:50, 5.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:23,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 92/892 [09:27<1:06:50, 5.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:23,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:25,890 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:23,831 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 93/892 [09:31<1:03:51, 4.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:28,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 93/892 [09:31<1:03:51, 4.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:28,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:29,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:28,036 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 94/892 [09:35<1:00:22, 4.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:31,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 94/892 [09:35<1:00:22, 4.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:31,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:33,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:31,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:33,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:31,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▋ | 95/892 [09:39<56:53, 4.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:35,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:37,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:35,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:37,164 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:35,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 96/892 [09:42<53:14, 4.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:38,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 96/892 [09:42<53:14, 4.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:38,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 97/892 [09:45<48:55, 3.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:41,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 98/892 [09:48<44:53, 3.39s/it]g-point operations will not be computed-03 03:49:41,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 98/892 [09:48<44:53, 3.39s/it]g-point operations will not be computed-03 03:49:41,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 99/892 [09:50<40:55, 3.10s/it]g-point operations will not be computed-03 03:49:44,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 99/892 [09:50<40:55, 3.10s/it]g-point operations will not be computed-03 03:49:44,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:47,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:46,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:47,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:46,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 100/892 [09:53<39:00, 2.96s/it]g-point operations will not be computed-03 03:49:46,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 100/892 [09:53<39:00, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:50,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 100/892 [09:53<39:00, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:50,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:54,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:50,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:49:54,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:50,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 101/892 [10:01<57:51, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 101/892 [10:01<57:51, 4.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:01,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 102/892 [10:08<1:09:45, 5.30s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 102/892 [10:08<1:09:45, 5.30s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6228, 'learning_rate': 0.0002, 'epoch': 0.11} 11%|█████████ | 102/892 [10:08<1:09:45, 5.30s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 102/892 [10:08<1:09:45, 5.30s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 102/892 [10:08<1:09:45, 5.30s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 103/892 [10:15<1:17:00, 5.86s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:14,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:14,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 104/892 [10:23<1:22:21, 6.27s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 104/892 [10:23<1:22:21, 6.27s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4669, 'learning_rate': 0.000204, 'epoch': 0.12} 12%|█████████▏ | 104/892 [10:23<1:22:21, 6.27s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 104/892 [10:23<1:22:21, 6.27s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 105/892 [10:30<1:26:06, 6.56s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 105/892 [10:30<1:26:06, 6.56s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:28,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:28,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 106/892 [10:37<1:27:53, 6.71s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 106/892 [10:37<1:27:53, 6.71s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4893, 'learning_rate': 0.000208, 'epoch': 0.12} 12%|█████████▍ | 106/892 [10:37<1:27:53, 6.71s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 106/892 [10:37<1:27:53, 6.71s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 106/892 [10:37<1:27:53, 6.71s/it]g-point operations will not be computed-03 03:49:58,263 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 107/892 [10:44<1:28:42, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 107/892 [10:44<1:28:42, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 107/892 [10:44<1:28:42, 6.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 108/892 [10:51<1:29:23, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 108/892 [10:51<1:29:23, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6778, 'learning_rate': 0.000212, 'epoch': 0.12} 12%|█████████▌ | 108/892 [10:51<1:29:23, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:53,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:53,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2605, 'learning_rate': 0.000214, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-03-03 03:50:53,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:50:53,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 110/892 [11:05<1:30:26, 6.94s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 110/892 [11:05<1:30:26, 6.94s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.431, 'learning_rate': 0.000216, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-03-03 03:51:05,648 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 111/892 [11:12<1:29:57, 6.91s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 111/892 [11:12<1:29:57, 6.91s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.52, 'learning_rate': 0.000218, 'epoch': 0.12} 12%|█████████▊ | 111/892 [11:12<1:29:57, 6.91s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:14,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:14,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5446, 'learning_rate': 0.00022, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-03 03:51:14,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:14,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 113/892 [11:25<1:29:06, 6.86s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 113/892 [11:25<1:29:06, 6.86s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:24,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:24,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 114/892 [11:32<1:28:41, 6.84s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 114/892 [11:32<1:28:41, 6.84s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3684, 'learning_rate': 0.000224, 'epoch': 0.13} 13%|██████████ | 114/892 [11:32<1:28:41, 6.84s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:34,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:34,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5161, 'learning_rate': 0.00022600000000000002, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-03 03:51:34,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:34,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:34,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 116/892 [11:46<1:27:26, 6.76s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:44,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:44,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:44,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 117/892 [11:52<1:27:00, 6.74s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 117/892 [11:52<1:27:00, 6.74s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:52,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:51:52,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 118/892 [11:59<1:26:15, 6.69s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 118/892 [11:59<1:26:15, 6.69s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 118/892 [11:59<1:26:15, 6.69s/it]g-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:00,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:00,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5091, 'learning_rate': 0.00023400000000000002, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-03 03:52:00,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:00,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:00,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:50:41,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 120/892 [12:12<1:25:29, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 120/892 [12:12<1:25:29, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 120/892 [12:12<1:25:29, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 120/892 [12:12<1:25:29, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▋ | 121/892 [12:19<1:25:13, 6.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:17,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:17,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:17,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▊ | 122/892 [12:25<1:24:19, 6.57s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:23,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:23,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:23,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 123/892 [12:31<1:23:39, 6.53s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 123/892 [12:31<1:23:39, 6.53s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:31,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 124/892 [12:38<1:22:56, 6.48s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 124/892 [12:38<1:22:56, 6.48s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4989, 'learning_rate': 0.000244, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-03 03:52:38,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 125/892 [12:45<1:24:15, 6.59s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 125/892 [12:45<1:24:15, 6.59s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.55, 'learning_rate': 0.000246, 'epoch': 0.14} 14%|███████████ | 125/892 [12:45<1:24:15, 6.59s/it]g-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:46,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:46,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5464, 'learning_rate': 0.000248, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-03 03:52:46,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:46,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:52:46,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:52:09,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 127/892 [12:57<1:22:36, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:54,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 127/892 [12:57<1:22:36, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:54,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 127/892 [12:57<1:22:36, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:54,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 127/892 [12:57<1:22:36, 6.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:52:54,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 128/892 [13:04<1:21:19, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:00,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 128/892 [13:04<1:21:19, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:00,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 128/892 [13:04<1:21:19, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:00,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 128/892 [13:04<1:21:19, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:00,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 129/892 [13:10<1:20:28, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:06,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 129/892 [13:10<1:20:28, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:06,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 129/892 [13:10<1:20:28, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:06,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 129/892 [13:10<1:20:28, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:06,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 130/892 [13:16<1:19:12, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 130/892 [13:16<1:19:12, 6.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:17,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:17,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4956, 'learning_rate': 0.00025800000000000004, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-03 03:53:17,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:23,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:23,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6595, 'learning_rate': 0.00026000000000000003, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-03 03:53:23,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:29,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:29,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5596, 'learning_rate': 0.000262, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-03 03:53:33,721 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 134/892 [13:40<1:15:19, 5.96s/it]g-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 134/892 [13:40<1:15:19, 5.96s/it]g-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6203, 'learning_rate': 0.000264, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-03 03:53:39,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:39,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 135/892 [13:45<1:14:12, 5.88s/it]g-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:43,610 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:43,610 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:43,610 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:13,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 136/892 [13:51<1:12:45, 5.78s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:50,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:50,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 137/892 [13:56<1:11:14, 5.66s/it]g-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:54,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:54,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:53:54,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:47,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 138/892 [14:01<1:09:39, 5.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 138/892 [14:01<1:09:39, 5.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 138/892 [14:01<1:09:39, 5.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:02,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:04,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:06,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:06,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4214, 'learning_rate': 0.00027600000000000004, 'epoch': 0.16} [WARNING|modeling_utils.py:388] 2022-03-03 03:54:10,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:10,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:53:58,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 141/892 [14:16<1:02:46, 5.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:54:12,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:14,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:12,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:14,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:12,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 142/892 [14:20<59:34, 4.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 142/892 [14:20<59:34, 4.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 142/892 [14:20<59:34, 4.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:19,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:21,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:21,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:23,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:23,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:26,697 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:28,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:28,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:31,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:31,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:32,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:35,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:35,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:36,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:36,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:38,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:38,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:40,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:40,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:44,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:44,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:44,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:47,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:47,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:51,622 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:55,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:55,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9597, 'learning_rate': 0.0003, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-03 03:54:58,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:58,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:54:58,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 153/892 [15:07<1:11:04, 5.77s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 153/892 [15:07<1:11:04, 5.77s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:07,675 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 154/892 [15:14<1:16:03, 6.18s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 154/892 [15:14<1:16:03, 6.18s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5698, 'learning_rate': 0.000304, 'epoch': 0.17} 17%|█████████████▋ | 154/892 [15:14<1:16:03, 6.18s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 154/892 [15:14<1:16:03, 6.18s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 154/892 [15:14<1:16:03, 6.18s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 155/892 [15:21<1:19:17, 6.46s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:20,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:20,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 156/892 [15:28<1:21:17, 6.63s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 156/892 [15:28<1:21:17, 6.63s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7222, 'learning_rate': 0.000308, 'epoch': 0.17} 17%|█████████████▊ | 156/892 [15:28<1:21:17, 6.63s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:30,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:30,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0516, 'learning_rate': 0.00031, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-03 03:55:30,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:30,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 158/892 [15:42<1:23:25, 6.82s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 158/892 [15:42<1:23:25, 6.82s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8873, 'learning_rate': 0.000312, 'epoch': 0.18} 18%|█████████████▉ | 158/892 [15:42<1:23:25, 6.82s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:44,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:44,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9982, 'learning_rate': 0.000314, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-03 03:55:44,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:44,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:44,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 160/892 [15:56<1:24:11, 6.90s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:54,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:55:54,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 161/892 [16:03<1:23:42, 6.87s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 161/892 [16:03<1:23:42, 6.87s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6326, 'learning_rate': 0.00031800000000000003, 'epoch': 0.18} 18%|██████████████▎ | 161/892 [16:03<1:23:42, 6.87s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 161/892 [16:03<1:23:42, 6.87s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 161/892 [16:03<1:23:42, 6.87s/it]g-point operations will not be computed-03 03:54:16,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 162/892 [16:10<1:23:22, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 162/892 [16:10<1:23:22, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 162/892 [16:10<1:23:22, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 162/892 [16:10<1:23:22, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 163/892 [16:16<1:22:55, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:15,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:15,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▌ | 164/892 [16:23<1:22:31, 6.80s/it]g-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▌ | 164/892 [16:23<1:22:31, 6.80s/it]g-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8179, 'learning_rate': 0.000324, 'epoch': 0.18} 18%|██████████████▌ | 164/892 [16:23<1:22:31, 6.80s/it]g-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:25,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:25,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6294, 'learning_rate': 0.000326, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-03 03:56:25,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:25,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:25,404 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:06,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 166/892 [16:37<1:21:39, 6.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 166/892 [16:37<1:21:39, 6.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 166/892 [16:37<1:21:39, 6.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 167/892 [16:43<1:21:01, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 167/892 [16:43<1:21:01, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:41,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:56:41,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 168/892 [16:50<1:20:26, 6.67s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 168/892 [16:50<1:20:26, 6.67s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5294, 'learning_rate': 0.00033200000000000005, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-03 03:56:50,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 169/892 [16:56<1:19:40, 6.61s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 169/892 [16:56<1:19:40, 6.61s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4458, 'learning_rate': 0.00033400000000000004, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-03 03:56:56,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 170/892 [17:03<1:19:26, 6.60s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 170/892 [17:03<1:19:26, 6.60s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7599, 'learning_rate': 0.00033600000000000004, 'epoch': 0.19} 19%|███████████████ | 170/892 [17:03<1:19:26, 6.60s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:04,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:04,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8152, 'learning_rate': 0.00033800000000000003, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-03 03:57:04,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:04,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 172/892 [17:16<1:18:35, 6.55s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 172/892 [17:16<1:18:35, 6.55s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:14,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:14,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 173/892 [17:22<1:17:54, 6.50s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 173/892 [17:22<1:17:54, 6.50s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:20,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:20,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 174/892 [17:28<1:17:05, 6.44s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 174/892 [17:28<1:17:05, 6.44s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:27,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:27,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 175/892 [17:35<1:18:15, 6.55s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 175/892 [17:35<1:18:15, 6.55s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5978, 'learning_rate': 0.000346, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-03 03:57:35,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 176/892 [17:42<1:17:18, 6.48s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 176/892 [17:42<1:17:18, 6.48s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6663, 'learning_rate': 0.000348, 'epoch': 0.2} 20%|███████████████▌ | 176/892 [17:42<1:17:18, 6.48s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 176/892 [17:42<1:17:18, 6.48s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:43,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:43,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:43,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:43,259 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:49,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:49,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:49,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:49,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:55,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:57:55,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:00,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:00,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 180/892 [18:06<1:13:24, 6.19s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 180/892 [18:06<1:13:24, 6.19s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:06,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:06,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 181/892 [18:12<1:12:30, 6.12s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 181/892 [18:12<1:12:30, 6.12s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:11,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:11,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 182/892 [18:18<1:11:50, 6.07s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 182/892 [18:18<1:11:50, 6.07s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:17,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:17,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 183/892 [18:24<1:10:45, 5.99s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:22,129 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:22,129 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:22,129 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 184/892 [18:29<1:09:32, 5.89s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:27,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:30,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:30,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8057, 'learning_rate': 0.000366, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 03:58:34,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:34,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 186/892 [18:40<1:07:02, 5.70s/it]g-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:38,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:38,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:38,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:56:33,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 187/892 [18:46<1:05:53, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 187/892 [18:46<1:05:53, 5.61s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:46,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:46,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5574, 'learning_rate': 0.000372, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 03:58:50,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:50,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 189/892 [18:56<1:03:24, 5.41s/it]g-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:54,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:56,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:58:56,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6756, 'learning_rate': 0.00037600000000000003, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 03:59:00,510 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:00,510 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:58:42,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 191/892 [19:06<1:00:11, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:05,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:05,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 192/892 [19:11<57:53, 4.96s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:08,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:08,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:10,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:12,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:12,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:14,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:16,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:16,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:18,031 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:21,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:21,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:22,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:22,935 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:24,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:26,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:26,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:29,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:29,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:30,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:30,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:31,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:35,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:35,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:39,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:39,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9359, 'learning_rate': 0.000398, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-03 03:59:42,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:42,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:42,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:46,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:46,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 03:59:52,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 203/892 [19:58<1:06:47, 5.82s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 203/892 [19:58<1:06:47, 5.82s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4457, 'learning_rate': 0.000402, 'epoch': 0.23} 23%|█████████████████▉ | 203/892 [19:58<1:06:47, 5.82s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 203/892 [19:58<1:06:47, 5.82s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 203/892 [19:58<1:06:47, 5.82s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 204/892 [20:05<1:11:02, 6.20s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:04,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:04,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 205/892 [20:12<1:13:49, 6.45s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 205/892 [20:12<1:13:49, 6.45s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7054, 'learning_rate': 0.00040600000000000006, 'epoch': 0.23} 23%|██████████████████▏ | 205/892 [20:12<1:13:49, 6.45s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 205/892 [20:12<1:13:49, 6.45s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 205/892 [20:12<1:13:49, 6.45s/it]g-point operations will not be computed-03 03:59:02,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 206/892 [20:20<1:16:04, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 206/892 [20:20<1:16:04, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 206/892 [20:20<1:16:04, 6.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 207/892 [20:27<1:17:16, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 207/892 [20:27<1:17:16, 6.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7219, 'learning_rate': 0.00041, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-03 04:00:27,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▍ | 208/892 [20:34<1:17:57, 6.84s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▍ | 208/892 [20:34<1:17:57, 6.84s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5005, 'learning_rate': 0.000412, 'epoch': 0.23} 23%|██████████████████▍ | 208/892 [20:34<1:17:57, 6.84s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▍ | 208/892 [20:34<1:17:57, 6.84s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 209/892 [20:40<1:18:05, 6.86s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 209/892 [20:40<1:18:05, 6.86s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7144, 'learning_rate': 0.000414, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-03 04:00:41,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 210/892 [20:47<1:18:14, 6.88s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 210/892 [20:47<1:18:14, 6.88s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8477, 'learning_rate': 0.000416, 'epoch': 0.24} 24%|██████████████████▌ | 210/892 [20:47<1:18:14, 6.88s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:49,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:49,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:49,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:53,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:00:53,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▊ | 212/892 [21:01<1:17:42, 6.86s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▊ | 212/892 [21:01<1:17:42, 6.86s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8034, 'learning_rate': 0.00042, 'epoch': 0.24} 24%|██████████████████▊ | 212/892 [21:01<1:17:42, 6.86s/it]g-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:03,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:03,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9817, 'learning_rate': 0.000422, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 04:01:03,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:03,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:00:16,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 214/892 [21:15<1:17:02, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 214/892 [21:15<1:17:02, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8543, 'learning_rate': 0.000424, 'epoch': 0.24} 24%|██████████████████▉ | 214/892 [21:15<1:17:02, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 215/892 [21:21<1:16:57, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 215/892 [21:21<1:16:57, 6.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:20,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:20,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 216/892 [21:28<1:16:21, 6.78s/it]g-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 216/892 [21:28<1:16:21, 6.78s/it]g-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8308, 'learning_rate': 0.000428, 'epoch': 0.24} 24%|███████████████████▏ | 216/892 [21:28<1:16:21, 6.78s/it]g-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 216/892 [21:28<1:16:21, 6.78s/it]g-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:30,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:30,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:30,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:30,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:30,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:12,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 218/892 [21:41<1:15:08, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 218/892 [21:41<1:15:08, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 218/892 [21:41<1:15:08, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 218/892 [21:41<1:15:08, 6.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▍ | 219/892 [21:48<1:14:30, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:46,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:46,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:46,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▍ | 220/892 [21:55<1:14:17, 6.63s/it]g-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:53,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:53,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:01:53,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▌ | 221/892 [22:01<1:13:48, 6.60s/it]g-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▌ | 221/892 [22:01<1:13:48, 6.60s/it]g-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▌ | 221/892 [22:01<1:13:48, 6.60s/it]g-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:03,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:03,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7249, 'learning_rate': 0.00044, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-03 04:02:03,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:09,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:09,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9072, 'learning_rate': 0.000442, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-03 04:02:09,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:09,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:09,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:01:38,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 224/892 [22:20<1:12:01, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 224/892 [22:20<1:12:01, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 224/892 [22:20<1:12:01, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 224/892 [22:20<1:12:01, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▉ | 225/892 [22:27<1:12:46, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:25,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:25,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:25,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████ | 226/892 [22:33<1:11:58, 6.48s/it]g-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:32,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:32,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:32,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████ | 227/892 [22:40<1:11:01, 6.41s/it]g-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:38,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:38,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:38,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▏ | 228/892 [22:46<1:10:03, 6.33s/it]g-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:44,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:44,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:44,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 229/892 [22:52<1:09:08, 6.26s/it]g-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:50,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:50,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:02:50,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:02:17,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 230/892 [22:58<1:08:19, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:54,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 230/892 [22:58<1:08:19, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:54,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 230/892 [22:58<1:08:19, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:54,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 230/892 [22:58<1:08:19, 6.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:02:54,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 231/892 [23:04<1:07:40, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 231/892 [23:04<1:07:40, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 231/892 [23:04<1:07:40, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 231/892 [23:04<1:07:40, 6.14s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 232/892 [23:10<1:07:02, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:08,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:08,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:08,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:01,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 233/892 [23:16<1:05:58, 6.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 233/892 [23:16<1:05:58, 6.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:16,868 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:16,868 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9863, 'learning_rate': 0.00046400000000000006, 'epoch': 0.26} [WARNING|modeling_utils.py:388] 2022-03-03 04:03:21,079 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 235/892 [23:27<1:03:37, 5.81s/it]g-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 235/892 [23:27<1:03:37, 5.81s/it]g-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:25,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:25,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:25,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:12,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 236/892 [23:32<1:02:34, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 236/892 [23:32<1:02:34, 5.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:33,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:33,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.703, 'learning_rate': 0.00047, 'epoch': 0.27} [WARNING|modeling_utils.py:388] 2022-03-03 04:03:37,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 238/892 [23:43<1:00:11, 5.52s/it]g-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 238/892 [23:43<1:00:11, 5.52s/it]g-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:41,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:41,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:41,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:29,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▋ | 239/892 [23:48<58:53, 5.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:47,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▊ | 240/892 [23:53<57:24, 5.28s/it]g-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▊ | 240/892 [23:53<57:24, 5.28s/it]g-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:51,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:53,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:53,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:55,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:57,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:03:57,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3981, 'learning_rate': 0.00048, 'epoch': 0.27} [WARNING|modeling_utils.py:388] 2022-03-03 04:04:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:03:45,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████ | 243/892 [24:07<50:35, 4.68s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:03,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▏ | 244/892 [24:10<47:43, 4.42s/it]g-point operations will not be computed-03 04:04:03,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▏ | 244/892 [24:10<47:43, 4.42s/it]g-point operations will not be computed-03 04:04:03,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▏ | 244/892 [24:10<47:43, 4.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:06,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▏ | 245/892 [24:14<44:50, 4.16s/it]g-point operations will not be computed-03 04:04:06,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▏ | 245/892 [24:14<44:50, 4.16s/it]g-point operations will not be computed-03 04:04:06,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:11,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:10,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 246/892 [24:17<41:45, 3.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:13,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 246/892 [24:17<41:45, 3.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:13,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 247/892 [24:20<38:34, 3.59s/it]g-point operations will not be computed-03 04:04:13,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 247/892 [24:20<38:34, 3.59s/it]g-point operations will not be computed-03 04:04:13,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▌ | 248/892 [24:23<35:20, 3.29s/it]g-point operations will not be computed-03 04:04:16,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▌ | 248/892 [24:23<35:20, 3.29s/it]g-point operations will not be computed-03 04:04:16,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:19,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:18,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:22,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:21,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:22,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:21,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▋ | 250/892 [24:28<30:50, 2.88s/it]g-point operations will not be computed-03 04:04:21,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▋ | 250/892 [24:28<30:50, 2.88s/it]g-point operations will not be computed-03 04:04:21,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▋ | 250/892 [24:28<30:50, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:25,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▋ | 250/892 [24:28<30:50, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:25,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:28,833 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:25,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 251/892 [24:35<45:52, 4.29s/it]g-point operations will not be computed-03 04:04:25,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 251/892 [24:35<45:52, 4.29s/it]g-point operations will not be computed-03 04:04:25,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 251/892 [24:35<45:52, 4.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:32,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:04:36,225 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:32,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▉ | 252/892 [24:42<55:30, 5.20s/it]g-point operations will not be computed-03 04:04:32,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▉ | 252/892 [24:42<55:30, 5.20s/it]g-point operations will not be computed-03 04:04:32,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▉ | 252/892 [24:42<55:30, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▉ | 252/892 [24:42<55:30, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▉ | 252/892 [24:42<55:30, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 253/892 [24:50<1:01:55, 5.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 253/892 [24:50<1:01:55, 5.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0797, 'learning_rate': 0.0005020000000000001, 'epoch': 0.28} [WARNING|modeling_utils.py:388] 2022-03-03 04:04:50,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 254/892 [24:57<1:05:56, 6.20s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 254/892 [24:57<1:05:56, 6.20s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8901, 'learning_rate': 0.000504, 'epoch': 0.28} 28%|██████████████████████▍ | 254/892 [24:57<1:05:56, 6.20s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 254/892 [24:57<1:05:56, 6.20s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 254/892 [24:57<1:05:56, 6.20s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 255/892 [25:04<1:08:40, 6.47s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:03,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:03,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:03,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 256/892 [25:11<1:10:31, 6.65s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 256/892 [25:11<1:10:31, 6.65s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 256/892 [25:11<1:10:31, 6.65s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:13,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:13,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1246, 'learning_rate': 0.00051, 'epoch': 0.29} [WARNING|modeling_utils.py:388] 2022-03-03 04:05:13,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:13,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:13,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▊ | 258/892 [25:25<1:12:17, 6.84s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▊ | 258/892 [25:25<1:12:17, 6.84s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:25,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 259/892 [25:32<1:12:33, 6.88s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 259/892 [25:32<1:12:33, 6.88s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8121, 'learning_rate': 0.000514, 'epoch': 0.29} 29%|██████████████████████▉ | 259/892 [25:32<1:12:33, 6.88s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 259/892 [25:32<1:12:33, 6.88s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 259/892 [25:32<1:12:33, 6.88s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 260/892 [25:39<1:12:47, 6.91s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:38,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:05:38,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 261/892 [25:46<1:12:52, 6.93s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 261/892 [25:46<1:12:52, 6.93s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8734, 'learning_rate': 0.000518, 'epoch': 0.29} [WARNING|modeling_utils.py:388] 2022-03-03 04:05:46,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 262/892 [25:53<1:12:21, 6.89s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 262/892 [25:53<1:12:21, 6.89s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9371, 'learning_rate': 0.0005200000000000001, 'epoch': 0.29} 29%|███████████████████████▏ | 262/892 [25:53<1:12:21, 6.89s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 262/892 [25:53<1:12:21, 6.89s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 262/892 [25:53<1:12:21, 6.89s/it]g-point operations will not be computed-03 04:04:39,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 263/892 [26:00<1:11:58, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 263/892 [26:00<1:11:58, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 263/892 [26:00<1:11:58, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 264/892 [26:06<1:11:29, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 264/892 [26:06<1:11:29, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8513, 'learning_rate': 0.000524, 'epoch': 0.3} 30%|███████████████████████▍ | 264/892 [26:06<1:11:29, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:08,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:08,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.242, 'learning_rate': 0.000526, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-03 04:06:08,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:15,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:15,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0909, 'learning_rate': 0.000528, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-03 04:06:15,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:15,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 267/892 [26:27<1:10:29, 6.77s/it]g-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 267/892 [26:27<1:10:29, 6.77s/it]g-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:25,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:25,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 268/892 [26:33<1:10:02, 6.74s/it]g-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 268/892 [26:33<1:10:02, 6.74s/it]g-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2137, 'learning_rate': 0.000532, 'epoch': 0.3} 30%|███████████████████████▋ | 268/892 [26:33<1:10:02, 6.74s/it]g-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:35,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:35,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7544, 'learning_rate': 0.0005340000000000001, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-03 04:06:35,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:42,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:42,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0531, 'learning_rate': 0.000536, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-03 04:06:42,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:42,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:06:42,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:05:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████ | 271/892 [26:53<1:08:54, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████ | 271/892 [26:53<1:08:54, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████ | 271/892 [26:53<1:08:54, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████ | 272/892 [27:00<1:08:23, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████ | 272/892 [27:00<1:08:23, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9672, 'learning_rate': 0.00054, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-03 04:07:00,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▏ | 273/892 [27:06<1:07:41, 6.56s/it]g-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▏ | 273/892 [27:06<1:07:41, 6.56s/it]g-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1118, 'learning_rate': 0.0005420000000000001, 'epoch': 0.31} 31%|████████████████████████▏ | 273/892 [27:06<1:07:41, 6.56s/it]g-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:08,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:08,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.766, 'learning_rate': 0.0005440000000000001, 'epoch': 0.31} [WARNING|modeling_utils.py:388] 2022-03-03 04:07:08,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:14,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:14,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2889, 'learning_rate': 0.000546, 'epoch': 0.31} [WARNING|modeling_utils.py:388] 2022-03-03 04:07:14,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:14,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:14,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:06:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 276/892 [27:26<1:07:15, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:22,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 276/892 [27:26<1:07:15, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:22,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 276/892 [27:26<1:07:15, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:22,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 276/892 [27:26<1:07:15, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:22,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 277/892 [27:32<1:06:20, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:29,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 277/892 [27:32<1:06:20, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:29,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 277/892 [27:32<1:06:20, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:29,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 277/892 [27:32<1:06:20, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:29,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 278/892 [27:38<1:05:25, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:35,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 278/892 [27:38<1:05:25, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:35,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 278/892 [27:38<1:05:25, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:35,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 278/892 [27:38<1:05:25, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:35,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 279/892 [27:44<1:04:37, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:41,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 279/892 [27:44<1:04:37, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:41,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 279/892 [27:44<1:04:37, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:41,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 279/892 [27:44<1:04:37, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:41,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 280/892 [27:50<1:03:46, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 280/892 [27:50<1:03:46, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:52,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:52,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7573, 'learning_rate': 0.000558, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-03 04:07:52,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:58,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:58,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7975, 'learning_rate': 0.0005600000000000001, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-03 04:07:58,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:07:58,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:03,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:03,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:03,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:03,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:09,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:09,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:13,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:13,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 285/892 [28:20<59:03, 5.84s/it]g-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:18,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:18,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:18,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:07:47,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 286/892 [28:25<57:58, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 286/892 [28:25<57:58, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 286/892 [28:25<57:58, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:28,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:28,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:28,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:22,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 288/892 [28:36<55:26, 5.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 288/892 [28:36<55:26, 5.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 288/892 [28:36<55:26, 5.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:36,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:38,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:41,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:41,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1764, 'learning_rate': 0.000576, 'epoch': 0.33} [WARNING|modeling_utils.py:388] 2022-03-03 04:08:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 291/892 [28:50<50:31, 5.04s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:48,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:48,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:50,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:52,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:52,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:54,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:56,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:56,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:57,988 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:59,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:08:59,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:03,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:03,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:04,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:06,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:06,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:08,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:08,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:11,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:11,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:13,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:14,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:14,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4224, 'learning_rate': 0.000596, 'epoch': 0.34} [WARNING|modeling_utils.py:388] 2022-03-03 04:09:18,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:18,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:18,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:22,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:22,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:26,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:26,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:26,419 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:29,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:33,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 303/892 [29:42<57:41, 5.88s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 303/892 [29:42<57:41, 5.88s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.911, 'learning_rate': 0.000602, 'epoch': 0.34} 34%|███████████████████████████▌ | 303/892 [29:42<57:41, 5.88s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 303/892 [29:42<57:41, 5.88s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 303/892 [29:42<57:41, 5.88s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 304/892 [29:49<1:01:40, 6.29s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 304/892 [29:49<1:01:40, 6.29s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:09:50,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 305/892 [29:56<1:03:53, 6.53s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 305/892 [29:56<1:03:53, 6.53s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0028, 'learning_rate': 0.000606, 'epoch': 0.34} 34%|███████████████████████████ | 305/892 [29:56<1:03:53, 6.53s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 305/892 [29:56<1:03:53, 6.53s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 305/892 [29:56<1:03:53, 6.53s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 306/892 [30:03<1:05:17, 6.69s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 306/892 [30:03<1:05:17, 6.69s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:04,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▏ | 307/892 [30:10<1:06:21, 6.81s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▏ | 307/892 [30:10<1:06:21, 6.81s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8418, 'learning_rate': 0.00061, 'epoch': 0.34} 34%|███████████████████████████▏ | 307/892 [30:10<1:06:21, 6.81s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:12,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:12,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2496, 'learning_rate': 0.000612, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-03 04:10:12,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:12,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:12,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▎ | 309/892 [30:24<1:07:08, 6.91s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▎ | 309/892 [30:24<1:07:08, 6.91s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:25,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▍ | 310/892 [30:31<1:07:14, 6.93s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▍ | 310/892 [30:31<1:07:14, 6.93s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1208, 'learning_rate': 0.000616, 'epoch': 0.35} 35%|███████████████████████████▍ | 310/892 [30:31<1:07:14, 6.93s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▍ | 310/892 [30:31<1:07:14, 6.93s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▍ | 310/892 [30:31<1:07:14, 6.93s/it]g-point operations will not be computed-03 04:08:32,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 311/892 [30:38<1:07:11, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 311/892 [30:38<1:07:11, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 311/892 [30:38<1:07:11, 6.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 312/892 [30:45<1:06:40, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 312/892 [30:45<1:06:40, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9914, 'learning_rate': 0.00062, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.042, 'learning_rate': 0.000622, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:45,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 314/892 [30:59<1:05:57, 6.85s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:57,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:10:57,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 315/892 [31:06<1:05:42, 6.83s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 315/892 [31:06<1:05:42, 6.83s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8236, 'learning_rate': 0.000626, 'epoch': 0.35} 35%|███████████████████████████▉ | 315/892 [31:06<1:05:42, 6.83s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:07,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:07,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1062, 'learning_rate': 0.000628, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-03 04:11:07,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:14,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:14,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.823, 'learning_rate': 0.00063, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-03 04:11:14,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:14,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:14,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 318/892 [31:26<1:04:20, 6.72s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 318/892 [31:26<1:04:20, 6.72s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:26,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 319/892 [31:32<1:04:00, 6.70s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 319/892 [31:32<1:04:00, 6.70s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8754, 'learning_rate': 0.000634, 'epoch': 0.36} 36%|████████████████████████████▎ | 319/892 [31:32<1:04:00, 6.70s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:34,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:34,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8727, 'learning_rate': 0.0006360000000000001, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-03 04:11:34,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:40,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:40,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2652, 'learning_rate': 0.000638, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-03 04:11:40,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:40,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 322/892 [31:52<1:02:47, 6.61s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 322/892 [31:52<1:02:47, 6.61s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:50,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:50,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 323/892 [31:58<1:02:16, 6.57s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 323/892 [31:58<1:02:16, 6.57s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:57,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:11:57,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▋ | 324/892 [32:05<1:01:48, 6.53s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▋ | 324/892 [32:05<1:01:48, 6.53s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:03,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:03,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▊ | 325/892 [32:12<1:02:28, 6.61s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▊ | 325/892 [32:12<1:02:28, 6.61s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1065, 'learning_rate': 0.000646, 'epoch': 0.36} 36%|████████████████████████████▊ | 325/892 [32:12<1:02:28, 6.61s/it]g-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:13,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:13,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0236, 'learning_rate': 0.000648, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-03 04:12:13,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:19,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:19,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7007, 'learning_rate': 0.0006500000000000001, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-03 04:12:19,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:25,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:25,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9706, 'learning_rate': 0.000652, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-03 04:12:25,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:25,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:25,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:10:35,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▉ | 329/892 [32:37<59:17, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:33,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▉ | 329/892 [32:37<59:17, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:33,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▉ | 329/892 [32:37<59:17, 6.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:33,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3813, 'learning_rate': 0.000656, 'epoch': 0.37} 37%|█████████████████████████████▉ | 330/892 [32:43<58:35, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▉ | 330/892 [32:43<58:35, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▉ | 330/892 [32:43<58:35, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:44,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:44,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:44,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:50,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:50,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8357, 'learning_rate': 0.00066, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-03 04:12:50,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:50,226 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:56,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:12:56,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:00,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:00,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 334/892 [33:06<55:29, 5.97s/it]g-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:04,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:04,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:04,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 335/892 [33:12<54:35, 5.88s/it]g-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:10,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:10,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:10,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:12:39,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 336/892 [33:18<53:45, 5.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 336/892 [33:18<53:45, 5.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 336/892 [33:18<53:45, 5.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:18,656 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:21,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:21,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:21,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:14,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 338/892 [33:28<51:27, 5.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 338/892 [33:28<51:27, 5.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:29,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:29,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:31,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:31,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:31,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:25,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 340/892 [33:39<48:43, 5.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:37,697 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 341/892 [33:43<47:10, 5.14s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 341/892 [33:43<47:10, 5.14s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:41,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:41,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:43,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:45,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:47,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:47,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:49,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:51,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:51,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:53,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:53,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:54,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:57,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:57,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:59,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:13:59,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:02,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:03,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:03,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:05,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:05,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1623, 'learning_rate': 0.000696, 'epoch': 0.39} [WARNING|modeling_utils.py:388] 2022-03-03 04:14:12,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:12,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:15,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:15,987 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7402, 'learning_rate': 0.0006979999999999999, 'epoch': 0.39} [WARNING|modeling_utils.py:388] 2022-03-03 04:14:19,694 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:19,694 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:19,694 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:23,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:23,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:28,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 353/892 [34:35<52:06, 5.80s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 353/892 [34:35<52:06, 5.80s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1392, 'learning_rate': 0.0007019999999999999, 'epoch': 0.4} 40%|████████████████████████████████ | 353/892 [34:35<52:06, 5.80s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 353/892 [34:35<52:06, 5.80s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 353/892 [34:35<52:06, 5.80s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 354/892 [34:42<55:39, 6.21s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 354/892 [34:42<55:39, 6.21s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:42,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 355/892 [34:49<57:49, 6.46s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 355/892 [34:49<57:49, 6.46s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0127, 'learning_rate': 0.0007059999999999999, 'epoch': 0.4} 40%|████████████████████████████████▏ | 355/892 [34:49<57:49, 6.46s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 355/892 [34:49<57:49, 6.46s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 355/892 [34:49<57:49, 6.46s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 356/892 [34:56<59:16, 6.64s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 356/892 [34:56<59:16, 6.64s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:14:56,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▌ | 357/892 [35:03<1:00:02, 6.73s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▌ | 357/892 [35:03<1:00:02, 6.73s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3195, 'learning_rate': 0.00071, 'epoch': 0.4} 40%|███████████████████████████████▌ | 357/892 [35:03<1:00:02, 6.73s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:05,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:05,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2799, 'learning_rate': 0.000712, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-03 04:15:09,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:09,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▊ | 359/892 [35:17<1:01:03, 6.87s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▊ | 359/892 [35:17<1:01:03, 6.87s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2069, 'learning_rate': 0.000714, 'epoch': 0.4} 40%|███████████████████████████████▊ | 359/892 [35:17<1:01:03, 6.87s/it]g-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:19,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:19,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1036, 'learning_rate': 0.000716, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-03 04:15:19,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:19,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:19,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:13:35,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▉ | 361/892 [35:31<1:00:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▉ | 361/892 [35:31<1:00:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▉ | 361/892 [35:31<1:00:36, 6.85s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████ | 362/892 [35:38<1:00:17, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████ | 362/892 [35:38<1:00:17, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0837, 'learning_rate': 0.0007199999999999999, 'epoch': 0.41} 41%|████████████████████████████████ | 362/892 [35:38<1:00:17, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:39,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:39,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8481, 'learning_rate': 0.000722, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-03 04:15:39,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:39,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:39,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 364/892 [35:51<59:54, 6.81s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:50,129 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:50,129 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 365/892 [35:58<59:30, 6.78s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 365/892 [35:58<59:30, 6.78s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:56,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:15:56,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 366/892 [36:05<59:06, 6.74s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 366/892 [36:05<59:06, 6.74s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2159, 'learning_rate': 0.000728, 'epoch': 0.41} 41%|█████████████████████████████████▏ | 366/892 [36:05<59:06, 6.74s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:06,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:06,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0045, 'learning_rate': 0.00073, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-03 04:16:06,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:13,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:13,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:13,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1269, 'learning_rate': 0.000732, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-03 04:16:13,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 369/892 [36:24<57:53, 6.64s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 369/892 [36:24<57:53, 6.64s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 370/892 [36:31<57:34, 6.62s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 370/892 [36:31<57:34, 6.62s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2436, 'learning_rate': 0.000736, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-03 04:16:31,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▋ | 371/892 [36:37<57:10, 6.58s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▋ | 371/892 [36:37<57:10, 6.58s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9266, 'learning_rate': 0.000738, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-03 04:16:37,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 372/892 [36:44<56:58, 6.57s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 372/892 [36:44<56:58, 6.57s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1939, 'learning_rate': 0.00074, 'epoch': 0.42} 42%|█████████████████████████████████▊ | 372/892 [36:44<56:58, 6.57s/it]g-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:45,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:45,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1638, 'learning_rate': 0.000742, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-03 04:16:45,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:16:45,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:15:28,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▉ | 374/892 [36:57<55:50, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▉ | 374/892 [36:57<55:50, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5325, 'learning_rate': 0.000744, 'epoch': 0.42} 42%|█████████████████████████████████▉ | 374/892 [36:57<55:50, 6.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████ | 375/892 [37:04<56:40, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████ | 375/892 [37:04<56:40, 6.58s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8427, 'learning_rate': 0.000746, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-03 04:17:03,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 376/892 [37:10<56:03, 6.52s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 376/892 [37:10<56:03, 6.52s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1271, 'learning_rate': 0.000748, 'epoch': 0.42} 42%|██████████████████████████████████▏ | 376/892 [37:10<56:03, 6.52s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 376/892 [37:10<56:03, 6.52s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:11,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:11,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:16,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:16,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 378/892 [37:22<54:08, 6.32s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 378/892 [37:22<54:08, 6.32s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:22,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:22,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▍ | 379/892 [37:28<53:30, 6.26s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▍ | 379/892 [37:28<53:30, 6.26s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:28,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:28,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 380/892 [37:34<52:54, 6.20s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 380/892 [37:34<52:54, 6.20s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:34,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:34,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 381/892 [37:40<52:09, 6.13s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 382/892 [37:46<51:34, 6.07s/it]g-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:44,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:44,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:44,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:16:53,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 383/892 [37:52<50:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 383/892 [37:52<50:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0459, 'learning_rate': 0.000764, 'epoch': 0.43} [WARNING|modeling_utils.py:388] 2022-03-03 04:17:53,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:17:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9586, 'learning_rate': 0.0007660000000000001, 'epoch': 0.43} [WARNING|modeling_utils.py:388] 2022-03-03 04:18:03,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:03,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 386/892 [38:09<48:27, 5.75s/it]g-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:07,306 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:09,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:09,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0477, 'learning_rate': 0.0007700000000000001, 'epoch': 0.43} [WARNING|modeling_utils.py:388] 2022-03-03 04:18:13,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 388/892 [38:20<46:26, 5.53s/it]g-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 388/892 [38:20<46:26, 5.53s/it]g-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:17,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:17,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:17,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:17:49,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 389/892 [38:25<45:35, 5.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 389/892 [38:25<45:35, 5.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:25,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:25,433 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:27,878 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:30,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:30,182 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:32,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:32,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:32,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:21,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 392/892 [38:39<41:17, 4.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:35,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:37,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:35,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:37,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:35,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 393/892 [38:43<39:29, 4.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:40,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:42,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:40,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:42,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:40,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 394/892 [38:47<37:33, 4.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:44,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 395/892 [38:51<35:25, 4.28s/it]g-point operations will not be computed-03 04:18:44,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 395/892 [38:51<35:25, 4.28s/it]g-point operations will not be computed-03 04:18:44,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 395/892 [38:51<35:25, 4.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:47,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▉ | 396/892 [38:54<32:59, 3.99s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:50,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▉ | 396/892 [38:54<32:59, 3.99s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:50,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████ | 397/892 [38:57<30:16, 3.67s/it]g-point operations will not be computed-03 04:18:50,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████ | 397/892 [38:57<30:16, 3.67s/it]g-point operations will not be computed-03 04:18:50,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:54,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:53,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:18:54,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:18:53,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 398/892 [39:00<27:30, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:56,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 399/892 [39:02<24:51, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:58,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 399/892 [39:02<24:51, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:18:58,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 400/892 [39:05<23:47, 2.90s/it]g-point operations will not be computed-03 04:18:58,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 400/892 [39:05<23:47, 2.90s/it]g-point operations will not be computed-03 04:18:58,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 400/892 [39:05<23:47, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:02,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 400/892 [39:05<23:47, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:02,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:06,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:02,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 401/892 [39:12<35:20, 4.32s/it]g-point operations will not be computed-03 04:19:02,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 401/892 [39:12<35:20, 4.32s/it]g-point operations will not be computed-03 04:19:02,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 401/892 [39:12<35:20, 4.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 401/892 [39:12<35:20, 4.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:13,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:13,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 402/892 [39:20<42:28, 5.20s/it]g-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:18,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:18,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 403/892 [39:27<47:20, 5.81s/it]g-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 403/892 [39:27<47:20, 5.81s/it]g-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4463, 'learning_rate': 0.0008020000000000001, 'epoch': 0.45} 45%|████████████████████████████████████▌ | 403/892 [39:27<47:20, 5.81s/it]g-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 403/892 [39:27<47:20, 5.81s/it]g-point operations will not be computed-03 04:19:09,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 404/892 [39:34<50:37, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 404/892 [39:34<50:37, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0944, 'learning_rate': 0.000804, 'epoch': 0.45} 45%|████████████████████████████████████▋ | 404/892 [39:34<50:37, 6.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 405/892 [39:41<52:50, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 405/892 [39:41<52:50, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3147, 'learning_rate': 0.0008060000000000001, 'epoch': 0.45} 45%|████████████████████████████████████▊ | 405/892 [39:41<52:50, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 405/892 [39:41<52:50, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 405/892 [39:41<52:50, 6.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▊ | 406/892 [39:48<54:04, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:47,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:47,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 407/892 [39:55<54:39, 6.76s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 407/892 [39:55<54:39, 6.76s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.163, 'learning_rate': 0.0008100000000000001, 'epoch': 0.46} 46%|████████████████████████████████████▉ | 407/892 [39:55<54:39, 6.76s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:57,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:57,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0185, 'learning_rate': 0.0008120000000000001, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-03-03 04:19:57,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:19:57,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 409/892 [40:09<55:34, 6.90s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 409/892 [40:09<55:34, 6.90s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:08,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:08,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:08,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 410/892 [40:16<55:33, 6.92s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 410/892 [40:16<55:33, 6.92s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 410/892 [40:16<55:33, 6.92s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 410/892 [40:16<55:33, 6.92s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 410/892 [40:16<55:33, 6.92s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▎ | 411/892 [40:23<55:16, 6.89s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:22,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:22,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:22,134 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▍ | 412/892 [40:30<54:36, 6.83s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:28,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:28,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:28,921 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 413/892 [40:37<54:29, 6.82s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 413/892 [40:37<54:29, 6.82s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 413/892 [40:37<54:29, 6.82s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:38,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:38,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8392, 'learning_rate': 0.000824, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-03-03 04:20:38,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:38,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:38,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 415/892 [40:50<53:46, 6.76s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:49,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:49,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:49,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 416/892 [40:57<53:26, 6.74s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 416/892 [40:57<53:26, 6.74s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 416/892 [40:57<53:26, 6.74s/it]g-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:59,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:20:59,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9795, 'learning_rate': 0.00083, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-03 04:20:59,011 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:05,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:05,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.216, 'learning_rate': 0.000832, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-03 04:21:05,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:05,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:05,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:19:31,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 419/892 [41:17<52:39, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 419/892 [41:17<52:39, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 419/892 [41:17<52:39, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 419/892 [41:17<52:39, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 420/892 [41:23<52:15, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 420/892 [41:23<52:15, 6.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:23,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:23,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 421/892 [41:30<51:47, 6.60s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 421/892 [41:30<51:47, 6.60s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 421/892 [41:30<51:47, 6.60s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:31,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:31,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2313, 'learning_rate': 0.00084, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-03 04:21:31,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1135, 'learning_rate': 0.000842, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-03 04:21:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0778, 'learning_rate': 0.000844, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-03 04:21:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:44,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 425/892 [41:56<51:27, 6.61s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:21:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 426/892 [42:02<50:41, 6.53s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:01,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:01,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:01,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 427/892 [42:09<49:56, 6.44s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:07,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:07,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:07,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 428/892 [42:15<49:17, 6.37s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 428/892 [42:15<49:17, 6.37s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:15,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:15,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▉ | 429/892 [42:21<48:54, 6.34s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▉ | 429/892 [42:21<48:54, 6.34s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:21,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:21,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 430/892 [42:27<48:25, 6.29s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 430/892 [42:27<48:25, 6.29s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:27,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 431/892 [42:33<47:50, 6.23s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 431/892 [42:33<47:50, 6.23s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4418, 'learning_rate': 0.000858, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-03 04:22:33,429 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 432/892 [42:39<47:17, 6.17s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 432/892 [42:39<47:17, 6.17s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1764, 'learning_rate': 0.00086, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-03 04:22:39,465 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:39,465 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▎ | 433/892 [42:45<46:52, 6.13s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:43,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:43,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:43,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 434/892 [42:51<46:16, 6.06s/it]g-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:21:14,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▌ | 435/892 [42:57<45:23, 5.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▌ | 435/892 [42:57<45:23, 5.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:58,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:22:58,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2176, 'learning_rate': 0.0008680000000000001, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-03 04:23:02,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 437/892 [43:08<43:47, 5.78s/it]g-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 437/892 [43:08<43:47, 5.78s/it]g-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:22:54,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▊ | 438/892 [43:14<42:53, 5.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▊ | 438/892 [43:14<42:53, 5.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:14,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:14,481 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1533, 'learning_rate': 0.000874, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-03 04:23:18,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:18,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 440/892 [43:24<40:37, 5.39s/it]g-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:22,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:24,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:24,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:28,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:28,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9092, 'learning_rate': 0.00088, 'epoch': 0.5} 50%|████████████████████████████████████████▏ | 443/892 [43:38<35:52, 4.79s/it]g-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 443/892 [43:38<35:52, 4.79s/it]g-point operations will not be computed-03 04:23:10,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 443/892 [43:38<35:52, 4.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:34,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▎ | 444/892 [43:42<34:01, 4.56s/it]g-point operations will not be computed-03 04:23:34,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▎ | 444/892 [43:42<34:01, 4.56s/it]g-point operations will not be computed-03 04:23:34,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▎ | 444/892 [43:42<34:01, 4.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:38,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 445/892 [43:45<32:08, 4.32s/it]g-point operations will not be computed-03 04:23:38,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 445/892 [43:45<32:08, 4.32s/it]g-point operations will not be computed-03 04:23:38,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 445/892 [43:45<32:08, 4.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:41,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 446/892 [43:49<29:55, 4.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 446/892 [43:49<29:55, 4.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:46,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:23:46,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:45,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 447/892 [43:52<27:30, 3.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:48,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▋ | 448/892 [43:55<27:09, 3.67s/it]g-point operations will not be computed-03 04:23:48,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▋ | 448/892 [43:55<27:09, 3.67s/it]g-point operations will not be computed-03 04:23:48,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▋ | 448/892 [43:55<27:09, 3.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:51,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 449/892 [43:59<26:29, 3.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:55,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 449/892 [43:59<26:29, 3.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:55,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 450/892 [44:02<24:40, 3.35s/it]g-point operations will not be computed-03 04:23:55,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 450/892 [44:02<24:40, 3.35s/it]g-point operations will not be computed-03 04:23:55,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 450/892 [44:02<24:40, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:59,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 450/892 [44:02<24:40, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:23:59,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:24:02,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:23:59,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 451/892 [44:09<34:13, 4.66s/it]g-point operations will not be computed-03 04:23:59,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 451/892 [44:09<34:13, 4.66s/it]g-point operations will not be computed-03 04:23:59,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 451/892 [44:09<34:13, 4.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:06,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 451/892 [44:09<34:13, 4.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:06,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 451/892 [44:09<34:13, 4.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:06,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 452/892 [44:16<40:00, 5.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 452/892 [44:16<40:00, 5.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.48, 'learning_rate': 0.0009000000000000001, 'epoch': 0.51} 51%|█████████████████████████████████████████ | 452/892 [44:16<40:00, 5.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 453/892 [44:24<43:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 453/892 [44:24<43:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4334, 'learning_rate': 0.000902, 'epoch': 0.51} 51%|█████████████████████████████████████████▏ | 453/892 [44:24<43:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 453/892 [44:24<43:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 453/892 [44:24<43:55, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 454/892 [44:31<46:19, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 454/892 [44:31<46:19, 6.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:24:31,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 455/892 [44:38<47:54, 6.58s/it]g-point operations will not be computed-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 455/892 [44:38<47:54, 6.58s/it]g-point operations will not be computed-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4256, 'learning_rate': 0.000906, 'epoch': 0.51} 51%|█████████████████████████████████████████▎ | 455/892 [44:38<47:54, 6.58s/it]g-point operations will not be computed-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 455/892 [44:38<47:54, 6.58s/it]g-point operations will not be computed-03 04:24:13,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 456/892 [44:45<48:50, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 456/892 [44:45<48:50, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3247, 'learning_rate': 0.0009080000000000001, 'epoch': 0.51} 51%|█████████████████████████████████████████▍ | 456/892 [44:45<48:50, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 457/892 [44:52<49:18, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 457/892 [44:52<49:18, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2336, 'learning_rate': 0.00091, 'epoch': 0.51} 51%|█████████████████████████████████████████▍ | 457/892 [44:52<49:18, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 457/892 [44:52<49:18, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 457/892 [44:52<49:18, 6.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 458/892 [44:59<49:43, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:24:58,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:24:58,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▋ | 459/892 [45:06<49:53, 6.91s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▋ | 459/892 [45:06<49:53, 6.91s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2095, 'learning_rate': 0.0009140000000000001, 'epoch': 0.51} 51%|█████████████████████████████████████████▋ | 459/892 [45:06<49:53, 6.91s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:08,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:08,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3384, 'learning_rate': 0.000916, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-03 04:25:08,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:08,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 461/892 [45:20<49:21, 6.87s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 461/892 [45:20<49:21, 6.87s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▉ | 462/892 [45:27<49:08, 6.86s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▉ | 462/892 [45:27<49:08, 6.86s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3328, 'learning_rate': 0.00092, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-03 04:25:27,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 463/892 [45:33<48:43, 6.82s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 463/892 [45:33<48:43, 6.82s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5567, 'learning_rate': 0.0009220000000000001, 'epoch': 0.52} 52%|██████████████████████████████████████████ | 463/892 [45:33<48:43, 6.82s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:35,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:35,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:38,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:38,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 465/892 [45:47<47:53, 6.73s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 465/892 [45:47<47:53, 6.73s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:45,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:45,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 466/892 [45:53<47:46, 6.73s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 466/892 [45:53<47:46, 6.73s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3116, 'learning_rate': 0.0009280000000000001, 'epoch': 0.52} 52%|██████████████████████████████████████████▎ | 466/892 [45:53<47:46, 6.73s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:55,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:25:55,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0797, 'learning_rate': 0.00093, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-03 04:25:55,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:01,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:01,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1186, 'learning_rate': 0.0009320000000000001, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-03 04:26:01,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:01,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:01,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 469/892 [46:13<46:44, 6.63s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:11,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:11,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:11,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 470/892 [46:20<46:21, 6.59s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:18,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:18,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:18,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 471/892 [46:26<45:48, 6.53s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:24,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:24,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:24,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 472/892 [46:32<45:29, 6.50s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 472/892 [46:32<45:29, 6.50s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 472/892 [46:32<45:29, 6.50s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 472/892 [46:32<45:29, 6.50s/it]g-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:34,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:34,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:34,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:34,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:40,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:40,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:40,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:40,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:40,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:24:42,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 475/892 [46:52<45:08, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 475/892 [46:52<45:08, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:53,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:53,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.418, 'learning_rate': 0.000948, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-03 04:26:53,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:59,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:26:59,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5494, 'learning_rate': 0.00095, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-03 04:26:59,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:05,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:05,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0258, 'learning_rate': 0.0009519999999999999, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-03 04:27:05,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:11,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:11,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0919, 'learning_rate': 0.000954, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-03 04:27:11,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0294, 'learning_rate': 0.0009559999999999999, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-03 04:27:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:17,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:26:48,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 481/892 [47:28<41:47, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:28,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 482/892 [47:34<41:21, 6.05s/it]g-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 482/892 [47:34<41:21, 6.05s/it]g-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3622, 'learning_rate': 0.00096, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-03 04:27:34,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:34,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 483/892 [47:40<40:39, 5.96s/it]g-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 483/892 [47:40<40:39, 5.96s/it]g-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:39,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:39,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 484/892 [47:46<39:58, 5.88s/it]g-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:44,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:44,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:44,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:25,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 485/892 [47:51<39:18, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 485/892 [47:51<39:18, 5.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:52,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:52,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9877, 'learning_rate': 0.000968, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-03 04:27:56,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:27:56,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▏ | 487/892 [48:02<38:08, 5.65s/it]g-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:00,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:00,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:00,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:27:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 488/892 [48:08<37:18, 5.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:06,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 489/892 [48:13<36:18, 5.41s/it]g-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 489/892 [48:13<36:18, 5.41s/it]g-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:10,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:10,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:10,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:04,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 490/892 [48:18<35:17, 5.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:28:14,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:16,739 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:14,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:16,739 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:14,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 491/892 [48:22<34:00, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 491/892 [48:22<34:00, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:22,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:22,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:24,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:26,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:26,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:28,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:28,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:29,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:33,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:33,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:34,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:34,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:36,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:39,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:39,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:40,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:40,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:42,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:42,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:44,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:28:44,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 04:28:46,559 >> Batch size = 8aluation *****e number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 0/331 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 2/331 [00:02<06:42, 1.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 3/331 [00:04<08:53, 1.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 4/331 [00:06<10:05, 1.85s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 5/331 [00:09<11:40, 2.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 6/331 [00:12<12:45, 2.35s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 7/331 [00:14<12:49, 2.38s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|██ | 8/331 [00:17<13:07, 2.44s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 9/331 [00:20<13:39, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 10/331 [00:23<14:37, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 11/331 [00:25<14:08, 2.65s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 12/331 [00:28<13:56, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 13/331 [00:30<13:46, 2.60s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 14/331 [00:33<13:33, 2.57s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 15/331 [00:36<14:55, 2.83s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 16/331 [00:40<15:48, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 17/331 [00:43<15:54, 3.04s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 18/331 [00:45<14:36, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 19/331 [00:48<14:16, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 20/331 [00:50<13:24, 2.59s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████▏ | 21/331 [00:53<13:48, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 22/331 [00:56<14:59, 2.91s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 23/331 [01:00<16:26, 3.20s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 24/331 [01:04<17:22, 3.40s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 25/331 [01:07<16:42, 3.28s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 26/331 [01:09<15:30, 3.05s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 27/331 [01:13<15:34, 3.08s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 28/331 [01:15<15:05, 2.99s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 29/331 [01:18<14:36, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 30/331 [01:21<14:00, 2.79s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▋ | 31/331 [01:23<13:25, 2.69s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 32/331 [01:26<13:07, 2.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 33/331 [01:28<13:08, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▍ | 34/331 [01:31<12:59, 2.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▋ | 35/331 [01:34<13:10, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 36/331 [01:37<13:48, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 37/331 [01:40<14:35, 2.98s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▍ | 38/331 [01:43<14:48, 3.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 39/331 [01:46<14:50, 3.05s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 40/331 [01:49<13:33, 2.79s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|██████████▏ | 41/331 [01:51<12:53, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 42/331 [01:54<13:45, 2.86s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 43/331 [01:58<14:32, 3.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▉ | 44/331 [02:01<14:55, 3.12s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 45/331 [02:04<14:02, 2.95s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 46/331 [02:06<12:56, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▋ | 47/331 [02:08<12:06, 2.56s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 48/331 [02:11<12:24, 2.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 49/331 [02:14<13:02, 2.78s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▍ | 50/331 [02:17<12:53, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 51/331 [02:20<13:13, 2.84s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 52/331 [02:22<12:39, 2.72s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▏ | 53/331 [02:25<12:39, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 54/331 [02:27<12:06, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 55/331 [02:31<13:06, 2.85s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 56/331 [02:33<12:52, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|██████████████ | 57/331 [02:36<12:27, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 58/331 [02:39<12:57, 2.85s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▌ | 59/331 [02:41<12:15, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▊ | 60/331 [02:44<11:54, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|███████████████ | 61/331 [02:47<12:18, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 62/331 [02:49<12:19, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▌ | 63/331 [02:53<13:29, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▊ | 64/331 [02:56<13:06, 2.95s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 65/331 [02:59<12:53, 2.91s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▎ | 66/331 [03:03<14:12, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▌ | 67/331 [03:06<14:46, 3.36s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 68/331 [03:10<14:51, 3.39s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████ | 69/331 [03:13<14:26, 3.31s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▎ | 70/331 [03:16<14:10, 3.26s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▌ | 71/331 [03:20<14:20, 3.31s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▊ | 72/331 [03:23<14:17, 3.31s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████ | 73/331 [03:26<13:45, 3.20s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████▎ | 74/331 [03:29<13:23, 3.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 75/331 [03:32<13:30, 3.16s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 76/331 [03:35<12:48, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|███████████████████ | 77/331 [03:37<12:27, 2.94s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 78/331 [03:40<11:51, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 79/331 [03:42<11:22, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 80/331 [03:45<11:13, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|████████████████████ | 81/331 [03:48<11:39, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▎ | 82/331 [03:51<11:25, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 83/331 [03:54<11:49, 2.86s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▊ | 84/331 [03:57<12:36, 3.06s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████ | 85/331 [04:00<11:41, 2.85s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▎ | 86/331 [04:03<12:18, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▌ | 87/331 [04:06<11:55, 2.93s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▊ | 88/331 [04:09<11:38, 2.87s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████ | 89/331 [04:11<10:48, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▎ | 90/331 [04:13<10:19, 2.57s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▌ | 91/331 [04:16<10:49, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 92/331 [04:18<10:09, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████ | 93/331 [04:21<10:17, 2.59s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████▎ | 94/331 [04:24<10:32, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▌ | 95/331 [04:27<10:39, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▊ | 96/331 [04:29<10:41, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|████████████████████████ | 97/331 [04:32<10:17, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 98/331 [04:35<10:37, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 99/331 [04:38<10:30, 2.72s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▍ | 100/331 [04:40<10:04, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 101/331 [04:43<09:58, 2.60s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 102/331 [04:46<10:49, 2.83s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 103/331 [04:48<10:16, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▍ | 104/331 [04:51<10:15, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 105/331 [04:54<10:16, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 106/331 [04:57<10:15, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 107/331 [04:59<09:30, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 108/331 [05:01<09:16, 2.50s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 109/331 [05:04<09:16, 2.51s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▉ | 110/331 [05:07<09:46, 2.65s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▏ | 111/331 [05:09<09:48, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 112/331 [05:12<09:49, 2.69s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 113/331 [05:14<09:19, 2.57s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▉ | 114/331 [05:17<09:24, 2.60s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▏ | 115/331 [05:20<09:21, 2.60s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▍ | 116/331 [05:22<09:40, 2.70s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▋ | 117/331 [05:25<09:39, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▉ | 118/331 [05:28<09:21, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████ | 119/331 [05:30<09:25, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 120/331 [05:33<09:25, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 121/331 [05:36<09:56, 2.84s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 122/331 [05:39<09:44, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 123/331 [05:42<10:21, 2.99s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 124/331 [05:45<10:08, 2.94s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 125/331 [05:49<10:43, 3.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 126/331 [05:52<10:46, 3.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 127/331 [05:56<11:09, 3.28s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 128/331 [05:59<11:07, 3.29s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 129/331 [06:02<10:53, 3.24s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 129/331 [06:02<10:53, 3.24s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 129/331 [06:02<10:53, 3.24s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 131/331 [06:09<11:12, 3.36s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 132/331 [06:12<10:38, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 133/331 [06:14<10:00, 3.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▊ | 134/331 [06:17<09:36, 2.93s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 135/331 [06:20<09:41, 2.97s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▎ | 136/331 [06:24<09:57, 3.06s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 137/331 [06:27<10:21, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 138/331 [06:31<10:36, 3.30s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████ | 139/331 [06:33<09:23, 2.94s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 140/331 [06:36<10:03, 3.16s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 141/331 [06:39<09:31, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 141/331 [06:39<09:31, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 141/331 [06:39<09:31, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 143/331 [06:45<09:42, 3.10s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 144/331 [06:48<09:16, 2.98s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 145/331 [06:51<09:09, 2.95s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 146/331 [06:54<09:31, 3.09s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▉ | 147/331 [06:57<09:11, 3.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 148/331 [06:59<08:33, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 149/331 [07:02<08:00, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 150/331 [07:05<08:19, 2.76s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 151/331 [07:07<08:09, 2.72s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 152/331 [07:10<07:49, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▍ | 153/331 [07:12<07:47, 2.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 154/331 [07:15<08:06, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 155/331 [07:19<08:29, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 156/331 [07:22<08:43, 2.99s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 157/331 [07:25<09:01, 3.11s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 158/331 [07:28<09:06, 3.16s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▉ | 159/331 [07:32<09:07, 3.18s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 160/331 [07:34<08:36, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 161/331 [07:37<08:22, 2.96s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 162/331 [07:41<08:53, 3.16s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 163/331 [07:44<08:58, 3.20s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 164/331 [07:47<08:28, 3.04s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 165/331 [07:50<08:15, 2.98s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 166/331 [07:52<08:04, 2.94s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 167/331 [07:56<08:15, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 168/331 [07:58<07:47, 2.87s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 169/331 [08:01<07:54, 2.93s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 170/331 [08:04<07:30, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 171/331 [08:06<07:25, 2.78s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 172/331 [08:09<07:08, 2.69s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 173/331 [08:12<07:17, 2.77s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 174/331 [08:14<06:56, 2.65s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 175/331 [08:17<07:00, 2.70s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 176/331 [08:20<06:46, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▎ | 177/331 [08:23<07:07, 2.78s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 178/331 [08:26<07:34, 2.97s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 179/331 [08:30<07:57, 3.14s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 180/331 [08:33<07:50, 3.12s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 181/331 [08:36<07:45, 3.10s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 182/331 [08:38<07:09, 2.88s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 183/331 [08:40<06:36, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 184/331 [08:42<06:09, 2.52s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 185/331 [08:44<05:42, 2.35s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 186/331 [08:47<05:53, 2.43s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▊ | 187/331 [08:50<06:23, 2.66s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 188/331 [08:53<06:23, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▎ | 189/331 [08:55<06:06, 2.58s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▍ | 190/331 [08:58<05:52, 2.50s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▋ | 191/331 [09:00<05:48, 2.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▉ | 192/331 [09:02<05:37, 2.43s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▏ | 193/331 [09:06<06:04, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▍ | 194/331 [09:08<05:43, 2.51s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 195/331 [09:10<05:34, 2.46s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▉ | 196/331 [09:13<05:40, 2.52s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▏ | 197/331 [09:16<05:55, 2.65s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 198/331 [09:18<05:39, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 199/331 [09:21<05:43, 2.60s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 200/331 [09:23<05:25, 2.48s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 201/331 [09:25<05:22, 2.48s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▍ | 202/331 [09:28<05:29, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 203/331 [09:31<05:27, 2.56s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 204/331 [09:34<05:47, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 205/331 [09:37<05:49, 2.77s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 206/331 [09:39<05:44, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 207/331 [09:43<05:58, 2.89s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▉ | 208/331 [09:46<06:03, 2.96s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 209/331 [09:48<05:33, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 210/331 [09:50<05:10, 2.57s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 211/331 [09:53<05:12, 2.61s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 212/331 [09:55<04:58, 2.51s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 213/331 [09:58<04:57, 2.52s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 214/331 [10:00<04:42, 2.41s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 215/331 [10:02<04:29, 2.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 216/331 [10:05<04:57, 2.59s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████ | 217/331 [10:08<04:57, 2.61s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:11<05:10, 2.75s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:14<05:06, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:16<04:52, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████ | 221/331 [10:19<04:54, 2.68s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:21<04:38, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:24<04:39, 2.59s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:26<04:40, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 225/331 [10:29<04:39, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 226/331 [10:32<04:54, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 227/331 [10:35<04:47, 2.76s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 228/331 [10:38<04:39, 2.72s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 229/331 [10:40<04:38, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▎ | 230/331 [10:43<04:30, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 231/331 [10:46<04:36, 2.77s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 232/331 [10:48<04:30, 2.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 233/331 [10:52<04:37, 2.83s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 234/331 [10:54<04:22, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 235/331 [10:56<04:11, 2.62s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 236/331 [11:00<04:37, 2.92s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 237/331 [11:03<04:48, 3.07s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:06<04:45, 3.07s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:10<04:43, 3.09s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:13<04:45, 3.14s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:16<04:48, 3.20s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:19<04:45, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 243/331 [11:23<04:42, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:26<04:47, 3.30s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:29<04:33, 3.18s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 246/331 [11:33<04:44, 3.35s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▍ | 247/331 [11:36<04:34, 3.26s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▋ | 248/331 [11:38<04:14, 3.07s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▉ | 249/331 [11:41<03:53, 2.84s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [11:43<03:40, 2.72s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [11:46<03:43, 2.79s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [11:49<03:31, 2.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [11:52<03:39, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [11:54<03:33, 2.77s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [11:58<03:38, 2.87s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [12:00<03:29, 2.79s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:03<03:33, 2.88s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▏ | 258/331 [12:06<03:20, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:08<03:14, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:11<03:18, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:14<03:04, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:16<03:03, 2.66s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:19<03:10, 2.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:22<03:00, 2.69s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:24<02:53, 2.63s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████████ | 266/331 [12:27<02:47, 2.57s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [12:30<02:55, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [12:33<02:52, 2.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [12:36<02:59, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████ | 270/331 [12:39<02:56, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [12:42<03:00, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [12:45<02:51, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▊ | 273/331 [12:48<02:50, 2.95s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████ | 274/331 [12:51<02:55, 3.07s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [12:54<02:55, 3.14s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [12:57<02:42, 2.96s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [13:00<02:35, 2.88s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████ | 278/331 [13:02<02:30, 2.83s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:06<02:39, 3.06s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:09<02:33, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:12<02:36, 3.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:15<02:33, 3.14s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:19<02:34, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:22<02:34, 3.29s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [13:26<02:33, 3.33s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [13:29<02:31, 3.36s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [13:33<02:31, 3.45s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [13:36<02:27, 3.43s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:39<02:15, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:39<02:15, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:39<02:15, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [13:44<01:55, 2.88s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [13:47<01:49, 2.82s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [13:50<01:47, 2.83s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [13:52<01:39, 2.70s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [13:54<01:35, 2.64s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [13:57<01:29, 2.55s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [14:00<01:36, 2.85s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [14:04<01:41, 3.06s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:07<01:34, 2.95s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:10<01:30, 2.93s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:12<01:26, 2.88s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:15<01:21, 2.82s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:17<01:15, 2.71s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:20<01:15, 2.79s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [14:24<01:15, 2.92s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [14:27<01:17, 3.09s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [14:31<01:17, 3.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:35<01:18, 3.42s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▌ | 309/331 [14:38<01:16, 3.46s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [14:41<01:07, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [14:44<01:04, 3.21s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [14:46<00:57, 3.01s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [14:49<00:52, 2.94s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [14:52<00:50, 2.97s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [14:56<00:48, 3.06s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [14:59<00:46, 3.08s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [15:02<00:45, 3.22s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [15:05<00:39, 3.04s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:07<00:34, 2.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:10<00:32, 2.91s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:13<00:28, 2.87s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:16<00:27, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:19<00:23, 2.92s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▎ | 324/331 [15:22<00:21, 3.02s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [15:26<00:18, 3.05s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [15:29<00:15, 3.09s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [15:32<00:12, 3.10s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [15:35<00:09, 3.14s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▌| 329/331 [15:38<00:06, 3.07s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [15:42<00:03, 3.24s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [15:43<00:00, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [15:43<00:00, 2.81s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 04:44:33 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-03-03 04:44:33,883 >> Configuration saved in ./checkpoint-500/config.json g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 04:46:11 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb']. This may take a bit of time if the files are large. [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-03 04:44:38,919 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:46:45,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.2615, 'learning_rate': 0.000998, 'epoch': 0.56} 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████ | 501/892 [1:06:52<35:26:29, 326.32s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9686, 'learning_rate': 0.001, 'epoch': 0.56} 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▏ | 502/892 [1:07:00<24:59:45, 230.73s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2496, 'learning_rate': 0.0009974489795918369, 'epoch': 0.56} 56%|██████████████████████████████████████████▎ | 503/892 [1:07:07<17:41:58, 163.80s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4162, 'learning_rate': 0.0009948979591836735, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:10,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3041, 'learning_rate': 0.00099234693877551, 'epoch': 0.57} 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▌ | 505/892 [1:07:22<9:01:48, 84.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▋ | 506/892 [1:07:29<6:32:37, 61.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▋ | 506/892 [1:07:29<6:32:37, 61.03s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:30,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3472, 'learning_rate': 0.0009872448979591838, 'epoch': 0.57} 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▊ | 507/892 [1:07:37<4:48:05, 44.90s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▊ | 508/892 [1:07:44<3:34:55, 33.58s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▊ | 508/892 [1:07:44<3:34:55, 33.58s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1626, 'learning_rate': 0.0009846938775510204, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-03 04:47:44,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3774, 'learning_rate': 0.0009821428571428572, 'epoch': 0.57} 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▉ | 509/892 [1:07:51<2:43:50, 25.67s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 510/892 [1:07:58<2:08:08, 20.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 510/892 [1:07:58<2:08:08, 20.13s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:57,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:47:57,402 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1559, 'learning_rate': 0.0009770408163265307, 'epoch': 0.57} 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 511/892 [1:08:05<1:43:04, 16.23s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 512/892 [1:08:13<1:25:26, 13.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 512/892 [1:08:13<1:25:26, 13.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0161, 'learning_rate': 0.0009744897959183674, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-03 04:48:13,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6675, 'learning_rate': 0.0009719387755102041, 'epoch': 0.58} 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 513/892 [1:08:20<1:12:48, 11.53s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 514/892 [1:08:26<1:03:57, 10.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 514/892 [1:08:26<1:03:57, 10.15s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:48:25,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:48:25,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▌ | 515/892 [1:08:33<57:37, 9.17s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▌ | 515/892 [1:08:33<57:37, 9.17s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2748, 'learning_rate': 0.0009668367346938776, 'epoch': 0.58} [WARNING|modeling_utils.py:388] 2022-03-03 04:48:34,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.033, 'learning_rate': 0.0009642857142857143, 'epoch': 0.58} 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▋ | 516/892 [1:08:40<53:13, 8.49s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▊ | 517/892 [1:08:47<50:00, 8.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▊ | 517/892 [1:08:47<50:00, 8.00s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2472, 'learning_rate': 0.0009617346938775511, 'epoch': 0.58} [WARNING|modeling_utils.py:388] 2022-03-03 04:48:48,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1146, 'learning_rate': 0.0009591836734693877, 'epoch': 0.58} 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 518/892 [1:08:54<48:13, 7.74s/it]g-point operations will not be computed-03 04:28:19,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|█████████████████████████████████████████████▉ | 519/892 [1:09:01<46:49, 7.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████ | 520/892 [1:09:08<45:12, 7.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████ | 520/892 [1:09:08<45:12, 7.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:08,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:08,655 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▏ | 521/892 [1:09:15<44:05, 7.13s/it]g-point operations will not be computed-03 04:48:58,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0594, 'learning_rate': 0.0009489795918367348, 'epoch': 0.59} 59%|██████████████████████████████████████████████▏ | 522/892 [1:09:21<43:02, 6.98s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▎ | 523/892 [1:09:28<42:01, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▎ | 523/892 [1:09:28<42:01, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:26,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:26,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 524/892 [1:09:34<41:13, 6.72s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 524/892 [1:09:34<41:13, 6.72s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:33,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▍ | 525/892 [1:09:41<41:23, 6.77s/it]g-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:43,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1616, 'learning_rate': 0.0009362244897959184, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-03 04:49:49,526 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1956, 'learning_rate': 0.0009336734693877551, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:49:55,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:49:18,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▊ | 529/892 [1:10:06<38:41, 6.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:03,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|██████████████████████████████████████████████▉ | 530/892 [1:10:13<38:09, 6.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:09,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████ | 531/892 [1:10:19<37:46, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████ | 531/892 [1:10:19<37:46, 6.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8953, 'learning_rate': 0.000923469387755102, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-03 04:50:20,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9381, 'learning_rate': 0.0009209183673469387, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-03 04:50:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:32,176 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:32,176 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9541, 'learning_rate': 0.0009183673469387756, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-03 04:50:36,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:36,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▍ | 535/892 [1:10:42<35:26, 5.96s/it]g-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:40,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:15,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▍ | 536/892 [1:10:48<34:45, 5.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▍ | 536/892 [1:10:48<34:45, 5.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:49,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:49,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3974, 'learning_rate': 0.0009107142857142857, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-03 04:50:53,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▋ | 538/892 [1:10:59<33:29, 5.68s/it]g-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▋ | 538/892 [1:10:59<33:29, 5.68s/it]g-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:50:57,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:50:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|███████████████████████████████████████████████▋ | 539/892 [1:11:04<32:30, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:04,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:07,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:09,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:09,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:11,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:13,408 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:13,408 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:15,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:17,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:17,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:19,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:20,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:20,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:22,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:22,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:25,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:27,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:27,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:28,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:28,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:31,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:31,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:33,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:34,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:34,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:37,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:37,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8482, 'learning_rate': 0.0008775510204081633, 'epoch': 0.62} [WARNING|modeling_utils.py:388] 2022-03-03 04:51:41,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:41,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2635, 'learning_rate': 0.000875, 'epoch': 0.62} [WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:48,934 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:52,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2408, 'learning_rate': 0.0008698979591836736, 'epoch': 0.62} [WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:51:59,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████ | 554/892 [1:12:12<35:12, 6.25s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████ | 554/892 [1:12:12<35:12, 6.25s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:52:12,494 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0966, 'learning_rate': 0.0008647959183673469, 'epoch': 0.62} 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 555/892 [1:12:19<36:36, 6.52s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 556/892 [1:12:26<37:35, 6.71s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▏ | 556/892 [1:12:26<37:35, 6.71s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:52:26,739 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9667, 'learning_rate': 0.0008596938775510205, 'epoch': 0.62} 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▎ | 557/892 [1:12:33<38:04, 6.82s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▍ | 558/892 [1:12:40<38:24, 6.90s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▍ | 558/892 [1:12:40<38:24, 6.90s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2674, 'learning_rate': 0.0008571428571428571, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-03 04:52:40,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0393, 'learning_rate': 0.0008545918367346938, 'epoch': 0.63} 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 559/892 [1:12:47<38:34, 6.95s/it]g-point operations will not be computed-03 04:51:01,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▌ | 560/892 [1:12:54<38:29, 6.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1893, 'learning_rate': 0.0008494897959183674, 'epoch': 0.63} 63%|█████████████████████████████████████████████████▋ | 561/892 [1:13:01<38:14, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0583, 'learning_rate': 0.0008469387755102041, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:52:51,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▊ | 563/892 [1:13:15<37:41, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▉ | 564/892 [1:13:21<37:20, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|█████████████████████████████████████████████████▉ | 564/892 [1:13:21<37:20, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:20,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████ | 565/892 [1:13:28<37:08, 6.81s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.06, 'learning_rate': 0.0008367346938775511, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:30,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▏ | 567/892 [1:13:42<36:40, 6.77s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:40,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▎ | 568/892 [1:13:48<36:23, 6.74s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:47,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:47,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9932, 'learning_rate': 0.0008290816326530613, 'epoch': 0.64} 64%|██████████████████████████████████████████████████▍ | 569/892 [1:13:55<35:59, 6.69s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1723, 'learning_rate': 0.000826530612244898, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-03 04:53:56,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9847, 'learning_rate': 0.0008239795918367348, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:03,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▋ | 572/892 [1:14:14<35:06, 6.58s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:13,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▋ | 573/892 [1:14:21<34:46, 6.54s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▋ | 573/892 [1:14:21<34:46, 6.54s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:21,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:21,254 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▊ | 574/892 [1:14:27<34:28, 6.51s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|██████████████████████████████████████████████████▊ | 574/892 [1:14:27<34:28, 6.51s/it]g-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3417, 'learning_rate': 0.0008137755102040817, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:54:27,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:53:11,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 576/892 [1:14:41<34:33, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:37,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████ | 577/892 [1:14:47<34:08, 6.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:44,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▏ | 578/892 [1:14:53<33:37, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:50,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 579/892 [1:14:59<33:13, 6.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:54:56,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▎ | 580/892 [1:15:06<32:50, 6.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:02,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▍ | 581/892 [1:15:12<32:23, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|███████████████████████████████████████████████████▍ | 581/892 [1:15:12<32:23, 6.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9729, 'learning_rate': 0.0007959183673469387, 'epoch': 0.65} [WARNING|modeling_utils.py:388] 2022-03-03 04:55:13,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4895, 'learning_rate': 0.0007933673469387756, 'epoch': 0.65} [WARNING|modeling_utils.py:388] 2022-03-03 04:55:19,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:25,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:25,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0296, 'learning_rate': 0.0007908163265306123, 'epoch': 0.65} [WARNING|modeling_utils.py:388] 2022-03-03 04:55:29,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▊ | 585/892 [1:15:35<30:19, 5.93s/it]g-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▊ | 585/892 [1:15:35<30:19, 5.93s/it]g-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:33,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:08,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▉ | 586/892 [1:15:41<29:45, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:41,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:41,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:46,070 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████ | 588/892 [1:15:52<28:38, 5.65s/it]g-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:50,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:37,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▏ | 589/892 [1:15:57<27:54, 5.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:55:57,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:00,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:55:54,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▎ | 591/892 [1:16:07<26:31, 5.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:06,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:06,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:04,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▍ | 592/892 [1:16:12<25:28, 5.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:08,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|████████████████████████████████████████████████████▌ | 593/892 [1:16:16<24:17, 4.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:14,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:14,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:12,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▌ | 594/892 [1:16:20<23:06, 4.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:18,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:18,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:16,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▋ | 595/892 [1:16:24<21:33, 4.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▊ | 596/892 [1:16:27<19:54, 4.04s/it]g-point operations will not be computed-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▊ | 596/892 [1:16:27<19:54, 4.04s/it]g-point operations will not be computed-03 04:56:20,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:25,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:23,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:25,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:23,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▊ | 597/892 [1:16:30<18:15, 3.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:26,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▉ | 598/892 [1:16:33<16:32, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:29,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|████████████████████████████████████████████████████▉ | 598/892 [1:16:33<16:32, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:29,093 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████ | 599/892 [1:16:35<14:54, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████ | 599/892 [1:16:35<14:54, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:32,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:32,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 600/892 [1:16:38<14:17, 2.94s/it]g-point operations will not be computed-03 04:56:31,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 600/892 [1:16:38<14:17, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:39,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it]g-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it]g-point operations will not be computed-03 04:56:35,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▏ | 601/892 [1:16:45<21:06, 4.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:46,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:46,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▎ | 602/892 [1:16:53<25:20, 5.24s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|█████████████████████████████████████████████████████▎ | 602/892 [1:16:53<25:20, 5.24s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:56:53,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2736, 'learning_rate': 0.0007423469387755102, 'epoch': 0.68} 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 603/892 [1:17:00<28:14, 5.86s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1739, 'learning_rate': 0.0007397959183673469, 'epoch': 0.68} 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▍ | 604/892 [1:17:07<30:09, 6.28s/it]g-point operations will not be computed-03 04:56:42,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▌ | 605/892 [1:17:15<31:24, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9385, 'learning_rate': 0.0007346938775510205, 'epoch': 0.68} 68%|█████████████████████████████████████████████████████▋ | 606/892 [1:17:22<32:06, 6.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.867, 'learning_rate': 0.0007321428571428571, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:24,305 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▊ | 608/892 [1:17:36<32:44, 6.92s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▊ | 608/892 [1:17:36<32:44, 6.92s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0564, 'learning_rate': 0.0007295918367346938, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-03 04:57:36,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1008, 'learning_rate': 0.0007270408163265307, 'epoch': 0.68} 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|█████████████████████████████████████████████████████▉ | 609/892 [1:17:43<32:48, 6.95s/it]g-point operations will not be computed-03 04:57:12,041 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████ | 610/892 [1:17:50<32:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9015, 'learning_rate': 0.0007219387755102041, 'epoch': 0.68} 68%|██████████████████████████████████████████████████████ | 611/892 [1:17:57<32:18, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1184, 'learning_rate': 0.0007193877551020408, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:57:59,045 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:57:47,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▎ | 613/892 [1:18:10<32:02, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:10,996 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▍ | 614/892 [1:18:17<31:39, 6.83s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▍ | 614/892 [1:18:17<31:39, 6.83s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0992, 'learning_rate': 0.0007142857142857143, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-03 04:58:17,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1966, 'learning_rate': 0.0007117346938775511, 'epoch': 0.69} 69%|██████████████████████████████████████████████████████▍ | 615/892 [1:18:24<31:25, 6.81s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9135, 'learning_rate': 0.0007091836734693877, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-03 04:58:26,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▋ | 617/892 [1:18:37<30:59, 6.76s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▋ | 617/892 [1:18:37<30:59, 6.76s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1225, 'learning_rate': 0.0007066326530612245, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-03 04:58:37,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▋ | 618/892 [1:18:44<30:40, 6.72s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▋ | 618/892 [1:18:44<30:40, 6.72s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8598, 'learning_rate': 0.0007040816326530613, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-03 04:58:44,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8324, 'learning_rate': 0.000701530612244898, 'epoch': 0.69} 69%|██████████████████████████████████████████████████████▊ | 619/892 [1:18:50<30:21, 6.67s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.01, 'learning_rate': 0.0006989795918367347, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:58:52,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████▉ | 621/892 [1:19:04<29:56, 6.63s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████▉ | 621/892 [1:19:04<29:56, 6.63s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:02,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:02,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████ | 622/892 [1:19:10<29:33, 6.57s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████ | 622/892 [1:19:10<29:33, 6.57s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:08,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:08,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.05, 'learning_rate': 0.0006913265306122449, 'epoch': 0.7} 70%|███████████████████████████████████████████████████████▏ | 623/892 [1:19:17<29:22, 6.55s/it]g-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0356, 'learning_rate': 0.0006887755102040817, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:18,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:58:07,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3868, 'learning_rate': 0.0006862244897959184, 'epoch': 0.7} 70%|███████████████████████████████████████████████████████▎ | 625/892 [1:19:30<29:36, 6.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:27,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0528, 'learning_rate': 0.0006836734693877551, 'epoch': 0.7} 70%|███████████████████████████████████████████████████████▍ | 626/892 [1:19:36<29:03, 6.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▌ | 627/892 [1:19:43<28:31, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▌ | 627/892 [1:19:43<28:31, 6.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:41,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:41,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▌ | 628/892 [1:19:49<28:05, 6.39s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|███████████████████████████████████████████████████████▌ | 628/892 [1:19:49<28:05, 6.39s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:47,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:47,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▋ | 629/892 [1:19:55<27:43, 6.33s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▋ | 629/892 [1:19:55<27:43, 6.33s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:53,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:53,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▊ | 630/892 [1:20:01<27:23, 6.27s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▊ | 630/892 [1:20:01<27:23, 6.27s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:59,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 04:59:59,729 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▉ | 631/892 [1:20:07<27:02, 6.22s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▉ | 631/892 [1:20:07<27:02, 6.22s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:05,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:05,756 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▉ | 632/892 [1:20:13<26:42, 6.16s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████▉ | 632/892 [1:20:13<26:42, 6.16s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:11,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:11,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|████████████████████████████████████████████████████████ | 633/892 [1:20:19<26:23, 6.11s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|████████████████████████████████████████████████████████ | 633/892 [1:20:19<26:23, 6.11s/it]g-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:17,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 04:59:33,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|████████████████████████████████████████████████████████▏ | 634/892 [1:20:25<26:06, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|████████████████████████████████████████████████████████▏ | 634/892 [1:20:25<26:06, 6.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1228, 'learning_rate': 0.0006607142857142857, 'epoch': 0.71} [WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:26,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:32,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:32,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:36,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:36,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|████████████████████████████████████████████████████████▍ | 637/892 [1:20:42<24:39, 5.80s/it]g-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:40,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:22,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|████████████████████████████████████████████████████████▌ | 638/892 [1:20:48<24:11, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|████████████████████████████████████████████████████████▌ | 639/892 [1:20:53<23:38, 5.61s/it]g-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:51,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:44,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|████████████████████████████████████████████████████████▋ | 640/892 [1:20:58<23:00, 5.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:00:58,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:01,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:03,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:03,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:05,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:07,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:07,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:09,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:11,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:11,519 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:13,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:15,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:15,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:17,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:17,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:18,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:21,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:21,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:23,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:25,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:25,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:27,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:29,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:29,335 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8832, 'learning_rate': 0.0006224489795918367, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-03 05:01:33,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:33,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:37,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:37,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3171, 'learning_rate': 0.0006198979591836736, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:40,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▋ | 652/892 [1:21:49<21:03, 5.26s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:01:48,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▊ | 653/892 [1:21:56<23:16, 5.84s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▉ | 654/892 [1:22:03<24:47, 6.25s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|█████████████████████████████████████████████████████████▉ | 654/892 [1:22:03<24:47, 6.25s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:02:04,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9983, 'learning_rate': 0.0006096938775510205, 'epoch': 0.73} 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████ | 655/892 [1:22:10<25:48, 6.53s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████ | 656/892 [1:22:17<26:16, 6.68s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████ | 656/892 [1:22:17<26:16, 6.68s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:02:18,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9124, 'learning_rate': 0.0006045918367346938, 'epoch': 0.74} 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▏ | 657/892 [1:22:25<26:40, 6.81s/it]g-point operations will not be computed-03 05:00:55,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 658/892 [1:22:32<26:47, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9621, 'learning_rate': 0.0005994897959183674, 'epoch': 0.74} 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▎ | 659/892 [1:22:39<26:51, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:29,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▍ | 660/892 [1:22:46<26:43, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▌ | 661/892 [1:22:52<26:31, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:02:51,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:02:51,361 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7113, 'learning_rate': 0.0005918367346938776, 'epoch': 0.74} 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 662/892 [1:22:59<26:24, 6.89s/it]g-point operations will not be computed-03 05:02:42,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▋ | 663/892 [1:23:06<26:13, 6.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▊ | 664/892 [1:23:13<25:56, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|██████████████████████████████████████████████████████████▊ | 664/892 [1:23:13<25:56, 6.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:13,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8416, 'learning_rate': 0.0005841836734693877, 'epoch': 0.75} 75%|██████████████████████████████████████████████████████████▉ | 665/892 [1:23:20<25:44, 6.80s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1135, 'learning_rate': 0.0005816326530612245, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-03 05:03:21,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9375, 'learning_rate': 0.0005790816326530613, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:28,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▏ | 668/892 [1:23:40<25:06, 6.72s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▏ | 668/892 [1:23:40<25:06, 6.72s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.909, 'learning_rate': 0.000576530612244898, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-03 05:03:40,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1056, 'learning_rate': 0.0005739795918367347, 'epoch': 0.75} 75%|███████████████████████████████████████████████████████████▎ | 669/892 [1:23:46<24:52, 6.69s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0018, 'learning_rate': 0.0005714285714285714, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-03 05:03:48,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7696, 'learning_rate': 0.0005688775510204082, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:03:54,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▌ | 672/892 [1:24:06<24:04, 6.57s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:04,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:04,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▌ | 673/892 [1:24:12<23:57, 6.56s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▌ | 673/892 [1:24:12<23:57, 6.56s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:11,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:11,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▋ | 674/892 [1:24:19<23:42, 6.53s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▋ | 674/892 [1:24:19<23:42, 6.53s/it]g-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8107, 'learning_rate': 0.0005612244897959184, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9513, 'learning_rate': 0.0005586734693877551, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:19,175 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:03:03,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 676/892 [1:24:32<23:38, 6.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:29,270 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 677/892 [1:24:38<23:15, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 678/892 [1:24:45<22:51, 6.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:04:43,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:04:35,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 679/892 [1:24:51<22:34, 6.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:48,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 680/892 [1:24:57<22:14, 6.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:04:54,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1294, 'learning_rate': 0.0005433673469387756, 'epoch': 0.76} 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▎ | 681/892 [1:25:03<21:55, 6.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:00,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 682/892 [1:25:09<21:34, 6.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:06,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▍ | 683/892 [1:25:15<21:18, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▍ | 683/892 [1:25:15<21:18, 6.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.217, 'learning_rate': 0.0005357142857142857, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-03 05:05:16,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:22,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:22,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9345, 'learning_rate': 0.0005331632653061225, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-03 05:05:26,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 686/892 [1:25:32<20:03, 5.84s/it]g-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 686/892 [1:25:32<20:03, 5.84s/it]g-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:30,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:30,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:12,207 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 687/892 [1:25:38<19:40, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 687/892 [1:25:38<19:40, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9354, 'learning_rate': 0.0005280612244897959, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-03 05:05:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:38,842 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9864, 'learning_rate': 0.0005255102040816326, 'epoch': 0.77} [WARNING|modeling_utils.py:388] 2022-03-03 05:05:42,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 689/892 [1:25:49<18:49, 5.56s/it]g-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 689/892 [1:25:49<18:49, 5.56s/it]g-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:46,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:46,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:34,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 690/892 [1:25:54<18:14, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 690/892 [1:25:54<18:14, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:52,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 691/892 [1:25:59<17:40, 5.28s/it]g-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 691/892 [1:25:59<17:40, 5.28s/it]g-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:56,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:56,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:05:58,818 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:01,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:01,068 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:03,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:05,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:05,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:06,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:08,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:08,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:12,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:13,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:13,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:15,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:15,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:18,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:18,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:19,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:19,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:21,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:22,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:22,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:24,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:24,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:28,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:32,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:32,090 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7946, 'learning_rate': 0.0004923469387755102, 'epoch': 0.79} [WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:35,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:06:39,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:05:50,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 703/892 [1:26:51<18:31, 5.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9163, 'learning_rate': 0.0004846938775510204, 'epoch': 0.79} 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▎ | 704/892 [1:26:58<19:36, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▍ | 705/892 [1:27:06<20:21, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:04,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 706/892 [1:27:13<20:44, 6.69s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9996, 'learning_rate': 0.00047704081632653065, 'epoch': 0.79} [WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:15,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 708/892 [1:27:27<21:05, 6.88s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9985, 'learning_rate': 0.0004719387755102041, 'epoch': 0.79} [WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:29,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 710/892 [1:27:41<21:07, 6.97s/it]g-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1134, 'learning_rate': 0.00046683673469387755, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:07:43,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:06:48,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 712/892 [1:27:55<20:46, 6.93s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 713/892 [1:28:02<20:38, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:07:51,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 714/892 [1:28:08<20:32, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 715/892 [1:28:15<20:14, 6.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:14,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:14,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 716/892 [1:28:22<19:57, 6.81s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 716/892 [1:28:22<19:57, 6.81s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8231, 'learning_rate': 0.00045408163265306124, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-03 05:08:22,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.09, 'learning_rate': 0.00045153061224489796, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 717/892 [1:28:29<19:47, 6.79s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 718/892 [1:28:35<19:38, 6.77s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:34,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 719/892 [1:28:42<19:23, 6.73s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:40,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:40,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 720/892 [1:28:49<19:10, 6.69s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 720/892 [1:28:49<19:10, 6.69s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.841, 'learning_rate': 0.00044387755102040814, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-03 05:08:49,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7433, 'learning_rate': 0.0004413265306122449, 'epoch': 0.81} 81%|███████████████████████████████████████████████████████████████▊ | 721/892 [1:28:55<18:56, 6.65s/it]g-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7101, 'learning_rate': 0.00043877551020408165, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-03 05:08:57,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8642, 'learning_rate': 0.00043622448979591837, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:03,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:08:05,732 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 724/892 [1:29:15<18:16, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 725/892 [1:29:22<18:33, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 725/892 [1:29:22<18:33, 6.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8054, 'learning_rate': 0.0004311224489795919, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-03 05:09:21,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0236, 'learning_rate': 0.00042857142857142855, 'epoch': 0.81} 81%|████████████████████████████████████████████████████████████████▎ | 726/892 [1:29:28<18:11, 6.57s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:29,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:29,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8533, 'learning_rate': 0.00042602040816326533, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-03 05:09:34,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6959, 'learning_rate': 0.00042346938775510206, 'epoch': 0.82} 82%|████████████████████████████████████████████████████████████████▍ | 728/892 [1:29:40<17:33, 6.42s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6739, 'learning_rate': 0.0004209183673469388, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-03 05:09:42,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0312, 'learning_rate': 0.00041836734693877556, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:48,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:09:54,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:00,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:00,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:04,959 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:04,959 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 733/892 [1:30:11<16:11, 6.11s/it]g-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:09,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:09:11,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 734/892 [1:30:17<15:47, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 734/892 [1:30:17<15:47, 6.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6994, 'learning_rate': 0.0004056122448979592, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-03 05:10:17,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:23,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:23,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8075, 'learning_rate': 0.0004030612244897959, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-03 05:10:27,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:13,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▎ | 737/892 [1:30:33<14:42, 5.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▎ | 737/892 [1:30:33<14:42, 5.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8765, 'learning_rate': 0.00040051020408163264, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-03 05:10:34,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:34,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.843, 'learning_rate': 0.00039795918367346937, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-03 05:10:38,040 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▍ | 739/892 [1:30:44<13:51, 5.43s/it]g-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▍ | 739/892 [1:30:44<13:51, 5.43s/it]g-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:41,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:30,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▌ | 740/892 [1:30:49<13:27, 5.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:47,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▋ | 741/892 [1:30:54<12:58, 5.15s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▋ | 741/892 [1:30:54<12:58, 5.15s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:51,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:53,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:53,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:55,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:58,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:10:58,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:00,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:02,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:02,145 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:04,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:05,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:05,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:07,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:07,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:10,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:12,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:12,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:14,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:14,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:16,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:16,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:18,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:20,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:20,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9339, 'learning_rate': 0.00036734693877551024, 'epoch': 0.84} [WARNING|modeling_utils.py:388] 2022-03-03 05:11:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:27,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:27,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9065, 'learning_rate': 0.0003647959183673469, 'epoch': 0.84} [WARNING|modeling_utils.py:388] 2022-03-03 05:11:31,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8276, 'learning_rate': 0.0003622448979591837, 'epoch': 0.84} [WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:35,123 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8596, 'learning_rate': 0.0003596938775510204, 'epoch': 0.84} 84%|██████████████████████████████████████████████████████████████████▋ | 753/892 [1:31:47<13:39, 5.90s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8511, 'learning_rate': 0.00035714285714285714, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:11:49,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8633, 'learning_rate': 0.00035459183673469387, 'epoch': 0.85} 85%|██████████████████████████████████████████████████████████████████▊ | 755/892 [1:32:01<15:03, 6.60s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:04,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:04,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:07,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:07,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0629, 'learning_rate': 0.0003494897959183674, 'epoch': 0.85} 85%|███████████████████████████████████████████████████████████████████ | 757/892 [1:32:16<15:24, 6.85s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8626, 'learning_rate': 0.0003469387755102041, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:18,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▏ | 759/892 [1:32:30<15:27, 6.97s/it]g-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:32,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:10:45,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 761/892 [1:32:44<15:06, 6.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 762/892 [1:32:50<14:55, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|███████████████████████████████████████████████████████████████████▍ | 762/892 [1:32:50<14:55, 6.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:51,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:12:51,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▌ | 763/892 [1:32:57<14:46, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▋ | 764/892 [1:33:04<14:37, 6.86s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:03,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▊ | 765/892 [1:33:11<14:32, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▊ | 765/892 [1:33:11<14:32, 6.87s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:11,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0025, 'learning_rate': 0.00032653061224489796, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▊ | 766/892 [1:33:18<14:20, 6.83s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6712, 'learning_rate': 0.0003239795918367347, 'epoch': 0.86} [WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:19,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████ | 768/892 [1:33:31<13:57, 6.76s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████ | 768/892 [1:33:31<13:57, 6.76s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:31,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:31,662 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████ | 769/892 [1:33:38<13:52, 6.77s/it]g-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7581, 'learning_rate': 0.0003163265306122449, 'epoch': 0.86} [WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:40,029 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:12:40,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|████████████████████████████████████████████████████████████████████▎ | 771/892 [1:33:51<13:32, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▎ | 772/892 [1:33:58<13:23, 6.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:56,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:13:56,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9918, 'learning_rate': 0.0003086734693877551, 'epoch': 0.87} 87%|████████████████████████████████████████████████████████████████████▍ | 773/892 [1:34:04<13:13, 6.67s/it]g-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5347, 'learning_rate': 0.0003061224489795919, 'epoch': 0.87} [WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:06,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:13:48,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 775/892 [1:34:18<13:05, 6.72s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:15,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▋ | 776/892 [1:34:24<12:47, 6.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▊ | 777/892 [1:34:31<12:30, 6.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:29,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▉ | 778/892 [1:34:37<12:17, 6.47s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:35,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|████████████████████████████████████████████████████████████████████▉ | 779/892 [1:34:43<12:03, 6.41s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:41,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|█████████████████████████████████████████████████████████████████████ | 780/892 [1:34:49<11:51, 6.35s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:48,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:48,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▏ | 781/892 [1:34:56<11:38, 6.29s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▏ | 781/892 [1:34:56<11:38, 6.29s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:54,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:14:54,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▎ | 782/892 [1:35:02<11:29, 6.27s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▎ | 782/892 [1:35:02<11:29, 6.27s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:00,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:00,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▎ | 783/892 [1:35:08<11:17, 6.22s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▎ | 783/892 [1:35:08<11:17, 6.22s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8777, 'learning_rate': 0.00028316326530612246, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-03 05:15:07,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▍ | 784/892 [1:35:14<11:02, 6.14s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▍ | 784/892 [1:35:14<11:02, 6.14s/it]g-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:12,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:14:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▌ | 785/892 [1:35:20<10:45, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▌ | 785/892 [1:35:20<10:45, 6.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6596, 'learning_rate': 0.00027551020408163264, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-03 05:15:20,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:26,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:26,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8652, 'learning_rate': 0.00027295918367346936, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-03 05:15:30,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:16,673 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▊ | 788/892 [1:35:36<09:54, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|█████████████████████████████████████████████████████████████████████▊ | 788/892 [1:35:36<09:54, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8039, 'learning_rate': 0.00027040816326530614, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-03 05:15:37,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:37,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7242, 'learning_rate': 0.00026785714285714287, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-03 05:15:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|█████████████████████████████████████████████████████████████████████▉ | 790/892 [1:35:47<09:16, 5.46s/it]g-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|█████████████████████████████████████████████████████████████████████▉ | 790/892 [1:35:47<09:16, 5.46s/it]g-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:44,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:47,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:47,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:49,749 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:33,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▏ | 792/892 [1:35:57<08:34, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▏ | 792/892 [1:35:57<08:34, 5.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:55,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:53,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▏ | 793/892 [1:36:01<08:08, 4.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▏ | 793/892 [1:36:01<08:08, 4.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:15:59,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:15:57,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▎ | 794/892 [1:36:05<07:39, 4.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▎ | 794/892 [1:36:05<07:39, 4.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:03,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:01,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▍ | 795/892 [1:36:09<07:08, 4.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▍ | 795/892 [1:36:09<07:08, 4.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:07,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:07,053 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:05,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▍ | 796/892 [1:36:12<06:35, 4.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:08,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▍ | 796/892 [1:36:12<06:35, 4.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:08,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▌ | 797/892 [1:36:15<06:00, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▌ | 797/892 [1:36:15<06:00, 3.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|██████████████████████████████████████████████████████████████████████▋ | 798/892 [1:36:18<05:26, 3.47s/it]g-point operations will not be computed-03 05:16:11,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:15,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:14,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:15,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:14,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:17,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:17,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it]g-point operations will not be computed-03 05:16:16,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|██████████████████████████████████████████████████████████████████████▊ | 800/892 [1:36:23<04:37, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:24,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:24,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|██████████████████████████████████████████████████████████████████████▉ | 801/892 [1:36:31<06:45, 4.46s/it]g-point operations will not be computed-03 05:16:20,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|██████████████████████████████████████████████████████████████████████▉ | 801/892 [1:36:31<06:45, 4.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:32,046 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6433, 'learning_rate': 0.00023469387755102041, 'epoch': 0.9} 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 802/892 [1:36:38<07:58, 5.32s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 803/892 [1:36:46<08:45, 5.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████ | 803/892 [1:36:46<08:45, 5.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:16:46,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▏ | 804/892 [1:36:53<09:13, 6.29s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▎ | 805/892 [1:37:00<09:32, 6.58s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▎ | 805/892 [1:37:00<09:32, 6.58s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:00,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9597, 'learning_rate': 0.00022448979591836734, 'epoch': 0.9} 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 806/892 [1:37:07<09:39, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|███████████████████████████████████████████████████████████████████████▍ | 807/892 [1:37:14<09:41, 6.85s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6865, 'learning_rate': 0.00021938775510204082, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:16,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|███████████████████████████████████████████████████████████████████████▋ | 809/892 [1:37:28<09:35, 6.93s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:27,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|███████████████████████████████████████████████████████████████████████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|███████████████████████████████████████████████████████████████████████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|███████████████████████████████████████████████████████████████████████▋ | 810/892 [1:37:35<09:27, 6.92s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8973, 'learning_rate': 0.00021173469387755103, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:37,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|███████████████████████████████████████████████████████████████████████▉ | 812/892 [1:37:49<09:11, 6.90s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:47,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:47,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5545, 'learning_rate': 0.0002066326530612245, 'epoch': 0.91} 91%|████████████████████████████████████████████████████████████████████████ | 813/892 [1:37:56<09:04, 6.89s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7104, 'learning_rate': 0.00020408163265306123, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:17:58,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|████████████████████████████████████████████████████████████████████████▏ | 815/892 [1:38:09<08:46, 6.84s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:08,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|████████████████████████████████████████████████████████████████████████▎ | 816/892 [1:38:16<08:36, 6.79s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|████████████████████████████████████████████████████████████████████████▎ | 816/892 [1:38:16<08:36, 6.79s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:16,635 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.762, 'learning_rate': 0.00019642857142857144, 'epoch': 0.92} 92%|████████████████████████████████████████████████████████████████████████▎ | 817/892 [1:38:23<08:25, 6.74s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6711, 'learning_rate': 0.00019387755102040816, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-03 05:18:24,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7525, 'learning_rate': 0.0001913265306122449, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:31,463 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▌ | 820/892 [1:38:42<07:57, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▌ | 820/892 [1:38:42<07:57, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:43,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▋ | 821/892 [1:38:49<07:50, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▋ | 821/892 [1:38:49<07:50, 6.63s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.628, 'learning_rate': 0.0001862244897959184, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-03 05:18:49,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4963, 'learning_rate': 0.00018367346938775512, 'epoch': 0.92} 92%|████████████████████████████████████████████████████████████████████████▊ | 822/892 [1:38:56<07:41, 6.60s/it]g-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6785, 'learning_rate': 0.00018112244897959185, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-03 05:18:57,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6588, 'learning_rate': 0.00017857142857142857, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:03,965 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:16:28,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0121, 'learning_rate': 0.00017602040816326532, 'epoch': 0.92} 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|█████████████████████████████████████████████████████████████████████████ | 825/892 [1:39:15<07:21, 6.59s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:12,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 826/892 [1:39:22<07:10, 6.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:18,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▏ | 827/892 [1:39:28<06:57, 6.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:24,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▎ | 828/892 [1:39:34<06:45, 6.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:31,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▍ | 829/892 [1:39:40<06:34, 6.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:37,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▌ | 830/892 [1:39:46<06:24, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|█████████████████████████████████████████████████████████████████████████▌ | 830/892 [1:39:46<06:24, 6.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4503, 'learning_rate': 0.00016071428571428573, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-03 05:19:47,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5233, 'learning_rate': 0.00015816326530612246, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-03 05:19:53,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7851, 'learning_rate': 0.00015561224489795918, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-03 05:19:59,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:05,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:05,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5912, 'learning_rate': 0.00015306122448979594, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-03 05:20:09,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:09,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|█████████████████████████████████████████████████████████████████████████▉ | 835/892 [1:40:16<05:36, 5.90s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|█████████████████████████████████████████████████████████████████████████▉ | 835/892 [1:40:16<05:36, 5.90s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:15,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:15,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|██████████████████████████████████████████████████████████████████████████ | 836/892 [1:40:21<05:26, 5.82s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:19,656 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:22,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:22,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5167, 'learning_rate': 0.00014540816326530611, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-03 05:20:26,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:26,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|██████████████████████████████████████████████████████████████████████████▏ | 838/892 [1:40:32<05:00, 5.57s/it]g-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:30,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:32,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:32,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5436, 'learning_rate': 0.0001403061224489796, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-03 05:20:36,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:36,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:19:43,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|██████████████████████████████████████████████████████████████████████████▍ | 840/892 [1:40:42<04:32, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|██████████████████████████████████████████████████████████████████████████▍ | 840/892 [1:40:42<04:32, 5.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:41,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:41,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:44,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:46,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:46,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:48,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:48,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:50,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:52,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:52,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:53,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:57,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:57,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:58,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:20:58,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:00,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:02,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:02,912 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:04,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:04,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:06,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:06,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:08,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:10,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:10,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7704, 'learning_rate': 0.00011224489795918367, 'epoch': 0.95} [WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:14,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:18,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:18,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:21,886 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:25,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:25,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9124, 'learning_rate': 0.00010714285714285714, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:29,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▌ | 853/892 [1:41:37<03:47, 5.84s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6014, 'learning_rate': 0.00010204081632653062, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:39,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▋ | 855/892 [1:41:52<04:01, 6.51s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▋ | 855/892 [1:41:52<04:01, 6.51s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:21:52,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5992, 'learning_rate': 9.693877551020408e-05, 'epoch': 0.96} 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▊ | 856/892 [1:41:59<04:00, 6.69s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▉ | 857/892 [1:42:06<03:58, 6.81s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:04,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:04,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6767, 'learning_rate': 9.183673469387756e-05, 'epoch': 0.96} 96%|███████████████████████████████████████████████████████████████████████████▉ | 858/892 [1:42:13<03:53, 6.86s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4785, 'learning_rate': 8.928571428571429e-05, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|████████████████████████████████████████████████████████████████████████████▏ | 860/892 [1:42:27<03:39, 6.87s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|████████████████████████████████████████████████████████████████████████████▏ | 860/892 [1:42:27<03:39, 6.87s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6034, 'learning_rate': 8.673469387755102e-05, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-03 05:22:27,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.926, 'learning_rate': 8.418367346938775e-05, 'epoch': 0.97} 97%|████████████████████████████████████████████████████████████████████████████▎ | 861/892 [1:42:33<03:32, 6.85s/it]g-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6059, 'learning_rate': 8.163265306122449e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:22:35,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:20:38,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6713, 'learning_rate': 7.908163265306123e-05, 'epoch': 0.97} 97%|████████████████████████████████████████████████████████████████████████████▍ | 863/892 [1:42:47<03:17, 6.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▌ | 864/892 [1:42:54<03:10, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▌ | 864/892 [1:42:54<03:10, 6.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9771, 'learning_rate': 7.653061224489797e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-03 05:22:54,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7271, 'learning_rate': 7.39795918367347e-05, 'epoch': 0.97} 97%|████████████████████████████████████████████████████████████████████████████▌ | 865/892 [1:43:00<03:02, 6.76s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7206, 'learning_rate': 7.142857142857142e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:02,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▊ | 867/892 [1:43:14<02:47, 6.68s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:12,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▊ | 868/892 [1:43:20<02:39, 6.64s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▊ | 868/892 [1:43:20<02:39, 6.64s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:20,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:20,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▉ | 869/892 [1:43:27<02:32, 6.62s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|████████████████████████████████████████████████████████████████████████████▉ | 869/892 [1:43:27<02:32, 6.62s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:27,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:27,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████ | 870/892 [1:43:33<02:24, 6.57s/it]g-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5866, 'learning_rate': 5.8673469387755104e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-03 05:23:35,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6059, 'learning_rate': 5.6122448979591836e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-03 05:23:41,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6245, 'learning_rate': 5.357142857142857e-05, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:23:47,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:22:44,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 874/892 [1:43:58<01:54, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 875/892 [1:44:06<01:51, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▍ | 875/892 [1:44:06<01:51, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:05,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:05,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▌ | 876/892 [1:44:12<01:43, 6.45s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▌ | 876/892 [1:44:12<01:43, 6.45s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:11,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:11,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▋ | 877/892 [1:44:18<01:34, 6.32s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:16,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|█████████████████████████████████████████████████████████████████████████████▊ | 878/892 [1:44:24<01:26, 6.21s/it]g-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:22,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:23:55,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|█████████████████████████████████████████████████████████████████████████████▊ | 879/892 [1:44:29<01:19, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|█████████████████████████████████████████████████████████████████████████████▊ | 879/892 [1:44:29<01:19, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9238, 'learning_rate': 3.571428571428571e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-03 05:24:34,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████ | 881/892 [1:44:41<01:04, 5.84s/it]g-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████ | 881/892 [1:44:41<01:04, 5.84s/it]g-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4336, 'learning_rate': 3.316326530612245e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-03 05:24:40,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:40,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:26,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████ | 882/892 [1:44:46<00:57, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████ | 882/892 [1:44:46<00:57, 5.71s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:46,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:46,779 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5835, 'learning_rate': 2.8061224489795918e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-03 05:24:50,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:50,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:42,942 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████▎| 884/892 [1:44:56<00:42, 5.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:55,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:55,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:52,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████▍| 885/892 [1:45:01<00:35, 5.10s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:24:57,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:24:59,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:24:57,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████▍| 886/892 [1:45:05<00:29, 4.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████▍| 886/892 [1:45:05<00:29, 4.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:03,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:03,354 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:01,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|██████████████████████████████████████████████████████████████████████████████▌| 887/892 [1:45:09<00:22, 4.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:06,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:06,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:05,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▋| 888/892 [1:45:12<00:16, 4.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▋| 889/892 [1:45:15<00:11, 3.93s/it]g-point operations will not be computed-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▋| 889/892 [1:45:15<00:11, 3.93s/it]g-point operations will not be computed-03 05:25:08,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:13,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:11,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 05:25:13,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 05:25:11,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▊| 890/892 [1:45:18<00:07, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:14,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▊| 890/892 [1:45:18<00:07, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:14,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▉| 891/892 [1:45:21<00:03, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|██████████████████████████████████████████████████████████████████████████████▉| 891/892 [1:45:21<00:03, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 05:25:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2487, 'learning_rate': 5.102040816326531e-06, 'epoch': 1.0} [INFO|trainer.py:2114] 2022-03-03 05:25:18,320 >> Saving model checkpoint to ./=)███| 892/892 [1:45:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2114] 2022-03-03 05:25:34,749 >> Saving model checkpoint to ./ ./pytorch_model.bin:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|modeling_utils.py:1081] 2022-03-03 05:25:51,178 >> Model weights saved in ./pytorch_model.bin:23<00:00, 2.87s/it][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 0%| | 64.0k/2.99G [00:01<27:14:47, 32.7kB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 1%|▌ | 34.6M/2.99G [00:03<03:43, 14.2MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 2%|█ | 72.8M/2.99G [00:05<02:55, 17.9MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 4%|█▊ | 113M/2.99G [00:07<02:38, 19.5MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 5%|██▍ | 152M/2.99G [00:09<02:32, 20.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 6%|███ | 192M/2.99G [00:11<02:25, 20.7MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 8%|███▋ | 234M/2.99G [00:13<02:19, 21.2MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 9%|████▎ | 275M/2.99G [00:15<02:16, 21.4MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 9%|████▎ | 275M/2.99G [00:15<02:16, 21.4MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 9%|████▎ | 275M/2.99G [00:15<02:16, 21.4MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file pytorch_model.bin: 9%|████▎ | 275M/2.99G [00:15<02:16, 21.4MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed bd3cc24..a0891d8 main -> main3953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed bd3cc24..a0891d8 main -> main3953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:21<00:00, 18.1MB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 05:30:11 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|████████████| 53.0M/53.0M [02:39<00:00, 192kB/s][INFO|trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|modelcard.py:460] 2022-03-03 05:30:14,945 >> Dropping the following result as it does not have all the necessary fields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.0M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed a0891d8..ca0bee7 main -> main3953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.0M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed a0891d8..ca0bee7 main -> main3953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.0M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 05:30:21 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed ***** train metrics ***** epoch = 1.0 train_loss = 4.9449 train_runtime = 1:45:24.93 train_samples = 28538 train_samples_per_second = 4.512 train_steps_per_second = 0.141 [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-03 05:30:24,628 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 05:49:02 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow ***** eval metrics ***** epoch = 1.0 eval_loss = 4.8277 eval_runtime = 0:18:38.06 eval_samples = 2642 eval_samples_per_second = 2.363 eval_steps_per_second = 0.296 [INFO|trainer.py:2114] 2022-03-03 05:49:02,692 >> Saving model checkpoint to ./*███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|modeling_utils.py:1081] 2022-03-03 05:49:19,093 >> Model weights saved in ./pytorch_model.bin0:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.2M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.2M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 0%| | 32.0k/53.2M [00:00> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/03/2022 05:49:50 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220303_033953-1eigbhyo/run-1eigbhyo.wandb: 100%|███████████| 53.2M/53.2M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-03 05:25:18,318 >> 6,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed