0%| | 0/1784 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9039, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:17,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 1/1784 [00:04<2:04:19, 4.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:19,720 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0473, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:21,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 2/1784 [00:07<1:55:56, 3.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:23,403 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1588, 'learning_rate': 6.000000000000001e-08, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:25,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 3/1784 [00:11<1:55:51, 3.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:27,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7, 'learning_rate': 1.2000000000000002e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:29,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 4/1784 [00:15<1:53:03, 3.81s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:30,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7898, 'learning_rate': 1.8e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:32,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 5/1784 [00:19<1:51:24, 3.76s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:34,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8486, 'learning_rate': 2.4000000000000003e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:36,363 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 6/1784 [00:22<1:50:07, 3.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:31:39,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7039, 'learning_rate': 3.0000000000000004e-07, 'epoch': 0.0} 0%|▎ | 7/1784 [00:26<1:49:02, 3.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:41,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:31:43,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 8/1784 [00:29<1:48:17, 3.66s/it] 0%|▎ | 8/1784 [00:29<1:48:17, 3.66s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:45,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:31:47,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 9/1784 [00:33<1:47:39, 3.64s/it] 1%|▍ | 9/1784 [00:33<1:47:39, 3.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:48,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7626, 'learning_rate': 4.800000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:50,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 10/1784 [00:37<1:46:20, 3.60s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:52,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8163, 'learning_rate': 5.4e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:54,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 11/1784 [00:40<1:45:02, 3.55s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:55,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7764, 'learning_rate': 6.000000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:31:57,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 12/1784 [00:43<1:43:47, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:31:59,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9868, 'learning_rate': 6.599999999999999e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:01,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 13/1784 [00:47<1:43:19, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:02,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:04,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.601, 'learning_rate': 7.2e-07, 'epoch': 0.01} 1%|▌ | 14/1784 [00:50<1:42:28, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:06,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7462, 'learning_rate': 7.799999999999999e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:07,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 15/1784 [00:54<1:41:49, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:09,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8539, 'learning_rate': 8.4e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:11,231 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 16/1784 [00:57<1:41:06, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:12,968 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:14,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 17/1784 [01:00<1:40:15, 3.40s/it] 1%|▊ | 17/1784 [01:00<1:40:15, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:16,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7755, 'learning_rate': 9.600000000000001e-07, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:17,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 18/1784 [01:04<1:39:21, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:19,575 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7324, 'learning_rate': 1.0200000000000002e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:21,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 19/1784 [01:07<1:38:32, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:22,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:24,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 20/1784 [01:10<1:38:01, 3.33s/it] 1%|▉ | 20/1784 [01:10<1:38:01, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:26,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7546, 'learning_rate': 1.14e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:27,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 21/1784 [01:14<1:37:31, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:29,457 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0524, 'learning_rate': 1.2000000000000002e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:31,022 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 22/1784 [01:17<1:37:01, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:32,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9026, 'learning_rate': 1.26e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:34,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 23/1784 [01:20<1:36:20, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:35,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6494, 'learning_rate': 1.3199999999999999e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:37,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 24/1784 [01:23<1:35:27, 3.25s/it] 1%|█ | 24/1784 [01:23<1:35:27, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:39,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:40,635 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 25/1784 [01:27<1:34:51, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:42,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9052, 'learning_rate': 1.44e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:43,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 26/1784 [01:30<1:34:24, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:45,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7384, 'learning_rate': 1.5e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:46,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 27/1784 [01:33<1:33:43, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:48,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4794, 'learning_rate': 1.5599999999999999e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:50,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 28/1784 [01:36<1:33:21, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:51,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:53,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 29/1784 [01:39<1:32:53, 3.18s/it] 2%|█▎ | 29/1784 [01:39<1:32:53, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:54,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6417, 'learning_rate': 1.68e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:32:56,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 30/1784 [01:42<1:32:22, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:32:57,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:32:59,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 31/1784 [01:45<1:31:35, 3.13s/it] 2%|█▎ | 31/1784 [01:45<1:31:35, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:01,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8925, 'learning_rate': 1.74e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:02,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 32/1784 [01:48<1:30:27, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:04,047 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:05,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 33/1784 [01:51<1:29:19, 3.06s/it] 2%|█▍ | 33/1784 [01:51<1:29:19, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:06,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:08,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5056, 'learning_rate': 1.86e-06, 'epoch': 0.02} 2%|█▌ | 34/1784 [01:54<1:27:47, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:09,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7537, 'learning_rate': 1.9200000000000003e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:11,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 35/1784 [01:57<1:26:32, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:12,694 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:14,071 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9873, 'learning_rate': 1.98e-06, 'epoch': 0.02} 2%|█▌ | 36/1784 [02:00<1:25:21, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:15,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6271, 'learning_rate': 2.0400000000000004e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:16,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 37/1784 [02:03<1:23:59, 2.88s/it] 2%|█▋ | 37/1784 [02:03<1:23:59, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:18,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:19,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 38/1784 [02:06<1:22:55, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:20,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:22,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8251, 'learning_rate': 2.16e-06, 'epoch': 0.02} 2%|█▋ | 39/1784 [02:08<1:21:01, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:23,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.816, 'learning_rate': 2.22e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:24,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 40/1784 [02:11<1:19:06, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:26,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6273, 'learning_rate': 2.28e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:27,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 41/1784 [02:13<1:16:49, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:28,530 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:29,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 42/1784 [02:16<1:14:12, 2.56s/it] 2%|█▊ | 42/1784 [02:16<1:14:12, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:30,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:31,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 43/1784 [02:18<1:11:24, 2.46s/it] 2%|█▉ | 43/1784 [02:18<1:11:24, 2.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:32,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:33,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 44/1784 [02:20<1:07:58, 2.34s/it] 2%|█▉ | 44/1784 [02:20<1:07:58, 2.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:34,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:35,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 45/1784 [02:22<1:04:12, 2.22s/it] {'loss': 4.8068, 'learning_rate': 2.52e-06, 'epoch': 0.03} 3%|█▉ | 45/1784 [02:22<1:04:12, 2.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:36,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:37,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 46/1784 [02:23<59:36, 2.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9131, 'learning_rate': 2.6399999999999997e-06, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:39,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 47/1784 [02:25<54:55, 1.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:39,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:40,449 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 48/1784 [02:26<50:19, 1.74s/it] {'loss': 5.2795, 'learning_rate': 2.7e-06, 'epoch': 0.03} 3%|██▏ | 48/1784 [02:26<50:19, 1.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:41,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:41,636 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 49/1784 [02:28<45:29, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:42,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3139, 'learning_rate': 2.82e-06, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-01 01:33:43,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 50/1784 [02:29<46:10, 1.60s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:45,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 50/1784 [02:29<46:10, 1.60s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:45,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:33<1:06:11, 2.29s/it]g-point operations will not be computed-01 01:33:45,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:33<1:06:11, 2.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:49,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:33<1:06:11, 2.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:49,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 52/1784 [02:37<1:18:20, 2.71s/it]g-point operations will not be computed-01 01:33:49,111 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:54,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:33:52,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:33:54,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:33:52,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 53/1784 [02:40<1:26:41, 3.00s/it] 3%|██▎ | 53/1784 [02:40<1:26:41, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:56,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 53/1784 [02:40<1:26:41, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:56,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 54/1784 [02:44<1:31:50, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:33:56,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 54/1784 [02:44<1:31:50, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:00,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 54/1784 [02:44<1:31:50, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:00,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 55/1784 [02:48<1:35:17, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:00,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 55/1784 [02:48<1:35:17, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 55/1784 [02:48<1:35:17, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 56/1784 [02:51<1:37:16, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 57/1784 [02:55<1:38:51, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:07,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 57/1784 [02:55<1:38:51, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:07,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 57/1784 [02:55<1:38:51, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:10,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [02:58<1:39:06, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:10,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [02:58<1:39:06, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:10,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [02:58<1:39:06, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:14,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [02:58<1:39:06, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:14,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:02<1:39:09, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:14,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:02<1:39:09, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:17,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:02<1:39:09, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:17,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 60/1784 [03:05<1:39:32, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:21,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:09<1:39:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:21,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:09<1:39:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:21,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:09<1:39:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:24,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:09<1:39:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:24,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 62/1784 [03:12<1:38:53, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:24,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 63/1784 [03:15<1:38:23, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:27,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 63/1784 [03:15<1:38:23, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:27,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 63/1784 [03:15<1:38:23, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:31,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:19<1:37:51, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:31,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:19<1:37:51, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:31,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:19<1:37:51, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:34,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:19<1:37:51, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:34,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 65/1784 [03:22<1:37:32, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:34,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:34:38,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:34:38,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 66/1784 [03:26<1:37:11, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:41,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:29<1:36:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:41,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:29<1:36:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:41,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:29<1:36:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:44,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:29<1:36:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:44,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 68/1784 [03:32<1:35:45, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:44,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 69/1784 [03:36<1:35:09, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 69/1784 [03:36<1:35:09, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 69/1784 [03:36<1:35:09, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:51,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:39<1:34:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:51,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:39<1:34:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:51,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:39<1:34:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:54,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:39<1:34:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:54,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 71/1784 [03:42<1:33:56, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:57,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:45<1:33:29, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:57,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:45<1:33:29, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:34:57,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:45<1:33:29, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:01,054 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:45<1:33:29, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:01,054 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 73/1784 [03:49<1:33:10, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:04,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 74/1784 [03:52<1:32:43, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:04,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 74/1784 [03:52<1:32:43, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:04,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 74/1784 [03:52<1:32:43, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:07,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 75/1784 [03:55<1:32:28, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:07,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 75/1784 [03:55<1:32:28, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:07,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [03:58<1:31:30, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:10,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [03:58<1:31:30, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:10,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [03:58<1:31:30, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:13,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 77/1784 [04:01<1:30:55, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:13,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 77/1784 [04:01<1:30:55, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:13,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:04<1:30:13, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:16,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:04<1:30:13, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:16,955 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:04<1:30:13, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:20,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 79/1784 [04:07<1:29:35, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:20,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 79/1784 [04:07<1:29:35, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:20,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:11<1:28:58, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:23,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:11<1:28:58, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:23,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:11<1:28:58, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:26,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:14<1:28:18, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:26,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:14<1:28:18, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:26,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:14<1:28:18, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:29,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:14<1:28:18, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:29,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 82/1784 [04:17<1:27:14, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:32,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 83/1784 [04:20<1:26:25, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:32,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 83/1784 [04:20<1:26:25, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:32,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:22<1:24:47, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:35,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:22<1:24:47, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:35,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:22<1:24:47, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:38,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 85/1784 [04:25<1:23:12, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:38,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 85/1784 [04:25<1:23:12, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:38,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 86/1784 [04:28<1:22:06, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:40,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 86/1784 [04:28<1:22:06, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:40,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:31<1:20:40, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:43,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:31<1:20:40, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:43,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:31<1:20:40, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:46,297 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 88/1784 [04:33<1:18:33, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:48,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 88/1784 [04:33<1:18:33, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:48,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 89/1784 [04:36<1:16:49, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:48,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 89/1784 [04:36<1:16:49, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:48,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 90/1784 [04:39<1:15:22, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:51,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 90/1784 [04:39<1:15:22, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:51,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 91/1784 [04:41<1:12:36, 2.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:53,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 91/1784 [04:41<1:12:36, 2.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:53,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 91/1784 [04:41<1:12:36, 2.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:56,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 91/1784 [04:41<1:12:36, 2.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:56,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 93/1784 [04:45<1:06:33, 2.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:58,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 93/1784 [04:45<1:06:33, 2.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:35:58,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 94/1784 [04:47<1:02:22, 2.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:00,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 94/1784 [04:47<1:02:22, 2.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:00,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 95/1784 [04:49<57:58, 2.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:02,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 95/1784 [04:49<57:58, 2.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:02,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 96/1784 [04:50<53:46, 1.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:05,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 96/1784 [04:50<53:46, 1.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:05,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.287, 'learning_rate': 5.58e-06, 'epoch': 0.05} 5%|████▍ | 98/1784 [04:53<45:32, 1.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:07,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 98/1784 [04:53<45:32, 1.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:07,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 99/1784 [04:54<41:52, 1.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:09,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 99/1784 [04:54<41:52, 1.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:09,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 100/1784 [04:56<42:44, 1.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:09,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 100/1784 [04:56<42:44, 1.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:09,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 101/1784 [05:00<1:01:47, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:12,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 101/1784 [05:00<1:01:47, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:12,009 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:03<1:13:50, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:15,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:03<1:13:50, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:15,671 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:03<1:13:50, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:19,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:07<1:21:55, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:19,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:07<1:21:55, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:19,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:07<1:21:55, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:22,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:11<1:27:48, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:22,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:11<1:27:48, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:22,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:11<1:27:48, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:26,489 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:11<1:27:48, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:26,489 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 105/1784 [05:14<1:31:04, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:29,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 106/1784 [05:18<1:33:21, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:29,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 106/1784 [05:18<1:33:21, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:29,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 106/1784 [05:18<1:33:21, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:33,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:21<1:34:39, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:33,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:21<1:34:39, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:33,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:21<1:34:39, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:37,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 108/1784 [05:25<1:35:28, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:37,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 108/1784 [05:25<1:35:28, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:37,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 109/1784 [05:28<1:35:48, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:40,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 109/1784 [05:28<1:35:48, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:40,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 109/1784 [05:28<1:35:48, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:44,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:32<1:36:28, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:44,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:32<1:36:28, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:44,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:32<1:36:28, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:47,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:35<1:36:17, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:47,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:35<1:36:17, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:47,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:35<1:36:17, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:50,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:35<1:36:17, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:50,885 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 112/1784 [05:38<1:35:40, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:54,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 113/1784 [05:42<1:35:19, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:54,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 113/1784 [05:42<1:35:19, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:54,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 113/1784 [05:42<1:35:19, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:57,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 114/1784 [05:45<1:35:10, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:57,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 114/1784 [05:45<1:35:10, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:36:57,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 115/1784 [05:49<1:34:22, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:01,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 115/1784 [05:49<1:34:22, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:01,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 115/1784 [05:49<1:34:22, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:04,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 116/1784 [05:52<1:33:44, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:04,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 116/1784 [05:52<1:33:44, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:04,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 116/1784 [05:52<1:33:44, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:07,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 117/1784 [05:55<1:33:19, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:10,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 117/1784 [05:55<1:33:19, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:10,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [05:58<1:32:17, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:10,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [05:58<1:32:17, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:10,974 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [05:58<1:32:17, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:14,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 119/1784 [06:02<1:31:36, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:14,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 119/1784 [06:02<1:31:36, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:14,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:05<1:30:51, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:17,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:05<1:30:51, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:17,461 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:05<1:30:51, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:20,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 121/1784 [06:08<1:30:20, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:20,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 121/1784 [06:08<1:30:20, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:20,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:11<1:30:06, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:23,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:11<1:30:06, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:27,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:11<1:30:06, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:27,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 123/1784 [06:15<1:29:33, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:27,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 123/1784 [06:15<1:29:33, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:27,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 123/1784 [06:15<1:29:33, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:30,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:18<1:28:42, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:33,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:18<1:28:42, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:33,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:21<1:27:52, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:33,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:21<1:27:52, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:33,422 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:21<1:27:52, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:36,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 126/1784 [06:24<1:27:20, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:39,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 126/1784 [06:24<1:27:20, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:39,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:27<1:26:56, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:39,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:27<1:26:56, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:39,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:27<1:26:56, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:42,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:27<1:26:56, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:42,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 128/1784 [06:30<1:27:13, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:42,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 128/1784 [06:30<1:27:13, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:42,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:33<1:26:39, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:45,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:33<1:26:39, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:49,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:33<1:26:39, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:49,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 130/1784 [06:36<1:25:31, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:51,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 130/1784 [06:36<1:25:31, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:51,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:39<1:24:44, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:51,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:39<1:24:44, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:55,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:39<1:24:44, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:55,037 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 132/1784 [06:42<1:24:04, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:57,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 132/1784 [06:42<1:24:04, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:57,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:45<1:22:54, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:57,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:45<1:22:54, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:37:57,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:45<1:22:54, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:00,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▊ | 134/1784 [06:48<1:21:17, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:03,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▊ | 134/1784 [06:48<1:21:17, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:03,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 135/1784 [06:51<1:20:16, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:03,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 135/1784 [06:51<1:20:16, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:03,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 136/1784 [06:54<1:19:21, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:06,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 136/1784 [06:54<1:19:21, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:06,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 136/1784 [06:54<1:19:21, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:09,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 137/1784 [06:56<1:17:48, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:11,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 137/1784 [06:56<1:17:48, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:11,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [06:59<1:16:33, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:11,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [06:59<1:16:33, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:11,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 139/1784 [07:02<1:14:38, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:14,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 139/1784 [07:02<1:14:38, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:14,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 140/1784 [07:04<1:13:07, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:17,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 140/1784 [07:04<1:13:07, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:19,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 140/1784 [07:04<1:13:07, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:19,645 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 141/1784 [07:07<1:10:53, 2.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:21,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 141/1784 [07:07<1:10:53, 2.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:21,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 142/1784 [07:09<1:08:14, 2.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:24,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 142/1784 [07:09<1:08:14, 2.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:24,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 143/1784 [07:11<1:05:29, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:26,258 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 144/1784 [07:13<1:01:46, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:28,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 144/1784 [07:13<1:01:46, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:28,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4011, 'learning_rate': 8.400000000000001e-06, 'epoch': 0.08} 8%|██████▌ | 146/1784 [07:17<54:33, 2.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:29,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 146/1784 [07:17<54:33, 2.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:29,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.484, 'learning_rate': 8.52e-06, 'epoch': 0.08} 8%|██████▌ | 147/1784 [07:18<50:18, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:32,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 147/1784 [07:18<50:18, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:32,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5693, 'learning_rate': 8.64e-06, 'epoch': 0.08} 8%|██████▋ | 149/1784 [07:21<42:16, 1.55s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:34,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 150/1784 [07:22<42:43, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:35,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 150/1784 [07:22<42:43, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:35,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 150/1784 [07:22<42:43, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:38,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:26<1:00:27, 2.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:38,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:26<1:00:27, 2.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:38,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:26<1:00:27, 2.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:41,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:30<1:11:40, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:41,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:30<1:11:40, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:41,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:30<1:11:40, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:45,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 153/1784 [07:33<1:19:58, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:45,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 153/1784 [07:33<1:19:58, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:45,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 154/1784 [07:37<1:25:09, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:49,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 154/1784 [07:37<1:25:09, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:49,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 154/1784 [07:37<1:25:09, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:52,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:40<1:28:37, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:52,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:40<1:28:37, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:52,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:40<1:28:37, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:56,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:44<1:30:25, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:56,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:44<1:30:25, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:56,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:44<1:30:25, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:59,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:44<1:30:25, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:38:59,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 157/1784 [07:47<1:32:04, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:03,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 158/1784 [07:51<1:33:01, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:03,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 158/1784 [07:51<1:33:01, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:03,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 158/1784 [07:51<1:33:01, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:06,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 159/1784 [07:54<1:33:13, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:06,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 159/1784 [07:54<1:33:13, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:06,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 159/1784 [07:54<1:33:13, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:10,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 160/1784 [07:58<1:33:35, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 160/1784 [07:58<1:33:35, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:01<1:33:25, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:01<1:33:25, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:13,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:01<1:33:25, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:17,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 162/1784 [08:05<1:32:59, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:17,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 162/1784 [08:05<1:32:59, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:17,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 162/1784 [08:05<1:32:59, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:20,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 163/1784 [08:08<1:32:57, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 163/1784 [08:08<1:32:57, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 164/1784 [08:12<1:32:41, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 164/1784 [08:12<1:32:41, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5006, 'learning_rate': 9.600000000000001e-06, 'epoch': 0.09} 9%|███████▏ | 165/1784 [08:15<1:32:22, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 165/1784 [08:15<1:32:22, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:24,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.287, 'learning_rate': 9.66e-06, 'epoch': 0.09} 9%|███████▎ | 166/1784 [08:18<1:31:41, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 166/1784 [08:18<1:31:41, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 167/1784 [08:22<1:30:51, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 167/1784 [08:22<1:30:51, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3238, 'learning_rate': 9.780000000000001e-06, 'epoch': 0.09} 9%|███████▎ | 168/1784 [08:25<1:29:51, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 168/1784 [08:25<1:29:51, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:39:42,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:39:42,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1272, 'learning_rate': 9.9e-06, 'epoch': 0.09} 10%|███████▍ | 170/1784 [08:31<1:28:31, 3.29s/it]g-point operations will not be computed-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 170/1784 [08:31<1:28:31, 3.29s/it]g-point operations will not be computed-01 01:39:34,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2639, 'learning_rate': 9.960000000000001e-06, 'epoch': 0.1} 10%|███████▍ | 171/1784 [08:35<1:27:59, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 171/1784 [08:35<1:27:59, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 172/1784 [08:38<1:27:49, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 172/1784 [08:38<1:27:49, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2672, 'learning_rate': 1.008e-05, 'epoch': 0.1} 10%|███████▌ | 173/1784 [08:41<1:27:53, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 173/1784 [08:41<1:27:53, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:39:58,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:39:58,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1786, 'learning_rate': 1.02e-05, 'epoch': 0.1} 10%|███████▋ | 175/1784 [08:48<1:26:15, 3.22s/it]g-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 175/1784 [08:48<1:26:15, 3.22s/it]g-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:04,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:04,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1529, 'learning_rate': 1.032e-05, 'epoch': 0.1} 10%|███████▋ | 177/1784 [08:54<1:24:39, 3.16s/it]g-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 177/1784 [08:54<1:24:39, 3.16s/it]g-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.205, 'learning_rate': 1.0379999999999999e-05, 'epoch': 0.1} 10%|███████▋ | 177/1784 [08:54<1:24:39, 3.16s/it]g-point operations will not be computed-01 01:39:50,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 178/1784 [08:57<1:23:59, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 179/1784 [09:00<1:23:10, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 179/1784 [09:00<1:23:10, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:16,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:16,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4296, 'learning_rate': 1.0559999999999999e-05, 'epoch': 0.1} 10%|███████▉ | 181/1784 [09:06<1:21:32, 3.05s/it]g-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 181/1784 [09:06<1:21:32, 3.05s/it]g-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:22,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:22,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2324, 'learning_rate': 1.068e-05, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-01 01:40:22,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 183/1784 [09:12<1:19:57, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:27,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 184/1784 [09:15<1:18:27, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:27,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 184/1784 [09:15<1:18:27, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:27,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3776, 'learning_rate': 1.08e-05, 'epoch': 0.1} 10%|████████ | 184/1784 [09:15<1:18:27, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:27,287 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 185/1784 [09:17<1:18:01, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 186/1784 [09:20<1:17:05, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 186/1784 [09:20<1:17:05, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:37,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:37,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.242, 'learning_rate': 1.098e-05, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-01 01:40:37,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:33,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 188/1784 [09:26<1:15:38, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 189/1784 [09:29<1:14:03, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 189/1784 [09:29<1:14:03, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:45,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:45,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:47,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:47,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:49,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:49,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:51,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:51,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:53,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:53,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:55,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:55,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:57,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:57,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:59,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:40:59,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:01,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:01,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3407, 'learning_rate': 1.164e-05, 'epoch': 0.11} [WARNING|modeling_utils.py:388] 2022-03-01 01:41:03,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:03,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2983, 'learning_rate': 1.1760000000000001e-05, 'epoch': 0.11} [WARNING|modeling_utils.py:388] 2022-03-01 01:41:07,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:07,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2789, 'learning_rate': 1.182e-05, 'epoch': 0.11} 11%|████████▊ | 202/1784 [09:57<1:10:25, 2.67s/it]g-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 202/1784 [09:57<1:10:25, 2.67s/it]g-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1983, 'learning_rate': 1.1880000000000001e-05, 'epoch': 0.11} 11%|████████▊ | 202/1784 [09:57<1:10:25, 2.67s/it]g-point operations will not be computed-01 01:40:41,364 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 203/1784 [10:00<1:17:30, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 204/1784 [10:04<1:22:20, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 204/1784 [10:04<1:22:20, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.059, 'learning_rate': 1.2e-05, 'epoch': 0.11} 11%|████████▉ | 205/1784 [10:07<1:25:13, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 205/1784 [10:07<1:25:13, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2062, 'learning_rate': 1.2060000000000001e-05, 'epoch': 0.11} 12%|█████████ | 206/1784 [10:11<1:27:05, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 206/1784 [10:11<1:27:05, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:28,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:28,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3501, 'learning_rate': 1.2180000000000002e-05, 'epoch': 0.12} 12%|█████████ | 208/1784 [10:18<1:29:29, 3.41s/it]g-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 208/1784 [10:18<1:29:29, 3.41s/it]g-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0598, 'learning_rate': 1.224e-05, 'epoch': 0.12} 12%|█████████▏ | 209/1784 [10:21<1:29:54, 3.43s/it]g-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 209/1784 [10:21<1:29:54, 3.43s/it]g-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1062, 'learning_rate': 1.2299999999999999e-05, 'epoch': 0.12} 12%|█████████▏ | 209/1784 [10:21<1:29:54, 3.43s/it]g-point operations will not be computed-01 01:41:16,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 210/1784 [10:25<1:29:46, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 211/1784 [10:28<1:29:39, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 211/1784 [10:28<1:29:39, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2774, 'learning_rate': 1.242e-05, 'epoch': 0.12} 12%|█████████▎ | 212/1784 [10:31<1:29:18, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 212/1784 [10:31<1:29:18, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:48,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:48,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1994, 'learning_rate': 1.254e-05, 'epoch': 0.12} 12%|█████████▎ | 214/1784 [10:38<1:28:35, 3.39s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 214/1784 [10:38<1:28:35, 3.39s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0105, 'learning_rate': 1.26e-05, 'epoch': 0.12} 12%|█████████▍ | 215/1784 [10:41<1:27:56, 3.36s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 215/1784 [10:41<1:27:56, 3.36s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:58,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:41:58,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8713, 'learning_rate': 1.272e-05, 'epoch': 0.12} 12%|█████████▍ | 217/1784 [10:48<1:27:19, 3.34s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 217/1784 [10:48<1:27:19, 3.34s/it]g-point operations will not be computed-01 01:41:40,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1284, 'learning_rate': 1.278e-05, 'epoch': 0.12} 12%|█████████▌ | 218/1784 [10:51<1:27:09, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 218/1784 [10:51<1:27:09, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 219/1784 [10:55<1:26:29, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 219/1784 [10:55<1:26:29, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1782, 'learning_rate': 1.29e-05, 'epoch': 0.12} 12%|█████████▌ | 220/1784 [10:58<1:25:59, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 220/1784 [10:58<1:25:59, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:15,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:15,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3053, 'learning_rate': 1.302e-05, 'epoch': 0.12} 12%|█████████▋ | 222/1784 [11:04<1:25:18, 3.28s/it]g-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 222/1784 [11:04<1:25:18, 3.28s/it]g-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:21,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:21,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1366, 'learning_rate': 1.314e-05, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-03-01 01:42:21,809 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 224/1784 [11:11<1:24:45, 3.26s/it]g-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 224/1784 [11:11<1:24:45, 3.26s/it]g-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 224/1784 [11:11<1:24:45, 3.26s/it]g-point operations will not be computed-01 01:42:07,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 225/1784 [11:14<1:23:46, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:29,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 225/1784 [11:14<1:23:46, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:29,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:17<1:23:09, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:29,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:17<1:23:09, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:29,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:17<1:23:09, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:29,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 227/1784 [11:20<1:22:19, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:36,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 227/1784 [11:20<1:22:19, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:36,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 228/1784 [11:23<1:21:51, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:36,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 228/1784 [11:23<1:21:51, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:36,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 228/1784 [11:23<1:21:51, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:36,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 229/1784 [11:27<1:21:33, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:42,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 229/1784 [11:27<1:21:33, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:42,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 230/1784 [11:30<1:20:55, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:42,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 230/1784 [11:30<1:20:55, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:42,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 231/1784 [11:33<1:20:02, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:48,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 231/1784 [11:33<1:20:02, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:48,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 232/1784 [11:36<1:19:12, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:48,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 232/1784 [11:36<1:19:12, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:48,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2574, 'learning_rate': 1.3680000000000001e-05, 'epoch': 0.13} 13%|██████████▏ | 233/1784 [11:39<1:17:51, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 233/1784 [11:39<1:17:51, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 234/1784 [11:41<1:16:21, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 234/1784 [11:41<1:16:21, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:58,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:42:58,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:42:54,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.121, 'learning_rate': 1.3860000000000001e-05, 'epoch': 0.13} 13%|██████████▎ | 236/1784 [11:47<1:14:10, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 236/1784 [11:47<1:14:10, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 237/1784 [11:50<1:12:36, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 237/1784 [11:50<1:12:36, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:06,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:06,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:08,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:08,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:08,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 240/1784 [11:57<1:07:04, 2.61s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 240/1784 [11:57<1:07:04, 2.61s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:13,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:13,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:15,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:15,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:17,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:17,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:19,216 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:19,216 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:22,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:22,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6007, 'learning_rate': 1.446e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-01 01:43:23,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:23,666 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:26,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:26,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4152, 'learning_rate': 1.464e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-01 01:43:27,720 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:27,720 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2647, 'learning_rate': 1.4760000000000001e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-01 01:43:31,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:31,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1991, 'learning_rate': 1.482e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-01 01:43:31,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:35,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:38,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:38,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1763, 'learning_rate': 1.4940000000000001e-05, 'epoch': 0.14} [WARNING|modeling_utils.py:388] 2022-03-01 01:43:38,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 254/1784 [12:28<1:20:29, 3.16s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 254/1784 [12:28<1:20:29, 3.16s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 254/1784 [12:28<1:20:29, 3.16s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 255/1784 [12:32<1:23:14, 3.27s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 255/1784 [12:32<1:23:14, 3.27s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 256/1784 [12:35<1:24:44, 3.33s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 256/1784 [12:35<1:24:44, 3.33s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:53,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:43:53,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2139, 'learning_rate': 1.518e-05, 'epoch': 0.14} 14%|███████████▎ | 258/1784 [12:42<1:27:01, 3.42s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 258/1784 [12:42<1:27:01, 3.42s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3484, 'learning_rate': 1.524e-05, 'epoch': 0.14} 15%|███████████▎ | 259/1784 [12:46<1:27:24, 3.44s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 259/1784 [12:46<1:27:24, 3.44s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4464, 'learning_rate': 1.53e-05, 'epoch': 0.15} 15%|███████████▎ | 259/1784 [12:46<1:27:24, 3.44s/it]g-point operations will not be computed-01 01:43:02,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 260/1784 [12:49<1:27:38, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 261/1784 [12:53<1:27:11, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 261/1784 [12:53<1:27:11, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0849, 'learning_rate': 1.542e-05, 'epoch': 0.15} 15%|███████████▍ | 262/1784 [12:56<1:26:15, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 262/1784 [12:56<1:26:15, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:13,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:13,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2621, 'learning_rate': 1.554e-05, 'epoch': 0.15} 15%|███████████▌ | 264/1784 [13:03<1:26:18, 3.41s/it]g-point operations will not be computed-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 264/1784 [13:03<1:26:18, 3.41s/it]g-point operations will not be computed-01 01:44:05,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0818, 'learning_rate': 1.56e-05, 'epoch': 0.15} 15%|███████████▌ | 265/1784 [13:06<1:25:39, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 265/1784 [13:06<1:25:39, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 266/1784 [13:10<1:24:57, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 266/1784 [13:10<1:24:57, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0042, 'learning_rate': 1.5720000000000002e-05, 'epoch': 0.15} 15%|███████████▋ | 267/1784 [13:13<1:24:41, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 267/1784 [13:13<1:24:41, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:30,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:30,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.221, 'learning_rate': 1.584e-05, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-01 01:44:30,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 269/1784 [13:19<1:23:13, 3.30s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 269/1784 [13:19<1:23:13, 3.30s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 270/1784 [13:23<1:23:06, 3.29s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 270/1784 [13:23<1:23:06, 3.29s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:40,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:40,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3152, 'learning_rate': 1.6020000000000002e-05, 'epoch': 0.15} 15%|███████████▉ | 272/1784 [13:29<1:21:52, 3.25s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 272/1784 [13:29<1:21:52, 3.25s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:46,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:46,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0427, 'learning_rate': 1.614e-05, 'epoch': 0.15} 15%|███████████▉ | 274/1784 [13:36<1:21:12, 3.23s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 274/1784 [13:36<1:21:12, 3.23s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:52,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:44:52,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2973, 'learning_rate': 1.626e-05, 'epoch': 0.15} 15%|████████████ | 276/1784 [13:42<1:20:24, 3.20s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 276/1784 [13:42<1:20:24, 3.20s/it]g-point operations will not be computed-01 01:44:22,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2771, 'learning_rate': 1.6320000000000003e-05, 'epoch': 0.15} 16%|████████████ | 277/1784 [13:45<1:20:01, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 277/1784 [13:45<1:20:01, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 278/1784 [13:48<1:19:29, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 278/1784 [13:48<1:19:29, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3386, 'learning_rate': 1.6440000000000002e-05, 'epoch': 0.16} 16%|████████████▏ | 279/1784 [13:51<1:18:47, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 279/1784 [13:51<1:18:47, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:08,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:08,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:00,743 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.201, 'learning_rate': 1.656e-05, 'epoch': 0.16} 16%|████████████▎ | 281/1784 [13:57<1:17:04, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:12,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 281/1784 [13:57<1:17:04, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:12,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 282/1784 [14:00<1:16:21, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:12,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 282/1784 [14:00<1:16:21, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:12,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3703, 'learning_rate': 1.6680000000000003e-05, 'epoch': 0.16} 16%|████████████▎ | 283/1784 [14:03<1:15:16, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 283/1784 [14:03<1:15:16, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 284/1784 [14:06<1:14:09, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 284/1784 [14:06<1:14:09, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:23,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:23,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:18,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2593, 'learning_rate': 1.686e-05, 'epoch': 0.16} 16%|████████████▌ | 286/1784 [14:12<1:12:29, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 286/1784 [14:12<1:12:29, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 287/1784 [14:15<1:11:37, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 287/1784 [14:15<1:11:37, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:31,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:31,265 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:27,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4772, 'learning_rate': 1.704e-05, 'epoch': 0.16} 16%|████████████▋ | 289/1784 [14:20<1:08:11, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:35,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 289/1784 [14:20<1:08:11, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:35,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 290/1784 [14:22<1:05:59, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:37,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 290/1784 [14:22<1:05:59, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:37,538 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 291/1784 [14:25<1:03:37, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 291/1784 [14:25<1:03:37, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▊ | 292/1784 [14:27<1:00:57, 2.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▊ | 292/1784 [14:27<1:00:57, 2.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:42,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:42,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:44,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:44,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:46,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:46,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:49,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:49,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:51,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:51,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2866, 'learning_rate': 1.758e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-01 01:45:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:54,345 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:54,345 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5097, 'learning_rate': 1.776e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-01 01:45:58,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:45:58,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:01,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:01,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1269, 'learning_rate': 1.7879999999999998e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-01 01:46:01,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [14:51<1:14:02, 3.00s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [14:51<1:14:02, 3.00s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [14:51<1:14:02, 3.00s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [14:55<1:18:34, 3.19s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [14:55<1:18:34, 3.19s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [14:55<1:18:34, 3.19s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 305/1784 [14:59<1:21:35, 3.31s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 305/1784 [14:59<1:21:35, 3.31s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:16,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:16,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:16,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 307/1784 [15:06<1:23:56, 3.41s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 307/1784 [15:06<1:23:56, 3.41s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 307/1784 [15:06<1:23:56, 3.41s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:09<1:24:28, 3.43s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:09<1:24:28, 3.43s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:09<1:24:28, 3.43s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 309/1784 [15:13<1:25:20, 3.47s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:30,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:30,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3007, 'learning_rate': 1.836e-05, 'epoch': 0.17} 17%|█████████████▌ | 311/1784 [15:20<1:24:34, 3.45s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 311/1784 [15:20<1:24:34, 3.45s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.301, 'learning_rate': 1.842e-05, 'epoch': 0.17} 17%|█████████████▌ | 311/1784 [15:20<1:24:34, 3.45s/it]g-point operations will not be computed-01 01:45:39,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 312/1784 [15:23<1:24:13, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 312/1784 [15:23<1:24:13, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:26<1:23:36, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:26<1:23:36, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:26<1:23:36, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 314/1784 [15:30<1:22:56, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:47,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:47,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3755, 'learning_rate': 1.866e-05, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-01 01:46:47,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [15:36<1:21:50, 3.34s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [15:36<1:21:50, 3.34s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [15:36<1:21:50, 3.34s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 317/1784 [15:40<1:21:19, 3.33s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 317/1784 [15:40<1:21:19, 3.33s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:57,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:57,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:46:57,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 319/1784 [15:46<1:20:42, 3.31s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2926, 'learning_rate': 1.896e-05, 'epoch': 0.18} [WARNING|modeling_utils.py:388] 2022-03-01 01:47:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 321/1784 [15:53<1:20:10, 3.29s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 321/1784 [15:53<1:20:10, 3.29s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 321/1784 [15:53<1:20:10, 3.29s/it]g-point operations will not be computed-01 01:46:38,837 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 322/1784 [15:56<1:19:58, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 322/1784 [15:56<1:19:58, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 323/1784 [15:59<1:19:37, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 323/1784 [15:59<1:19:37, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 323/1784 [15:59<1:19:37, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 324/1784 [16:02<1:18:13, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 324/1784 [16:02<1:18:13, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:19,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:19,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:19,522 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 326/1784 [16:09<1:16:52, 3.16s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 326/1784 [16:09<1:16:52, 3.16s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:25,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:25,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:25,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 328/1784 [16:15<1:15:49, 3.12s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 328/1784 [16:15<1:15:49, 3.12s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:31,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:31,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:31,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 330/1784 [16:21<1:14:51, 3.09s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 330/1784 [16:21<1:14:51, 3.09s/it]g-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:37,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:37,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:11,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 332/1784 [16:27<1:13:29, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:42,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 332/1784 [16:27<1:13:29, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:42,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 333/1784 [16:30<1:12:41, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:42,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 333/1784 [16:30<1:12:41, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:42,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3714, 'learning_rate': 1.974e-05, 'epoch': 0.19} 19%|██████████████▌ | 334/1784 [16:33<1:11:36, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 334/1784 [16:33<1:11:36, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 335/1784 [16:35<1:10:35, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 335/1784 [16:35<1:10:35, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:52,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:52,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.152, 'learning_rate': 1.9920000000000002e-05, 'epoch': 0.19} 19%|██████████████▋ | 337/1784 [16:41<1:08:44, 2.85s/it]g-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 337/1784 [16:41<1:08:44, 2.85s/it]g-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:57,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:47:57,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:00,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:00,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2196, 'learning_rate': 2.01e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-01 01:48:00,447 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:47:48,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 340/1784 [16:49<1:04:54, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 341/1784 [16:51<1:03:40, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 341/1784 [16:51<1:03:40, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:07,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:07,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:10,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:10,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:12,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:12,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:14,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:14,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:15,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:15,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:19,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:19,063 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6586, 'learning_rate': 2.0580000000000003e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-01 01:48:20,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:20,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:21,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:21,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2302, 'learning_rate': 2.0759999999999998e-05, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-01 01:48:25,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:48:25,850 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:04,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3514, 'learning_rate': 2.082e-05, 'epoch': 0.2} 20%|███████████████▍ | 352/1784 [17:15<1:04:46, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 352/1784 [17:15<1:04:46, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 353/1784 [17:19<1:11:21, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 353/1784 [17:19<1:11:21, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0628, 'learning_rate': 2.094e-05, 'epoch': 0.2} 20%|███████████████▍ | 354/1784 [17:23<1:15:40, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 354/1784 [17:23<1:15:40, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1922, 'learning_rate': 2.1e-05, 'epoch': 0.2} 20%|███████████████▌ | 355/1784 [17:26<1:18:37, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 355/1784 [17:26<1:18:37, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:31,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8978, 'learning_rate': 2.1059999999999998e-05, 'epoch': 0.2} 20%|███████████████▌ | 356/1784 [17:30<1:20:30, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 356/1784 [17:30<1:20:30, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 357/1784 [17:33<1:21:06, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 357/1784 [17:33<1:21:06, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3353, 'learning_rate': 2.118e-05, 'epoch': 0.2} 20%|███████████████▋ | 358/1784 [17:37<1:21:55, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 358/1784 [17:37<1:21:55, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3339, 'learning_rate': 2.124e-05, 'epoch': 0.2} 20%|███████████████▋ | 359/1784 [17:40<1:22:24, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 359/1784 [17:40<1:22:24, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:45,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4832, 'learning_rate': 2.13e-05, 'epoch': 0.2} 20%|███████████████▋ | 360/1784 [17:44<1:22:39, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 360/1784 [17:44<1:22:39, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 361/1784 [17:47<1:22:25, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 361/1784 [17:47<1:22:25, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2226, 'learning_rate': 2.1419999999999998e-05, 'epoch': 0.2} 20%|███████████████▊ | 362/1784 [17:51<1:21:44, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 362/1784 [17:51<1:21:44, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:08,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:08,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:08,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 364/1784 [17:57<1:20:42, 3.41s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 364/1784 [17:57<1:20:42, 3.41s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2995, 'learning_rate': 2.16e-05, 'epoch': 0.2} 20%|███████████████▉ | 365/1784 [18:01<1:20:12, 3.39s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 365/1784 [18:01<1:20:12, 3.39s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:18,206 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:18,206 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3832, 'learning_rate': 2.172e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-01 01:49:18,206 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 367/1784 [18:07<1:19:09, 3.35s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 367/1784 [18:07<1:19:09, 3.35s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 367/1784 [18:07<1:19:09, 3.35s/it]g-point operations will not be computed-01 01:48:59,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 368/1784 [18:11<1:18:36, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 368/1784 [18:11<1:18:36, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 369/1784 [18:14<1:18:34, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 369/1784 [18:14<1:18:34, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 369/1784 [18:14<1:18:34, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 370/1784 [18:17<1:18:27, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:34,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:34,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1984, 'learning_rate': 2.202e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-01 01:49:34,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 372/1784 [18:24<1:17:20, 3.29s/it]g-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 372/1784 [18:24<1:17:20, 3.29s/it]g-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 372/1784 [18:24<1:17:20, 3.29s/it]g-point operations will not be computed-01 01:49:26,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 373/1784 [18:27<1:16:28, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 373/1784 [18:27<1:16:28, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 374/1784 [18:30<1:15:42, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 374/1784 [18:30<1:15:42, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 374/1784 [18:30<1:15:42, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 375/1784 [18:33<1:15:00, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 375/1784 [18:33<1:15:00, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:50,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:50,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:50,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 377/1784 [18:40<1:14:16, 3.17s/it]g-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 377/1784 [18:40<1:14:16, 3.17s/it]g-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:56,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:56,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:49:56,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 379/1784 [18:46<1:12:58, 3.12s/it]g-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 379/1784 [18:46<1:12:58, 3.12s/it]g-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:02,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:02,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:02,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:49:42,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 381/1784 [18:52<1:11:44, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 381/1784 [18:52<1:11:44, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 382/1784 [18:55<1:11:06, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:11,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:11,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2409, 'learning_rate': 2.274e-05, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-01 01:50:11,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 384/1784 [19:00<1:08:46, 2.95s/it]g-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:17,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:17,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3583, 'learning_rate': 2.286e-05, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-01 01:50:17,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:07,417 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 386/1784 [19:06<1:06:34, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 386/1784 [19:06<1:06:34, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 387/1784 [19:09<1:05:41, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 387/1784 [19:09<1:05:41, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:25,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:28,044 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:50:28,044 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3362, 'learning_rate': 2.3100000000000002e-05, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-01 01:50:28,044 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:21,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 390/1784 [19:16<1:01:01, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:31,703 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 390/1784 [19:16<1:01:01, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:31,703 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 391/1784 [19:19<58:30, 2.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:33,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 391/1784 [19:19<58:30, 2.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:33,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 392/1784 [19:21<56:03, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:36,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 392/1784 [19:21<56:03, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:36,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 393/1784 [19:23<53:08, 2.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:37,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 393/1784 [19:23<53:08, 2.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:37,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 394/1784 [19:25<50:06, 2.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:39,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 394/1784 [19:25<50:06, 2.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:39,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 395/1784 [19:26<47:10, 2.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:41,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 395/1784 [19:26<47:10, 2.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:41,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3343, 'learning_rate': 2.3520000000000002e-05, 'epoch': 0.22} 22%|█████████████████▊ | 397/1784 [19:29<39:47, 1.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:42,810 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▊ | 397/1784 [19:29<39:47, 1.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:42,810 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7341, 'learning_rate': 2.364e-05, 'epoch': 0.22} 22%|█████████████████▉ | 399/1784 [19:32<33:19, 1.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:45,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 399/1784 [19:32<33:19, 1.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:45,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 400/1784 [19:33<34:31, 1.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:46,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 400/1784 [19:33<34:31, 1.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:49,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 400/1784 [19:33<34:31, 1.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:49,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 401/1784 [19:37<50:44, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:49,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 401/1784 [19:37<50:44, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▉ | 401/1784 [19:37<50:44, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [19:41<1:00:49, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [19:41<1:00:49, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [19:41<1:00:49, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 403/1784 [19:44<1:07:39, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:02,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:02,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3194, 'learning_rate': 2.4e-05, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-01 01:51:02,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 405/1784 [19:52<1:14:47, 3.25s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 405/1784 [19:52<1:14:47, 3.25s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 405/1784 [19:52<1:14:47, 3.25s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 406/1784 [19:55<1:16:45, 3.34s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:12,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:12,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1186, 'learning_rate': 2.4180000000000002e-05, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-01 01:51:12,677 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 408/1784 [20:02<1:17:49, 3.39s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 408/1784 [20:02<1:17:49, 3.39s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 408/1784 [20:02<1:17:49, 3.39s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 409/1784 [20:05<1:17:55, 3.40s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 409/1784 [20:05<1:17:55, 3.40s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 409/1784 [20:05<1:17:55, 3.40s/it]g-point operations will not be computed-01 01:50:53,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 410/1784 [20:09<1:17:53, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 410/1784 [20:09<1:17:53, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 411/1784 [20:12<1:17:59, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 411/1784 [20:12<1:17:59, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 411/1784 [20:12<1:17:59, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 412/1784 [20:16<1:18:06, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 412/1784 [20:16<1:18:06, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 412/1784 [20:16<1:18:06, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 413/1784 [20:19<1:18:17, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 413/1784 [20:19<1:18:17, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:36,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:36,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:36,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 415/1784 [20:26<1:17:33, 3.40s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:43,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:43,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1832, 'learning_rate': 2.472e-05, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-01 01:51:43,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [20:33<1:16:29, 3.36s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [20:33<1:16:29, 3.36s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [20:33<1:16:29, 3.36s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 418/1784 [20:36<1:16:28, 3.36s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:53,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:53,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.289, 'learning_rate': 2.49e-05, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-03-01 01:51:53,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 420/1784 [20:42<1:15:42, 3.33s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:59,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:51:59,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1223, 'learning_rate': 2.502e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:51:59,911 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 422/1784 [20:49<1:15:02, 3.31s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 422/1784 [20:49<1:15:02, 3.31s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 422/1784 [20:49<1:15:02, 3.31s/it]g-point operations will not be computed-01 01:51:24,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 423/1784 [20:52<1:14:22, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 424/1784 [20:56<1:14:03, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 424/1784 [20:56<1:14:03, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5131, 'learning_rate': 2.52e-05, 'epoch': 0.24} 24%|██████████████████▌ | 424/1784 [20:56<1:14:03, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 425/1784 [20:59<1:13:31, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:15,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:15,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2671, 'learning_rate': 2.5319999999999998e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:15,926 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▋ | 427/1784 [21:05<1:11:55, 3.18s/it]g-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:22,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:22,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1743, 'learning_rate': 2.544e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:22,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▊ | 429/1784 [21:11<1:10:12, 3.11s/it]g-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:28,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:28,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2877, 'learning_rate': 2.556e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:28,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▊ | 431/1784 [21:17<1:08:52, 3.05s/it]g-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:34,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:34,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2805, 'learning_rate': 2.568e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:34,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:08,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 433/1784 [21:23<1:06:45, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 433/1784 [21:23<1:06:45, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 434/1784 [21:26<1:05:58, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:42,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:42,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3591, 'learning_rate': 2.586e-05, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:42,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:38,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 436/1784 [21:31<1:03:46, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 436/1784 [21:31<1:03:46, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 437/1784 [21:34<1:02:37, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:50,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:50,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4241, 'learning_rate': 2.604e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:52:50,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:46,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▏ | 439/1784 [21:39<1:00:38, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▏ | 439/1784 [21:39<1:00:38, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▋ | 440/1784 [21:42<59:16, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▋ | 440/1784 [21:42<59:16, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:58,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:52:58,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:00,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:00,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:02,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:02,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:04,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:04,088 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:05,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:05,813 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:08,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:08,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.705, 'learning_rate': 2.658e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:53:10,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:10,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:12,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:12,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.516, 'learning_rate': 2.676e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:53:16,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:16,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9647, 'learning_rate': 2.682e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:53:20,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:20,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1498, 'learning_rate': 2.688e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:53:20,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 453/1784 [22:10<1:05:31, 2.95s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:27,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:27,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9839, 'learning_rate': 2.7000000000000002e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-01 01:53:31,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:31,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0633, 'learning_rate': 2.7060000000000002e-05, 'epoch': 0.26} 26%|███████████████████▉ | 456/1784 [22:21<1:14:30, 3.37s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 456/1784 [22:21<1:14:30, 3.37s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1811, 'learning_rate': 2.712e-05, 'epoch': 0.26} 26%|███████████████████▉ | 456/1784 [22:21<1:14:30, 3.37s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 457/1784 [22:24<1:15:34, 3.42s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 457/1784 [22:24<1:15:34, 3.42s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 457/1784 [22:24<1:15:34, 3.42s/it]g-point operations will not be computed-01 01:52:54,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 458/1784 [22:28<1:16:19, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 459/1784 [22:31<1:16:18, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 459/1784 [22:31<1:16:18, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9451, 'learning_rate': 2.7300000000000003e-05, 'epoch': 0.26} 26%|████████████████████ | 460/1784 [22:35<1:16:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 460/1784 [22:35<1:16:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1622, 'learning_rate': 2.7360000000000002e-05, 'epoch': 0.26} 26%|████████████████████ | 460/1784 [22:35<1:16:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▏ | 461/1784 [22:38<1:16:20, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:55,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:53:55,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0575, 'learning_rate': 2.748e-05, 'epoch': 0.26} 26%|████████████████████▏ | 463/1784 [22:45<1:15:44, 3.44s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▏ | 463/1784 [22:45<1:15:44, 3.44s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1324, 'learning_rate': 2.754e-05, 'epoch': 0.26} 26%|████████████████████▏ | 463/1784 [22:45<1:15:44, 3.44s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 464/1784 [22:48<1:15:23, 3.43s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:05,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:05,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1026, 'learning_rate': 2.7660000000000003e-05, 'epoch': 0.26} 26%|████████████████████▎ | 466/1784 [22:55<1:14:39, 3.40s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 466/1784 [22:55<1:14:39, 3.40s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3026, 'learning_rate': 2.7720000000000002e-05, 'epoch': 0.26} 26%|████████████████████▎ | 466/1784 [22:55<1:14:39, 3.40s/it]g-point operations will not be computed-01 01:53:43,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 467/1784 [22:59<1:14:30, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 468/1784 [23:02<1:14:06, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 468/1784 [23:02<1:14:06, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2504, 'learning_rate': 2.784e-05, 'epoch': 0.26} 26%|████████████████████▍ | 468/1784 [23:02<1:14:06, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 469/1784 [23:05<1:13:37, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:22,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:22,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1554, 'learning_rate': 2.7960000000000003e-05, 'epoch': 0.26} 26%|████████████████████▌ | 471/1784 [23:12<1:12:31, 3.31s/it]g-point operations will not be computed-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 471/1784 [23:12<1:12:31, 3.31s/it]g-point operations will not be computed-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.34, 'learning_rate': 2.8020000000000003e-05, 'epoch': 0.26} 26%|████████████████████▌ | 471/1784 [23:12<1:12:31, 3.31s/it]g-point operations will not be computed-01 01:54:14,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 472/1784 [23:15<1:12:19, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 473/1784 [23:18<1:11:49, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 473/1784 [23:18<1:11:49, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1717, 'learning_rate': 2.8139999999999998e-05, 'epoch': 0.27} 27%|████████████████████▋ | 474/1784 [23:22<1:11:27, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 474/1784 [23:22<1:11:27, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:38,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:38,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2665, 'learning_rate': 2.826e-05, 'epoch': 0.27} 27%|████████████████████▊ | 476/1784 [23:28<1:10:22, 3.23s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▊ | 476/1784 [23:28<1:10:22, 3.23s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:45,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:45,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3676, 'learning_rate': 2.838e-05, 'epoch': 0.27} 27%|████████████████████▉ | 478/1784 [23:34<1:09:21, 3.19s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 478/1784 [23:34<1:09:21, 3.19s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:51,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:51,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3501, 'learning_rate': 2.8499999999999998e-05, 'epoch': 0.27} 27%|████████████████████▉ | 480/1784 [23:41<1:08:45, 3.16s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 480/1784 [23:41<1:08:45, 3.16s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:57,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:54:57,653 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0092, 'learning_rate': 2.862e-05, 'epoch': 0.27} 27%|█████████████████████ | 482/1784 [23:47<1:07:30, 3.11s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 482/1784 [23:47<1:07:30, 3.11s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:03,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:03,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4792, 'learning_rate': 2.874e-05, 'epoch': 0.27} 27%|█████████████████████▏ | 484/1784 [23:53<1:05:41, 3.03s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 484/1784 [23:53<1:05:41, 3.03s/it]g-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:09,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:09,543 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:54:30,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4669, 'learning_rate': 2.8859999999999998e-05, 'epoch': 0.27} 27%|█████████████████████▏ | 486/1784 [23:58<1:03:49, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 486/1784 [23:58<1:03:49, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 487/1784 [24:01<1:02:30, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 487/1784 [24:01<1:02:30, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:17,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:17,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:13,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4734, 'learning_rate': 2.904e-05, 'epoch': 0.27} 27%|█████████████████████▉ | 489/1784 [24:06<59:42, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▉ | 489/1784 [24:06<59:42, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▉ | 490/1784 [24:09<58:20, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▉ | 490/1784 [24:09<58:20, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:25,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:25,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:27,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:27,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:30,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:30,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:32,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:32,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:33,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:33,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:35,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:35,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:37,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:37,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1813, 'learning_rate': 2.958e-05, 'epoch': 0.28} [WARNING|modeling_utils.py:388] 2022-03-01 01:55:39,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 01:55:39,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-01 01:55:41,648 >> Batch size = 8aluation *****e number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-01 01:55:41,648 >> Batch size = 8aluation *****e number of tokens of the input, floating-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 0/331 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 2/331 [00:02<06:41, 1.22s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 3/331 [00:04<09:03, 1.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 4/331 [00:06<10:08, 1.86s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 5/331 [00:09<11:45, 2.16s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 6/331 [00:12<12:49, 2.37s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 7/331 [00:14<12:58, 2.40s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|██ | 8/331 [00:17<13:18, 2.47s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 9/331 [00:20<13:48, 2.57s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 10/331 [00:23<14:32, 2.72s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 11/331 [00:25<14:00, 2.63s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 12/331 [00:28<13:50, 2.60s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 13/331 [00:30<13:31, 2.55s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 14/331 [00:33<13:31, 2.56s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 15/331 [00:36<14:52, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 16/331 [00:40<15:46, 3.01s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 17/331 [00:43<15:55, 3.04s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 18/331 [00:45<14:39, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 19/331 [00:48<14:16, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 20/331 [00:50<13:16, 2.56s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████▏ | 21/331 [00:53<13:49, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████▏ | 21/331 [00:53<13:49, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████▏ | 21/331 [00:53<13:49, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 23/331 [01:00<16:23, 3.19s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 24/331 [01:04<17:10, 3.36s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 25/331 [01:07<16:27, 3.23s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 26/331 [01:09<15:19, 3.01s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 27/331 [01:12<15:22, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 28/331 [01:15<14:53, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 29/331 [01:18<14:31, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 30/331 [01:20<13:49, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▋ | 31/331 [01:23<13:07, 2.63s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 32/331 [01:25<12:54, 2.59s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 33/331 [01:28<13:02, 2.63s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▍ | 34/331 [01:30<13:03, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▋ | 35/331 [01:33<13:15, 2.69s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 36/331 [01:36<13:52, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 37/331 [01:40<14:38, 2.99s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▍ | 38/331 [01:43<14:49, 3.04s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 39/331 [01:46<14:51, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 40/331 [01:48<13:40, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|██████████▏ | 41/331 [01:51<13:03, 2.70s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 42/331 [01:54<13:56, 2.90s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 43/331 [01:58<14:38, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▉ | 44/331 [02:01<15:04, 3.15s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 45/331 [02:03<14:12, 2.98s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 46/331 [02:06<13:05, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▋ | 47/331 [02:08<12:16, 2.59s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 48/331 [02:11<12:37, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 49/331 [02:14<13:16, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▍ | 50/331 [02:17<13:05, 2.80s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 51/331 [02:20<13:19, 2.86s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 52/331 [02:22<12:41, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▏ | 53/331 [02:25<12:39, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 54/331 [02:27<12:10, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 55/331 [02:31<13:14, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 56/331 [02:33<12:59, 2.84s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|██████████████ | 57/331 [02:36<12:34, 2.75s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 58/331 [02:39<12:58, 2.85s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▌ | 59/331 [02:41<12:08, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▊ | 60/331 [02:44<11:49, 2.62s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|███████████████ | 61/331 [02:47<12:19, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 62/331 [02:50<12:16, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▌ | 63/331 [02:53<13:25, 3.00s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▊ | 64/331 [02:56<13:00, 2.92s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 65/331 [02:59<12:44, 2.87s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▎ | 66/331 [03:02<13:49, 3.13s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▌ | 67/331 [03:06<14:24, 3.27s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 68/331 [03:10<14:36, 3.33s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████ | 69/331 [03:13<14:11, 3.25s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▎ | 70/331 [03:16<14:00, 3.22s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▌ | 71/331 [03:19<14:01, 3.24s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▊ | 72/331 [03:22<14:02, 3.25s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████ | 73/331 [03:25<13:35, 3.16s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████▎ | 74/331 [03:28<13:21, 3.12s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 75/331 [03:32<13:34, 3.18s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 76/331 [03:34<12:45, 3.00s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|███████████████████ | 77/331 [03:37<12:24, 2.93s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 78/331 [03:39<11:51, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 79/331 [03:42<11:28, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 80/331 [03:45<11:17, 2.70s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|████████████████████ | 81/331 [03:48<11:42, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▎ | 82/331 [03:50<11:26, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 83/331 [03:54<11:52, 2.87s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▊ | 84/331 [03:57<12:40, 3.08s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████ | 85/331 [03:59<11:48, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▎ | 86/331 [04:03<12:22, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▌ | 87/331 [04:06<12:02, 2.96s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▊ | 88/331 [04:08<11:44, 2.90s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████ | 89/331 [04:11<10:59, 2.72s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▎ | 90/331 [04:13<10:30, 2.62s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▌ | 91/331 [04:16<10:56, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 92/331 [04:18<10:06, 2.54s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████ | 93/331 [04:21<10:14, 2.58s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████▎ | 94/331 [04:24<10:31, 2.67s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▌ | 95/331 [04:26<10:32, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▊ | 96/331 [04:29<10:40, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|████████████████████████ | 97/331 [04:32<10:19, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 98/331 [04:35<10:38, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 99/331 [04:37<10:35, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▍ | 100/331 [04:40<10:12, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 101/331 [04:43<10:07, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 102/331 [04:46<10:58, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 103/331 [04:48<10:29, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▍ | 104/331 [04:51<10:27, 2.77s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 105/331 [04:54<10:28, 2.78s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 106/331 [04:57<10:26, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 107/331 [04:59<09:42, 2.60s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 108/331 [05:01<09:25, 2.54s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 109/331 [05:04<09:19, 2.52s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▉ | 110/331 [05:07<09:44, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▏ | 111/331 [05:09<09:45, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 112/331 [05:12<09:48, 2.69s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 113/331 [05:15<09:23, 2.58s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▉ | 114/331 [05:17<09:27, 2.61s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▏ | 115/331 [05:20<09:26, 2.62s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▍ | 116/331 [05:23<09:45, 2.72s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▋ | 117/331 [05:26<09:44, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▉ | 118/331 [05:28<09:31, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████ | 119/331 [05:31<09:22, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 120/331 [05:33<09:20, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 121/331 [05:37<09:53, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 122/331 [05:39<09:43, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 123/331 [05:43<10:25, 3.01s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 124/331 [05:46<10:13, 2.96s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 125/331 [05:49<10:44, 3.13s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 126/331 [05:52<10:44, 3.14s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 127/331 [05:56<11:09, 3.28s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 128/331 [05:59<11:12, 3.31s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 129/331 [06:02<10:51, 3.23s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 130/331 [06:06<10:54, 3.26s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 131/331 [06:09<11:10, 3.35s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 132/331 [06:12<10:31, 3.17s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 133/331 [06:15<09:52, 2.99s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▊ | 134/331 [06:17<09:28, 2.89s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 135/331 [06:20<09:35, 2.93s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▎ | 136/331 [06:24<09:50, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 137/331 [06:27<10:06, 3.13s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 138/331 [06:30<10:19, 3.21s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████ | 139/331 [06:32<09:14, 2.89s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 140/331 [06:36<09:54, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 141/331 [06:39<09:33, 3.02s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 142/331 [06:42<09:16, 2.94s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 143/331 [06:45<09:41, 3.10s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 144/331 [06:48<09:15, 2.97s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 145/331 [06:51<09:09, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 146/331 [06:54<09:35, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▉ | 147/331 [06:57<09:13, 3.01s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 148/331 [06:59<08:38, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 149/331 [07:02<08:10, 2.70s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 150/331 [07:05<08:34, 2.84s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 151/331 [07:08<08:22, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 152/331 [07:10<07:56, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▍ | 153/331 [07:13<07:49, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 154/331 [07:16<08:09, 2.77s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 155/331 [07:19<08:31, 2.91s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 156/331 [07:22<08:42, 2.99s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 157/331 [07:25<09:03, 3.12s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 158/331 [07:29<09:07, 3.16s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▉ | 159/331 [07:32<09:07, 3.18s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 160/331 [07:35<08:35, 3.02s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 161/331 [07:37<08:25, 2.97s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 162/331 [07:41<08:50, 3.14s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 163/331 [07:44<08:58, 3.20s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 164/331 [07:47<08:29, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 165/331 [07:50<08:16, 2.99s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 166/331 [07:53<08:06, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 167/331 [07:56<08:17, 3.04s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 168/331 [07:59<07:51, 2.89s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 169/331 [08:02<07:55, 2.94s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 170/331 [08:04<07:30, 2.80s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 171/331 [08:07<07:24, 2.78s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 172/331 [08:09<07:02, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 173/331 [08:12<07:11, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 174/331 [08:15<06:56, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 175/331 [08:17<07:04, 2.72s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 176/331 [08:20<06:50, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▎ | 177/331 [08:23<07:14, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 178/331 [08:27<07:43, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 179/331 [08:30<07:59, 3.16s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 180/331 [08:33<07:52, 3.13s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 181/331 [08:36<07:42, 3.08s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 182/331 [08:38<07:02, 2.84s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 183/331 [08:41<06:29, 2.63s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 184/331 [08:43<06:06, 2.49s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 185/331 [08:45<05:41, 2.34s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 186/331 [08:47<05:50, 2.42s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▊ | 187/331 [08:50<06:20, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 188/331 [08:53<06:17, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▎ | 189/331 [08:55<05:59, 2.53s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▍ | 190/331 [08:58<05:44, 2.45s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▍ | 190/331 [08:58<05:44, 2.45s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▍ | 190/331 [08:58<05:44, 2.45s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▉ | 192/331 [09:02<05:34, 2.40s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▏ | 193/331 [09:06<06:03, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▍ | 194/331 [09:08<05:46, 2.53s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 195/331 [09:10<05:40, 2.50s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▉ | 196/331 [09:13<05:44, 2.55s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▏ | 197/331 [09:16<05:58, 2.67s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 198/331 [09:18<05:39, 2.55s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 199/331 [09:21<05:42, 2.59s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 200/331 [09:23<05:24, 2.48s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 201/331 [09:26<05:21, 2.48s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▍ | 202/331 [09:28<05:28, 2.54s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 203/331 [09:31<05:26, 2.55s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 204/331 [09:34<05:47, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 205/331 [09:37<05:52, 2.80s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 206/331 [09:40<05:48, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 207/331 [09:43<06:03, 2.93s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▉ | 208/331 [09:46<06:06, 2.98s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 209/331 [09:48<05:35, 2.75s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 210/331 [09:50<05:12, 2.59s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 211/331 [09:53<05:17, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 212/331 [09:56<05:02, 2.54s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 213/331 [09:58<05:01, 2.56s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 214/331 [10:00<04:47, 2.46s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 215/331 [10:02<04:33, 2.36s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 216/331 [10:06<05:00, 2.62s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████ | 217/331 [10:08<04:58, 2.62s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:11<05:12, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:14<05:08, 2.75s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:17<04:55, 2.67s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████ | 221/331 [10:19<04:57, 2.71s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:22<04:43, 2.60s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:25<04:46, 2.65s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:27<04:47, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 225/331 [10:30<04:43, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 226/331 [10:33<04:57, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 227/331 [10:36<04:49, 2.78s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 228/331 [10:38<04:40, 2.72s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 229/331 [10:41<04:35, 2.70s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▎ | 230/331 [10:44<04:26, 2.64s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 231/331 [10:47<04:34, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 232/331 [10:49<04:27, 2.70s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 233/331 [10:52<04:33, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 234/331 [10:55<04:19, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 235/331 [10:57<04:07, 2.58s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 236/331 [11:00<04:34, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 237/331 [11:04<04:44, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:07<04:40, 3.01s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:10<04:40, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:13<04:44, 3.12s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:17<04:47, 3.20s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:20<04:47, 3.23s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 243/331 [11:23<04:46, 3.25s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:27<04:49, 3.33s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:30<04:39, 3.25s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 246/331 [11:34<04:48, 3.39s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▍ | 247/331 [11:37<04:36, 3.30s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▋ | 248/331 [11:39<04:15, 3.07s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▉ | 249/331 [11:42<03:55, 2.87s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [11:44<03:44, 2.77s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [11:47<03:46, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [11:50<03:34, 2.71s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [11:53<03:42, 2.86s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [11:55<03:34, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [11:58<03:39, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [12:01<03:30, 2.80s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:04<03:34, 2.89s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▏ | 258/331 [12:07<03:21, 2.76s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:09<03:17, 2.74s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:12<03:20, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:15<03:05, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:17<03:03, 2.66s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:20<03:11, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:23<03:02, 2.73s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:25<02:55, 2.67s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████████ | 266/331 [12:28<02:49, 2.61s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [12:31<02:59, 2.80s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [12:34<02:56, 2.79s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [12:37<03:02, 2.94s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████ | 270/331 [12:40<02:57, 2.92s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [12:43<03:01, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [12:46<02:52, 2.92s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▊ | 273/331 [12:49<02:51, 2.96s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████ | 274/331 [12:53<02:57, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [12:56<02:56, 3.14s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [12:58<02:42, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [13:01<02:34, 2.87s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████ | 278/331 [13:04<02:30, 2.84s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:07<02:40, 3.09s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:10<02:34, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:14<02:35, 3.10s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:17<02:32, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:20<02:33, 3.19s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:24<02:35, 3.30s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [13:27<02:34, 3.35s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [13:31<02:31, 3.37s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [13:34<02:32, 3.46s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [13:38<02:27, 3.42s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:40<02:15, 3.23s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|██████████████████████████████████████████████████████████████████████▉ | 290/331 [13:43<02:05, 3.07s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [13:46<01:56, 2.92s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [13:48<01:50, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [13:51<01:46, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [13:53<01:39, 2.68s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [13:56<01:33, 2.60s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [13:58<01:28, 2.52s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [14:02<01:35, 2.81s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [14:05<01:40, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:08<01:34, 2.96s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:11<01:31, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:14<01:27, 2.91s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:16<01:22, 2.86s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:19<01:16, 2.75s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:22<01:16, 2.82s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [14:25<01:15, 2.91s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [14:29<01:17, 3.08s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [14:32<01:17, 3.23s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:36<01:18, 3.40s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▌ | 309/331 [14:39<01:15, 3.43s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [14:42<01:06, 3.18s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [14:45<01:03, 3.19s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [14:48<00:56, 2.99s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [14:51<00:52, 2.93s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [14:54<00:50, 2.98s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [14:57<00:49, 3.09s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [15:00<00:46, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [15:04<00:45, 3.24s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [15:06<00:39, 3.04s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:09<00:34, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:12<00:31, 2.90s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:15<00:28, 2.88s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:18<00:27, 3.03s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:21<00:23, 2.95s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▎ | 324/331 [15:24<00:21, 3.05s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [15:27<00:18, 3.07s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [15:30<00:15, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [15:33<00:12, 3.11s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [15:37<00:09, 3.16s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▌| 329/331 [15:40<00:06, 3.07s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [15:43<00:03, 3.25s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [15:45<00:00, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [15:45<00:00, 2.83s/it]g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/01/2022 02:11:30 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-03-01 02:11:30,432 >> Configuration saved in ./checkpoint-500/config.json g-point operations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:11:46,345 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:11:46,345 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:11:46,345 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:11:46,345 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/01/2022 02:13:19 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220228_223243-2ay2wvge/run-2ay2wvge.wandb', 'wandb/run-20220228_231357-3lq2qpez/run-3lq2qpez.wandb', 'wandb/run-20220301_002446-2vmlu6y4/run-2vmlu6y4.wandb', 'wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb']. This may take a bit of time if the files are large. 28%|█████████████████████ | 501/1784 [42:37<116:55:25, 328.08s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████ | 501/1784 [42:37<116:55:25, 328.08s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3698, 'learning_rate': 2.982e-05, 'epoch': 0.28} 28%|█████████████████████▍ | 502/1784 [42:41<82:11:42, 230.81s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 502/1784 [42:41<82:11:42, 230.81s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0329, 'learning_rate': 2.9880000000000002e-05, 'epoch': 0.28} 28%|█████████████████████▍ | 502/1784 [42:41<82:11:42, 230.81s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 503/1784 [42:45<57:53:08, 162.68s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 503/1784 [42:45<57:53:08, 162.68s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 503/1784 [42:45<57:53:08, 162.68s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 504/1784 [42:48<40:53:02, 114.99s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 504/1784 [42:48<40:53:02, 114.99s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▍ | 504/1784 [42:48<40:53:02, 114.99s/it]onfig.jsonerations will not be computed-01 01:55:21,805 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 505/1784 [42:52<28:59:36, 81.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 506/1784 [42:56<20:40:25, 58.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 506/1784 [42:56<20:40:25, 58.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3151, 'learning_rate': 2.9953271028037384e-05, 'epoch': 0.28} 28%|█████████████████████▉ | 507/1784 [43:00<14:50:47, 41.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 507/1784 [43:00<14:50:47, 41.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1362, 'learning_rate': 2.9929906542056074e-05, 'epoch': 0.28} 28%|█████████████████████▉ | 508/1784 [43:03<10:46:24, 30.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 508/1784 [43:03<10:46:24, 30.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1223, 'learning_rate': 2.9906542056074768e-05, 'epoch': 0.28} 28%|█████████████████████▉ | 508/1784 [43:03<10:46:24, 30.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▎ | 509/1784 [43:07<7:56:39, 22.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:14:24,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:14:24,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4417, 'learning_rate': 2.985981308411215e-05, 'epoch': 0.29} 29%|██████████████████████▎ | 511/1784 [43:14<4:32:02, 12.82s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▎ | 511/1784 [43:14<4:32:02, 12.82s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.299, 'learning_rate': 2.983644859813084e-05, 'epoch': 0.29} 29%|██████████████████████▍ | 512/1784 [43:18<3:32:42, 10.03s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▍ | 512/1784 [43:18<3:32:42, 10.03s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1538, 'learning_rate': 2.9813084112149534e-05, 'epoch': 0.29} 29%|██████████████████████▍ | 512/1784 [43:18<3:32:42, 10.03s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▍ | 513/1784 [43:21<2:51:23, 8.09s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▍ | 513/1784 [43:21<2:51:23, 8.09s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▍ | 513/1784 [43:21<2:51:23, 8.09s/it]g-point operations will not be computed-01 02:14:08,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▍ | 514/1784 [43:25<2:22:44, 6.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 515/1784 [43:29<2:03:16, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 515/1784 [43:29<2:03:16, 5.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1283, 'learning_rate': 2.9742990654205608e-05, 'epoch': 0.29} 29%|██████████████████████▌ | 516/1784 [43:32<1:49:49, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 516/1784 [43:32<1:49:49, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3935, 'learning_rate': 2.97196261682243e-05, 'epoch': 0.29} 29%|██████████████████████▌ | 516/1784 [43:32<1:49:49, 5.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 517/1784 [43:36<1:38:30, 4.66s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:14:53,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:14:53,309 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5719, 'learning_rate': 2.9672897196261685e-05, 'epoch': 0.29} 29%|██████████████████████▋ | 519/1784 [43:43<1:24:56, 4.03s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 519/1784 [43:43<1:24:56, 4.03s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3025, 'learning_rate': 2.9649532710280375e-05, 'epoch': 0.29} 29%|██████████████████████▋ | 519/1784 [43:43<1:24:56, 4.03s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 520/1784 [43:46<1:20:56, 3.84s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:03,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:03,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9659, 'learning_rate': 2.9602803738317758e-05, 'epoch': 0.29} 29%|██████████████████████▊ | 522/1784 [43:53<1:15:16, 3.58s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▊ | 522/1784 [43:53<1:15:16, 3.58s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1495, 'learning_rate': 2.957943925233645e-05, 'epoch': 0.29} 29%|██████████████████████▊ | 522/1784 [43:53<1:15:16, 3.58s/it]g-point operations will not be computed-01 02:14:40,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▊ | 523/1784 [43:56<1:13:49, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 524/1784 [43:59<1:12:32, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 524/1784 [43:59<1:12:32, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3021, 'learning_rate': 2.953271028037383e-05, 'epoch': 0.29} 29%|██████████████████████▉ | 525/1784 [44:03<1:11:23, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 525/1784 [44:03<1:11:23, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:19,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:19,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.289, 'learning_rate': 2.9485981308411218e-05, 'epoch': 0.29} 30%|███████████████████████ | 527/1784 [44:09<1:09:38, 3.32s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████ | 527/1784 [44:09<1:09:38, 3.32s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.425, 'learning_rate': 2.9462616822429905e-05, 'epoch': 0.3} 30%|███████████████████████ | 527/1784 [44:09<1:09:38, 3.32s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████ | 528/1784 [44:12<1:09:37, 3.33s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:29,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:29,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3364, 'learning_rate': 2.941588785046729e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-01 02:15:29,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▏ | 530/1784 [44:19<1:08:06, 3.26s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:36,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:36,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1333, 'learning_rate': 2.936915887850467e-05, 'epoch': 0.3} 30%|███████████████████████▎ | 532/1784 [44:25<1:06:52, 3.20s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▎ | 532/1784 [44:25<1:06:52, 3.20s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:42,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:42,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0705, 'learning_rate': 2.932242990654206e-05, 'epoch': 0.3} 30%|███████████████████████▎ | 534/1784 [44:31<1:04:55, 3.12s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▎ | 534/1784 [44:31<1:04:55, 3.12s/it]g-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3871, 'learning_rate': 2.927570093457944e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-01 02:15:48,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:11,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 536/1784 [44:37<1:02:27, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 537/1784 [44:40<1:01:19, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 537/1784 [44:40<1:01:19, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:56,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:15:56,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4381, 'learning_rate': 2.9205607476635515e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-01 02:15:56,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:15:52,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▏ | 539/1784 [44:45<58:40, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▏ | 540/1784 [44:48<56:58, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▏ | 540/1784 [44:48<56:58, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:04,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:04,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5107, 'learning_rate': 2.913551401869159e-05, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-01 02:16:04,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:00,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 543/1784 [44:55<50:18, 2.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:07,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 543/1784 [44:55<50:18, 2.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:07,834 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 543/1784 [44:55<50:18, 2.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:09,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 543/1784 [44:55<50:18, 2.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:09,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 545/1784 [44:59<44:56, 2.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:11,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 545/1784 [44:59<44:56, 2.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:11,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 546/1784 [45:00<41:48, 2.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 546/1784 [45:00<41:48, 2.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:15,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 547/1784 [45:02<38:25, 1.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:16,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 547/1784 [45:02<38:25, 1.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:16,649 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7226, 'learning_rate': 2.899532710280374e-05, 'epoch': 0.31} 31%|████████████████████████▌ | 549/1784 [45:04<31:54, 1.55s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:19,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▌ | 549/1784 [45:04<31:54, 1.55s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:19,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 550/1784 [45:06<32:44, 1.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:19,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 550/1784 [45:06<32:44, 1.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:19,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 550/1784 [45:06<32:44, 1.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:22,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 551/1784 [45:10<47:17, 2.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:22,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 551/1784 [45:10<47:17, 2.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:22,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 551/1784 [45:10<47:17, 2.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 552/1784 [45:14<56:54, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 552/1784 [45:14<56:54, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.484, 'learning_rate': 2.8878504672897196e-05, 'epoch': 0.31} 31%|████████████████████████▊ | 552/1784 [45:14<56:54, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▏ | 553/1784 [45:17<1:02:03, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:35,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:35,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1946, 'learning_rate': 2.8831775700934582e-05, 'epoch': 0.31} 31%|████████████████████████▎ | 555/1784 [45:25<1:07:43, 3.31s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▎ | 555/1784 [45:25<1:07:43, 3.31s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.105, 'learning_rate': 2.8808411214953272e-05, 'epoch': 0.31} 31%|████████████████████████▎ | 556/1784 [45:28<1:09:15, 3.38s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▎ | 556/1784 [45:28<1:09:15, 3.38s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2699, 'learning_rate': 2.8785046728971962e-05, 'epoch': 0.31} 31%|████████████████████████▎ | 556/1784 [45:28<1:09:15, 3.38s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▎ | 557/1784 [45:32<1:10:10, 3.43s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:49,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:49,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.108, 'learning_rate': 2.873831775700935e-05, 'epoch': 0.31} 31%|████████████████████████▍ | 559/1784 [45:39<1:11:21, 3.49s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 559/1784 [45:39<1:11:21, 3.49s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.065, 'learning_rate': 2.8714953271028036e-05, 'epoch': 0.31} 31%|████████████████████████▍ | 560/1784 [45:42<1:11:46, 3.52s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▍ | 560/1784 [45:42<1:11:46, 3.52s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:59,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:16:59,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3704, 'learning_rate': 2.8668224299065423e-05, 'epoch': 0.31} 32%|████████████████████████▌ | 562/1784 [45:49<1:10:40, 3.47s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▌ | 562/1784 [45:49<1:10:40, 3.47s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.275, 'learning_rate': 2.8644859813084116e-05, 'epoch': 0.32} 32%|████████████████████████▌ | 563/1784 [45:53<1:10:09, 3.45s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▌ | 563/1784 [45:53<1:10:09, 3.45s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9922, 'learning_rate': 2.8621495327102803e-05, 'epoch': 0.32} 32%|████████████████████████▌ | 563/1784 [45:53<1:10:09, 3.45s/it]g-point operations will not be computed-01 02:16:26,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▋ | 564/1784 [45:56<1:09:58, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▋ | 565/1784 [45:59<1:09:31, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▋ | 565/1784 [45:59<1:09:31, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4709, 'learning_rate': 2.857476635514019e-05, 'epoch': 0.32} 32%|████████████████████████▋ | 566/1784 [46:03<1:09:12, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▋ | 566/1784 [46:03<1:09:12, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:20,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:20,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2284, 'learning_rate': 2.852803738317757e-05, 'epoch': 0.32} 32%|████████████████████████▊ | 568/1784 [46:10<1:09:04, 3.41s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▊ | 568/1784 [46:10<1:09:04, 3.41s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4302, 'learning_rate': 2.8504672897196263e-05, 'epoch': 0.32} 32%|████████████████████████▉ | 569/1784 [46:13<1:08:33, 3.39s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▉ | 569/1784 [46:13<1:08:33, 3.39s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:30,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:30,391 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4577, 'learning_rate': 2.8457943925233646e-05, 'epoch': 0.32} 32%|████████████████████████▉ | 571/1784 [46:20<1:07:18, 3.33s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|████████████████████████▉ | 571/1784 [46:20<1:07:18, 3.33s/it]g-point operations will not be computed-01 02:17:11,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1258, 'learning_rate': 2.8434579439252336e-05, 'epoch': 0.32} 32%|█████████████████████████ | 572/1784 [46:23<1:06:52, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████ | 572/1784 [46:23<1:06:52, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████ | 573/1784 [46:26<1:06:10, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████ | 573/1784 [46:26<1:06:10, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1741, 'learning_rate': 2.838785046728972e-05, 'epoch': 0.32} 32%|█████████████████████████ | 574/1784 [46:29<1:05:38, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████ | 574/1784 [46:29<1:05:38, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:46,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:46,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1338, 'learning_rate': 2.8341121495327103e-05, 'epoch': 0.32} 32%|█████████████████████████▏ | 576/1784 [46:36<1:04:29, 3.20s/it]g-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▏ | 576/1784 [46:36<1:04:29, 3.20s/it]g-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:52,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:17:52,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2055, 'learning_rate': 2.8294392523364487e-05, 'epoch': 0.32} 32%|█████████████████████████▎ | 578/1784 [46:42<1:03:13, 3.15s/it]g-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▎ | 578/1784 [46:42<1:03:13, 3.15s/it]g-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0812, 'learning_rate': 2.827102803738318e-05, 'epoch': 0.32} 32%|█████████████████████████▎ | 578/1784 [46:42<1:03:13, 3.15s/it]g-point operations will not be computed-01 02:17:38,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▎ | 579/1784 [46:45<1:02:54, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:00,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████▎ | 580/1784 [46:48<1:02:30, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:00,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████▎ | 580/1784 [46:48<1:02:30, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:00,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4458, 'learning_rate': 2.822429906542056e-05, 'epoch': 0.33} 33%|█████████████████████████▎ | 580/1784 [46:48<1:02:30, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:00,500 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████▍ | 581/1784 [46:51<1:01:43, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████▍ | 582/1784 [46:54<1:01:01, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|█████████████████████████▍ | 582/1784 [46:54<1:01:01, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:10,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:10,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:13,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:13,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:13,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▏ | 585/1784 [47:02<58:03, 2.90s/it]g-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▏ | 585/1784 [47:02<58:03, 2.90s/it]g-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:19,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:19,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:06,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.218, 'learning_rate': 2.8084112149532714e-05, 'epoch': 0.33} 33%|██████████████████████████▎ | 587/1784 [47:08<56:11, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▎ | 587/1784 [47:08<56:11, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▎ | 588/1784 [47:10<54:52, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▎ | 588/1784 [47:10<54:52, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:27,022 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:27,022 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:29,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:29,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:31,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:18:31,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1973, 'learning_rate': 2.7967289719626167e-05, 'epoch': 0.33} [WARNING|modeling_utils.py:388] 2022-03-01 02:18:31,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:23,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▌ | 592/1784 [47:20<48:01, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:35,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▌ | 592/1784 [47:20<48:01, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:35,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▌ | 593/1784 [47:22<45:58, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:37,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▌ | 593/1784 [47:22<45:58, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:37,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 595/1784 [47:26<40:52, 2.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:38,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 595/1784 [47:26<40:52, 2.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:38,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 596/1784 [47:27<37:59, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 596/1784 [47:27<37:59, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▊ | 597/1784 [47:29<34:53, 1.76s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:43,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▊ | 597/1784 [47:29<34:53, 1.76s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:43,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5597, 'learning_rate': 2.782710280373832e-05, 'epoch': 0.33} 34%|██████████████████████████▊ | 599/1784 [47:31<29:25, 1.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:45,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▊ | 599/1784 [47:31<29:25, 1.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:45,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 600/1784 [47:33<30:07, 1.53s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:45,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 600/1784 [47:33<30:07, 1.53s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:45,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 600/1784 [47:33<30:07, 1.53s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:48,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 601/1784 [47:37<43:53, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:48,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 601/1784 [47:37<43:53, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:48,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 601/1784 [47:37<43:53, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:52,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 602/1784 [47:40<52:26, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:52,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 602/1784 [47:40<52:26, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:52,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4654, 'learning_rate': 2.7710280373831777e-05, 'epoch': 0.34} 34%|██████████████████████████▉ | 602/1784 [47:40<52:26, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:52,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████ | 603/1784 [47:44<58:19, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▍ | 604/1784 [47:48<1:02:23, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▍ | 604/1784 [47:48<1:02:23, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2439, 'learning_rate': 2.766355140186916e-05, 'epoch': 0.34} 34%|██████████████████████████▍ | 605/1784 [47:51<1:04:33, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▍ | 605/1784 [47:51<1:04:33, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.072, 'learning_rate': 2.764018691588785e-05, 'epoch': 0.34} 34%|██████████████████████████▍ | 606/1784 [47:55<1:05:57, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▍ | 606/1784 [47:55<1:05:57, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:12,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:12,279 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1668, 'learning_rate': 2.7593457943925234e-05, 'epoch': 0.34} 34%|██████████████████████████▌ | 608/1784 [48:02<1:07:44, 3.46s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▌ | 608/1784 [48:02<1:07:44, 3.46s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.212, 'learning_rate': 2.7570093457943924e-05, 'epoch': 0.34} 34%|██████████████████████████▋ | 609/1784 [48:05<1:08:12, 3.48s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▋ | 609/1784 [48:05<1:08:12, 3.48s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4345, 'learning_rate': 2.7546728971962618e-05, 'epoch': 0.34} 34%|██████████████████████████▋ | 609/1784 [48:05<1:08:12, 3.48s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▋ | 610/1784 [48:09<1:08:02, 3.48s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:26,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:26,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4643, 'learning_rate': 2.75e-05, 'epoch': 0.34} 34%|██████████████████████████▊ | 612/1784 [48:16<1:07:50, 3.47s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▊ | 612/1784 [48:16<1:07:50, 3.47s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1591, 'learning_rate': 2.747663551401869e-05, 'epoch': 0.34} 34%|██████████████████████████▊ | 613/1784 [48:19<1:07:33, 3.46s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▊ | 613/1784 [48:19<1:07:33, 3.46s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:36,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:36,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1714, 'learning_rate': 2.7429906542056078e-05, 'epoch': 0.34} 34%|██████████████████████████▉ | 615/1784 [48:26<1:06:25, 3.41s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|██████████████████████████▉ | 615/1784 [48:26<1:06:25, 3.41s/it]g-point operations will not be computed-01 02:18:59,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3276, 'learning_rate': 2.7406542056074764e-05, 'epoch': 0.34} 35%|██████████████████████████▉ | 616/1784 [48:29<1:05:50, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|██████████████████████████▉ | 616/1784 [48:29<1:05:50, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|██████████████████████████▉ | 617/1784 [48:32<1:05:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|██████████████████████████▉ | 617/1784 [48:32<1:05:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1613, 'learning_rate': 2.735981308411215e-05, 'epoch': 0.35} 35%|███████████████████████████ | 618/1784 [48:36<1:05:03, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████ | 618/1784 [48:36<1:05:03, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:53,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:19:53,186 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0714, 'learning_rate': 2.731308411214953e-05, 'epoch': 0.35} 35%|███████████████████████████ | 620/1784 [48:42<1:04:23, 3.32s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████ | 620/1784 [48:42<1:04:23, 3.32s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3086, 'learning_rate': 2.7289719626168225e-05, 'epoch': 0.35} 35%|███████████████████████████▏ | 621/1784 [48:46<1:03:55, 3.30s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▏ | 621/1784 [48:46<1:03:55, 3.30s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:02,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:02,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3431, 'learning_rate': 2.7242990654205608e-05, 'epoch': 0.35} 35%|███████████████████████████▏ | 623/1784 [48:52<1:03:04, 3.26s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▏ | 623/1784 [48:52<1:03:04, 3.26s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:09,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:09,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5547, 'learning_rate': 2.719626168224299e-05, 'epoch': 0.35} 35%|███████████████████████████▎ | 625/1784 [48:58<1:02:18, 3.23s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▎ | 625/1784 [48:58<1:02:18, 3.23s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:15,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:15,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1948, 'learning_rate': 2.7149532710280375e-05, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-01 02:20:15,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▍ | 627/1784 [49:05<1:01:10, 3.17s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:21,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:21,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:21,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 629/1784 [49:11<1:00:22, 3.14s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 629/1784 [49:11<1:00:22, 3.14s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:28,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:28,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1649, 'learning_rate': 2.705607476635514e-05, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-01 02:20:28,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▎ | 631/1784 [49:17<59:11, 3.08s/it]g-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:34,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:34,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2827, 'learning_rate': 2.7009345794392525e-05, 'epoch': 0.35} [WARNING|modeling_utils.py:388] 2022-03-01 02:20:34,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:19:44,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▍ | 633/1784 [49:23<57:44, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▍ | 633/1784 [49:23<57:44, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 634/1784 [49:26<56:46, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:42,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:42,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2469, 'learning_rate': 2.69392523364486e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-01 02:20:42,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 636/1784 [49:31<55:16, 2.89s/it]g-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 636/1784 [49:31<55:16, 2.89s/it]g-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:48,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:48,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:48,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:38,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 638/1784 [49:37<53:09, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 638/1784 [49:37<53:09, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▋ | 639/1784 [49:39<51:47, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▋ | 639/1784 [49:39<51:47, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:55,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:55,829 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:58,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:20:58,168 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:00,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:00,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:02,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:02,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:04,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:04,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:06,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:06,320 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:07,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:07,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:09,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:09,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3826, 'learning_rate': 2.663551401869159e-05, 'epoch': 0.36} [WARNING|modeling_utils.py:388] 2022-03-01 02:21:12,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:12,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:13,918 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:13,918 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:13,918 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:17,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:17,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:21:17,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 652/1784 [50:07<51:08, 2.71s/it]g-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 652/1784 [50:07<51:08, 2.71s/it]g-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 652/1784 [50:07<51:08, 2.71s/it]g-point operations will not be computed-01 02:20:52,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▎ | 653/1784 [50:11<56:13, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▎ | 654/1784 [50:15<59:36, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▎ | 654/1784 [50:15<59:36, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1909, 'learning_rate': 2.649532710280374e-05, 'epoch': 0.37} 37%|█████████████████████████████▎ | 654/1784 [50:15<59:36, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 655/1784 [50:18<1:01:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 655/1784 [50:18<1:01:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 655/1784 [50:18<1:01:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 656/1784 [50:22<1:03:04, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 656/1784 [50:22<1:03:04, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 656/1784 [50:22<1:03:04, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:26,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 657/1784 [50:25<1:03:46, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▋ | 657/1784 [50:25<1:03:46, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 658/1784 [50:29<1:04:35, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 658/1784 [50:29<1:04:35, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 658/1784 [50:29<1:04:35, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 659/1784 [50:32<1:05:18, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 659/1784 [50:32<1:05:18, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 659/1784 [50:32<1:05:18, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 660/1784 [50:36<1:05:21, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 660/1784 [50:36<1:05:21, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▊ | 660/1784 [50:36<1:05:21, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:41,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 661/1784 [50:39<1:05:14, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 661/1784 [50:39<1:05:14, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 662/1784 [50:43<1:04:43, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 662/1784 [50:43<1:04:43, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 663/1784 [50:46<1:04:09, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|████████████████████████████▉ | 663/1784 [50:46<1:04:09, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:21:55,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3017, 'learning_rate': 2.6285046728971963e-05, 'epoch': 0.37} 37%|█████████████████████████████ | 664/1784 [50:49<1:04:05, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████ | 664/1784 [50:49<1:04:05, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████ | 665/1784 [50:53<1:03:46, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████ | 665/1784 [50:53<1:03:46, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1274, 'learning_rate': 2.6238317757009346e-05, 'epoch': 0.37} 37%|█████████████████████████████ | 666/1784 [50:56<1:03:32, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████ | 666/1784 [50:56<1:03:32, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:13,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:13,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2019, 'learning_rate': 2.619158878504673e-05, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-03-01 02:22:13,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 668/1784 [51:03<1:03:00, 3.39s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 668/1784 [51:03<1:03:00, 3.39s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▏ | 668/1784 [51:03<1:03:00, 3.39s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▎ | 669/1784 [51:06<1:02:20, 3.35s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▎ | 669/1784 [51:06<1:02:20, 3.35s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:23,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:23,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:23,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▎ | 671/1784 [51:13<1:01:41, 3.33s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:30,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:30,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5609, 'learning_rate': 2.6074766355140186e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-01 02:22:30,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▍ | 673/1784 [51:19<1:01:22, 3.31s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▍ | 673/1784 [51:19<1:01:22, 3.31s/it]g-point operations will not be computed-01 02:22:05,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▍ | 674/1784 [51:23<1:01:04, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▍ | 674/1784 [51:23<1:01:04, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▌ | 675/1784 [51:26<1:00:43, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▌ | 675/1784 [51:26<1:00:43, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0495, 'learning_rate': 2.600467289719626e-05, 'epoch': 0.38} 38%|█████████████████████████████▌ | 676/1784 [51:29<1:00:04, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|█████████████████████████████▌ | 676/1784 [51:29<1:00:04, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:46,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:46,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1967, 'learning_rate': 2.5957943925233647e-05, 'epoch': 0.38} 38%|██████████████████████████████▍ | 678/1784 [51:35<59:17, 3.22s/it]g-point operations will not be computed-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 678/1784 [51:35<59:17, 3.22s/it]g-point operations will not be computed-01 02:22:38,499 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1595, 'learning_rate': 2.593457943925234e-05, 'epoch': 0.38} 38%|██████████████████████████████▍ | 679/1784 [51:39<58:48, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 679/1784 [51:39<58:48, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 680/1784 [51:42<58:09, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 680/1784 [51:42<58:09, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:58,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:22:58,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1382, 'learning_rate': 2.5864485981308413e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-01 02:22:58,907 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 682/1784 [51:48<56:57, 3.10s/it]g-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:04,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:04,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4054, 'learning_rate': 2.5817757009345793e-05, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-01 02:23:04,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 684/1784 [51:54<55:26, 3.02s/it]g-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 684/1784 [51:54<55:26, 3.02s/it]g-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:10,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:10,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:10,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:22:54,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 686/1784 [51:59<53:11, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 686/1784 [51:59<53:11, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▊ | 687/1784 [52:02<51:59, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▊ | 687/1784 [52:02<51:59, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:18,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:21,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:21,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2607, 'learning_rate': 2.5677570093457944e-05, 'epoch': 0.39} [WARNING|modeling_utils.py:388] 2022-03-01 02:23:21,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:14,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▉ | 690/1784 [52:10<48:52, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:25,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▉ | 690/1784 [52:10<48:52, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:25,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▉ | 691/1784 [52:12<47:26, 2.60s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|██████████████████████████████▉ | 691/1784 [52:12<47:26, 2.60s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████ | 692/1784 [52:15<45:44, 2.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████ | 692/1784 [52:15<45:44, 2.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:30,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:30,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:32,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:32,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:34,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:34,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5045, 'learning_rate': 2.5514018691588784e-05, 'epoch': 0.39} [WARNING|modeling_utils.py:388] 2022-03-01 02:23:37,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:37,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:38,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:38,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:41,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:41,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2691, 'learning_rate': 2.542056074766355e-05, 'epoch': 0.39} [WARNING|modeling_utils.py:388] 2022-03-01 02:23:41,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:45,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:45,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:45,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:49,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:49,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:23:49,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 703/1784 [52:39<53:05, 2.95s/it]g-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 703/1784 [52:39<53:05, 2.95s/it]g-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 703/1784 [52:39<53:05, 2.95s/it]g-point operations will not be computed-01 02:23:27,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 704/1784 [52:42<56:21, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▌ | 705/1784 [52:46<58:50, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▌ | 705/1784 [52:46<58:50, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2303, 'learning_rate': 2.530373831775701e-05, 'epoch': 0.4} 40%|███████████████████████████████▌ | 705/1784 [52:46<58:50, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▋ | 706/1784 [52:49<59:55, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▋ | 706/1784 [52:49<59:55, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▋ | 706/1784 [52:49<59:55, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|██████████████████████████████▉ | 707/1784 [52:53<1:00:49, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:10,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:10,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1924, 'learning_rate': 2.5233644859813084e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-01 02:24:10,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|██████████████████████████████▉ | 709/1784 [53:00<1:01:33, 3.44s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|██████████████████████████████▉ | 709/1784 [53:00<1:01:33, 3.44s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|██████████████████████████████▉ | 709/1784 [53:00<1:01:33, 3.44s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████ | 710/1784 [53:03<1:01:06, 3.41s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:20,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:20,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3738, 'learning_rate': 2.5163551401869158e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-01 02:24:20,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▏ | 712/1784 [53:10<1:00:37, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▏ | 712/1784 [53:10<1:00:37, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▏ | 712/1784 [53:10<1:00:37, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▏ | 713/1784 [53:13<1:00:40, 3.40s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:30,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:30,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5749, 'learning_rate': 2.5093457943925234e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-01 02:24:30,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▎ | 715/1784 [53:20<1:00:20, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▎ | 715/1784 [53:20<1:00:20, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▎ | 715/1784 [53:20<1:00:20, 3.39s/it]g-point operations will not be computed-01 02:23:58,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|███████████████████████████████▎ | 716/1784 [53:23<1:00:06, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 717/1784 [53:27<59:39, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 717/1784 [53:27<59:39, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1837, 'learning_rate': 2.5023364485981308e-05, 'epoch': 0.4} 40%|████████████████████████████████▏ | 717/1784 [53:27<59:39, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 718/1784 [53:30<59:37, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 718/1784 [53:30<59:37, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 718/1784 [53:30<59:37, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:39,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 719/1784 [53:33<59:16, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 720/1784 [53:37<58:54, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 720/1784 [53:37<58:54, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2529, 'learning_rate': 2.4953271028037385e-05, 'epoch': 0.4} 40%|████████████████████████████████▎ | 720/1784 [53:37<58:54, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 721/1784 [53:40<58:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:57,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:24:57,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3385, 'learning_rate': 2.4906542056074768e-05, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-01 02:24:57,260 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▍ | 723/1784 [53:46<57:55, 3.28s/it]g-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:03,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:03,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4662, 'learning_rate': 2.4859813084112148e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-01 02:25:03,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▌ | 725/1784 [53:53<56:49, 3.22s/it]g-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:09,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:09,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.188, 'learning_rate': 2.4813084112149535e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-01 02:25:09,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▌ | 727/1784 [53:59<55:49, 3.17s/it]g-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▌ | 727/1784 [53:59<55:49, 3.17s/it]g-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▌ | 727/1784 [53:59<55:49, 3.17s/it]g-point operations will not be computed-01 02:24:49,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 728/1784 [54:02<55:28, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:17,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 728/1784 [54:02<55:28, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:17,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 729/1784 [54:05<55:10, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:17,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 729/1784 [54:05<55:10, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:17,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 729/1784 [54:05<55:10, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:17,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 730/1784 [54:08<54:51, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:23,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▋ | 730/1784 [54:08<54:51, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:23,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 731/1784 [54:11<54:23, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:23,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 731/1784 [54:11<54:23, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:23,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 731/1784 [54:11<54:23, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:23,960 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 732/1784 [54:14<53:52, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 733/1784 [54:17<53:18, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 733/1784 [54:17<53:18, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:34,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:34,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2196, 'learning_rate': 2.462616822429907e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-01 02:25:34,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▉ | 735/1784 [54:23<52:04, 2.98s/it]g-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:40,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:40,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1694, 'learning_rate': 2.457943925233645e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-01 02:25:40,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:29,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 737/1784 [54:29<50:40, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 737/1784 [54:29<50:40, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 738/1784 [54:32<49:26, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:48,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:48,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2423, 'learning_rate': 2.4509345794392522e-05, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-01 02:25:48,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:44,342 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 740/1784 [54:37<47:13, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:52,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 740/1784 [54:37<47:13, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:52,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▏ | 741/1784 [54:39<45:41, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:54,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▏ | 741/1784 [54:39<45:41, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:54,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▎ | 742/1784 [54:41<44:09, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▎ | 742/1784 [54:41<44:09, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▎ | 743/1784 [54:44<42:25, 2.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:59,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:25:59,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:02,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:02,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7925, 'learning_rate': 2.4369158878504672e-05, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-01 02:26:03,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:03,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:06,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:06,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5483, 'learning_rate': 2.429906542056075e-05, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-01 02:26:07,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:07,770 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:09,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:09,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:09,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:13,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:17,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:17,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.256, 'learning_rate': 2.4205607476635516e-05, 'epoch': 0.42} 42%|█████████████████████████████████▊ | 753/1784 [55:07<51:21, 2.99s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 753/1784 [55:07<51:21, 2.99s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2916, 'learning_rate': 2.4182242990654206e-05, 'epoch': 0.42} 42%|█████████████████████████████████▊ | 754/1784 [55:10<54:38, 3.18s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 754/1784 [55:10<54:38, 3.18s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0228, 'learning_rate': 2.41588785046729e-05, 'epoch': 0.42} 42%|█████████████████████████████████▊ | 754/1784 [55:10<54:38, 3.18s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 755/1784 [55:14<56:37, 3.30s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 755/1784 [55:14<56:37, 3.30s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 755/1784 [55:14<56:37, 3.30s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▉ | 756/1784 [55:17<57:44, 3.37s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:34,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:34,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0716, 'learning_rate': 2.4088785046728972e-05, 'epoch': 0.42} 42%|█████████████████████████████████▉ | 758/1784 [55:24<58:57, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▉ | 758/1784 [55:24<58:57, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2561, 'learning_rate': 2.4065420560747666e-05, 'epoch': 0.42} 42%|█████████████████████████████████▉ | 758/1784 [55:24<58:57, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████ | 759/1784 [55:28<59:15, 3.47s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:45,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:45,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1544, 'learning_rate': 2.4018691588785046e-05, 'epoch': 0.43} 43%|██████████████████████████████████▏ | 761/1784 [55:35<58:59, 3.46s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▏ | 761/1784 [55:35<58:59, 3.46s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5181, 'learning_rate': 2.399532710280374e-05, 'epoch': 0.43} 43%|██████████████████████████████████▏ | 761/1784 [55:35<58:59, 3.46s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▏ | 762/1784 [55:38<58:46, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:55,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:26:55,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2302, 'learning_rate': 2.394859813084112e-05, 'epoch': 0.43} 43%|██████████████████████████████████▎ | 764/1784 [55:45<58:39, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▎ | 764/1784 [55:45<58:39, 3.45s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0103, 'learning_rate': 2.3925233644859813e-05, 'epoch': 0.43} 43%|██████████████████████████████████▎ | 765/1784 [55:49<58:21, 3.44s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▎ | 765/1784 [55:49<58:21, 3.44s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:05,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:05,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.188, 'learning_rate': 2.38785046728972e-05, 'epoch': 0.43} 43%|██████████████████████████████████▍ | 767/1784 [55:55<57:34, 3.40s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▍ | 767/1784 [55:55<57:34, 3.40s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:12,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:12,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:12,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▍ | 769/1784 [56:02<57:18, 3.39s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▍ | 769/1784 [56:02<57:18, 3.39s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5287, 'learning_rate': 2.3808411214953273e-05, 'epoch': 0.43} 43%|██████████████████████████████████▌ | 770/1784 [56:05<56:48, 3.36s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 770/1784 [56:05<56:48, 3.36s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:22,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:22,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1368, 'learning_rate': 2.3761682242990653e-05, 'epoch': 0.43} 43%|██████████████████████████████████▌ | 772/1784 [56:12<56:14, 3.33s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 772/1784 [56:12<56:14, 3.33s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1505, 'learning_rate': 2.3738317757009346e-05, 'epoch': 0.43} 43%|██████████████████████████████████▌ | 772/1784 [56:12<56:14, 3.33s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 773/1784 [56:15<55:48, 3.31s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:32,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:32,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2193, 'learning_rate': 2.369158878504673e-05, 'epoch': 0.43} 43%|██████████████████████████████████▊ | 775/1784 [56:22<54:31, 3.24s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 775/1784 [56:22<54:31, 3.24s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:38,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:38,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1377, 'learning_rate': 2.3644859813084113e-05, 'epoch': 0.43} 44%|██████████████████████████████████▊ | 777/1784 [56:28<53:48, 3.21s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|██████████████████████████████████▊ | 777/1784 [56:28<53:48, 3.21s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:45,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:45,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.421, 'learning_rate': 2.3598130841121497e-05, 'epoch': 0.44} 44%|██████████████████████████████████▉ | 779/1784 [56:34<52:40, 3.14s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|██████████████████████████████████▉ | 779/1784 [56:34<52:40, 3.14s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:51,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:51,227 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3547, 'learning_rate': 2.355140186915888e-05, 'epoch': 0.44} 44%|███████████████████████████████████ | 781/1784 [56:40<51:51, 3.10s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████ | 781/1784 [56:40<51:51, 3.10s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:57,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:27:57,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.347, 'learning_rate': 2.3504672897196263e-05, 'epoch': 0.44} 44%|███████████████████████████████████ | 783/1784 [56:46<50:49, 3.05s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████ | 783/1784 [56:46<50:49, 3.05s/it]g-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:03,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:03,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1498, 'learning_rate': 2.3457943925233643e-05, 'epoch': 0.44} [WARNING|modeling_utils.py:388] 2022-03-01 02:28:03,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:25:56,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 785/1784 [56:52<48:54, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 786/1784 [56:55<47:57, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 786/1784 [56:55<47:57, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:11,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:11,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8649, 'learning_rate': 2.338785046728972e-05, 'epoch': 0.44} [WARNING|modeling_utils.py:388] 2022-03-01 02:28:11,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:07,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 788/1784 [57:00<45:59, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:15,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 789/1784 [57:02<44:39, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 789/1784 [57:02<44:39, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 790/1784 [57:05<43:36, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 790/1784 [57:05<43:36, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:21,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:23,656 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:23,656 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:25,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:25,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:27,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:27,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:29,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:29,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:31,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:31,267 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2447, 'learning_rate': 2.3177570093457944e-05, 'epoch': 0.45} [WARNING|modeling_utils.py:388] 2022-03-01 02:28:34,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:34,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:35,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:35,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:36,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:36,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4033, 'learning_rate': 2.308411214953271e-05, 'epoch': 0.45} [WARNING|modeling_utils.py:388] 2022-03-01 02:28:36,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:40,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:44,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:44,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.292, 'learning_rate': 2.3037383177570094e-05, 'epoch': 0.45} 45%|████████████████████████████████████ | 803/1784 [57:34<48:30, 2.97s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████ | 803/1784 [57:34<48:30, 2.97s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.322, 'learning_rate': 2.3014018691588784e-05, 'epoch': 0.45} 45%|████████████████████████████████████ | 804/1784 [57:38<51:45, 3.17s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████ | 804/1784 [57:38<51:45, 3.17s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1898, 'learning_rate': 2.2990654205607477e-05, 'epoch': 0.45} 45%|████████████████████████████████████ | 805/1784 [57:41<53:48, 3.30s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████ | 805/1784 [57:41<53:48, 3.30s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:58,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:28:58,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3549, 'learning_rate': 2.294392523364486e-05, 'epoch': 0.45} 45%|████████████████████████████████████▏ | 807/1784 [57:48<55:30, 3.41s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 807/1784 [57:48<55:30, 3.41s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2268, 'learning_rate': 2.292056074766355e-05, 'epoch': 0.45} 45%|████████████████████████████████████▏ | 808/1784 [57:52<55:47, 3.43s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 808/1784 [57:52<55:47, 3.43s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9229, 'learning_rate': 2.2897196261682244e-05, 'epoch': 0.45} 45%|████████████████████████████████████▎ | 809/1784 [57:55<55:58, 3.44s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 809/1784 [57:55<55:58, 3.44s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:12,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:12,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3893, 'learning_rate': 2.2850467289719628e-05, 'epoch': 0.45} 45%|████████████████████████████████████▎ | 811/1784 [58:02<55:33, 3.43s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 811/1784 [58:02<55:33, 3.43s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2611, 'learning_rate': 2.2827102803738318e-05, 'epoch': 0.45} 45%|████████████████████████████████████▎ | 811/1784 [58:02<55:33, 3.43s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▍ | 812/1784 [58:05<55:07, 3.40s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:22,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:22,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2921, 'learning_rate': 2.27803738317757e-05, 'epoch': 0.46} 46%|████████████████████████████████████▌ | 814/1784 [58:12<55:09, 3.41s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▌ | 814/1784 [58:12<55:09, 3.41s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0179, 'learning_rate': 2.2757009345794394e-05, 'epoch': 0.46} 46%|████████████████████████████████████▌ | 814/1784 [58:12<55:09, 3.41s/it]g-point operations will not be computed-01 02:28:17,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▌ | 815/1784 [58:16<54:55, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▌ | 816/1784 [58:19<54:47, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▌ | 816/1784 [58:19<54:47, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1997, 'learning_rate': 2.2710280373831774e-05, 'epoch': 0.46} 46%|████████████████████████████████████▋ | 817/1784 [58:22<54:46, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▋ | 817/1784 [58:22<54:46, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0681, 'learning_rate': 2.2686915887850468e-05, 'epoch': 0.46} 46%|████████████████████████████████████▋ | 817/1784 [58:22<54:46, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:31,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▋ | 818/1784 [58:26<54:26, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▋ | 819/1784 [58:29<54:09, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▋ | 819/1784 [58:29<54:09, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1792, 'learning_rate': 2.2640186915887848e-05, 'epoch': 0.46} 46%|████████████████████████████████████▊ | 820/1784 [58:32<53:47, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▊ | 820/1784 [58:32<53:47, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:49,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:49,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4142, 'learning_rate': 2.2593457943925235e-05, 'epoch': 0.46} 46%|████████████████████████████████████▊ | 822/1784 [58:39<52:59, 3.30s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▊ | 822/1784 [58:39<52:59, 3.30s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:56,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:29:56,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5255, 'learning_rate': 2.2546728971962615e-05, 'epoch': 0.46} 46%|████████████████████████████████████▉ | 824/1784 [58:45<52:16, 3.27s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 824/1784 [58:45<52:16, 3.27s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2801, 'learning_rate': 2.2523364485981308e-05, 'epoch': 0.46} 46%|████████████████████████████████████▉ | 824/1784 [58:45<52:16, 3.27s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 825/1784 [58:49<51:57, 3.25s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:05,878 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:05,878 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2408, 'learning_rate': 2.247663551401869e-05, 'epoch': 0.46} 46%|█████████████████████████████████████ | 827/1784 [58:55<51:10, 3.21s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 827/1784 [58:55<51:10, 3.21s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:12,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:12,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1209, 'learning_rate': 2.2429906542056075e-05, 'epoch': 0.46} 46%|█████████████████████████████████████▏ | 829/1784 [59:01<50:40, 3.18s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 829/1784 [59:01<50:40, 3.18s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:18,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:18,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1085, 'learning_rate': 2.2383177570093458e-05, 'epoch': 0.47} 47%|█████████████████████████████████████▎ | 831/1784 [59:07<49:59, 3.15s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▎ | 831/1784 [59:07<49:59, 3.15s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:24,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:24,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5035, 'learning_rate': 2.233644859813084e-05, 'epoch': 0.47} 47%|█████████████████████████████████████▎ | 833/1784 [59:14<49:12, 3.10s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▎ | 833/1784 [59:14<49:12, 3.10s/it]g-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:30,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:30,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:29:41,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0138, 'learning_rate': 2.2289719626168225e-05, 'epoch': 0.47} 47%|█████████████████████████████████████▍ | 835/1784 [59:20<48:18, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▍ | 835/1784 [59:20<48:18, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▍ | 836/1784 [59:22<47:17, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▍ | 836/1784 [59:22<47:17, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:39,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:39,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3247, 'learning_rate': 2.22196261682243e-05, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-01 02:30:39,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▌ | 838/1784 [59:28<45:46, 2.90s/it]g-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▌ | 838/1784 [59:28<45:46, 2.90s/it]g-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:44,891 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:47,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:47,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2826, 'learning_rate': 2.2149532710280372e-05, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-01 02:30:47,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:35,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 841/1784 [59:36<42:31, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:51,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 841/1784 [59:36<42:31, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:51,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 842/1784 [59:38<41:09, 2.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 842/1784 [59:38<41:09, 2.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 843/1784 [59:41<39:17, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 843/1784 [59:41<39:17, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:56,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:56,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:58,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:30:58,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:00,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:00,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9984, 'learning_rate': 2.1985981308411215e-05, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-03-01 02:31:03,700 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:03,700 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:04,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:04,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:06,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:10,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:10,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:10,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▎ | 852/1784 [1:00:00<42:33, 2.74s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▎ | 852/1784 [1:00:00<42:33, 2.74s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:17,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:17,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:17,844 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▎ | 854/1784 [1:00:07<49:35, 3.20s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▎ | 854/1784 [1:00:07<49:35, 3.20s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▎ | 854/1784 [1:00:07<49:35, 3.20s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▍ | 855/1784 [1:00:11<51:07, 3.30s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▍ | 855/1784 [1:00:11<51:07, 3.30s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▍ | 855/1784 [1:00:11<51:07, 3.30s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▍ | 856/1784 [1:00:14<52:18, 3.38s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:32,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:31:32,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1449, 'learning_rate': 2.1752336448598132e-05, 'epoch': 0.48} [WARNING|modeling_utils.py:388] 2022-03-01 02:31:32,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 858/1784 [1:00:22<53:18, 3.45s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 858/1784 [1:00:22<53:18, 3.45s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 858/1784 [1:00:22<53:18, 3.45s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 859/1784 [1:00:25<53:31, 3.47s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 859/1784 [1:00:25<53:31, 3.47s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 859/1784 [1:00:25<53:31, 3.47s/it]g-point operations will not be computed-01 02:30:53,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 860/1784 [1:00:29<53:41, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▌ | 860/1784 [1:00:29<53:41, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 861/1784 [1:00:32<53:47, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 861/1784 [1:00:32<53:47, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 861/1784 [1:00:32<53:47, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 862/1784 [1:00:36<53:24, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 862/1784 [1:00:36<53:24, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 862/1784 [1:00:36<53:24, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:44,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 863/1784 [1:00:39<53:11, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▋ | 863/1784 [1:00:39<53:11, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▊ | 864/1784 [1:00:42<52:53, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▊ | 864/1784 [1:00:42<52:53, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▊ | 864/1784 [1:00:42<52:53, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|█████████████████████████████████████▊ | 865/1784 [1:00:46<52:35, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:03,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:03,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.272, 'learning_rate': 2.1542056074766356e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:03,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|█████████████████████████████████████▉ | 867/1784 [1:00:53<52:15, 3.42s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|█████████████████████████████████████▉ | 867/1784 [1:00:53<52:15, 3.42s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|█████████████████████████████████████▉ | 867/1784 [1:00:53<52:15, 3.42s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|█████████████████████████████████████▉ | 868/1784 [1:00:56<51:48, 3.39s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:13,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:13,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2667, 'learning_rate': 2.147196261682243e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:13,387 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████ | 870/1784 [1:01:03<51:07, 3.36s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████ | 870/1784 [1:01:03<51:07, 3.36s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████ | 870/1784 [1:01:03<51:07, 3.36s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████ | 871/1784 [1:01:06<50:38, 3.33s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████ | 871/1784 [1:01:06<50:38, 3.33s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:23,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:23,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:23,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▏ | 873/1784 [1:01:12<49:43, 3.27s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:29,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:29,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.109, 'learning_rate': 2.135514018691589e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:29,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▎ | 875/1784 [1:01:19<48:56, 3.23s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:35,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:35,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2466, 'learning_rate': 2.130841121495327e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:35,940 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▎ | 877/1784 [1:01:25<48:06, 3.18s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:42,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:42,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2792, 'learning_rate': 2.1261682242990657e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:42,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▍ | 879/1784 [1:01:31<47:30, 3.15s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:48,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:32:48,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3134, 'learning_rate': 2.1214953271028037e-05, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-01 02:32:48,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 881/1784 [1:01:37<46:54, 3.12s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 881/1784 [1:01:37<46:54, 3.12s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 881/1784 [1:01:37<46:54, 3.12s/it]g-point operations will not be computed-01 02:31:54,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 882/1784 [1:01:40<46:26, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:32:56,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 882/1784 [1:01:40<46:26, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:32:56,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 883/1784 [1:01:43<46:07, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:32:56,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 883/1784 [1:01:43<46:07, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:32:56,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|██████████████████████████████████████▌ | 883/1784 [1:01:43<46:07, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:32:56,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▋ | 884/1784 [1:01:46<45:08, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▋ | 884/1784 [1:01:46<45:08, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▋ | 885/1784 [1:01:49<44:32, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▋ | 885/1784 [1:01:49<44:32, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:06,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:06,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:06,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:01,902 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▊ | 887/1784 [1:01:55<42:51, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▊ | 887/1784 [1:01:55<42:51, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▊ | 888/1784 [1:01:57<42:02, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▊ | 888/1784 [1:01:57<42:02, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:14,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:14,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:14,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:10,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▉ | 890/1784 [1:02:03<40:02, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:17,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▉ | 890/1784 [1:02:03<40:02, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:17,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▉ | 891/1784 [1:02:05<38:46, 2.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:20,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|██████████████████████████████████████▉ | 891/1784 [1:02:05<38:46, 2.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:20,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 892/1784 [1:02:07<37:05, 2.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:22,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 892/1784 [1:02:07<37:05, 2.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:22,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 893/1784 [1:02:09<35:22, 2.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:24,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 893/1784 [1:02:09<35:22, 2.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:24,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 894/1784 [1:02:11<33:48, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:26,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████ | 894/1784 [1:02:11<33:48, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:26,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▏ | 895/1784 [1:02:13<31:45, 2.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:28,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▏ | 895/1784 [1:02:13<31:45, 2.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:28,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▏ | 897/1784 [1:02:16<27:14, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:29,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▏ | 897/1784 [1:02:16<27:14, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:29,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2464, 'learning_rate': 2.0817757009345794e-05, 'epoch': 0.5} 50%|███████████████████████████████████████▎ | 898/1784 [1:02:18<24:58, 1.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:32,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▎ | 898/1784 [1:02:18<24:58, 1.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:32,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▎ | 900/1784 [1:02:21<23:12, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:33,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▎ | 900/1784 [1:02:21<23:12, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:33,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|███████████████████████████████████████▎ | 900/1784 [1:02:21<23:12, 1.57s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:36,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 901/1784 [1:02:24<33:32, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:36,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 901/1784 [1:02:24<33:32, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:36,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 901/1784 [1:02:24<33:32, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 901/1784 [1:02:24<33:32, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 902/1784 [1:02:28<39:44, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 902/1784 [1:02:28<39:44, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 902/1784 [1:02:28<39:44, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 903/1784 [1:02:32<43:43, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 903/1784 [1:02:32<43:43, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▍ | 903/1784 [1:02:32<43:43, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▌ | 904/1784 [1:02:35<46:37, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:53,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:33:53,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2496, 'learning_rate': 2.0630841121495327e-05, 'epoch': 0.51} [WARNING|modeling_utils.py:388] 2022-03-01 02:33:53,026 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▌ | 906/1784 [1:02:42<49:13, 3.36s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▌ | 906/1784 [1:02:42<49:13, 3.36s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▌ | 906/1784 [1:02:42<49:13, 3.36s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▋ | 907/1784 [1:02:46<50:01, 3.42s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:03,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:03,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1458, 'learning_rate': 2.05607476635514e-05, 'epoch': 0.51} 51%|███████████████████████████████████████▋ | 909/1784 [1:02:53<50:18, 3.45s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▋ | 909/1784 [1:02:53<50:18, 3.45s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2969, 'learning_rate': 2.0537383177570094e-05, 'epoch': 0.51} 51%|███████████████████████████████████████▋ | 909/1784 [1:02:53<50:18, 3.45s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 910/1784 [1:02:56<50:01, 3.43s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 910/1784 [1:02:56<50:01, 3.43s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 910/1784 [1:02:56<50:01, 3.43s/it]g-point operations will not be computed-01 02:33:40,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 911/1784 [1:03:00<49:45, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 912/1784 [1:03:03<49:25, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▊ | 912/1784 [1:03:03<49:25, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0777, 'learning_rate': 2.0467289719626168e-05, 'epoch': 0.51} 51%|███████████████████████████████████████▊ | 912/1784 [1:03:03<49:25, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|███████████████████████████████████████▉ | 913/1784 [1:03:07<49:18, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:23,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:23,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1676, 'learning_rate': 2.042056074766355e-05, 'epoch': 0.51} [WARNING|modeling_utils.py:388] 2022-03-01 02:34:23,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████ | 915/1784 [1:03:13<48:46, 3.37s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████ | 915/1784 [1:03:13<48:46, 3.37s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████ | 915/1784 [1:03:13<48:46, 3.37s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████ | 916/1784 [1:03:17<48:39, 3.36s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:33,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:33,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0691, 'learning_rate': 2.0350467289719628e-05, 'epoch': 0.51} [WARNING|modeling_utils.py:388] 2022-03-01 02:34:33,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▏ | 918/1784 [1:03:23<48:02, 3.33s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▏ | 918/1784 [1:03:23<48:02, 3.33s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▏ | 918/1784 [1:03:23<48:02, 3.33s/it]g-point operations will not be computed-01 02:34:15,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▏ | 919/1784 [1:03:26<48:04, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▏ | 920/1784 [1:03:30<48:03, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▏ | 920/1784 [1:03:30<48:03, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0056, 'learning_rate': 2.02803738317757e-05, 'epoch': 0.52} 52%|████████████████████████████████████████▏ | 920/1784 [1:03:30<48:03, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▎ | 921/1784 [1:03:33<47:42, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:50,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:50,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9573, 'learning_rate': 2.0233644859813085e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:34:50,442 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▎ | 923/1784 [1:03:40<46:55, 3.27s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:34:56,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1254, 'learning_rate': 2.0186915887850468e-05, 'epoch': 0.52} 52%|████████████████████████████████████████▍ | 925/1784 [1:03:46<46:26, 3.24s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▍ | 925/1784 [1:03:46<46:26, 3.24s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2993, 'learning_rate': 2.0163551401869158e-05, 'epoch': 0.52} 52%|████████████████████████████████████████▍ | 925/1784 [1:03:46<46:26, 3.24s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▍ | 926/1784 [1:03:49<46:05, 3.22s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:06,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:06,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1477, 'learning_rate': 2.0116822429906545e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:06,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▌ | 928/1784 [1:03:55<45:11, 3.17s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:12,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:12,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1204, 'learning_rate': 2.0070093457943925e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:12,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▋ | 930/1784 [1:04:02<44:35, 3.13s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:18,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:18,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4103, 'learning_rate': 2.002336448598131e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:18,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▋ | 932/1784 [1:04:08<43:51, 3.09s/it]g-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:24,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:24,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2804, 'learning_rate': 1.997663551401869e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:24,751 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:34:42,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▊ | 934/1784 [1:04:14<42:34, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▉ | 935/1784 [1:04:16<42:06, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|████████████████████████████████████████▉ | 935/1784 [1:04:16<42:06, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:33,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:33,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3072, 'learning_rate': 1.9906542056074765e-05, 'epoch': 0.52} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:33,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:29,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|████████████████████████████████████████▉ | 937/1784 [1:04:22<41:05, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████ | 938/1784 [1:04:25<40:21, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████ | 938/1784 [1:04:25<40:21, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:41,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:41,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.954, 'learning_rate': 1.9836448598130842e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:41,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:37,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████ | 940/1784 [1:04:30<38:32, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:45,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▏ | 941/1784 [1:04:33<37:36, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▏ | 941/1784 [1:04:33<37:36, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▏ | 942/1784 [1:04:35<36:08, 2.58s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▏ | 942/1784 [1:04:35<36:08, 2.58s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:51,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:51,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:53,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:53,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:55,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:55,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:56,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:56,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3206, 'learning_rate': 1.9672897196261682e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-01 02:35:59,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:35:59,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:00,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:00,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1039, 'learning_rate': 1.960280373831776e-05, 'epoch': 0.53} [WARNING|modeling_utils.py:388] 2022-03-01 02:36:02,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:02,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:02,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:06,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:10,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:36:10,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2665, 'learning_rate': 1.9532710280373832e-05, 'epoch': 0.53} 53%|█████████████████████████████████████████▋ | 953/1784 [1:05:00<41:30, 3.00s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▋ | 953/1784 [1:05:00<41:30, 3.00s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3006, 'learning_rate': 1.9509345794392522e-05, 'epoch': 0.53} 53%|█████████████████████████████████████████▋ | 954/1784 [1:05:03<44:05, 3.19s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|█████████████████████████████████████████▋ | 954/1784 [1:05:03<44:05, 3.19s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9481, 'learning_rate': 1.9485981308411216e-05, 'epoch': 0.53} 53%|█████████████████████████████████████████▋ | 954/1784 [1:05:03<44:05, 3.19s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 955/1784 [1:05:07<45:46, 3.31s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 955/1784 [1:05:07<45:46, 3.31s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 955/1784 [1:05:07<45:46, 3.31s/it]g-point operations will not be computed-01 02:35:48,028 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 956/1784 [1:05:10<46:56, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 957/1784 [1:05:14<47:46, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▊ | 957/1784 [1:05:14<47:46, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2548, 'learning_rate': 1.941588785046729e-05, 'epoch': 0.54} 54%|█████████████████████████████████████████▉ | 958/1784 [1:05:18<48:03, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▉ | 958/1784 [1:05:18<48:03, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0172, 'learning_rate': 1.9392523364485982e-05, 'epoch': 0.54} 54%|█████████████████████████████████████████▉ | 958/1784 [1:05:18<48:03, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▉ | 959/1784 [1:05:21<48:08, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▉ | 959/1784 [1:05:21<48:08, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▉ | 959/1784 [1:05:21<48:08, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:26,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|█████████████████████████████████████████▉ | 960/1784 [1:05:25<48:15, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 961/1784 [1:05:28<47:51, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 961/1784 [1:05:28<47:51, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.111, 'learning_rate': 1.9322429906542056e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████ | 961/1784 [1:05:28<47:51, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 962/1784 [1:05:32<47:43, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 962/1784 [1:05:32<47:43, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 962/1784 [1:05:32<47:43, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:40,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████ | 963/1784 [1:05:35<47:19, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▏ | 964/1784 [1:05:38<47:01, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▏ | 964/1784 [1:05:38<47:01, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2684, 'learning_rate': 1.925233644859813e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████▏ | 965/1784 [1:05:42<46:45, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▏ | 965/1784 [1:05:42<46:45, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1267, 'learning_rate': 1.9228971962616823e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████▏ | 965/1784 [1:05:42<46:45, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:36:50,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▏ | 966/1784 [1:05:45<46:25, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▎ | 967/1784 [1:05:48<46:08, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▎ | 967/1784 [1:05:48<46:08, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2444, 'learning_rate': 1.9182242990654206e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████▎ | 967/1784 [1:05:48<46:08, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▎ | 968/1784 [1:05:52<45:52, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:09,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:09,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1988, 'learning_rate': 1.913551401869159e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████▍ | 970/1784 [1:05:59<45:37, 3.36s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▍ | 970/1784 [1:05:59<45:37, 3.36s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1309, 'learning_rate': 1.9112149532710283e-05, 'epoch': 0.54} 54%|██████████████████████████████████████████▍ | 970/1784 [1:05:59<45:37, 3.36s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|██████████████████████████████████████████▍ | 971/1784 [1:06:02<45:18, 3.34s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:19,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:19,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1522, 'learning_rate': 1.9065420560747663e-05, 'epoch': 0.54} [WARNING|modeling_utils.py:388] 2022-03-01 02:37:19,162 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▌ | 973/1784 [1:06:08<44:23, 3.28s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:25,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:25,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4322, 'learning_rate': 1.9018691588785046e-05, 'epoch': 0.55} 55%|██████████████████████████████████████████▋ | 975/1784 [1:06:15<43:54, 3.26s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▋ | 975/1784 [1:06:15<43:54, 3.26s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3223, 'learning_rate': 1.899532710280374e-05, 'epoch': 0.55} 55%|██████████████████████████████████████████▋ | 975/1784 [1:06:15<43:54, 3.26s/it]g-point operations will not be computed-01 02:37:00,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▋ | 976/1784 [1:06:18<43:23, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:33,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▋ | 977/1784 [1:06:21<43:11, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:33,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▋ | 977/1784 [1:06:21<43:11, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:33,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5155, 'learning_rate': 1.8948598130841123e-05, 'epoch': 0.55} 55%|██████████████████████████████████████████▋ | 977/1784 [1:06:21<43:11, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:33,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▊ | 978/1784 [1:06:24<42:48, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:39,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▊ | 979/1784 [1:06:27<42:25, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:39,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▊ | 979/1784 [1:06:27<42:25, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:39,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0066, 'learning_rate': 1.8901869158878507e-05, 'epoch': 0.55} 55%|██████████████████████████████████████████▊ | 979/1784 [1:06:27<42:25, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:39,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▊ | 980/1784 [1:06:30<42:00, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:46,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▉ | 981/1784 [1:06:33<41:32, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:46,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▉ | 981/1784 [1:06:33<41:32, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:46,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8829, 'learning_rate': 1.8855140186915887e-05, 'epoch': 0.55} 55%|██████████████████████████████████████████▉ | 981/1784 [1:06:33<41:32, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:46,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▉ | 982/1784 [1:06:36<41:03, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:52,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▉ | 983/1784 [1:06:39<40:22, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:52,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|██████████████████████████████████████████▉ | 983/1784 [1:06:39<40:22, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:37:52,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:56,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:52,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:37:56,268 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:37:52,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.327, 'learning_rate': 1.8785046728971963e-05, 'epoch': 0.55} 55%|███████████████████████████████████████████ | 985/1784 [1:06:45<38:55, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████ | 985/1784 [1:06:45<38:55, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████ | 986/1784 [1:06:48<38:23, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████ | 986/1784 [1:06:48<38:23, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:04,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:04,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3195, 'learning_rate': 1.871495327102804e-05, 'epoch': 0.55} [WARNING|modeling_utils.py:388] 2022-03-01 02:38:04,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:00,550 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████▏ | 988/1784 [1:06:53<36:40, 2.76s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:08,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████▏ | 989/1784 [1:06:56<35:29, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████▏ | 989/1784 [1:06:56<35:29, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████▎ | 990/1784 [1:06:58<34:14, 2.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|███████████████████████████████████████████▎ | 990/1784 [1:06:58<34:14, 2.59s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:14,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:14,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:16,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:16,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:18,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:18,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:20,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:20,091 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:21,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:21,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:24,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:24,650 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.786, 'learning_rate': 1.850467289719626e-05, 'epoch': 0.56} [WARNING|modeling_utils.py:388] 2022-03-01 02:38:27,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:38:27,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8163, 'learning_rate': 1.8457943925233647e-05, 'epoch': 0.56} [INFO|trainer.py:2369] 2022-03-01 02:38:28,899 >> Batch size = 8aluation *****e number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2369] 2022-03-01 02:38:28,899 >> Batch size = 8aluation *****e number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3994, 'learning_rate': 1.8411214953271027e-05, 'epoch': 0.56} 0%| | 0/331 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 2/331 [00:02<06:14, 1.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 3/331 [00:04<08:21, 1.53s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 4/331 [00:06<09:33, 1.75s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 5/331 [00:09<11:12, 2.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 6/331 [00:11<12:16, 2.27s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 7/331 [00:14<12:23, 2.30s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|██ | 8/331 [00:16<12:52, 2.39s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 9/331 [00:19<13:27, 2.51s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 10/331 [00:22<14:14, 2.66s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 11/331 [00:25<13:55, 2.61s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 12/331 [00:27<13:38, 2.57s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 13/331 [00:29<13:26, 2.54s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 14/331 [00:32<13:18, 2.52s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 15/331 [00:35<14:38, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 16/331 [00:39<15:32, 2.96s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 17/331 [00:42<15:34, 2.98s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 18/331 [00:44<14:12, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 19/331 [00:47<14:02, 2.70s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 20/331 [00:49<13:14, 2.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████▏ | 21/331 [00:52<13:48, 2.67s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 22/331 [00:55<14:47, 2.87s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 23/331 [00:59<15:53, 3.10s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 24/331 [01:02<16:43, 3.27s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 25/331 [01:05<15:59, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 26/331 [01:08<14:51, 2.92s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 27/331 [01:11<14:56, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 28/331 [01:13<14:34, 2.88s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 29/331 [01:16<14:15, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 30/331 [01:18<13:34, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▋ | 31/331 [01:21<12:55, 2.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 32/331 [01:23<12:43, 2.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 33/331 [01:26<12:55, 2.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▍ | 34/331 [01:28<12:50, 2.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▋ | 35/331 [01:31<12:57, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 36/331 [01:34<13:34, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 37/331 [01:38<14:20, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▍ | 38/331 [01:41<14:36, 2.99s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 39/331 [01:44<14:43, 3.03s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 40/331 [01:46<13:23, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|██████████▏ | 41/331 [01:48<12:49, 2.65s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 42/331 [01:52<13:40, 2.84s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 43/331 [01:55<14:27, 3.01s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▉ | 44/331 [01:58<14:58, 3.13s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 45/331 [02:01<14:10, 2.97s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 46/331 [02:03<13:09, 2.77s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▋ | 47/331 [02:06<12:21, 2.61s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 48/331 [02:08<12:38, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▏ | 49/331 [02:12<13:10, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▍ | 50/331 [02:14<12:57, 2.77s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 51/331 [02:17<13:12, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 52/331 [02:20<12:38, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▏ | 53/331 [02:22<12:41, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 54/331 [02:25<12:01, 2.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 55/331 [02:28<13:01, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 56/331 [02:31<12:50, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|██████████████ | 57/331 [02:33<12:28, 2.73s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 58/331 [02:37<12:57, 2.85s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▌ | 59/331 [02:39<12:15, 2.70s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▊ | 60/331 [02:41<11:55, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|███████████████ | 61/331 [02:44<12:14, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 62/331 [02:47<12:08, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▌ | 63/331 [02:50<13:11, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▊ | 64/331 [02:53<12:41, 2.85s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 65/331 [02:56<12:25, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▎ | 66/331 [03:00<13:36, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████▌ | 67/331 [03:03<14:17, 3.25s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 68/331 [03:06<14:20, 3.27s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████ | 69/331 [03:10<14:05, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▎ | 70/331 [03:13<13:53, 3.19s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|█████████████████▌ | 71/331 [03:16<14:00, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▊ | 72/331 [03:19<13:54, 3.22s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████ | 73/331 [03:22<13:17, 3.09s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|██████████████████▎ | 74/331 [03:25<13:00, 3.04s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 75/331 [03:28<13:09, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 76/331 [03:31<12:28, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|███████████████████ | 77/331 [03:33<12:11, 2.88s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 78/331 [03:36<11:43, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 79/331 [03:39<11:18, 2.69s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 80/331 [03:41<11:13, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|████████████████████ | 81/331 [03:44<11:44, 2.82s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▎ | 82/331 [03:47<11:32, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▎ | 82/331 [03:47<11:32, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▎ | 82/331 [03:47<11:32, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▊ | 84/331 [03:54<12:40, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████ | 85/331 [03:56<11:50, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▎ | 86/331 [04:00<12:29, 3.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|█████████████████████▌ | 87/331 [04:02<12:09, 2.99s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▊ | 88/331 [04:05<11:47, 2.91s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████ | 89/331 [04:07<10:59, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▎ | 90/331 [04:10<10:32, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|██████████████████████▌ | 91/331 [04:13<11:03, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▊ | 92/331 [04:15<10:20, 2.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████ | 93/331 [04:18<10:27, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|███████████████████████▎ | 94/331 [04:21<10:43, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▌ | 95/331 [04:24<10:48, 2.75s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▊ | 96/331 [04:26<10:47, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|████████████████████████ | 97/331 [04:29<10:13, 2.62s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▎ | 98/331 [04:32<10:32, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 99/331 [04:34<10:30, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▍ | 100/331 [04:37<10:08, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▋ | 101/331 [04:39<10:02, 2.62s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 102/331 [04:43<10:54, 2.86s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 103/331 [04:45<10:26, 2.75s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▍ | 104/331 [04:48<10:22, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 105/331 [04:51<10:23, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 106/331 [04:53<10:12, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 107/331 [04:56<09:33, 2.56s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 108/331 [04:58<09:22, 2.52s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▋ | 109/331 [05:01<09:21, 2.53s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▉ | 110/331 [05:03<09:47, 2.66s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▏ | 111/331 [05:06<09:54, 2.70s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 112/331 [05:09<09:54, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 113/331 [05:11<09:25, 2.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▉ | 114/331 [05:14<09:30, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▏ | 115/331 [05:17<09:32, 2.65s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▍ | 116/331 [05:20<09:47, 2.73s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████▋ | 117/331 [05:22<09:46, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▉ | 118/331 [05:25<09:30, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████ | 119/331 [05:28<09:28, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 120/331 [05:30<09:26, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 121/331 [05:33<09:47, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 122/331 [05:36<09:36, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 123/331 [05:39<10:10, 2.94s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 124/331 [05:42<10:00, 2.90s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 125/331 [05:46<10:31, 3.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 126/331 [05:49<10:37, 3.11s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 127/331 [05:52<11:02, 3.25s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 128/331 [05:56<11:03, 3.27s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 129/331 [05:59<10:49, 3.22s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 130/331 [06:02<10:55, 3.26s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 131/331 [06:06<11:06, 3.33s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 131/331 [06:06<11:06, 3.33s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 131/331 [06:06<11:06, 3.33s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 133/331 [06:11<09:44, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▊ | 134/331 [06:14<09:24, 2.86s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 135/331 [06:17<09:34, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▎ | 136/331 [06:20<09:51, 3.03s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▌ | 137/331 [06:24<10:14, 3.17s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|█████████████████████████████████▊ | 138/331 [06:27<10:28, 3.26s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████ | 139/331 [06:29<09:19, 2.91s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 140/331 [06:33<09:52, 3.10s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 141/331 [06:35<09:19, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 142/331 [06:38<09:02, 2.87s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 143/331 [06:41<09:24, 3.00s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▏ | 144/331 [06:44<09:00, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 145/331 [06:47<08:56, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 146/331 [06:50<09:24, 3.05s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▉ | 147/331 [06:53<09:05, 2.96s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 148/331 [06:55<08:30, 2.79s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 149/331 [06:58<08:01, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 150/331 [07:01<08:20, 2.77s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 151/331 [07:03<08:10, 2.73s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 152/331 [07:06<07:44, 2.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▍ | 153/331 [07:08<07:37, 2.57s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▋ | 154/331 [07:11<07:58, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 155/331 [07:14<08:21, 2.85s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 156/331 [07:17<08:33, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 157/331 [07:21<08:56, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 158/331 [07:24<09:02, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▉ | 159/331 [07:27<09:08, 3.19s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 160/331 [07:30<08:37, 3.02s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 161/331 [07:33<08:23, 2.96s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 162/331 [07:36<08:45, 3.11s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 163/331 [07:40<08:50, 3.16s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 164/331 [07:42<08:21, 3.00s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▍ | 165/331 [07:45<08:11, 2.96s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 166/331 [07:48<07:57, 2.90s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 167/331 [07:51<08:08, 2.98s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 168/331 [07:54<07:43, 2.84s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 169/331 [07:57<07:49, 2.90s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 170/331 [07:59<07:29, 2.79s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 171/331 [08:02<07:27, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 172/331 [08:04<07:08, 2.69s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 173/331 [08:07<07:19, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 174/331 [08:10<06:57, 2.66s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 175/331 [08:13<07:05, 2.73s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 176/331 [08:15<06:51, 2.65s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▎ | 177/331 [08:18<07:08, 2.79s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 178/331 [08:22<07:35, 2.98s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 179/331 [08:25<07:56, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 180/331 [08:28<07:49, 3.11s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 181/331 [08:31<07:41, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 182/331 [08:34<07:05, 2.86s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 183/331 [08:36<06:34, 2.67s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 184/331 [08:38<06:07, 2.50s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 185/331 [08:40<05:41, 2.34s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 186/331 [08:42<05:51, 2.43s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▊ | 187/331 [08:46<06:20, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 188/331 [08:48<06:17, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▎ | 189/331 [08:50<05:57, 2.52s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▍ | 190/331 [08:53<05:41, 2.43s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▋ | 191/331 [08:55<05:38, 2.42s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▉ | 192/331 [08:57<05:27, 2.36s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▏ | 193/331 [09:00<05:54, 2.57s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▍ | 194/331 [09:03<05:38, 2.47s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 195/331 [09:05<05:32, 2.44s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▉ | 196/331 [09:08<05:38, 2.51s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▏ | 197/331 [09:11<05:53, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 198/331 [09:13<05:34, 2.52s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 199/331 [09:15<05:39, 2.57s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 200/331 [09:18<05:24, 2.48s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 201/331 [09:20<05:18, 2.45s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▍ | 202/331 [09:23<05:29, 2.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 203/331 [09:26<05:28, 2.56s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 204/331 [09:29<05:50, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 205/331 [09:32<05:54, 2.81s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 206/331 [09:34<05:49, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 207/331 [09:38<06:04, 2.94s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▉ | 208/331 [09:41<06:06, 2.98s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 209/331 [09:43<05:34, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 210/331 [09:45<05:08, 2.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 211/331 [09:48<05:10, 2.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 212/331 [09:50<04:56, 2.49s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 213/331 [09:53<04:56, 2.52s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 214/331 [09:55<04:41, 2.41s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 215/331 [09:57<04:31, 2.34s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 216/331 [10:00<05:00, 2.62s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████ | 217/331 [10:03<05:00, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:06<05:13, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:09<05:08, 2.75s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:11<04:55, 2.67s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████ | 221/331 [10:14<04:58, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:16<04:44, 2.61s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:19<04:47, 2.66s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:22<04:47, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 225/331 [10:25<04:45, 2.70s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 226/331 [10:28<04:57, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 227/331 [10:30<04:49, 2.78s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 228/331 [10:33<04:40, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 229/331 [10:36<04:36, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▎ | 230/331 [10:38<04:28, 2.66s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 231/331 [10:41<04:34, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 232/331 [10:44<04:28, 2.72s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 233/331 [10:47<04:34, 2.80s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 234/331 [10:49<04:20, 2.69s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 235/331 [10:52<04:11, 2.62s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 236/331 [10:55<04:36, 2.91s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 237/331 [10:59<04:46, 3.04s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:02<04:40, 3.02s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:05<04:38, 3.03s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:08<04:42, 3.10s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:11<04:45, 3.17s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:15<04:45, 3.21s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 243/331 [11:18<04:44, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:21<04:50, 3.34s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:24<04:36, 3.21s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 246/331 [11:28<04:45, 3.35s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▍ | 247/331 [11:31<04:32, 3.24s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▋ | 248/331 [11:34<04:13, 3.05s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▉ | 249/331 [11:36<03:53, 2.85s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [11:38<03:39, 2.71s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [11:41<03:40, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [11:44<03:28, 2.64s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [11:47<03:37, 2.79s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [11:49<03:32, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [11:52<03:36, 2.84s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [11:55<03:28, 2.79s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [11:58<03:34, 2.90s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▏ | 258/331 [12:01<03:19, 2.73s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:03<03:13, 2.69s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:06<03:15, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:08<03:01, 2.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:11<03:00, 2.61s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:14<03:08, 2.77s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:17<02:59, 2.68s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:19<02:53, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████████ | 266/331 [12:21<02:45, 2.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [12:25<02:56, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [12:27<02:53, 2.76s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [12:31<03:01, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████ | 270/331 [12:34<02:56, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [12:37<02:59, 2.99s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [12:39<02:49, 2.88s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████████▊ | 273/331 [12:42<02:48, 2.91s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████ | 274/331 [12:46<02:55, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [12:49<02:55, 3.13s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [12:52<02:42, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [12:54<02:35, 2.88s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████ | 278/331 [12:57<02:30, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:01<02:38, 3.05s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:04<02:32, 3.00s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:07<02:33, 3.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:10<02:30, 3.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:13<02:31, 3.16s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:17<02:32, 3.25s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [13:20<02:32, 3.32s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [13:24<02:30, 3.35s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [13:27<02:31, 3.45s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [13:31<02:27, 3.43s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:33<02:15, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|██████████████████████████████████████████████████████████████████████▉ | 290/331 [13:36<02:04, 3.05s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [13:39<01:55, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [13:41<01:50, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [13:44<01:47, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [13:46<01:39, 2.69s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [13:49<01:34, 2.63s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [13:51<01:29, 2.56s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [13:55<01:36, 2.83s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [13:58<01:40, 3.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:01<01:33, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:04<01:30, 2.92s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:07<01:26, 2.87s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:09<01:21, 2.82s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:12<01:16, 2.74s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:15<01:16, 2.82s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [14:18<01:16, 2.94s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [14:22<01:17, 3.09s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [14:25<01:17, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:29<01:18, 3.40s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:29<01:18, 3.40s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:29<01:18, 3.40s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [14:35<01:07, 3.20s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [14:38<01:04, 3.21s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [14:41<00:57, 3.00s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [14:44<00:52, 2.93s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [14:47<00:50, 2.97s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [14:50<00:49, 3.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [14:53<00:46, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [14:57<00:45, 3.22s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [14:59<00:39, 3.04s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:02<00:35, 2.92s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:05<00:32, 2.95s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:08<00:28, 2.89s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:11<00:27, 3.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:14<00:23, 2.96s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▎ | 324/331 [15:17<00:21, 3.06s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [15:20<00:18, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [15:24<00:15, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [15:27<00:12, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [15:30<00:09, 3.14s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 99%|████████████████████████████████████████████████████████████████████████████████▌| 329/331 [15:33<00:06, 3.08s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [15:36<00:03, 3.23s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|configuration_utils.py:438] 2022-03-01 02:54:10,274 >> Configuration saved in ./checkpoint-1000/config.json g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|configuration_utils.py:438] 2022-03-01 02:54:10,274 >> Configuration saved in ./checkpoint-1000/config.json g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 03/01/2022 02:54:10 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|feature_extraction_utils.py:324] 2022-03-01 02:54:15,236 >> Configuration saved in ./checkpoint-1000/preprocessor_config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:54:15,236 >> Configuration saved in ./checkpoint-1000/preprocessor_config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:54:15,236 >> Configuration saved in ./checkpoint-1000/preprocessor_config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:54:15,236 >> Configuration saved in ./checkpoint-1000/preprocessor_config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|feature_extraction_utils.py:324] 2022-03-01 02:54:15,236 >> Configuration saved in ./checkpoint-1000/preprocessor_config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|████████████████████████████████████████▉ | 1001/1784 [1:25:03<69:58:23, 321.72s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|████████████████████████████████████████▉ | 1001/1784 [1:25:03<69:58:23, 321.72s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|████████████████████████████████████████▉ | 1001/1784 [1:25:03<69:58:23, 321.72s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████ | 1002/1784 [1:25:07<49:09:35, 226.31s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████ | 1002/1784 [1:25:07<49:09:35, 226.31s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████ | 1002/1784 [1:25:07<49:09:35, 226.31s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████ | 1003/1784 [1:25:11<34:36:38, 159.54s/it]config.jsonrations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:28,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:28,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2058, 'learning_rate': 1.8317757009345794e-05, 'epoch': 0.56} 56%|█████████████████████████████████████████▋ | 1005/1784 [1:25:18<17:19:36, 80.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████▋ | 1005/1784 [1:25:18<17:19:36, 80.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1705, 'learning_rate': 1.8294392523364487e-05, 'epoch': 0.56} 56%|█████████████████████████████████████████▋ | 1006/1784 [1:25:22<12:21:08, 57.16s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████▋ | 1006/1784 [1:25:22<12:21:08, 57.16s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2186, 'learning_rate': 1.8271028037383177e-05, 'epoch': 0.56} 56%|█████████████████████████████████████████▋ | 1006/1784 [1:25:22<12:21:08, 57.16s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|██████████████████████████████████████████▎ | 1007/1784 [1:25:26<8:51:49, 41.07s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:43,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:43,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4482, 'learning_rate': 1.822429906542056e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▍ | 1009/1784 [1:25:33<4:43:45, 21.97s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▍ | 1009/1784 [1:25:33<4:43:45, 21.97s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3113, 'learning_rate': 1.820093457943925e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▍ | 1010/1784 [1:25:36<3:32:23, 16.46s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▍ | 1010/1784 [1:25:36<3:32:23, 16.46s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0683, 'learning_rate': 1.8177570093457944e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▍ | 1010/1784 [1:25:36<3:32:23, 16.46s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▌ | 1011/1784 [1:25:40<2:42:10, 12.59s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:57,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:56:57,605 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1346, 'learning_rate': 1.8130841121495328e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▌ | 1013/1784 [1:25:47<1:42:00, 7.94s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▌ | 1013/1784 [1:25:47<1:42:00, 7.94s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2404, 'learning_rate': 1.8107476635514018e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▋ | 1014/1784 [1:25:50<1:24:44, 6.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▋ | 1014/1784 [1:25:50<1:24:44, 6.60s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:08,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:08,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6737, 'learning_rate': 1.8060747663551404e-05, 'epoch': 0.57} 57%|██████████████████████████████████████████▋ | 1016/1784 [1:25:57<1:04:17, 5.02s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████▋ | 1016/1784 [1:25:57<1:04:17, 5.02s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2312, 'learning_rate': 1.803738317757009e-05, 'epoch': 0.57} 57%|███████████████████████████████████████████▉ | 1017/1784 [1:26:01<58:07, 4.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▉ | 1017/1784 [1:26:01<58:07, 4.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2907, 'learning_rate': 1.8014018691588784e-05, 'epoch': 0.57} 57%|███████████████████████████████████████████▉ | 1017/1784 [1:26:01<58:07, 4.55s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|███████████████████████████████████████████▉ | 1018/1784 [1:26:04<53:44, 4.21s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3023, 'learning_rate': 1.796728971962617e-05, 'epoch': 0.57} 57%|████████████████████████████████████████████ | 1020/1784 [1:26:11<48:32, 3.81s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 1020/1784 [1:26:11<48:32, 3.81s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.262, 'learning_rate': 1.7943925233644858e-05, 'epoch': 0.57} 57%|████████████████████████████████████████████ | 1020/1784 [1:26:11<48:32, 3.81s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████ | 1021/1784 [1:26:15<47:00, 3.70s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:32,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:32,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2461, 'learning_rate': 1.7897196261682245e-05, 'epoch': 0.57} 57%|████████████████████████████████████████████▏ | 1023/1784 [1:26:21<44:53, 3.54s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 1023/1784 [1:26:21<44:53, 3.54s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.175, 'learning_rate': 1.7873831775700935e-05, 'epoch': 0.57} 57%|████████████████████████████████████████████▏ | 1023/1784 [1:26:21<44:53, 3.54s/it]g-point operations will not be computed-01 02:38:10,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 1024/1784 [1:26:25<44:03, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 1025/1784 [1:26:28<43:30, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|████████████████████████████████████████████▏ | 1025/1784 [1:26:28<43:30, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.245, 'learning_rate': 1.7827102803738318e-05, 'epoch': 0.57} 58%|████████████████████████████████████████████▎ | 1026/1784 [1:26:31<42:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 1026/1784 [1:26:31<42:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:48,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:48,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2568, 'learning_rate': 1.77803738317757e-05, 'epoch': 0.58} 58%|████████████████████████████████████████████▎ | 1028/1784 [1:26:38<41:23, 3.28s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▎ | 1028/1784 [1:26:38<41:23, 3.28s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:55,003 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:57:55,003 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1602, 'learning_rate': 1.7733644859813085e-05, 'epoch': 0.58} 58%|████████████████████████████████████████████▍ | 1030/1784 [1:26:44<40:29, 3.22s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▍ | 1030/1784 [1:26:44<40:29, 3.22s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:01,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:01,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2435, 'learning_rate': 1.7686915887850468e-05, 'epoch': 0.58} 58%|████████████████████████████████████████████▌ | 1032/1784 [1:26:50<39:44, 3.17s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▌ | 1032/1784 [1:26:50<39:44, 3.17s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:07,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:07,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:07,502 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▋ | 1034/1784 [1:26:56<39:04, 3.13s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▋ | 1034/1784 [1:26:56<39:04, 3.13s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:13,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:13,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:13,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▋ | 1036/1784 [1:27:02<37:54, 3.04s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▋ | 1036/1784 [1:27:02<37:54, 3.04s/it]g-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:19,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:19,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:57:40,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2128, 'learning_rate': 1.7546728971962615e-05, 'epoch': 0.58} 58%|████████████████████████████████████████████▊ | 1038/1784 [1:27:08<36:26, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▊ | 1038/1784 [1:27:08<36:26, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▊ | 1039/1784 [1:27:11<35:14, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|████████████████████████████████████████████▊ | 1039/1784 [1:27:11<35:14, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:27,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:27,285 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:29,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:29,634 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:31,866 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:31,866 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:33,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:33,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:35,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:35,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:37,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:37,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:38,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:38,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:41,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:41,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2418, 'learning_rate': 1.7313084112149535e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-01 02:58:42,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:42,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:44,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:44,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1032, 'learning_rate': 1.724299065420561e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-01 02:58:44,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:48,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:48,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:52,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:52,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:58:52,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1053/1784 [1:27:42<36:49, 3.02s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1053/1784 [1:27:42<36:49, 3.02s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1053/1784 [1:27:42<36:49, 3.02s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1054/1784 [1:27:46<39:12, 3.22s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1054/1784 [1:27:46<39:12, 3.22s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▍ | 1054/1784 [1:27:46<39:12, 3.22s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1055/1784 [1:27:49<40:33, 3.34s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1055/1784 [1:27:49<40:33, 3.34s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1055/1784 [1:27:49<40:33, 3.34s/it]g-point operations will not be computed-01 02:58:23,545 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1056/1784 [1:27:53<41:35, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1056/1784 [1:27:53<41:35, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1057/1784 [1:27:56<42:10, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1057/1784 [1:27:56<42:10, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▌ | 1057/1784 [1:27:56<42:10, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▋ | 1058/1784 [1:28:00<42:14, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▋ | 1058/1784 [1:28:00<42:14, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▋ | 1058/1784 [1:28:00<42:14, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▋ | 1059/1784 [1:28:04<42:27, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:21,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:21,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0166, 'learning_rate': 1.7009345794392523e-05, 'epoch': 0.59} [WARNING|modeling_utils.py:388] 2022-03-01 02:59:21,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▊ | 1061/1784 [1:28:11<42:20, 3.51s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▊ | 1061/1784 [1:28:11<42:20, 3.51s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|█████████████████████████████████████████████▊ | 1061/1784 [1:28:11<42:20, 3.51s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▊ | 1062/1784 [1:28:14<42:03, 3.49s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▊ | 1062/1784 [1:28:14<42:03, 3.49s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▊ | 1062/1784 [1:28:14<42:03, 3.49s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▉ | 1063/1784 [1:28:17<41:43, 3.47s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▉ | 1063/1784 [1:28:17<41:43, 3.47s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:34,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:34,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:34,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▉ | 1065/1784 [1:28:24<41:15, 3.44s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▉ | 1065/1784 [1:28:24<41:15, 3.44s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|█████████████████████████████████████████████▉ | 1065/1784 [1:28:24<41:15, 3.44s/it]g-point operations will not be computed-01 02:59:08,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1066/1784 [1:28:28<40:46, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1066/1784 [1:28:28<40:46, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1067/1784 [1:28:31<40:38, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1067/1784 [1:28:31<40:38, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1067/1784 [1:28:31<40:38, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████ | 1068/1784 [1:28:34<40:37, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:51,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 02:59:51,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4147, 'learning_rate': 1.6799065420560746e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-01 02:59:51,808 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▏ | 1070/1784 [1:28:41<40:08, 3.37s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▏ | 1070/1784 [1:28:41<40:08, 3.37s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▏ | 1070/1784 [1:28:41<40:08, 3.37s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▏ | 1071/1784 [1:28:44<39:51, 3.35s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▏ | 1071/1784 [1:28:44<39:51, 3.35s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:01,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:01,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:01,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▎ | 1073/1784 [1:28:51<39:15, 3.31s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:08,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:08,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3168, 'learning_rate': 1.6682242990654206e-05, 'epoch': 0.6} [WARNING|modeling_utils.py:388] 2022-03-01 03:00:08,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1075/1784 [1:28:57<38:44, 3.28s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1075/1784 [1:28:57<38:44, 3.28s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1075/1784 [1:28:57<38:44, 3.28s/it]g-point operations will not be computed-01 02:59:43,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1076/1784 [1:29:01<38:18, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1076/1784 [1:29:01<38:18, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1077/1784 [1:29:04<37:58, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1077/1784 [1:29:04<37:58, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▍ | 1077/1784 [1:29:04<37:58, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▌ | 1078/1784 [1:29:07<37:37, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|██████████████████████████████████████████████▌ | 1078/1784 [1:29:07<37:37, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:24,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:24,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▌ | 1080/1784 [1:29:13<36:55, 3.15s/it]g-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▌ | 1080/1784 [1:29:13<36:55, 3.15s/it]g-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:30,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:30,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1615, 'learning_rate': 1.6518691588785047e-05, 'epoch': 0.61} [WARNING|modeling_utils.py:388] 2022-03-01 03:00:30,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▋ | 1082/1784 [1:29:19<36:14, 3.10s/it]g-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▋ | 1082/1784 [1:29:19<36:14, 3.10s/it]g-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:36,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:39,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:39,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2254, 'learning_rate': 1.644859813084112e-05, 'epoch': 0.61} [WARNING|modeling_utils.py:388] 2022-03-01 03:00:39,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▊ | 1085/1784 [1:29:28<34:29, 2.96s/it]g-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:44,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:44,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2585, 'learning_rate': 1.6401869158878507e-05, 'epoch': 0.61} [WARNING|modeling_utils.py:388] 2022-03-01 03:00:44,847 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:16,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▉ | 1087/1784 [1:29:34<33:36, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▉ | 1087/1784 [1:29:34<33:36, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|██████████████████████████████████████████████▉ | 1088/1784 [1:29:36<33:06, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:53,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:00:53,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3585, 'learning_rate': 1.633177570093458e-05, 'epoch': 0.61} [WARNING|modeling_utils.py:388] 2022-03-01 03:00:53,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:00:49,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████ | 1090/1784 [1:29:41<31:17, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:56,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████ | 1090/1784 [1:29:41<31:17, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:56,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████ | 1091/1784 [1:29:44<29:44, 2.58s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████ | 1091/1784 [1:29:44<29:44, 2.58s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:00:59,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1092/1784 [1:29:46<28:19, 2.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:01,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1092/1784 [1:29:46<28:19, 2.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:01,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1093/1784 [1:29:48<26:45, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:03,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1093/1784 [1:29:48<26:45, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:03,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1094/1784 [1:29:50<25:10, 2.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:04,866 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▏ | 1094/1784 [1:29:50<25:10, 2.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:04,866 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3113, 'learning_rate': 1.619158878504673e-05, 'epoch': 0.61} 61%|███████████████████████████████████████████████▎ | 1096/1784 [1:29:53<21:52, 1.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:06,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▎ | 1096/1784 [1:29:53<21:52, 1.91s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:06,512 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▎ | 1097/1784 [1:29:55<20:22, 1.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:09,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|███████████████████████████████████████████████▎ | 1097/1784 [1:29:55<20:22, 1.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:09,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▍ | 1099/1784 [1:29:57<17:35, 1.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:10,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▍ | 1099/1784 [1:29:57<17:35, 1.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:10,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3936, 'learning_rate': 1.6098130841121497e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▍ | 1099/1784 [1:29:57<17:35, 1.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:11,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▍ | 1100/1784 [1:29:59<18:02, 1.58s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:15,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1101/1784 [1:30:03<25:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:15,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1101/1784 [1:30:03<25:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:15,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1101/1784 [1:30:03<25:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1102/1784 [1:30:07<30:53, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1102/1784 [1:30:07<30:53, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3251, 'learning_rate': 1.602803738317757e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▌ | 1103/1784 [1:30:10<34:08, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▌ | 1103/1784 [1:30:10<34:08, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2298, 'learning_rate': 1.6004672897196264e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▋ | 1104/1784 [1:30:14<35:58, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▋ | 1104/1784 [1:30:14<35:58, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:31,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:31,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:31,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▋ | 1106/1784 [1:30:21<38:19, 3.39s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▋ | 1106/1784 [1:30:21<38:19, 3.39s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0129, 'learning_rate': 1.5934579439252337e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▊ | 1107/1784 [1:30:25<38:47, 3.44s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▊ | 1107/1784 [1:30:25<38:47, 3.44s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8364, 'learning_rate': 1.591121495327103e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▊ | 1108/1784 [1:30:28<39:08, 3.47s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▊ | 1108/1784 [1:30:28<39:08, 3.47s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:45,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:45,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2688, 'learning_rate': 1.586448598130841e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▉ | 1110/1784 [1:30:35<39:12, 3.49s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▉ | 1110/1784 [1:30:35<39:12, 3.49s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2297, 'learning_rate': 1.5841121495327104e-05, 'epoch': 0.62} 62%|███████████████████████████████████████████████▉ | 1111/1784 [1:30:39<39:02, 3.48s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|███████████████████████████████████████████████▉ | 1111/1784 [1:30:39<39:02, 3.48s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:56,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:01:56,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3571, 'learning_rate': 1.5794392523364484e-05, 'epoch': 0.62} 62%|████████████████████████████████████████████████ | 1113/1784 [1:30:46<38:47, 3.47s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|████████████████████████████████████████████████ | 1113/1784 [1:30:46<38:47, 3.47s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.7861, 'learning_rate': 1.5771028037383178e-05, 'epoch': 0.62} 62%|████████████████████████████████████████████████ | 1114/1784 [1:30:49<38:36, 3.46s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|████████████████████████████████████████████████ | 1114/1784 [1:30:49<38:36, 3.46s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:06,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:06,411 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4363, 'learning_rate': 1.572429906542056e-05, 'epoch': 0.62} 63%|████████████████████████████████████████████████▏ | 1116/1784 [1:30:56<37:55, 3.41s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▏ | 1116/1784 [1:30:56<37:55, 3.41s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2552, 'learning_rate': 1.570093457943925e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▏ | 1117/1784 [1:30:59<37:50, 3.40s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▏ | 1117/1784 [1:30:59<37:50, 3.40s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:16,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:16,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1694, 'learning_rate': 1.5654205607476634e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▎ | 1119/1784 [1:31:06<37:35, 3.39s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▎ | 1119/1784 [1:31:06<37:35, 3.39s/it]g-point operations will not be computed-01 03:01:18,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1184, 'learning_rate': 1.5630841121495328e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▎ | 1120/1784 [1:31:09<37:21, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▎ | 1120/1784 [1:31:09<37:21, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▍ | 1121/1784 [1:31:13<37:10, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▍ | 1121/1784 [1:31:13<37:10, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.214, 'learning_rate': 1.558411214953271e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▍ | 1122/1784 [1:31:16<36:53, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▍ | 1122/1784 [1:31:16<36:53, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:33,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:02:33,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0021, 'learning_rate': 1.5537383177570095e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▌ | 1124/1784 [1:31:22<36:10, 3.29s/it]g-point operations will not be computed-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1124/1784 [1:31:22<36:10, 3.29s/it]g-point operations will not be computed-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0581, 'learning_rate': 1.5514018691588785e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▌ | 1124/1784 [1:31:22<36:10, 3.29s/it]g-point operations will not be computed-01 03:02:24,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1125/1784 [1:31:25<35:51, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:41,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1125/1784 [1:31:25<35:51, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:41,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1126/1784 [1:31:29<35:35, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:41,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1126/1784 [1:31:29<35:35, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:41,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▌ | 1126/1784 [1:31:29<35:35, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:41,221 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1127/1784 [1:31:32<35:17, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:47,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1127/1784 [1:31:32<35:17, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:47,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1128/1784 [1:31:35<35:04, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:47,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1128/1784 [1:31:35<35:04, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:47,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1128/1784 [1:31:35<35:04, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:47,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▋ | 1129/1784 [1:31:38<34:56, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:53,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1130/1784 [1:31:41<34:42, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:53,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1130/1784 [1:31:41<34:42, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:53,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.47, 'learning_rate': 1.5373831775700935e-05, 'epoch': 0.63} 63%|████████████████████████████████████████████████▊ | 1130/1784 [1:31:41<34:42, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:02:53,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1131/1784 [1:31:44<34:17, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1131/1784 [1:31:44<34:17, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1132/1784 [1:31:47<33:52, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1132/1784 [1:31:47<33:52, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|████████████████████████████████████████████████▊ | 1132/1784 [1:31:47<33:52, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████▉ | 1133/1784 [1:31:51<33:33, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████▉ | 1133/1784 [1:31:51<33:33, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:07,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:07,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:07,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:00,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████▉ | 1135/1784 [1:31:56<32:35, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████ | 1136/1784 [1:31:59<32:04, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████ | 1136/1784 [1:31:59<32:04, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:16,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:16,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:16,205 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████ | 1138/1784 [1:32:05<31:07, 2.89s/it]g-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████ | 1138/1784 [1:32:05<31:07, 2.89s/it]g-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:21,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:21,691 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:24,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:24,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:24,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:11,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2967, 'learning_rate': 1.5140186915887848e-05, 'epoch': 0.64} 64%|█████████████████████████████████████████████████▏ | 1141/1784 [1:32:13<29:02, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:28,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████▏ | 1141/1784 [1:32:13<29:02, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:28,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████▎ | 1142/1784 [1:32:15<28:09, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████▎ | 1142/1784 [1:32:15<28:09, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████▎ | 1143/1784 [1:32:18<27:08, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|█████████████████████████████████████████████████▎ | 1143/1784 [1:32:18<27:08, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:33,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:33,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:35,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:35,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:37,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:37,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2848, 'learning_rate': 1.4976635514018692e-05, 'epoch': 0.64} [WARNING|modeling_utils.py:388] 2022-03-01 03:03:39,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:39,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:41,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:41,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:43,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:43,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:43,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:47,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:47,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:03:47,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▋ | 1152/1784 [1:32:37<28:51, 2.74s/it]g-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▋ | 1152/1784 [1:32:37<28:51, 2.74s/it]g-point operations will not be computed-01 03:03:30,581 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▊ | 1153/1784 [1:32:41<31:43, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▊ | 1153/1784 [1:32:41<31:43, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▊ | 1154/1784 [1:32:44<33:36, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▊ | 1154/1784 [1:32:44<33:36, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2827, 'learning_rate': 1.4813084112149532e-05, 'epoch': 0.65} 65%|█████████████████████████████████████████████████▊ | 1155/1784 [1:32:48<34:44, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▊ | 1155/1784 [1:32:48<34:44, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2893, 'learning_rate': 1.4789719626168226e-05, 'epoch': 0.65} 65%|█████████████████████████████████████████████████▉ | 1156/1784 [1:32:51<35:31, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▉ | 1156/1784 [1:32:51<35:31, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:09,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:09,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:09,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▉ | 1158/1784 [1:32:59<36:17, 3.48s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████▉ | 1158/1784 [1:32:59<36:17, 3.48s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1321, 'learning_rate': 1.4719626168224299e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████ | 1159/1784 [1:33:02<36:30, 3.50s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████ | 1159/1784 [1:33:02<36:30, 3.50s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0284, 'learning_rate': 1.4696261682242993e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████ | 1160/1784 [1:33:06<36:24, 3.50s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████ | 1160/1784 [1:33:06<36:24, 3.50s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:23,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:23,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3155, 'learning_rate': 1.4649532710280374e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████▏ | 1162/1784 [1:33:13<35:59, 3.47s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▏ | 1162/1784 [1:33:13<35:59, 3.47s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1251, 'learning_rate': 1.4626168224299066e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████▏ | 1163/1784 [1:33:16<35:47, 3.46s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▏ | 1163/1784 [1:33:16<35:47, 3.46s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:33,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:33,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0236, 'learning_rate': 1.457943925233645e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████▎ | 1165/1784 [1:33:23<35:28, 3.44s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▎ | 1165/1784 [1:33:23<35:28, 3.44s/it]g-point operations will not be computed-01 03:03:56,638 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0506, 'learning_rate': 1.4556074766355141e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████▎ | 1166/1784 [1:33:26<35:12, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▎ | 1166/1784 [1:33:26<35:12, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▎ | 1167/1784 [1:33:29<34:58, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▎ | 1167/1784 [1:33:29<34:58, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0952, 'learning_rate': 1.4509345794392524e-05, 'epoch': 0.65} 65%|██████████████████████████████████████████████████▍ | 1168/1784 [1:33:33<34:52, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|██████████████████████████████████████████████████▍ | 1168/1784 [1:33:33<34:52, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:50,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:50,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:04:50,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▍ | 1170/1784 [1:33:40<34:17, 3.35s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▍ | 1170/1784 [1:33:40<34:17, 3.35s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3295, 'learning_rate': 1.4439252336448598e-05, 'epoch': 0.66} 66%|██████████████████████████████████████████████████▍ | 1170/1784 [1:33:40<34:17, 3.35s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▌ | 1171/1784 [1:33:43<34:08, 3.34s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▌ | 1171/1784 [1:33:43<34:08, 3.34s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:00,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:00,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:00,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1173/1784 [1:33:49<33:37, 3.30s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1173/1784 [1:33:49<33:37, 3.30s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1173/1784 [1:33:49<33:37, 3.30s/it]g-point operations will not be computed-01 03:04:41,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1174/1784 [1:33:53<33:27, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1174/1784 [1:33:53<33:27, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1175/1784 [1:33:56<33:20, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1175/1784 [1:33:56<33:20, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▋ | 1175/1784 [1:33:56<33:20, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▊ | 1176/1784 [1:33:59<33:01, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:16,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:16,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2658, 'learning_rate': 1.427570093457944e-05, 'epoch': 0.66} 66%|██████████████████████████████████████████████████▊ | 1178/1784 [1:34:05<32:23, 3.21s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▊ | 1178/1784 [1:34:05<32:23, 3.21s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2669, 'learning_rate': 1.4252336448598131e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-01 03:05:22,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:22,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▉ | 1180/1784 [1:34:12<31:52, 3.17s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|██████████████████████████████████████████████████▉ | 1180/1784 [1:34:12<31:52, 3.17s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:28,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:28,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0168, 'learning_rate': 1.4182242990654207e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-01 03:05:28,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████ | 1182/1784 [1:34:18<30:56, 3.08s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████ | 1182/1784 [1:34:18<30:56, 3.08s/it]g-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:34,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:34,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:34,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:08,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████ | 1184/1784 [1:34:24<30:06, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████ | 1184/1784 [1:34:24<30:06, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|███████████████████████████████████████████████████▏ | 1185/1784 [1:34:26<29:36, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:43,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:43,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9412, 'learning_rate': 1.4065420560747663e-05, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-03-01 03:05:43,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:39,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▏ | 1187/1784 [1:34:32<28:40, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▏ | 1187/1784 [1:34:32<28:40, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▎ | 1188/1784 [1:34:35<28:01, 2.82s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:51,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:05:51,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9693, 'learning_rate': 1.399532710280374e-05, 'epoch': 0.67} [WARNING|modeling_utils.py:388] 2022-03-01 03:05:51,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:47,551 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▎ | 1190/1784 [1:34:40<26:46, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:55,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▎ | 1190/1784 [1:34:40<26:46, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:55,334 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1191/1784 [1:34:42<25:55, 2.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:57,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1191/1784 [1:34:42<25:55, 2.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:57,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1192/1784 [1:34:45<24:51, 2.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1192/1784 [1:34:45<24:51, 2.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1193/1784 [1:34:47<23:52, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▍ | 1193/1784 [1:34:47<23:52, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:02,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:02,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:04,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:04,702 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:07,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:07,860 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0778, 'learning_rate': 1.3808411214953272e-05, 'epoch': 0.67} [WARNING|modeling_utils.py:388] 2022-03-01 03:06:09,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:09,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:12,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:12,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0137, 'learning_rate': 1.3738317757009345e-05, 'epoch': 0.67} [WARNING|modeling_utils.py:388] 2022-03-01 03:06:15,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:15,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.063, 'learning_rate': 1.3714953271028039e-05, 'epoch': 0.67} [WARNING|modeling_utils.py:388] 2022-03-01 03:06:15,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:19,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:19,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:19,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▉ | 1203/1784 [1:35:09<29:02, 3.00s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▉ | 1203/1784 [1:35:09<29:02, 3.00s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▉ | 1203/1784 [1:35:09<29:02, 3.00s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|███████████████████████████████████████████████████▉ | 1204/1784 [1:35:13<30:51, 3.19s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:30,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:30,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8923, 'learning_rate': 1.3621495327102804e-05, 'epoch': 0.68} 68%|████████████████████████████████████████████████████ | 1206/1784 [1:35:20<32:29, 3.37s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████ | 1206/1784 [1:35:20<32:29, 3.37s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2806, 'learning_rate': 1.3598130841121496e-05, 'epoch': 0.68} 68%|████████████████████████████████████████████████████ | 1206/1784 [1:35:20<32:29, 3.37s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████ | 1207/1784 [1:35:23<32:47, 3.41s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████ | 1207/1784 [1:35:23<32:47, 3.41s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████ | 1207/1784 [1:35:23<32:47, 3.41s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▏ | 1208/1784 [1:35:27<33:07, 3.45s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:44,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:06:44,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0846, 'learning_rate': 1.352803738317757e-05, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-01 03:06:44,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▏ | 1210/1784 [1:35:34<32:57, 3.45s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▏ | 1210/1784 [1:35:34<32:57, 3.45s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▏ | 1210/1784 [1:35:34<32:57, 3.45s/it]g-point operations will not be computed-01 03:05:59,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1211/1784 [1:35:37<32:42, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1212/1784 [1:35:41<32:33, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1212/1784 [1:35:41<32:33, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3152, 'learning_rate': 1.3457943925233644e-05, 'epoch': 0.68} 68%|████████████████████████████████████████████████████▎ | 1212/1784 [1:35:41<32:33, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1213/1784 [1:35:44<32:24, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1213/1784 [1:35:44<32:24, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▎ | 1213/1784 [1:35:44<32:24, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▍ | 1214/1784 [1:35:48<32:22, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:05,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:05,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1032, 'learning_rate': 1.3387850467289721e-05, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-01 03:07:05,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▍ | 1216/1784 [1:35:54<31:57, 3.38s/it]g-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▍ | 1216/1784 [1:35:54<31:57, 3.38s/it]g-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▍ | 1216/1784 [1:35:54<31:57, 3.38s/it]g-point operations will not be computed-01 03:06:53,149 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▌ | 1217/1784 [1:35:58<31:52, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▌ | 1218/1784 [1:36:01<31:48, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▌ | 1218/1784 [1:36:01<31:48, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9534, 'learning_rate': 1.3317757009345794e-05, 'epoch': 0.68} 68%|████████████████████████████████████████████████████▌ | 1218/1784 [1:36:01<31:48, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▌ | 1219/1784 [1:36:04<31:37, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:21,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:21,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2816, 'learning_rate': 1.3271028037383178e-05, 'epoch': 0.68} [WARNING|modeling_utils.py:388] 2022-03-01 03:07:21,711 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▋ | 1221/1784 [1:36:11<31:20, 3.34s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▋ | 1221/1784 [1:36:11<31:20, 3.34s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▋ | 1221/1784 [1:36:11<31:20, 3.34s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|████████████████████████████████████████████████████▋ | 1222/1784 [1:36:14<31:04, 3.32s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:31,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:31,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2744, 'learning_rate': 1.3200934579439253e-05, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-01 03:07:31,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████▊ | 1224/1784 [1:36:21<30:23, 3.26s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:37,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:37,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2383, 'learning_rate': 1.3154205607476636e-05, 'epoch': 0.69} [WARNING|modeling_utils.py:388] 2022-03-01 03:07:37,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████▉ | 1226/1784 [1:36:27<30:00, 3.23s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:44,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:44,286 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0618, 'learning_rate': 1.310747663551402e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████ | 1228/1784 [1:36:33<29:40, 3.20s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████ | 1228/1784 [1:36:33<29:40, 3.20s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:50,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:07:50,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0311, 'learning_rate': 1.3060747663551403e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████ | 1230/1784 [1:36:40<29:06, 3.15s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████ | 1230/1784 [1:36:40<29:06, 3.15s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2391, 'learning_rate': 1.3037383177570093e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████ | 1230/1784 [1:36:40<29:06, 3.15s/it]g-point operations will not be computed-01 03:07:13,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▏ | 1231/1784 [1:36:43<28:59, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:58,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▏ | 1232/1784 [1:36:46<28:41, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:58,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▏ | 1232/1784 [1:36:46<28:41, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:58,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0725, 'learning_rate': 1.2990654205607477e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████▏ | 1232/1784 [1:36:46<28:41, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:07:58,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▏ | 1233/1784 [1:36:49<28:15, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:04,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▎ | 1234/1784 [1:36:52<28:02, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:04,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▎ | 1234/1784 [1:36:52<28:02, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:04,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0475, 'learning_rate': 1.294392523364486e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████▎ | 1234/1784 [1:36:52<28:02, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:04,375 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▎ | 1235/1784 [1:36:55<27:44, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▎ | 1235/1784 [1:36:55<27:44, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▎ | 1236/1784 [1:36:58<27:16, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:14,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:14,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:17,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:17,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:10,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0024, 'learning_rate': 1.2850467289719627e-05, 'epoch': 0.69} 69%|█████████████████████████████████████████████████████▍ | 1239/1784 [1:37:06<25:41, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|█████████████████████████████████████████████████████▍ | 1239/1784 [1:37:06<25:41, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████▌ | 1240/1784 [1:37:08<24:56, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████▌ | 1240/1784 [1:37:08<24:56, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:25,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:25,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:27,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:27,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:29,588 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:29,588 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:31,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:31,624 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:33,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:33,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:35,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:35,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:36,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:36,792 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4063, 'learning_rate': 1.2640186915887852e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-01 03:08:39,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:39,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:41,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:41,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5105, 'learning_rate': 1.2570093457943925e-05, 'epoch': 0.7} [WARNING|modeling_utils.py:388] 2022-03-01 03:08:44,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:44,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2447, 'learning_rate': 1.2546728971962617e-05, 'epoch': 0.7} 70%|██████████████████████████████████████████████████████ | 1252/1784 [1:37:34<23:53, 2.69s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████ | 1252/1784 [1:37:34<23:53, 2.69s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1262, 'learning_rate': 1.2523364485981309e-05, 'epoch': 0.7} 70%|██████████████████████████████████████████████████████ | 1253/1784 [1:37:38<26:31, 3.00s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████ | 1253/1784 [1:37:38<26:31, 3.00s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:08:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.292, 'learning_rate': 1.2476635514018692e-05, 'epoch': 0.7} 70%|██████████████████████████████████████████████████████▏ | 1255/1784 [1:37:45<29:11, 3.31s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████▏ | 1255/1784 [1:37:45<29:11, 3.31s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3866, 'learning_rate': 1.2453271028037384e-05, 'epoch': 0.7} 70%|██████████████████████████████████████████████████████▏ | 1256/1784 [1:37:49<29:55, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████▏ | 1256/1784 [1:37:49<29:55, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0586, 'learning_rate': 1.2429906542056074e-05, 'epoch': 0.7} 70%|██████████████████████████████████████████████████████▎ | 1257/1784 [1:37:53<30:18, 3.45s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|██████████████████████████████████████████████████████▎ | 1257/1784 [1:37:53<30:18, 3.45s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:10,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:10,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9164, 'learning_rate': 1.2383177570093457e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▎ | 1259/1784 [1:38:00<30:29, 3.48s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▎ | 1259/1784 [1:38:00<30:29, 3.48s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1645, 'learning_rate': 1.235981308411215e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▍ | 1260/1784 [1:38:03<30:31, 3.50s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▍ | 1260/1784 [1:38:03<30:31, 3.50s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:20,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:20,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9794, 'learning_rate': 1.2313084112149534e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▍ | 1262/1784 [1:38:10<30:03, 3.46s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▍ | 1262/1784 [1:38:10<30:03, 3.46s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2441, 'learning_rate': 1.2289719626168224e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▌ | 1263/1784 [1:38:13<29:55, 3.45s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▌ | 1263/1784 [1:38:13<29:55, 3.45s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:30,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:30,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1159, 'learning_rate': 1.2242990654205608e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▌ | 1265/1784 [1:38:20<29:24, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▌ | 1265/1784 [1:38:20<29:24, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.317, 'learning_rate': 1.22196261682243e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▋ | 1266/1784 [1:38:23<29:20, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▋ | 1266/1784 [1:38:23<29:20, 3.40s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:40,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:40,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0619, 'learning_rate': 1.2172897196261683e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▋ | 1268/1784 [1:38:30<29:10, 3.39s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▋ | 1268/1784 [1:38:30<29:10, 3.39s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0484, 'learning_rate': 1.2149532710280374e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▊ | 1269/1784 [1:38:34<29:00, 3.38s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▊ | 1269/1784 [1:38:34<29:00, 3.38s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:50,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:09:50,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0288, 'learning_rate': 1.2102803738317758e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▊ | 1271/1784 [1:38:40<28:33, 3.34s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▊ | 1271/1784 [1:38:40<28:33, 3.34s/it]g-point operations will not be computed-01 03:08:21,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0943, 'learning_rate': 1.207943925233645e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▉ | 1272/1784 [1:38:43<28:14, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:09:59,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▉ | 1272/1784 [1:38:43<28:14, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:09:59,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▉ | 1273/1784 [1:38:47<27:49, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:09:59,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▉ | 1273/1784 [1:38:47<27:49, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:09:59,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3845, 'learning_rate': 1.2032710280373833e-05, 'epoch': 0.71} 71%|██████████████████████████████████████████████████████▉ | 1274/1784 [1:38:50<27:37, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|██████████████████████████████████████████████████████▉ | 1274/1784 [1:38:50<27:37, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████ | 1275/1784 [1:38:53<27:34, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|███████████████████████████████████████████████████████ | 1275/1784 [1:38:53<27:34, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1361, 'learning_rate': 1.1985981308411216e-05, 'epoch': 0.71} 71%|███████████████████████████████████████████████████████ | 1275/1784 [1:38:53<27:34, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████ | 1276/1784 [1:38:56<27:32, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:13,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:13,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2566, 'learning_rate': 1.19392523364486e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-01 03:10:13,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▏ | 1278/1784 [1:39:03<27:06, 3.21s/it]g-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:19,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:19,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3413, 'learning_rate': 1.1892523364485981e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-01 03:10:19,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▏ | 1280/1784 [1:39:09<26:36, 3.17s/it]g-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▏ | 1280/1784 [1:39:09<26:36, 3.17s/it]g-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▏ | 1280/1784 [1:39:09<26:36, 3.17s/it]g-point operations will not be computed-01 03:10:05,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▎ | 1281/1784 [1:39:12<26:18, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:27,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▎ | 1281/1784 [1:39:12<26:18, 3.14s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:27,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▎ | 1282/1784 [1:39:15<26:02, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:27,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▎ | 1282/1784 [1:39:15<26:02, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:27,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▎ | 1282/1784 [1:39:15<26:02, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:27,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▍ | 1283/1784 [1:39:18<25:48, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▍ | 1283/1784 [1:39:18<25:48, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▍ | 1284/1784 [1:39:21<25:21, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:38,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:38,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1345, 'learning_rate': 1.1752336448598132e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-01 03:10:38,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:33,696 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▌ | 1286/1784 [1:39:27<24:41, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▌ | 1287/1784 [1:39:30<24:20, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▌ | 1287/1784 [1:39:30<24:20, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:46,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:46,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1205, 'learning_rate': 1.1682242990654205e-05, 'epoch': 0.72} [WARNING|modeling_utils.py:388] 2022-03-01 03:10:46,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:42,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▋ | 1289/1784 [1:39:35<23:20, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▋ | 1289/1784 [1:39:35<23:20, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|███████████████████████████████████████████████████████▋ | 1290/1784 [1:39:38<22:54, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:54,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:54,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:56,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:56,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:58,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:10:58,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:01,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:01,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:02,972 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:02,972 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:04,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:04,733 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:06,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:06,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0327, 'learning_rate': 1.147196261682243e-05, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-01 03:11:09,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:09,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:10,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:10,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9934, 'learning_rate': 1.1401869158878504e-05, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-01 03:11:14,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:14,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2126, 'learning_rate': 1.1378504672897197e-05, 'epoch': 0.73} 73%|████████████████████████████████████████████████████████▏ | 1302/1784 [1:40:04<22:06, 2.75s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▏ | 1302/1784 [1:40:04<22:06, 2.75s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3675, 'learning_rate': 1.1355140186915887e-05, 'epoch': 0.73} 73%|████████████████████████████████████████████████████████▏ | 1302/1784 [1:40:04<22:06, 2.75s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▏ | 1303/1784 [1:40:08<24:19, 3.04s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▏ | 1303/1784 [1:40:08<24:19, 3.04s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▏ | 1303/1784 [1:40:08<24:19, 3.04s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▎ | 1304/1784 [1:40:12<25:47, 3.22s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:29,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:29,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0115, 'learning_rate': 1.1285046728971964e-05, 'epoch': 0.73} 73%|████████████████████████████████████████████████████████▎ | 1306/1784 [1:40:19<27:05, 3.40s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▎ | 1306/1784 [1:40:19<27:05, 3.40s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1536, 'learning_rate': 1.1261682242990654e-05, 'epoch': 0.73} 73%|████████████████████████████████████████████████████████▎ | 1306/1784 [1:40:19<27:05, 3.40s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▍ | 1307/1784 [1:40:22<27:18, 3.43s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▍ | 1307/1784 [1:40:22<27:18, 3.43s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▍ | 1307/1784 [1:40:22<27:18, 3.43s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▍ | 1308/1784 [1:40:26<27:25, 3.46s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:43,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:43,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1187, 'learning_rate': 1.1191588785046729e-05, 'epoch': 0.73} [WARNING|modeling_utils.py:388] 2022-03-01 03:11:43,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▌ | 1310/1784 [1:40:33<27:30, 3.48s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▌ | 1310/1784 [1:40:33<27:30, 3.48s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▌ | 1310/1784 [1:40:33<27:30, 3.48s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|████████████████████████████████████████████████████████▌ | 1311/1784 [1:40:36<27:14, 3.46s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:53,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:11:53,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2141, 'learning_rate': 1.1121495327102804e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-01 03:11:53,764 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▋ | 1313/1784 [1:40:43<26:50, 3.42s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▋ | 1313/1784 [1:40:43<26:50, 3.42s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▋ | 1313/1784 [1:40:43<26:50, 3.42s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▋ | 1314/1784 [1:40:46<26:35, 3.40s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:03,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:03,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3324, 'learning_rate': 1.105140186915888e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-01 03:12:03,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▊ | 1316/1784 [1:40:53<26:20, 3.38s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▊ | 1316/1784 [1:40:53<26:20, 3.38s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▊ | 1316/1784 [1:40:53<26:20, 3.38s/it]g-point operations will not be computed-01 03:10:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▊ | 1317/1784 [1:40:56<26:17, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▉ | 1318/1784 [1:41:00<26:08, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▉ | 1318/1784 [1:41:00<26:08, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.567, 'learning_rate': 1.0981308411214953e-05, 'epoch': 0.74} 74%|████████████████████████████████████████████████████████▉ | 1318/1784 [1:41:00<26:08, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████▉ | 1319/1784 [1:41:03<25:49, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:20,465 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:20,465 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8993, 'learning_rate': 1.0934579439252336e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-01 03:12:20,465 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████ | 1321/1784 [1:41:10<25:26, 3.30s/it]g-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:26,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:26,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.09, 'learning_rate': 1.088785046728972e-05, 'epoch': 0.74} 74%|█████████████████████████████████████████████████████████ | 1323/1784 [1:41:16<25:03, 3.26s/it]g-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████ | 1323/1784 [1:41:16<25:03, 3.26s/it]g-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0313, 'learning_rate': 1.0864485981308411e-05, 'epoch': 0.74} 74%|█████████████████████████████████████████████████████████ | 1323/1784 [1:41:16<25:03, 3.26s/it]g-point operations will not be computed-01 03:12:12,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████▏ | 1324/1784 [1:41:19<24:38, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████▏ | 1325/1784 [1:41:22<24:28, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████▏ | 1325/1784 [1:41:22<24:28, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1181, 'learning_rate': 1.0817757009345795e-05, 'epoch': 0.74} 74%|█████████████████████████████████████████████████████████▏ | 1325/1784 [1:41:22<24:28, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████▏ | 1326/1784 [1:41:25<24:12, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:42,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:42,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8988, 'learning_rate': 1.0771028037383178e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-01 03:12:42,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|█████████████████████████████████████████████████████████▎ | 1328/1784 [1:41:32<24:00, 3.16s/it]g-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:48,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:48,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2302, 'learning_rate': 1.0724299065420561e-05, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-01 03:12:48,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▍ | 1330/1784 [1:41:38<23:32, 3.11s/it]g-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:55,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:12:55,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9233, 'learning_rate': 1.0677570093457945e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-01 03:12:55,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▍ | 1332/1784 [1:41:44<23:06, 3.07s/it]g-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:01,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:01,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1301, 'learning_rate': 1.0630841121495328e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-01 03:13:01,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:12:34,898 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▌ | 1334/1784 [1:41:50<22:32, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▌ | 1335/1784 [1:41:53<22:11, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▌ | 1335/1784 [1:41:53<22:11, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:09,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:09,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0593, 'learning_rate': 1.0560747663551402e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-01 03:13:09,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:05,441 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▋ | 1337/1784 [1:41:58<21:31, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▊ | 1338/1784 [1:42:01<21:04, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▊ | 1338/1784 [1:42:01<21:04, 2.83s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:17,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:17,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4362, 'learning_rate': 1.0490654205607477e-05, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-01 03:13:17,767 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:13,848 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▊ | 1340/1784 [1:42:06<20:03, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:21,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▊ | 1340/1784 [1:42:06<20:03, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:21,630 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▉ | 1341/1784 [1:42:09<19:29, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▉ | 1341/1784 [1:42:09<19:29, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▉ | 1342/1784 [1:42:11<18:42, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|█████████████████████████████████████████████████████████▉ | 1342/1784 [1:42:11<18:42, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:27,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:27,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:29,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:29,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:31,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:31,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:34,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:34,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:35,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:35,577 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1296, 'learning_rate': 1.02803738317757e-05, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-01 03:13:36,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:36,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:38,496 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:42,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:42,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3112, 'learning_rate': 1.0210280373831776e-05, 'epoch': 0.76} [WARNING|modeling_utils.py:388] 2022-03-01 03:13:46,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:13:46,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2557, 'learning_rate': 1.0186915887850467e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▍ | 1353/1784 [1:42:36<21:36, 3.01s/it]g-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▍ | 1353/1784 [1:42:36<21:36, 3.01s/it]g-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2107, 'learning_rate': 1.0163551401869159e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▍ | 1354/1784 [1:42:39<22:49, 3.18s/it]g-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▍ | 1354/1784 [1:42:39<22:49, 3.18s/it]g-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2676, 'learning_rate': 1.014018691588785e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▍ | 1354/1784 [1:42:39<22:49, 3.18s/it]g-point operations will not be computed-01 03:13:24,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▍ | 1355/1784 [1:42:43<23:35, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▌ | 1356/1784 [1:42:47<24:12, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▌ | 1356/1784 [1:42:47<24:12, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4405, 'learning_rate': 1.0093457943925234e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▌ | 1357/1784 [1:42:50<24:23, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▌ | 1357/1784 [1:42:50<24:23, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9069, 'learning_rate': 1.0070093457943926e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▌ | 1358/1784 [1:42:54<24:38, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▌ | 1358/1784 [1:42:54<24:38, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:11,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:11,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.364, 'learning_rate': 1.0023364485981309e-05, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▋ | 1360/1784 [1:43:01<24:38, 3.49s/it]g-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▋ | 1360/1784 [1:43:01<24:38, 3.49s/it]g-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6242, 'learning_rate': 9.999999999999999e-06, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▋ | 1361/1784 [1:43:04<24:30, 3.48s/it]g-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▋ | 1361/1784 [1:43:04<24:30, 3.48s/it]g-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.115, 'learning_rate': 9.976635514018693e-06, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▋ | 1361/1784 [1:43:04<24:30, 3.48s/it]g-point operations will not be computed-01 03:13:58,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▊ | 1362/1784 [1:43:08<24:28, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▊ | 1363/1784 [1:43:11<24:20, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▊ | 1363/1784 [1:43:11<24:20, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0141, 'learning_rate': 9.929906542056076e-06, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▊ | 1364/1784 [1:43:14<24:13, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|██████████████████████████████████████████████████████████▊ | 1364/1784 [1:43:14<24:13, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4776, 'learning_rate': 9.906542056074766e-06, 'epoch': 0.76} 76%|██████████████████████████████████████████████████████████▊ | 1364/1784 [1:43:14<24:13, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:23,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████▉ | 1365/1784 [1:43:18<24:02, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████▉ | 1366/1784 [1:43:21<23:48, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|██████████████████████████████████████████████████████████▉ | 1366/1784 [1:43:21<23:48, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8906, 'learning_rate': 9.85981308411215e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████ | 1367/1784 [1:43:25<23:37, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████ | 1367/1784 [1:43:25<23:37, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:42,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:42,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9848, 'learning_rate': 9.813084112149533e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████ | 1369/1784 [1:43:31<23:21, 3.38s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████ | 1369/1784 [1:43:31<23:21, 3.38s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3488, 'learning_rate': 9.789719626168224e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▏ | 1370/1784 [1:43:35<23:07, 3.35s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▏ | 1370/1784 [1:43:35<23:07, 3.35s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:51,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:14:51,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.28, 'learning_rate': 9.742990654205608e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▏ | 1372/1784 [1:43:41<22:55, 3.34s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▏ | 1372/1784 [1:43:41<22:55, 3.34s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9477, 'learning_rate': 9.7196261682243e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▏ | 1372/1784 [1:43:41<22:55, 3.34s/it]g-point operations will not be computed-01 03:14:33,679 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▎ | 1373/1784 [1:43:44<22:40, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▎ | 1374/1784 [1:43:48<22:32, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▎ | 1374/1784 [1:43:48<22:32, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2706, 'learning_rate': 9.672897196261681e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▎ | 1375/1784 [1:43:51<22:26, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▎ | 1375/1784 [1:43:51<22:26, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:08,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:08,356 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8925, 'learning_rate': 9.626168224299065e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▍ | 1377/1784 [1:43:58<22:12, 3.27s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▍ | 1377/1784 [1:43:58<22:12, 3.27s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:14,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:14,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1056, 'learning_rate': 9.579439252336448e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▌ | 1379/1784 [1:44:04<21:41, 3.21s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▌ | 1379/1784 [1:44:04<21:41, 3.21s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:21,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:21,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0996, 'learning_rate': 9.532710280373831e-06, 'epoch': 0.77} 77%|███████████████████████████████████████████████████████████▌ | 1381/1784 [1:44:10<21:17, 3.17s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|███████████████████████████████████████████████████████████▌ | 1381/1784 [1:44:10<21:17, 3.17s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:27,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:27,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3562, 'learning_rate': 9.485981308411215e-06, 'epoch': 0.77} 78%|███████████████████████████████████████████████████████████▋ | 1383/1784 [1:44:16<20:53, 3.12s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▋ | 1383/1784 [1:44:16<20:53, 3.12s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0043, 'learning_rate': 9.462616822429907e-06, 'epoch': 0.78} 78%|███████████████████████████████████████████████████████████▋ | 1383/1784 [1:44:16<20:53, 3.12s/it]g-point operations will not be computed-01 03:15:00,247 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▋ | 1384/1784 [1:44:19<20:29, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:34,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▊ | 1385/1784 [1:44:22<20:08, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:34,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▊ | 1385/1784 [1:44:22<20:08, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:34,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:39,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:34,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:39,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:34,864 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2274, 'learning_rate': 9.392523364485982e-06, 'epoch': 0.78} 78%|███████████████████████████████████████████████████████████▊ | 1387/1784 [1:44:28<19:22, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▊ | 1387/1784 [1:44:28<19:22, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▉ | 1388/1784 [1:44:31<19:09, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▉ | 1388/1784 [1:44:31<19:09, 2.90s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:47,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:47,497 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:43,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1399, 'learning_rate': 9.322429906542057e-06, 'epoch': 0.78} 78%|███████████████████████████████████████████████████████████▉ | 1390/1784 [1:44:36<18:12, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|███████████████████████████████████████████████████████████▉ | 1390/1784 [1:44:36<18:12, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|████████████████████████████████████████████████████████████ | 1391/1784 [1:44:39<17:43, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|████████████████████████████████████████████████████████████ | 1391/1784 [1:44:39<17:43, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:55,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:55,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:57,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:57,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:59,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:15:59,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:01,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:01,282 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:02,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:02,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:04,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:04,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.34, 'learning_rate': 9.135514018691589e-06, 'epoch': 0.78} [WARNING|modeling_utils.py:388] 2022-03-01 03:16:07,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:07,152 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:08,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:08,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4264, 'learning_rate': 9.065420560747664e-06, 'epoch': 0.78} [WARNING|modeling_utils.py:388] 2022-03-01 03:16:12,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:12,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9775, 'learning_rate': 9.042056074766356e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▌ | 1402/1784 [1:45:02<17:11, 2.70s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▌ | 1402/1784 [1:45:02<17:11, 2.70s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:19,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:19,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0301, 'learning_rate': 8.995327102803739e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▌ | 1404/1784 [1:45:09<20:05, 3.17s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▌ | 1404/1784 [1:45:09<20:05, 3.17s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9575, 'learning_rate': 8.971962616822429e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▋ | 1405/1784 [1:45:13<20:46, 3.29s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▋ | 1405/1784 [1:45:13<20:46, 3.29s/it]g-point operations will not be computed-01 03:15:51,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9296, 'learning_rate': 8.948598130841122e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▋ | 1406/1784 [1:45:17<21:07, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:32,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▋ | 1406/1784 [1:45:17<21:07, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:32,424 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4694, 'learning_rate': 8.925233644859812e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▋ | 1407/1784 [1:45:20<21:25, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▊ | 1408/1784 [1:45:24<21:41, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▊ | 1408/1784 [1:45:24<21:41, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9505, 'learning_rate': 8.878504672897196e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▊ | 1409/1784 [1:45:27<21:44, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▊ | 1409/1784 [1:45:27<21:44, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0672, 'learning_rate': 8.855140186915887e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▊ | 1410/1784 [1:45:31<21:38, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▊ | 1410/1784 [1:45:31<21:38, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:48,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:48,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9355, 'learning_rate': 8.80841121495327e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▉ | 1412/1784 [1:45:37<21:23, 3.45s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▉ | 1412/1784 [1:45:37<21:23, 3.45s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9802, 'learning_rate': 8.785046728971963e-06, 'epoch': 0.79} 79%|████████████████████████████████████████████████████████████▉ | 1413/1784 [1:45:41<21:19, 3.45s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|████████████████████████████████████████████████████████████▉ | 1413/1784 [1:45:41<21:19, 3.45s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:58,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:16:58,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2815, 'learning_rate': 8.738317757009346e-06, 'epoch': 0.79} 79%|█████████████████████████████████████████████████████████████ | 1415/1784 [1:45:48<21:04, 3.43s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|█████████████████████████████████████████████████████████████ | 1415/1784 [1:45:48<21:04, 3.43s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2071, 'learning_rate': 8.714953271028038e-06, 'epoch': 0.79} 79%|█████████████████████████████████████████████████████████████ | 1415/1784 [1:45:48<21:04, 3.43s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|█████████████████████████████████████████████████████████████ | 1416/1784 [1:45:51<20:51, 3.40s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:08,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:08,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1243, 'learning_rate': 8.668224299065421e-06, 'epoch': 0.79} 79%|█████████████████████████████████████████████████████████████▏ | 1418/1784 [1:45:58<20:40, 3.39s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|█████████████████████████████████████████████████████████████▏ | 1418/1784 [1:45:58<20:40, 3.39s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:15,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:15,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:15,314 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▎ | 1420/1784 [1:46:05<20:23, 3.36s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▎ | 1420/1784 [1:46:05<20:23, 3.36s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9652, 'learning_rate': 8.598130841121494e-06, 'epoch': 0.8} 80%|█████████████████████████████████████████████████████████████▎ | 1421/1784 [1:46:08<20:18, 3.36s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▎ | 1421/1784 [1:46:08<20:18, 3.36s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1442, 'learning_rate': 8.574766355140188e-06, 'epoch': 0.8} 80%|█████████████████████████████████████████████████████████████▎ | 1421/1784 [1:46:08<20:18, 3.36s/it]g-point operations will not be computed-01 03:16:36,021 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▍ | 1422/1784 [1:46:11<20:02, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▍ | 1423/1784 [1:46:14<19:52, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▍ | 1423/1784 [1:46:14<19:52, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:31,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:31,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0383, 'learning_rate': 8.504672897196261e-06, 'epoch': 0.8} 80%|█████████████████████████████████████████████████████████████▌ | 1425/1784 [1:46:21<19:22, 3.24s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▌ | 1425/1784 [1:46:21<19:22, 3.24s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2949, 'learning_rate': 8.481308411214953e-06, 'epoch': 0.8} 80%|█████████████████████████████████████████████████████████████▌ | 1426/1784 [1:46:24<19:05, 3.20s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▌ | 1426/1784 [1:46:24<19:05, 3.20s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:41,054 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:41,054 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0419, 'learning_rate': 8.434579439252336e-06, 'epoch': 0.8} 80%|█████████████████████████████████████████████████████████████▋ | 1428/1784 [1:46:30<18:40, 3.15s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▋ | 1428/1784 [1:46:30<18:40, 3.15s/it]g-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:47,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:17:47,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2055, 'learning_rate': 8.38785046728972e-06, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-01 03:17:47,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:26,924 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▋ | 1430/1784 [1:46:36<18:28, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:51,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▋ | 1430/1784 [1:46:36<18:28, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:51,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1431/1784 [1:46:39<18:19, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:51,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1431/1784 [1:46:39<18:19, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:51,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1431/1784 [1:46:39<18:19, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:51,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1432/1784 [1:46:42<18:06, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1432/1784 [1:46:42<18:06, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▊ | 1433/1784 [1:46:45<17:46, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:02,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:02,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0624, 'learning_rate': 8.271028037383177e-06, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-01 03:18:02,316 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▉ | 1435/1784 [1:46:51<17:13, 2.96s/it]g-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|█████████████████████████████████████████████████████████████▉ | 1435/1784 [1:46:51<17:13, 2.96s/it]g-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:07,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:10,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:10,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3315, 'learning_rate': 8.200934579439253e-06, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-01 03:18:10,686 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:17:58,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████ | 1438/1784 [1:46:59<16:09, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████ | 1438/1784 [1:46:59<16:09, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████ | 1439/1784 [1:47:02<15:45, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:18,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:18,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:20,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:20,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3416, 'learning_rate': 8.107476635514018e-06, 'epoch': 0.81} [WARNING|modeling_utils.py:388] 2022-03-01 03:18:20,812 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:14,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▏ | 1442/1784 [1:47:09<14:16, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:24,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▏ | 1442/1784 [1:47:09<14:16, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:24,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1443/1784 [1:47:11<13:40, 2.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:26,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1443/1784 [1:47:11<13:40, 2.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:26,432 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1444/1784 [1:47:13<13:04, 2.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:28,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1444/1784 [1:47:13<13:04, 2.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:28,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1445/1784 [1:47:15<12:22, 2.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:30,273 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▎ | 1445/1784 [1:47:15<12:22, 2.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:30,273 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▍ | 1447/1784 [1:47:19<10:45, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:31,933 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▍ | 1447/1784 [1:47:19<10:45, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:31,933 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2311, 'learning_rate': 7.967289719626169e-06, 'epoch': 0.81} 81%|██████████████████████████████████████████████████████████████▍ | 1448/1784 [1:47:20<09:47, 1.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:34,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▍ | 1448/1784 [1:47:20<09:47, 1.75s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:34,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▌ | 1450/1784 [1:47:23<08:56, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:35,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▌ | 1450/1784 [1:47:23<08:56, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:35,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▌ | 1450/1784 [1:47:23<08:56, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:38,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1451/1784 [1:47:27<12:38, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:38,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1451/1784 [1:47:27<12:38, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:38,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1451/1784 [1:47:27<12:38, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1451/1784 [1:47:27<12:38, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1452/1784 [1:47:30<14:58, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1452/1784 [1:47:30<14:58, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1452/1784 [1:47:30<14:58, 2.71s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1453/1784 [1:47:34<16:31, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1453/1784 [1:47:34<16:31, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|██████████████████████████████████████████████████████████████▋ | 1453/1784 [1:47:34<16:31, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▊ | 1454/1784 [1:47:38<17:29, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:55,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:18:55,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0491, 'learning_rate': 7.7803738317757e-06, 'epoch': 0.82} 82%|██████████████████████████████████████████████████████████████▊ | 1456/1784 [1:47:45<18:38, 3.41s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▊ | 1456/1784 [1:47:45<18:38, 3.41s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0949, 'learning_rate': 7.757009345794392e-06, 'epoch': 0.82} 82%|██████████████████████████████████████████████████████████████▊ | 1456/1784 [1:47:45<18:38, 3.41s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▉ | 1457/1784 [1:47:48<18:48, 3.45s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▉ | 1457/1784 [1:47:48<18:48, 3.45s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▉ | 1457/1784 [1:47:48<18:48, 3.45s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|██████████████████████████████████████████████████████████████▉ | 1458/1784 [1:47:52<18:56, 3.49s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:09,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:09,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9979, 'learning_rate': 7.686915887850467e-06, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-01 03:19:09,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1460/1784 [1:47:59<19:04, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1460/1784 [1:47:59<19:04, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1460/1784 [1:47:59<19:04, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1461/1784 [1:48:03<18:59, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1461/1784 [1:48:03<18:59, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1461/1784 [1:48:03<18:59, 3.53s/it]g-point operations will not be computed-01 03:18:42,628 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████ | 1462/1784 [1:48:06<18:52, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▏ | 1463/1784 [1:48:10<18:39, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▏ | 1463/1784 [1:48:10<18:39, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.154, 'learning_rate': 7.593457943925234e-06, 'epoch': 0.82} 82%|███████████████████████████████████████████████████████████████▏ | 1463/1784 [1:48:10<18:39, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▏ | 1464/1784 [1:48:13<18:28, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:30,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:30,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3885, 'learning_rate': 7.546728971962617e-06, 'epoch': 0.82} 82%|███████████████████████████████████████████████████████████████▎ | 1466/1784 [1:48:20<18:06, 3.42s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▎ | 1466/1784 [1:48:20<18:06, 3.42s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8577, 'learning_rate': 7.523364485981308e-06, 'epoch': 0.82} 82%|███████████████████████████████████████████████████████████████▎ | 1466/1784 [1:48:20<18:06, 3.42s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▎ | 1467/1784 [1:48:23<18:02, 3.42s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:40,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:40,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1424, 'learning_rate': 7.476635514018692e-06, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-01 03:19:40,596 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▍ | 1469/1784 [1:48:30<17:49, 3.40s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▍ | 1469/1784 [1:48:30<17:49, 3.40s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▍ | 1469/1784 [1:48:30<17:49, 3.40s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|███████████████████████████████████████████████████████████████▍ | 1470/1784 [1:48:33<17:40, 3.38s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:50,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:50,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.113, 'learning_rate': 7.406542056074766e-06, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-01 03:19:50,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▌ | 1472/1784 [1:48:40<17:20, 3.34s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:57,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:19:57,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0513, 'learning_rate': 7.3598130841121496e-06, 'epoch': 0.83} 83%|███████████████████████████████████████████████████████████████▌ | 1474/1784 [1:48:46<17:07, 3.32s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▌ | 1474/1784 [1:48:46<17:07, 3.32s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9866, 'learning_rate': 7.336448598130841e-06, 'epoch': 0.83} 83%|███████████████████████████████████████████████████████████████▌ | 1474/1784 [1:48:46<17:07, 3.32s/it]g-point operations will not be computed-01 03:19:21,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▋ | 1475/1784 [1:48:50<16:56, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▋ | 1476/1784 [1:48:53<16:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▋ | 1476/1784 [1:48:53<16:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1526, 'learning_rate': 7.289719626168225e-06, 'epoch': 0.83} 83%|███████████████████████████████████████████████████████████████▋ | 1476/1784 [1:48:53<16:53, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▋ | 1477/1784 [1:48:56<16:36, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:13,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:13,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.8465, 'learning_rate': 7.242990654205607e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:13,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▊ | 1479/1784 [1:49:02<16:11, 3.18s/it]g-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:19,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:19,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.9722, 'learning_rate': 7.196261682242991e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:19,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|███████████████████████████████████████████████████████████████▉ | 1481/1784 [1:49:09<15:50, 3.14s/it]g-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:25,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:25,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1813, 'learning_rate': 7.149532710280374e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:25,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████ | 1483/1784 [1:49:15<15:28, 3.09s/it]g-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:31,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:31,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 3.7479, 'learning_rate': 7.1028037383177574e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:31,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:05,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████ | 1485/1784 [1:49:20<14:55, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████▏ | 1486/1784 [1:49:23<14:37, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████▏ | 1486/1784 [1:49:23<14:37, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:40,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:40,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2861, 'learning_rate': 7.032710280373832e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:40,114 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:35,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████▏ | 1488/1784 [1:49:29<14:01, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████▏ | 1488/1784 [1:49:29<14:01, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|████████████████████████████████████████████████████████████████▎ | 1489/1784 [1:49:31<13:37, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:47,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:47,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:50,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-01 03:20:50,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5709, 'learning_rate': 6.9392523364485985e-06, 'epoch': 0.84} [WARNING|modeling_utils.py:388] 2022-03-01 03:20:50,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:20:44,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1492/1784 [1:49:39<12:28, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:54,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1492/1784 [1:49:39<12:28, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:54,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1493/1784 [1:49:41<12:06, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:56,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1493/1784 [1:49:41<12:06, 2.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:56,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1494/1784 [1:49:43<11:37, 2.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:58,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▍ | 1494/1784 [1:49:43<11:37, 2.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:20:58,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▌ | 1495/1784 [1:49:45<10:57, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:00,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▌ | 1495/1784 [1:49:45<10:57, 2.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:00,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▌ | 1497/1784 [1:49:49<09:18, 1.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:01,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▌ | 1497/1784 [1:49:49<09:18, 1.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:01,986 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▋ | 1498/1784 [1:49:50<08:27, 1.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:03,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▋ | 1498/1784 [1:49:50<08:27, 1.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:03,436 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5138, 'learning_rate': 6.7757009345794396e-06, 'epoch': 0.84} 84%|████████████████████████████████████████████████████████████████▋ | 1499/1784 [1:49:51<07:39, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:05,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|████████████████████████████████████████████████████████████████▋ | 1499/1784 [1:49:51<07:39, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:21:05,932 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2366] 2022-03-01 03:21:07,172 >> Num examples = 2642 | 1500/1784 [1:49:53<07:47, 1.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-01 03:21:07,172 >> Num examples = 2642 | 1500/1784 [1:49:53<07:47, 1.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-01 03:21:07,172 >> Num examples = 2642 | 1500/1784 [1:49:53<07:47, 1.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-01 03:21:07,172 >> Num examples = 2642 | 1500/1784 [1:49:53<07:47, 1.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 1%|█ | 4/331 [00:06<09:58, 1.83s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▎ | 5/331 [00:09<11:35, 2.13s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▌ | 6/331 [00:12<12:38, 2.33s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▊ | 7/331 [00:14<12:39, 2.34s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|██ | 8/331 [00:17<13:06, 2.43s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▎ | 9/331 [00:19<13:34, 2.53s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▍ | 10/331 [00:23<14:30, 2.71s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 11/331 [00:25<13:57, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|██▉ | 12/331 [00:28<13:46, 2.59s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▏ | 13/331 [00:30<13:39, 2.58s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▍ | 14/331 [00:33<13:28, 2.55s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▋ | 15/331 [00:36<14:41, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▉ | 16/331 [00:39<15:32, 2.96s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▏ | 17/331 [00:42<15:35, 2.98s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▍ | 18/331 [00:45<14:23, 2.76s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▋ | 19/331 [00:47<14:08, 2.72s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▉ | 20/331 [00:49<13:19, 2.57s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|█████▏ | 21/331 [00:52<13:47, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▍ | 22/331 [00:56<14:58, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▋ | 23/331 [01:00<16:14, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▉ | 24/331 [01:03<17:03, 3.34s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▏ | 25/331 [01:06<16:23, 3.21s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▍ | 26/331 [01:09<15:13, 3.00s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▋ | 27/331 [01:12<15:21, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▉ | 28/331 [01:15<14:52, 2.94s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▏ | 29/331 [01:17<14:25, 2.87s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▍ | 30/331 [01:20<13:48, 2.75s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▋ | 31/331 [01:22<13:16, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|███████▉ | 32/331 [01:25<12:57, 2.60s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▏ | 33/331 [01:27<12:56, 2.60s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▍ | 34/331 [01:30<12:49, 2.59s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▋ | 35/331 [01:33<13:03, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▉ | 36/331 [01:36<13:45, 2.80s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▏ | 37/331 [01:39<14:35, 2.98s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▍ | 38/331 [01:42<14:51, 3.04s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▋ | 39/331 [01:45<14:46, 3.04s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▉ | 40/331 [01:47<13:29, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|██████████▏ | 41/331 [01:50<12:49, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▍ | 42/331 [01:53<13:42, 2.85s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▋ | 43/331 [01:56<14:14, 2.97s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▉ | 44/331 [02:00<14:40, 3.07s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▏ | 45/331 [02:02<13:48, 2.90s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▍ | 46/331 [02:04<12:44, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▋ | 47/331 [02:07<12:08, 2.56s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|███████████▉ | 48/331 [02:09<12:22, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▏ | 49/331 [02:13<13:04, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▍ | 50/331 [02:15<13:00, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▋ | 51/331 [02:18<13:26, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|████████████▉ | 52/331 [02:21<12:56, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▏ | 53/331 [02:24<12:58, 2.80s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▍ | 54/331 [02:26<12:19, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▋ | 55/331 [02:30<13:14, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▊ | 56/331 [02:32<13:04, 2.85s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|██████████████ | 57/331 [02:35<12:36, 2.76s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▎ | 58/331 [02:38<13:06, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▌ | 59/331 [02:40<12:22, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▊ | 60/331 [02:43<11:59, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|███████████████ | 61/331 [02:46<12:20, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▎ | 62/331 [02:49<12:22, 2.76s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▌ | 63/331 [02:52<13:23, 3.00s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▊ | 64/331 [02:55<12:52, 2.89s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████ | 65/331 [02:58<12:40, 2.86s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▎ | 66/331 [03:02<13:56, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▌ | 67/331 [03:05<14:39, 3.33s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|████████████████▊ | 68/331 [03:09<14:48, 3.38s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████ | 69/331 [03:12<14:24, 3.30s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▎ | 70/331 [03:15<14:08, 3.25s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▌ | 71/331 [03:18<14:16, 3.30s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|█████████████████▊ | 72/331 [03:22<14:13, 3.30s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████ | 73/331 [03:25<13:44, 3.19s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▎ | 74/331 [03:28<13:24, 3.13s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▌ | 75/331 [03:31<13:28, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▊ | 76/331 [03:34<12:49, 3.02s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|███████████████████ | 77/331 [03:36<12:23, 2.93s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▎ | 78/331 [03:39<11:54, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▌ | 79/331 [03:41<11:34, 2.75s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▊ | 80/331 [03:44<11:26, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|████████████████████ | 81/331 [03:47<11:49, 2.84s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▎ | 82/331 [03:50<11:35, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▌ | 83/331 [03:53<12:02, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▊ | 84/331 [03:57<12:48, 3.11s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████ | 85/331 [03:59<11:55, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▎ | 86/331 [04:03<12:34, 3.08s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▌ | 87/331 [04:05<12:08, 2.99s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|█████████████████████▊ | 88/331 [04:08<11:48, 2.92s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████ | 89/331 [04:10<10:56, 2.71s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▎ | 90/331 [04:13<10:22, 2.58s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▌ | 91/331 [04:16<10:48, 2.70s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|██████████████████████▊ | 92/331 [04:18<10:02, 2.52s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████ | 93/331 [04:20<10:14, 2.58s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████▎ | 94/331 [04:23<10:36, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▌ | 95/331 [04:26<10:45, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▊ | 96/331 [04:29<10:50, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|████████████████████████ | 97/331 [04:31<10:18, 2.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▎ | 98/331 [04:34<10:36, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▌ | 99/331 [04:37<10:35, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▍ | 100/331 [04:40<10:10, 2.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▋ | 101/331 [04:42<10:02, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▉ | 102/331 [04:45<10:49, 2.84s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▏ | 103/331 [04:48<10:16, 2.70s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▍ | 104/331 [04:50<10:09, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▋ | 105/331 [04:53<10:11, 2.71s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▉ | 106/331 [04:56<10:15, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▏ | 107/331 [04:58<09:37, 2.58s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▍ | 108/331 [05:01<09:28, 2.55s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▋ | 109/331 [05:03<09:28, 2.56s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▉ | 110/331 [05:06<09:54, 2.69s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▏ | 111/331 [05:09<09:59, 2.72s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▍ | 112/331 [05:12<09:58, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▋ | 113/331 [05:14<09:31, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▉ | 114/331 [05:17<09:32, 2.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▏ | 115/331 [05:20<09:33, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▍ | 116/331 [05:22<09:47, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▋ | 117/331 [05:25<09:46, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|████████████████████████████▉ | 118/331 [05:28<09:26, 2.66s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████ | 119/331 [05:30<09:25, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▎ | 120/331 [05:33<09:28, 2.69s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▌ | 121/331 [05:36<09:53, 2.83s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▊ | 122/331 [05:39<09:38, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████ | 123/331 [05:42<10:18, 2.97s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████▎ | 124/331 [05:45<10:14, 2.97s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▌ | 125/331 [05:49<10:41, 3.11s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▊ | 126/331 [05:52<10:43, 3.14s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████ | 127/331 [05:56<11:04, 3.26s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▎ | 128/331 [05:59<11:02, 3.27s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▌ | 129/331 [06:02<10:48, 3.21s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▊ | 130/331 [06:05<10:56, 3.27s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████ | 131/331 [06:09<11:09, 3.35s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▎ | 132/331 [06:12<10:31, 3.17s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▌ | 133/331 [06:14<09:53, 3.00s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▊ | 134/331 [06:17<09:34, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████ | 135/331 [06:20<09:45, 2.99s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▎ | 136/331 [06:23<09:57, 3.07s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▌ | 137/331 [06:27<10:18, 3.19s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|█████████████████████████████████▊ | 138/331 [06:30<10:30, 3.27s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████ | 139/331 [06:32<09:24, 2.94s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▎ | 140/331 [06:36<10:07, 3.18s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▌ | 141/331 [06:39<09:39, 3.05s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▋ | 142/331 [06:42<09:21, 2.97s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▉ | 143/331 [06:45<09:43, 3.10s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▏ | 144/331 [06:48<09:15, 2.97s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▍ | 145/331 [06:51<09:05, 2.93s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▋ | 146/331 [06:54<09:32, 3.09s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▉ | 147/331 [06:57<09:07, 2.98s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▏ | 148/331 [06:59<08:36, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▍ | 149/331 [07:02<08:07, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 150/331 [07:05<08:25, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|████████████████████████████████████▉ | 151/331 [07:07<08:20, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▏ | 152/331 [07:10<07:59, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▍ | 153/331 [07:12<07:50, 2.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▋ | 154/331 [07:15<08:10, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▉ | 155/331 [07:19<08:34, 2.92s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▏ | 156/331 [07:22<08:46, 3.01s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▍ | 157/331 [07:25<09:05, 3.14s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▋ | 158/331 [07:29<09:06, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▉ | 159/331 [07:32<09:05, 3.17s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|███████████████████████████████████████▏ | 160/331 [07:34<08:31, 2.99s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▍ | 161/331 [07:37<08:20, 2.95s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▋ | 162/331 [07:41<08:47, 3.12s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▉ | 163/331 [07:44<08:55, 3.19s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▏ | 164/331 [07:47<08:31, 3.06s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▍ | 165/331 [07:50<08:18, 3.00s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▌ | 166/331 [07:53<08:08, 2.96s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▊ | 167/331 [07:56<08:17, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████ | 168/331 [07:58<07:50, 2.89s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▎ | 169/331 [08:01<07:53, 2.92s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▌ | 170/331 [08:04<07:29, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|█████████████████████████████████████████▊ | 171/331 [08:07<07:26, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████ | 172/331 [08:09<07:05, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▎ | 173/331 [08:12<07:19, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▌ | 174/331 [08:15<07:02, 2.69s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▊ | 175/331 [08:17<07:05, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████ | 176/331 [08:20<06:51, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████▎ | 177/331 [08:23<07:13, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▌ | 178/331 [08:26<07:39, 3.00s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▊ | 179/331 [08:30<08:00, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|████████████████████████████████████████████ | 180/331 [08:33<07:53, 3.14s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▎ | 181/331 [08:36<07:43, 3.09s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▌ | 182/331 [08:38<07:08, 2.87s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▊ | 183/331 [08:41<06:37, 2.68s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████ | 184/331 [08:43<06:12, 2.53s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▎ | 185/331 [08:45<05:47, 2.38s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▌ | 186/331 [08:47<05:55, 2.45s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▊ | 187/331 [08:51<06:24, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████ | 188/331 [08:53<06:18, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▎ | 189/331 [08:56<06:00, 2.54s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▍ | 190/331 [08:58<05:49, 2.48s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▋ | 191/331 [09:00<05:45, 2.47s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▉ | 192/331 [09:03<05:34, 2.41s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|███████████████████████████████████████████████▏ | 193/331 [09:06<06:01, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▍ | 194/331 [09:08<05:39, 2.48s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▋ | 195/331 [09:10<05:31, 2.44s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▉ | 196/331 [09:13<05:36, 2.49s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▏ | 197/331 [09:16<05:54, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▍ | 198/331 [09:18<05:38, 2.55s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▋ | 199/331 [09:21<05:44, 2.61s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▉ | 200/331 [09:23<05:23, 2.47s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▏ | 201/331 [09:25<05:17, 2.44s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▍ | 202/331 [09:28<05:23, 2.51s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▋ | 203/331 [09:31<05:23, 2.53s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|█████████████████████████████████████████████████▉ | 204/331 [09:34<05:41, 2.69s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▏ | 205/331 [09:37<05:47, 2.75s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▍ | 206/331 [09:39<05:44, 2.76s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▋ | 207/331 [09:43<05:59, 2.90s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▉ | 208/331 [09:46<06:02, 2.94s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▏ | 209/331 [09:48<05:33, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▍ | 210/331 [09:50<05:10, 2.57s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▋ | 211/331 [09:53<05:12, 2.61s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▉ | 212/331 [09:55<05:00, 2.53s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|████████████████████████████████████████████████████ | 213/331 [09:58<05:02, 2.56s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▎ | 214/331 [10:00<04:46, 2.45s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▌ | 215/331 [10:02<04:34, 2.36s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▊ | 216/331 [10:05<04:59, 2.60s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████ | 217/331 [10:08<04:55, 2.59s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:11<05:10, 2.75s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:14<05:04, 2.72s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:16<04:52, 2.64s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████ | 221/331 [10:19<04:56, 2.70s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:21<04:41, 2.58s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:24<04:46, 2.66s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:27<04:47, 2.69s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████ | 225/331 [10:30<04:47, 2.71s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████▎ | 226/331 [10:33<04:59, 2.85s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▌ | 227/331 [10:36<04:53, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▊ | 228/331 [10:38<04:44, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████ | 229/331 [10:41<04:39, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████▎ | 230/331 [10:43<04:29, 2.67s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▌ | 231/331 [10:46<04:34, 2.75s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▊ | 232/331 [10:49<04:26, 2.70s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|█████████████████████████████████████████████████████████ | 233/331 [10:52<04:33, 2.79s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▎ | 234/331 [10:54<04:17, 2.66s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▌ | 235/331 [10:57<04:08, 2.59s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▊ | 236/331 [11:00<04:34, 2.89s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|█████████████████████████████████████████████████████████▉ | 237/331 [11:04<04:48, 3.07s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:07<04:42, 3.04s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:10<04:43, 3.08s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:13<04:44, 3.12s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:17<04:51, 3.24s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:20<04:48, 3.24s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▍ | 243/331 [11:23<04:45, 3.25s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:27<04:48, 3.32s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:30<04:35, 3.21s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|████████████████████████████████████████████████████████████▏ | 246/331 [11:33<04:46, 3.37s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▍ | 247/331 [11:36<04:34, 3.27s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▋ | 248/331 [11:39<04:16, 3.09s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▉ | 249/331 [11:41<03:54, 2.85s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [11:44<03:40, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [11:47<03:42, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [11:49<03:29, 2.65s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [11:52<03:38, 2.81s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [11:55<03:31, 2.74s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [11:58<03:35, 2.84s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [12:00<03:28, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:04<03:32, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▏ | 258/331 [12:06<03:18, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:09<03:15, 2.72s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:12<03:17, 2.78s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:14<03:03, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:16<03:01, 2.63s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:20<03:08, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:22<03:01, 2.71s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:25<02:55, 2.66s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|█████████████████████████████████████████████████████████████████ | 266/331 [12:27<02:49, 2.60s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [12:30<02:57, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [12:33<02:54, 2.77s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [12:36<03:02, 2.95s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████ | 270/331 [12:39<02:59, 2.95s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [12:43<03:01, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [12:45<02:53, 2.93s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▊ | 273/331 [12:48<02:50, 2.93s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████ | 274/331 [12:52<02:54, 3.07s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [12:55<02:53, 3.11s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [12:57<02:41, 2.93s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [13:00<02:33, 2.85s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████ | 278/331 [13:03<02:29, 2.81s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:06<02:39, 3.06s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:09<02:33, 3.02s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:13<02:36, 3.13s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:16<02:33, 3.12s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:19<02:34, 3.21s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:23<02:34, 3.29s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [13:26<02:32, 3.32s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [13:29<02:29, 3.32s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [13:33<02:31, 3.43s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [13:36<02:27, 3.43s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [13:39<02:16, 3.24s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|██████████████████████████████████████████████████████████████████████▉ | 290/331 [13:42<02:05, 3.06s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [13:44<01:55, 2.89s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [13:47<01:49, 2.80s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [13:50<01:46, 2.80s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [13:52<01:38, 2.66s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [13:55<01:34, 2.62s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [13:57<01:28, 2.53s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [14:00<01:35, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [14:04<01:39, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:07<01:33, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:09<01:29, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:12<01:25, 2.84s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:15<01:21, 2.80s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:17<01:16, 2.73s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:20<01:16, 2.82s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [14:24<01:15, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [14:27<01:17, 3.09s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [14:31<01:18, 3.25s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:35<01:19, 3.44s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▌ | 309/331 [14:38<01:16, 3.49s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [14:41<01:08, 3.26s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [14:44<01:04, 3.25s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [14:47<00:57, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [14:49<00:53, 2.95s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [14:53<00:50, 2.99s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [14:56<00:49, 3.07s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [14:59<00:46, 3.10s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [15:02<00:45, 3.21s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [15:05<00:39, 3.03s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:08<00:34, 2.88s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:11<00:32, 2.94s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:14<00:29, 2.91s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:17<00:27, 3.04s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:20<00:23, 2.95s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▎ | 324/331 [15:23<00:21, 3.05s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [15:26<00:18, 3.08s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [15:29<00:15, 3.12s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [15:32<00:12, 3.13s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [15:36<00:09, 3.16s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▌| 329/331 [15:39<00:06, 3.11s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [15:42<00:03, 3.29s/it][INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|configuration_utils.py:438] 2022-03-01 03:36:54,520 >> Configuration saved in ./checkpoint-1500/config.json [INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|configuration_utils.py:438] 2022-03-01 03:36:54,520 >> Configuration saved in ./checkpoint-1500/config.json [INFO|trainer.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/01/2022 03:36:54 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|feature_extraction_utils.py:324] 2022-03-01 03:36:59,615 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-01 03:36:59,615 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-01 03:36:59,615 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-01 03:36:59,615 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-01 03:36:59,615 >> Configuration saved in ./checkpoint-1500/preprocessor_config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1501/1784 [2:07:48<25:26:22, 323.61s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1501/1784 [2:07:48<25:26:22, 323.61s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1501/1784 [2:07:48<25:26:22, 323.61s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1502/1784 [2:07:52<17:50:07, 227.69s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1502/1784 [2:07:52<17:50:07, 227.69s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▍ | 1502/1784 [2:07:52<17:50:07, 227.69s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▌ | 1503/1784 [2:07:55<12:31:41, 160.50s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▌ | 1503/1784 [2:07:55<12:31:41, 160.50s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|█████████████████████████████████████████████████████████████▌ | 1503/1784 [2:07:55<12:31:41, 160.50s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|██████████████████████████████████████████████████████████████▍ | 1504/1784 [2:07:59<8:49:36, 113.49s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|██████████████████████████████████████████████████████████████▍ | 1504/1784 [2:07:59<8:49:36, 113.49s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|██████████████████████████████████████████████████████████████▍ | 1504/1784 [2:07:59<8:49:36, 113.49s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████▎ | 1505/1784 [2:08:03<6:14:34, 80.55s/it]config.jsoner.py:560] 2022-03-01 03:21:07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:20,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:20,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.055, 'learning_rate': 6.588785046728972e-06, 'epoch': 0.84} [WARNING|modeling_utils.py:388] 2022-03-01 03:39:20,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████▎ | 1507/1784 [2:08:10<3:10:52, 41.34s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████▎ | 1507/1784 [2:08:10<3:10:52, 41.34s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████▎ | 1507/1784 [2:08:10<3:10:52, 41.34s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1508/1784 [2:08:14<2:18:08, 30.03s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1508/1784 [2:08:14<2:18:08, 30.03s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1508/1784 [2:08:14<2:18:08, 30.03s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1509/1784 [2:08:17<1:41:22, 22.12s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1509/1784 [2:08:17<1:41:22, 22.12s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1509/1784 [2:08:17<1:41:22, 22.12s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|███████████████████████████████████████████████████████████████▍ | 1510/1784 [2:08:21<1:15:41, 16.58s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:38,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:38,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9209, 'learning_rate': 6.47196261682243e-06, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-01 03:39:38,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▎ | 1512/1784 [2:08:28<45:01, 9.93s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▎ | 1512/1784 [2:08:28<45:01, 9.93s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▎ | 1512/1784 [2:08:28<45:01, 9.93s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▎ | 1513/1784 [2:08:32<36:03, 7.98s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:49,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:49,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0394, 'learning_rate': 6.401869158878505e-06, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-01 03:39:49,250 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 1515/1784 [2:08:39<25:27, 5.68s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 1515/1784 [2:08:39<25:27, 5.68s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 1515/1784 [2:08:39<25:27, 5.68s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 1516/1784 [2:08:42<22:23, 5.01s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:59,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:39:59,644 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0129, 'learning_rate': 6.331775700934579e-06, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▌ | 1518/1784 [2:08:49<18:46, 4.24s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 1518/1784 [2:08:49<18:46, 4.24s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2214, 'learning_rate': 6.308411214953271e-06, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▌ | 1518/1784 [2:08:49<18:46, 4.24s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 1519/1784 [2:08:53<17:42, 4.01s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 1519/1784 [2:08:53<17:42, 4.01s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 1519/1784 [2:08:53<17:42, 4.01s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 1520/1784 [2:08:56<16:53, 3.84s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:13,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:13,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0637, 'learning_rate': 6.238317757009346e-06, 'epoch': 0.85} [WARNING|modeling_utils.py:388] 2022-03-01 03:40:13,390 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 1522/1784 [2:09:03<15:41, 3.59s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 1522/1784 [2:09:03<15:41, 3.59s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 1522/1784 [2:09:03<15:41, 3.59s/it]g-point operations will not be computed07,167 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 1523/1784 [2:09:06<15:19, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 1523/1784 [2:09:06<15:19, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1524/1784 [2:09:09<15:04, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1524/1784 [2:09:09<15:04, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1524/1784 [2:09:09<15:04, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1525/1784 [2:09:13<14:45, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1525/1784 [2:09:13<14:45, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▊ | 1525/1784 [2:09:13<14:45, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:21,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▊ | 1526/1784 [2:09:16<14:25, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:31,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▊ | 1526/1784 [2:09:16<14:25, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:31,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1527/1784 [2:09:19<14:12, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:31,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1527/1784 [2:09:19<14:12, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:31,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1527/1784 [2:09:19<14:12, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:31,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1528/1784 [2:09:22<13:56, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:37,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1528/1784 [2:09:22<13:56, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:37,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1529/1784 [2:09:25<13:44, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:37,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1529/1784 [2:09:25<13:44, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:37,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████▉ | 1529/1784 [2:09:25<13:44, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:37,981 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1530/1784 [2:09:29<13:30, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1530/1784 [2:09:29<13:30, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1531/1784 [2:09:32<13:21, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1531/1784 [2:09:32<13:21, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1531/1784 [2:09:32<13:21, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1532/1784 [2:09:35<13:14, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████ | 1532/1784 [2:09:35<13:14, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:51,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:51,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:51,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▏ | 1534/1784 [2:09:41<12:52, 3.09s/it]g-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▏ | 1534/1784 [2:09:41<12:52, 3.09s/it]g-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:57,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:57,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:40:57,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:40:44,223 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▎ | 1536/1784 [2:09:47<12:23, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▎ | 1536/1784 [2:09:47<12:23, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▎ | 1537/1784 [2:09:49<12:08, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▎ | 1537/1784 [2:09:49<12:08, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:06,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:06,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:06,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:02,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▍ | 1539/1784 [2:09:55<11:38, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▍ | 1539/1784 [2:09:55<11:38, 2.85s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▍ | 1540/1784 [2:09:58<11:20, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▍ | 1540/1784 [2:09:58<11:20, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:14,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:14,296 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:16,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:16,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:16,724 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:10,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▌ | 1543/1784 [2:10:05<10:09, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████▌ | 1543/1784 [2:10:05<10:09, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:20,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▋ | 1544/1784 [2:10:07<09:35, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:22,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▋ | 1544/1784 [2:10:07<09:35, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:22,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6037, 'learning_rate': 5.677570093457944e-06, 'epoch': 0.87} 87%|██████████████████████████████████████████████████████████████████▋ | 1546/1784 [2:10:11<08:26, 2.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▋ | 1546/1784 [2:10:11<08:26, 2.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:24,014 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▊ | 1547/1784 [2:10:12<07:46, 1.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:25,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▊ | 1547/1784 [2:10:12<07:46, 1.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:25,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▊ | 1548/1784 [2:10:14<07:05, 1.80s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:28,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▊ | 1548/1784 [2:10:14<07:05, 1.80s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:28,591 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1550/1784 [2:10:17<06:33, 1.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:29,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1550/1784 [2:10:17<06:33, 1.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:29,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1550/1784 [2:10:17<06:33, 1.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:29,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1550/1784 [2:10:17<06:33, 1.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1551/1784 [2:10:21<09:14, 2.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████▉ | 1551/1784 [2:10:21<09:14, 2.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:38,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:38,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1302, 'learning_rate': 5.514018691588785e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████ | 1553/1784 [2:10:28<11:59, 3.12s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████ | 1553/1784 [2:10:28<11:59, 3.12s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0421, 'learning_rate': 5.490654205607476e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████ | 1554/1784 [2:10:32<12:40, 3.31s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████ | 1554/1784 [2:10:32<12:40, 3.31s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9994, 'learning_rate': 5.467289719626168e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████ | 1555/1784 [2:10:36<13:02, 3.42s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████ | 1555/1784 [2:10:36<13:02, 3.42s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3427, 'learning_rate': 5.44392523364486e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████▏ | 1556/1784 [2:10:40<13:16, 3.49s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████▏ | 1556/1784 [2:10:40<13:16, 3.49s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:57,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:41:57,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.18, 'learning_rate': 5.397196261682243e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████▏ | 1558/1784 [2:10:47<13:24, 3.56s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████▏ | 1558/1784 [2:10:47<13:24, 3.56s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2818, 'learning_rate': 5.373831775700935e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████▎ | 1559/1784 [2:10:50<13:21, 3.56s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████▎ | 1559/1784 [2:10:50<13:21, 3.56s/it]g-point operations will not be computed-01 03:41:33,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9262, 'learning_rate': 5.350467289719626e-06, 'epoch': 0.87} 87%|███████████████████████████████████████████████████████████████████▎ | 1560/1784 [2:10:54<13:16, 3.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|███████████████████████████████████████████████████████████████████▎ | 1560/1784 [2:10:54<13:16, 3.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▍ | 1561/1784 [2:10:57<13:09, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▍ | 1561/1784 [2:10:57<13:09, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1897, 'learning_rate': 5.303738317757009e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▍ | 1562/1784 [2:11:01<13:02, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▍ | 1562/1784 [2:11:01<13:02, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1049, 'learning_rate': 5.280373831775701e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▍ | 1563/1784 [2:11:04<12:53, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▍ | 1563/1784 [2:11:04<12:53, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:21,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:21,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0656, 'learning_rate': 5.233644859813084e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▌ | 1565/1784 [2:11:11<12:44, 3.49s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▌ | 1565/1784 [2:11:11<12:44, 3.49s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1985, 'learning_rate': 5.210280373831776e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▌ | 1566/1784 [2:11:15<12:38, 3.48s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▌ | 1566/1784 [2:11:15<12:38, 3.48s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:32,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:32,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8778, 'learning_rate': 5.1635514018691585e-06, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-01 03:42:32,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▋ | 1568/1784 [2:11:22<12:24, 3.45s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▋ | 1568/1784 [2:11:22<12:24, 3.45s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▋ | 1569/1784 [2:11:25<12:17, 3.43s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▋ | 1569/1784 [2:11:25<12:17, 3.43s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:42:42,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9919, 'learning_rate': 5.093457943925234e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▊ | 1571/1784 [2:11:32<12:06, 3.41s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▊ | 1571/1784 [2:11:32<12:06, 3.41s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0812, 'learning_rate': 5.070093457943925e-06, 'epoch': 0.88} 88%|███████████████████████████████████████████████████████████████████▊ | 1571/1784 [2:11:32<12:06, 3.41s/it]g-point operations will not be computed-01 03:42:09,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▊ | 1572/1784 [2:11:35<12:03, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▊ | 1572/1784 [2:11:35<12:03, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1573/1784 [2:11:39<11:58, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1573/1784 [2:11:39<11:58, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1573/1784 [2:11:39<11:58, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1574/1784 [2:11:42<11:53, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1574/1784 [2:11:42<11:53, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1574/1784 [2:11:42<11:53, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:42:51,069 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1575/1784 [2:11:45<11:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████▉ | 1575/1784 [2:11:45<11:47, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|████████████████████████████████████████████████████████████████████ | 1576/1784 [2:11:49<11:36, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|████████████████████████████████████████████████████████████████████ | 1576/1784 [2:11:49<11:36, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|████████████████████████████████████████████████████████████████████ | 1576/1784 [2:11:49<11:36, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|████████████████████████████████████████████████████████████████████ | 1577/1784 [2:11:52<11:28, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|████████████████████████████████████████████████████████████████████ | 1577/1784 [2:11:52<11:28, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:09,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:09,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:09,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▏ | 1579/1784 [2:11:58<11:12, 3.28s/it]g-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▏ | 1579/1784 [2:11:58<11:12, 3.28s/it]g-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:15,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:15,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:15,559 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▏ | 1581/1784 [2:12:05<10:47, 3.19s/it]g-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:21,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:21,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2072, 'learning_rate': 4.813084112149532e-06, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-01 03:43:21,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▎ | 1583/1784 [2:12:11<10:26, 3.12s/it]g-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:27,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:27,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0697, 'learning_rate': 4.766355140186916e-06, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-01 03:43:27,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:01,138 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▍ | 1585/1784 [2:12:17<10:02, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▍ | 1586/1784 [2:12:19<09:50, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▍ | 1586/1784 [2:12:19<09:50, 2.98s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:36,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:36,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.008, 'learning_rate': 4.696261682242991e-06, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-01 03:43:36,319 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:32,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▌ | 1588/1784 [2:12:25<09:20, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▌ | 1589/1784 [2:12:28<09:08, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▌ | 1589/1784 [2:12:28<09:08, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:44,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:43:44,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.962, 'learning_rate': 4.626168224299065e-06, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-01 03:43:44,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:43:40,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▋ | 1591/1784 [2:12:33<08:40, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:48,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▋ | 1591/1784 [2:12:33<08:40, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:48,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▋ | 1592/1784 [2:12:35<08:20, 2.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:50,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▋ | 1592/1784 [2:12:35<08:20, 2.61s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:50,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1593/1784 [2:12:37<07:59, 2.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:52,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1593/1784 [2:12:37<07:59, 2.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:52,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1594/1784 [2:12:40<07:36, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:54,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1594/1784 [2:12:40<07:36, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:54,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1595/1784 [2:12:42<07:04, 2.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:56,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▊ | 1595/1784 [2:12:42<07:04, 2.25s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:56,582 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▉ | 1596/1784 [2:12:43<06:33, 2.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:58,200 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████▉ | 1596/1784 [2:12:43<06:33, 2.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:58,200 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████▉ | 1598/1784 [2:12:46<05:29, 1.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:59,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████▉ | 1598/1784 [2:12:46<05:29, 1.77s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:43:59,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4162, 'learning_rate': 4.439252336448598e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████ | 1599/1784 [2:12:47<04:59, 1.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:02,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1599/1784 [2:12:47<04:59, 1.62s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:02,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1600/1784 [2:12:49<05:06, 1.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:02,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1600/1784 [2:12:49<05:06, 1.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1600/1784 [2:12:49<05:06, 1.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1601/1784 [2:12:53<07:17, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1601/1784 [2:12:53<07:17, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████ | 1601/1784 [2:12:53<07:17, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▏ | 1602/1784 [2:12:57<08:26, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:14,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:14,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0248, 'learning_rate': 4.3224299065420555e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▏ | 1604/1784 [2:13:04<09:39, 3.22s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▏ | 1604/1784 [2:13:04<09:39, 3.22s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1355, 'learning_rate': 4.299065420560747e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▏ | 1604/1784 [2:13:04<09:39, 3.22s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▎ | 1605/1784 [2:13:08<09:54, 3.32s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▎ | 1605/1784 [2:13:08<09:54, 3.32s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▎ | 1605/1784 [2:13:08<09:54, 3.32s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▎ | 1606/1784 [2:13:11<10:05, 3.40s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:29,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:29,130 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0852, 'learning_rate': 4.228971962616822e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▍ | 1608/1784 [2:13:19<10:09, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▍ | 1608/1784 [2:13:19<10:09, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9907, 'learning_rate': 4.205607476635514e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▍ | 1608/1784 [2:13:19<10:09, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▍ | 1609/1784 [2:13:22<10:04, 3.45s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:39,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:39,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.945, 'learning_rate': 4.158878504672897e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▌ | 1611/1784 [2:13:29<09:59, 3.47s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▌ | 1611/1784 [2:13:29<09:59, 3.47s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2846, 'learning_rate': 4.135514018691588e-06, 'epoch': 0.9} 90%|█████████████████████████████████████████████████████████████████████▌ | 1611/1784 [2:13:29<09:59, 3.47s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▌ | 1612/1784 [2:13:32<09:55, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▌ | 1612/1784 [2:13:32<09:55, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▌ | 1612/1784 [2:13:32<09:55, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████▌ | 1613/1784 [2:13:36<09:51, 3.46s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:53,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:44:53,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8733, 'learning_rate': 4.065420560747663e-06, 'epoch': 0.9} [WARNING|modeling_utils.py:388] 2022-03-01 03:44:53,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▋ | 1615/1784 [2:13:43<09:35, 3.41s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▋ | 1615/1784 [2:13:43<09:35, 3.41s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▋ | 1615/1784 [2:13:43<09:35, 3.41s/it]g-point operations will not be computed-01 03:44:05,590 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▋ | 1616/1784 [2:13:46<09:28, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▊ | 1617/1784 [2:13:49<09:25, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▊ | 1617/1784 [2:13:49<09:25, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0991, 'learning_rate': 3.995327102803738e-06, 'epoch': 0.91} 91%|█████████████████████████████████████████████████████████████████████▊ | 1617/1784 [2:13:49<09:25, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▊ | 1618/1784 [2:13:53<09:18, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:09,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:09,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1337, 'learning_rate': 3.948598130841121e-06, 'epoch': 0.91} 91%|█████████████████████████████████████████████████████████████████████▉ | 1620/1784 [2:13:59<09:04, 3.32s/it]g-point operations will not be computed-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▉ | 1620/1784 [2:13:59<09:04, 3.32s/it]g-point operations will not be computed-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1297, 'learning_rate': 3.925233644859813e-06, 'epoch': 0.91} 91%|█████████████████████████████████████████████████████████████████████▉ | 1620/1784 [2:13:59<09:04, 3.32s/it]g-point operations will not be computed-01 03:45:01,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████▉ | 1621/1784 [2:14:02<08:57, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████ | 1622/1784 [2:14:06<08:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████ | 1622/1784 [2:14:06<08:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8608, 'learning_rate': 3.878504672897196e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████ | 1622/1784 [2:14:06<08:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████ | 1623/1784 [2:14:09<08:50, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:26,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:26,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9592, 'learning_rate': 3.831775700934579e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████▏ | 1625/1784 [2:14:15<08:35, 3.24s/it]g-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▏ | 1625/1784 [2:14:15<08:35, 3.24s/it]g-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:32,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:32,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2544, 'learning_rate': 3.785046728971962e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████▏ | 1627/1784 [2:14:22<08:26, 3.23s/it]g-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▏ | 1627/1784 [2:14:22<08:26, 3.23s/it]g-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1496, 'learning_rate': 3.761682242990654e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████▏ | 1627/1784 [2:14:22<08:26, 3.23s/it]g-point operations will not be computed-01 03:45:18,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▎ | 1628/1784 [2:14:25<08:18, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:40,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▎ | 1629/1784 [2:14:28<08:12, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:40,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▎ | 1629/1784 [2:14:28<08:12, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:40,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2071, 'learning_rate': 3.7149532710280376e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████▎ | 1629/1784 [2:14:28<08:12, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:40,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▎ | 1630/1784 [2:14:31<08:06, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:46,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▍ | 1631/1784 [2:14:34<07:59, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:46,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▍ | 1631/1784 [2:14:34<07:59, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:46,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1445, 'learning_rate': 3.6682242990654206e-06, 'epoch': 0.91} 91%|██████████████████████████████████████████████████████████████████████▍ | 1631/1784 [2:14:34<07:59, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:46,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████▍ | 1632/1784 [2:14:37<07:48, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▍ | 1633/1784 [2:14:40<07:42, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▍ | 1633/1784 [2:14:40<07:42, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:57,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:45:57,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0224, 'learning_rate': 3.5981308411214953e-06, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-01 03:45:57,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▌ | 1635/1784 [2:14:46<07:19, 2.95s/it]g-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:02,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:02,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.991, 'learning_rate': 3.5514018691588787e-06, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-01 03:46:02,798 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:45:52,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▋ | 1637/1784 [2:14:51<06:59, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:06,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▋ | 1637/1784 [2:14:51<06:59, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:06,897 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▋ | 1638/1784 [2:14:54<06:45, 2.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▋ | 1639/1784 [2:14:57<06:33, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████▋ | 1639/1784 [2:14:57<06:33, 2.72s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:13,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:13,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:15,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:15,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:17,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:17,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:19,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:19,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:21,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:21,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:23,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:23,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:24,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:24,599 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5435, 'learning_rate': 3.3177570093457945e-06, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-01 03:46:27,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:27,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7151, 'learning_rate': 3.2710280373831774e-06, 'epoch': 0.92} [WARNING|modeling_utils.py:388] 2022-03-01 03:46:28,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:28,428 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:30,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:33,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:33,931 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2053, 'learning_rate': 3.2009345794392525e-06, 'epoch': 0.93} [WARNING|modeling_utils.py:388] 2022-03-01 03:46:37,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:46:37,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0842, 'learning_rate': 3.177570093457944e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▎ | 1653/1784 [2:15:27<06:26, 2.95s/it]g-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▎ | 1653/1784 [2:15:27<06:26, 2.95s/it]g-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1126, 'learning_rate': 3.1542056074766355e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▍ | 1654/1784 [2:15:31<06:48, 3.14s/it]g-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▍ | 1654/1784 [2:15:31<06:48, 3.14s/it]g-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0547, 'learning_rate': 3.1308411214953272e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▍ | 1654/1784 [2:15:31<06:48, 3.14s/it]g-point operations will not be computed-01 03:46:09,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▍ | 1655/1784 [2:15:34<07:03, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▍ | 1656/1784 [2:15:38<07:09, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▍ | 1656/1784 [2:15:38<07:09, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0489, 'learning_rate': 3.08411214953271e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▌ | 1657/1784 [2:15:41<07:13, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▌ | 1657/1784 [2:15:41<07:13, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1916, 'learning_rate': 3.060747663551402e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▌ | 1658/1784 [2:15:45<07:16, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▌ | 1658/1784 [2:15:45<07:16, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:02,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:02,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9667, 'learning_rate': 3.014018691588785e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▋ | 1660/1784 [2:15:52<07:11, 3.48s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▋ | 1660/1784 [2:15:52<07:11, 3.48s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.108, 'learning_rate': 2.9906542056074766e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▋ | 1661/1784 [2:15:55<07:05, 3.46s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▋ | 1661/1784 [2:15:55<07:05, 3.46s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:13,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:13,002 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1952, 'learning_rate': 2.94392523364486e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▊ | 1663/1784 [2:16:02<06:57, 3.45s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▊ | 1663/1784 [2:16:02<06:57, 3.45s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8044, 'learning_rate': 2.9205607476635513e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▊ | 1664/1784 [2:16:06<06:53, 3.45s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▊ | 1664/1784 [2:16:06<06:53, 3.45s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1709, 'learning_rate': 2.897196261682243e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▊ | 1664/1784 [2:16:06<06:53, 3.45s/it]g-point operations will not be computed-01 03:46:50,299 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▊ | 1665/1784 [2:16:09<06:45, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▉ | 1666/1784 [2:16:12<06:42, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▉ | 1666/1784 [2:16:12<06:42, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0337, 'learning_rate': 2.850467289719626e-06, 'epoch': 0.93} 93%|███████████████████████████████████████████████████████████████████████▉ | 1667/1784 [2:16:16<06:34, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████▉ | 1667/1784 [2:16:16<06:34, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:33,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:33,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0033, 'learning_rate': 2.8037383177570094e-06, 'epoch': 0.93} 94%|████████████████████████████████████████████████████████████████████████ | 1669/1784 [2:16:22<06:24, 3.35s/it]g-point operations will not be computed-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████ | 1669/1784 [2:16:22<06:24, 3.35s/it]g-point operations will not be computed-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0537, 'learning_rate': 2.780373831775701e-06, 'epoch': 0.94} 94%|████████████████████████████████████████████████████████████████████████ | 1669/1784 [2:16:22<06:24, 3.35s/it]g-point operations will not be computed-01 03:47:24,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████ | 1670/1784 [2:16:26<06:19, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████ | 1671/1784 [2:16:29<06:14, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████ | 1671/1784 [2:16:29<06:14, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0172, 'learning_rate': 2.733644859813084e-06, 'epoch': 0.94} 94%|████████████████████████████████████████████████████████████████████████▏ | 1672/1784 [2:16:32<06:07, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▏ | 1672/1784 [2:16:32<06:07, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:49,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:49,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9845, 'learning_rate': 2.6869158878504674e-06, 'epoch': 0.94} 94%|████████████████████████████████████████████████████████████████████████▎ | 1674/1784 [2:16:39<05:57, 3.25s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▎ | 1674/1784 [2:16:39<05:57, 3.25s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:55,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:47:55,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3764, 'learning_rate': 2.6401869158878504e-06, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-01 03:47:55,964 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▎ | 1676/1784 [2:16:45<05:46, 3.21s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:02,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:02,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.848, 'learning_rate': 2.5934579439252334e-06, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-01 03:48:02,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▍ | 1678/1784 [2:16:51<05:34, 3.16s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▍ | 1678/1784 [2:16:51<05:34, 3.16s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:08,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:08,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:08,366 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▌ | 1680/1784 [2:16:57<05:22, 3.10s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:14,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:14,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8142, 'learning_rate': 2.4999999999999998e-06, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-01 03:48:14,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▌ | 1682/1784 [2:17:03<05:14, 3.08s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:20,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:20,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3228, 'learning_rate': 2.453271028037383e-06, 'epoch': 0.94} [WARNING|modeling_utils.py:388] 2022-03-01 03:48:20,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▋ | 1684/1784 [2:17:09<05:01, 3.01s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████▋ | 1684/1784 [2:17:09<05:01, 3.01s/it]g-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:26,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:26,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:26,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:47:41,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▊ | 1686/1784 [2:17:15<04:47, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▊ | 1686/1784 [2:17:15<04:47, 2.93s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▊ | 1687/1784 [2:17:18<04:39, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▊ | 1687/1784 [2:17:18<04:39, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:34,654 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:37,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:48:37,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1392, 'learning_rate': 2.3130841121495325e-06, 'epoch': 0.95} [WARNING|modeling_utils.py:388] 2022-03-01 03:48:37,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:48:30,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▉ | 1690/1784 [2:17:26<04:11, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:40,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▉ | 1690/1784 [2:17:26<04:11, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:40,939 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▉ | 1691/1784 [2:17:28<03:58, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:43,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████▉ | 1691/1784 [2:17:28<03:58, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:43,178 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1692/1784 [2:17:30<03:44, 2.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:45,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1692/1784 [2:17:30<03:44, 2.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:45,295 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1693/1784 [2:17:32<03:31, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:47,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1693/1784 [2:17:32<03:31, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:47,290 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1694/1784 [2:17:34<03:18, 2.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:49,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████ | 1694/1784 [2:17:34<03:18, 2.21s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:49,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▏ | 1696/1784 [2:17:37<02:48, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:50,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▏ | 1696/1784 [2:17:37<02:48, 1.92s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:50,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.8174, 'learning_rate': 2.1495327102803736e-06, 'epoch': 0.95} 95%|█████████████████████████████████████████████████████████████████████████▏ | 1697/1784 [2:17:39<02:34, 1.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:52,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▏ | 1697/1784 [2:17:39<02:34, 1.78s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:52,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1699/1784 [2:17:41<02:08, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:54,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1699/1784 [2:17:41<02:08, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:54,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1700/1784 [2:17:43<02:11, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:56,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1700/1784 [2:17:43<02:11, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:56,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1700/1784 [2:17:43<02:11, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:59,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▎ | 1700/1784 [2:17:43<02:11, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:59,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1701/1784 [2:17:47<03:06, 2.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:48:59,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1701/1784 [2:17:47<03:06, 2.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1701/1784 [2:17:47<03:06, 2.24s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1702/1784 [2:17:51<03:39, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1702/1784 [2:17:51<03:39, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▍ | 1702/1784 [2:17:51<03:39, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▌ | 1703/1784 [2:17:54<04:00, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▌ | 1703/1784 [2:17:54<04:00, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████▌ | 1703/1784 [2:17:54<04:00, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▌ | 1704/1784 [2:17:58<04:14, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▌ | 1704/1784 [2:17:58<04:14, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:15,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:15,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:15,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1706/1784 [2:18:05<04:24, 3.39s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1706/1784 [2:18:05<04:24, 3.39s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1706/1784 [2:18:05<04:24, 3.39s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1707/1784 [2:18:09<04:25, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1707/1784 [2:18:09<04:25, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1707/1784 [2:18:09<04:25, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1708/1784 [2:18:12<04:24, 3.48s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▋ | 1708/1784 [2:18:12<04:24, 3.48s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:29,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:29,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:29,877 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▊ | 1710/1784 [2:18:19<04:18, 3.50s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▊ | 1710/1784 [2:18:19<04:18, 3.50s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▊ | 1710/1784 [2:18:19<04:18, 3.50s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▊ | 1711/1784 [2:18:23<04:14, 3.48s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:40,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:49:40,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1777, 'learning_rate': 1.7757009345794394e-06, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-01 03:49:40,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1713/1784 [2:18:30<04:05, 3.46s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1713/1784 [2:18:30<04:05, 3.46s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1713/1784 [2:18:30<04:05, 3.46s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1714/1784 [2:18:33<04:01, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1714/1784 [2:18:33<04:01, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████▉ | 1714/1784 [2:18:33<04:01, 3.45s/it]g-point operations will not be computed-01 03:49:02,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1715/1784 [2:18:37<03:59, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1715/1784 [2:18:37<03:59, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1716/1784 [2:18:40<03:53, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1716/1784 [2:18:40<03:53, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1716/1784 [2:18:40<03:53, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████ | 1717/1784 [2:18:43<03:50, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:00,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:00,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3736, 'learning_rate': 1.6355140186915887e-06, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:00,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▏ | 1719/1784 [2:18:50<03:40, 3.40s/it]g-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▏ | 1719/1784 [2:18:50<03:40, 3.40s/it]g-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▏ | 1719/1784 [2:18:50<03:40, 3.40s/it]g-point operations will not be computed-01 03:49:52,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▏ | 1720/1784 [2:18:53<03:35, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▎ | 1721/1784 [2:18:57<03:31, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████▎ | 1721/1784 [2:18:57<03:31, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1393, 'learning_rate': 1.5654205607476636e-06, 'epoch': 0.96} 96%|██████████████████████████████████████████████████████████████████████████▎ | 1721/1784 [2:18:57<03:31, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▎ | 1722/1784 [2:19:00<03:25, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:17,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:17,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.261, 'learning_rate': 1.5186915887850468e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:17,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▍ | 1724/1784 [2:19:06<03:16, 3.28s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▍ | 1724/1784 [2:19:06<03:16, 3.28s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▍ | 1724/1784 [2:19:06<03:16, 3.28s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▍ | 1725/1784 [2:19:10<03:11, 3.25s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▍ | 1725/1784 [2:19:10<03:11, 3.25s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:26,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:26,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:26,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▌ | 1727/1784 [2:19:16<03:04, 3.23s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:33,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:33,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2803, 'learning_rate': 1.4018691588785047e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:33,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▋ | 1729/1784 [2:19:22<02:54, 3.17s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:39,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:39,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0993, 'learning_rate': 1.3551401869158879e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:39,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▋ | 1731/1784 [2:19:28<02:45, 3.12s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:45,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:45,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0755, 'learning_rate': 1.308411214953271e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:45,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▊ | 1733/1784 [2:19:34<02:35, 3.06s/it]g-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:51,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:50:51,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2421, 'learning_rate': 1.2616822429906543e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:50:51,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:09,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▉ | 1735/1784 [2:19:40<02:27, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▉ | 1735/1784 [2:19:40<02:27, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████▉ | 1736/1784 [2:19:43<02:22, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:00,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:00,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8363, 'learning_rate': 1.191588785046729e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-01 03:51:00,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:50:55,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|███████████████████████████████████████████████████████████████████████████ | 1738/1784 [2:19:49<02:10, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|███████████████████████████████████████████████████████████████████████████ | 1738/1784 [2:19:49<02:10, 2.84s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|███████████████████████████████████████████████████████████████████████████ | 1739/1784 [2:19:51<02:05, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:07,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:07,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:10,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:10,317 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:12,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:12,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:14,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:14,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:16,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:18,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:18,777 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:20,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:20,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:22,206 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:22,206 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2018, 'learning_rate': 9.579439252336447e-07, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-01 03:51:24,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:24,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:26,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:26,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0967, 'learning_rate': 8.878504672897197e-07, 'epoch': 0.98} [WARNING|modeling_utils.py:388] 2022-03-01 03:51:30,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:30,537 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8762, 'learning_rate': 8.644859813084113e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▌ | 1752/1784 [2:20:20<01:28, 2.76s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████▌ | 1752/1784 [2:20:20<01:28, 2.76s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.087, 'learning_rate': 8.411214953271028e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▋ | 1753/1784 [2:20:24<01:33, 3.02s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████▋ | 1753/1784 [2:20:24<01:33, 3.02s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2492, 'learning_rate': 8.177570093457944e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▋ | 1753/1784 [2:20:24<01:33, 3.02s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████▋ | 1754/1784 [2:20:27<01:35, 3.19s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:45,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:51:45,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.272, 'learning_rate': 7.710280373831776e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▊ | 1756/1784 [2:20:35<01:34, 3.39s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████▊ | 1756/1784 [2:20:35<01:34, 3.39s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8828, 'learning_rate': 7.476635514018691e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▊ | 1757/1784 [2:20:38<01:32, 3.43s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████▊ | 1757/1784 [2:20:38<01:32, 3.43s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9841, 'learning_rate': 7.242990654205607e-07, 'epoch': 0.98} 98%|███████████████████████████████████████████████████████████████████████████▊ | 1757/1784 [2:20:38<01:32, 3.43s/it]g-point operations will not be computed-01 03:51:04,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████▉ | 1758/1784 [2:20:42<01:29, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████▉ | 1759/1784 [2:20:45<01:25, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████▉ | 1759/1784 [2:20:45<01:25, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0956, 'learning_rate': 6.775700934579439e-07, 'epoch': 0.99} 99%|███████████████████████████████████████████████████████████████████████████▉ | 1760/1784 [2:20:48<01:21, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████▉ | 1760/1784 [2:20:48<01:21, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:05,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:05,841 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2357, 'learning_rate': 6.308411214953271e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████ | 1762/1784 [2:20:55<01:14, 3.39s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████ | 1762/1784 [2:20:55<01:14, 3.39s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8661, 'learning_rate': 6.074766355140187e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████ | 1763/1784 [2:20:58<01:11, 3.38s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████ | 1763/1784 [2:20:58<01:11, 3.38s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:15,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:15,872 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.233, 'learning_rate': 5.607476635514018e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████▏| 1765/1784 [2:21:05<01:03, 3.34s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▏| 1765/1784 [2:21:05<01:03, 3.34s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.8625, 'learning_rate': 5.373831775700934e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████▏| 1765/1784 [2:21:05<01:03, 3.34s/it]g-point operations will not be computed-01 03:51:57,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▏| 1766/1784 [2:21:08<00:59, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▎| 1767/1784 [2:21:12<00:56, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▎| 1767/1784 [2:21:12<00:56, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9741, 'learning_rate': 4.906542056074766e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████▎| 1768/1784 [2:21:15<00:52, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▎| 1768/1784 [2:21:15<00:52, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:32,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:32,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0376, 'learning_rate': 4.4392523364485984e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████▍| 1770/1784 [2:21:21<00:45, 3.22s/it]g-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▍| 1770/1784 [2:21:21<00:45, 3.22s/it]g-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:38,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:38,351 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1271, 'learning_rate': 3.97196261682243e-07, 'epoch': 0.99} 99%|████████████████████████████████████████████████████████████████████████████▍| 1772/1784 [2:21:27<00:37, 3.15s/it]g-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▍| 1772/1784 [2:21:27<00:37, 3.15s/it]g-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:44,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:44,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0569, 'learning_rate': 3.5046728971962617e-07, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-01 03:52:44,377 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:24,099 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▌| 1774/1784 [2:21:33<00:30, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▌| 1775/1784 [2:21:36<00:26, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████▌| 1775/1784 [2:21:36<00:26, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:52,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-01 03:52:52,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1338, 'learning_rate': 2.803738317757009e-07, 'epoch': 1.0} [WARNING|modeling_utils.py:388] 2022-03-01 03:52:52,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-01 03:52:48,709 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▋| 1777/1784 [2:21:41<00:19, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:56,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▋| 1778/1784 [2:21:44<00:16, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:58,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▋| 1778/1784 [2:21:44<00:16, 2.68s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:52:58,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▊| 1780/1784 [2:21:48<00:09, 2.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:00,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▊| 1780/1784 [2:21:48<00:09, 2.31s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:00,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▊| 1781/1784 [2:21:49<00:06, 2.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:02,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▊| 1781/1784 [2:21:49<00:06, 2.09s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:02,572 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.042, 'learning_rate': 1.8691588785046729e-07, 'epoch': 1.0} 100%|████████████████████████████████████████████████████████████████████████████▉| 1782/1784 [2:21:51<00:03, 1.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:05,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████▉| 1782/1784 [2:21:51<00:03, 1.89s/it][WARNING|modeling_utils.py:388] 2022-03-01 03:53:05,396 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-01 03:53:07,116 >> Saving model checkpoint to ./=)█| 1784/1784 [2:21:53<00:00, 1.50s/it][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-01 03:53:07,116 >> Saving model checkpoint to ./=)█| 1784/1784 [2:21:53<00:00, 1.50s/it][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4125, 'learning_rate': 1.1682242990654206e-07, 'epoch': 1.0} {'loss': 3.5579, 'learning_rate': 9.345794392523364e-08, 'epoch': 1.0} [INFO|trainer.py:2114] 2022-03-01 03:53:24,709 >> Saving model checkpoint to ./ ./pytorch_model.bin:53<00:00, 1.50s/it][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modeling_utils.py:1081] 2022-03-01 03:53:41,008 >> Model weights saved in ./pytorch_model.bin:53<00:00, 1.50s/it][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 0%| | 32.0k/2.99G [00:00> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 1%|▍ | 31.7M/2.99G [00:02<03:07, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 2%|█ | 68.8M/2.99G [00:04<02:47, 18.7MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 4%|█▋ | 110M/2.99G [00:06<02:32, 20.3MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 5%|██▎ | 150M/2.99G [00:08<02:27, 20.7MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 190M/2.99G [00:10<02:25, 20.8MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 8%|███▌ | 230M/2.99G [00:12<02:22, 20.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 8%|███▌ | 230M/2.99G [00:12<02:22, 20.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 07d31f4..9f862b3 main -> main3112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 07d31f4..9f862b3 main -> main3112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 52.9M/52.9M [00:15<00:00, 16.9MB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/01/2022 03:57:48 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|████████████| 52.9M/52.9M [02:31<00:00, 217kB/s][INFO|trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modelcard.py:460] 2022-03-01 03:57:51,296 >> Dropping the following result as it does not have all the necessary fields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 32%|███▌ | 17.1M/53.0M [00:01<00:02, 17.8MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 32%|███▌ | 17.1M/53.0M [00:01<00:02, 17.8MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/01/2022 03:57:57 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. ***** train metrics ***** epoch = 1.0 train_loss = 4.2472 train_runtime = 2:21:54.91 train_samples = 28538 train_samples_per_second = 3.352 train_steps_per_second = 0.21 [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2369] 2022-03-01 03:58:00,549 >> Batch size = 8aluation *****███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-01 04:16:28,516 >> Saving model checkpoint to ./*███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-01 04:16:28,516 >> Saving model checkpoint to ./*███| 53.0M/53.0M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/01/2022 04:16:28 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow ***** eval metrics ***** epoch = 1.0 eval_loss = 4.1887 eval_runtime = 0:18:27.96 eval_samples = 2642 eval_samples_per_second = 2.385 eval_steps_per_second = 0.299 [INFO|modeling_utils.py:1081] 2022-03-01 04:16:44,802 >> Model weights saved in ./pytorch_model.bin0:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 29%|███▏ | 15.7M/53.2M [00:01<00:02, 16.3MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 53.2M/53.2M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220301_013112-3e2necnj/run-3e2necnj.wandb: 100%|███████████| 53.2M/53.2M [00:03<00:00, 18.5MB/s]ields:trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/01/2022 04:17:14 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>trainer.py:1492] 2022-03-01 03:53:07,115 >> 6,589 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.