0%| | 0/1784 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9004, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:25,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 1/1784 [00:04<2:06:52, 4.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:27,846 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.028, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:29,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%| | 2/1784 [00:08<1:59:06, 4.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:31,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:45:33,485 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1215, 'learning_rate': 2e-06, 'epoch': 0.0} 0%|▏ | 3/1784 [00:12<1:58:52, 4.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:35,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6759, 'learning_rate': 4e-06, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:37,399 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 4/1784 [00:15<1:55:28, 3.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:39,346 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7525, 'learning_rate': 6e-06, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:41,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 5/1784 [00:19<1:53:57, 3.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:43,076 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7486, 'learning_rate': 8e-06, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:44,869 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 6/1784 [00:23<1:52:35, 3.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:46,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5801, 'learning_rate': 1e-05, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:48,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 7/1784 [00:27<1:55:33, 3.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:51,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6594, 'learning_rate': 1.2e-05, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:53,353 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 8/1784 [00:31<1:59:55, 4.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:55,231 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.831, 'learning_rate': 1.4e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:45:56,985 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 9/1784 [00:35<1:55:58, 3.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:45:58,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:00,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 10/1784 [00:38<1:52:50, 3.82s/it] 1%|▍ | 10/1784 [00:38<1:52:50, 3.82s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:02,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.513, 'learning_rate': 1.8e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:04,150 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 11/1784 [00:42<1:50:38, 3.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:05,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:07,663 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 12/1784 [00:46<1:48:30, 3.67s/it] 1%|▌ | 12/1784 [00:46<1:48:30, 3.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:09,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:11,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 13/1784 [00:49<1:47:08, 3.63s/it] 1%|▌ | 13/1784 [00:49<1:47:08, 3.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:13,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2523, 'learning_rate': 2.4e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:14,664 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 14/1784 [00:53<1:45:40, 3.58s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:16,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:18,133 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 15/1784 [00:56<1:44:37, 3.55s/it] 1%|▋ | 15/1784 [00:56<1:44:37, 3.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:19,892 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:21,568 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 16/1784 [00:59<1:43:33, 3.51s/it] 1%|▋ | 16/1784 [00:59<1:43:33, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:23,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4427, 'learning_rate': 3e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:25,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 17/1784 [01:03<1:42:56, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:26,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:28,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 18/1784 [01:06<1:41:48, 3.46s/it] 1%|▊ | 18/1784 [01:06<1:41:48, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:30,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:31,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 19/1784 [01:10<1:41:04, 3.44s/it] 1%|▊ | 19/1784 [01:10<1:41:04, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:33,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:35,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 20/1784 [01:13<1:40:16, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:36,870 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6023, 'learning_rate': 3.6e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:38,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 21/1784 [01:16<1:39:38, 3.39s/it] 1%|▉ | 21/1784 [01:16<1:39:38, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:40,257 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:41,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 22/1784 [01:20<1:39:35, 3.39s/it] 1%|▉ | 22/1784 [01:20<1:39:35, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:43,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4214, 'learning_rate': 4.2000000000000004e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:45,216 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 23/1784 [01:23<1:39:09, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:46,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:48,494 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 24/1784 [01:26<1:38:13, 3.35s/it] 1%|█ | 24/1784 [01:26<1:38:13, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:50,185 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:51,737 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 25/1784 [01:30<1:37:15, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:53,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3643, 'learning_rate': 4.6e-05, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-03 00:46:54,946 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█▏ | 26/1784 [01:33<1:36:15, 3.28s/it] 1%|█▏ | 26/1784 [01:33<1:36:15, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:56,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:46:58,173 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 27/1784 [01:36<1:35:41, 3.27s/it] 2%|█▏ | 27/1784 [01:36<1:35:41, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:46:59,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.032, 'learning_rate': 5.2e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:01,455 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 28/1784 [01:39<1:35:45, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.341, 'learning_rate': 5.4e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:04,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 29/1784 [01:43<1:35:40, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:06,392 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:07,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 30/1784 [01:46<1:34:55, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:09,562 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2808, 'learning_rate': 5.6e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:11,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 31/1784 [01:49<1:34:11, 3.22s/it] 2%|█▎ | 31/1784 [01:49<1:34:11, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:12,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:14,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 32/1784 [01:52<1:33:17, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:15,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5311, 'learning_rate': 6e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:17,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 33/1784 [01:55<1:31:47, 3.15s/it] 2%|█▍ | 33/1784 [01:55<1:31:47, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:18,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:20,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 34/1784 [01:58<1:30:19, 3.10s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:21,763 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.107, 'learning_rate': 6.4e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:23,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 35/1784 [02:01<1:28:56, 3.05s/it] 2%|█▌ | 35/1784 [02:01<1:28:56, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:24,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:26,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 36/1784 [02:04<1:27:54, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:27,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.674, 'learning_rate': 6.800000000000001e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:28,933 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 37/1784 [02:07<1:26:10, 2.96s/it] 2%|█▋ | 37/1784 [02:07<1:26:10, 2.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:30,407 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:31,748 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 38/1784 [02:10<1:24:52, 2.92s/it] 2%|█▋ | 38/1784 [02:10<1:24:52, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:33,155 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:34,467 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▋ | 39/1784 [02:12<1:23:05, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:35,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5041, 'learning_rate': 7.4e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:37,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 40/1784 [02:15<1:21:09, 2.79s/it] 2%|█▊ | 40/1784 [02:15<1:21:09, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:38,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:39,695 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 41/1784 [02:18<1:19:17, 2.73s/it] 2%|█▊ | 41/1784 [02:18<1:19:17, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:40,980 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:42,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 42/1784 [02:20<1:16:42, 2.64s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:43,378 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1578, 'learning_rate': 8e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:44,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 43/1784 [02:22<1:13:51, 2.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:45,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4119, 'learning_rate': 8.2e-05, 'epoch': 0.02} {'loss': 4.2466, 'learning_rate': 8.400000000000001e-05, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:46,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 44/1784 [02:25<1:10:30, 2.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:47,695 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:48,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|█▉ | 45/1784 [02:27<1:06:42, 2.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:49,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:50,352 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5696, 'learning_rate': 8.599999999999999e-05, 'epoch': 0.03} 3%|██ | 46/1784 [02:28<1:01:46, 2.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:51,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:51,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 47/1784 [02:30<56:54, 1.97s/it] {'loss': 4.5456, 'learning_rate': 8.8e-05, 'epoch': 0.03} 3%|██▏ | 47/1784 [02:30<56:54, 1.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:52,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:53,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 48/1784 [02:31<52:01, 1.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:54,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8245, 'learning_rate': 9.2e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-03 00:47:54,576 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 49/1784 [02:33<47:11, 1.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:55,219 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:47:56,276 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 50/1784 [02:34<47:43, 1.65s/it] 3%|██▎ | 50/1784 [02:34<47:43, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:58,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 50/1784 [02:34<47:43, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:47:58,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:38<1:07:56, 2.35s/it]g-point operations will not be computed-03 00:47:58,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:38<1:07:56, 2.35s/it]g-point operations will not be computed-03 00:47:58,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:38<1:07:56, 2.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:02,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 51/1784 [02:38<1:07:56, 2.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:02,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 52/1784 [02:42<1:20:27, 2.79s/it]g-point operations will not be computed-03 00:48:02,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 52/1784 [02:42<1:20:27, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:06,006 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 52/1784 [02:42<1:20:27, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:06,006 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 53/1784 [02:46<1:28:53, 3.08s/it]g-point operations will not be computed-03 00:48:06,006 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 53/1784 [02:46<1:28:53, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 53/1784 [02:46<1:28:53, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 54/1784 [02:49<1:34:09, 3.27s/it]g-point operations will not be computed-03 00:48:09,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 54/1784 [02:49<1:34:09, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:13,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 55/1784 [02:53<1:37:30, 3.38s/it]g-point operations will not be computed-03 00:48:13,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 55/1784 [02:53<1:37:30, 3.38s/it]g-point operations will not be computed-03 00:48:13,410 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:48:18,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:48:17,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:48:18,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:48:17,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4468, 'learning_rate': 0.000108, 'epoch': 0.03} 3%|██▍ | 56/1784 [02:57<1:39:29, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:20,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:48:22,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 57/1784 [03:00<1:41:01, 3.51s/it] 3%|██▌ | 57/1784 [03:00<1:41:01, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:24,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [03:04<1:41:32, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:24,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [03:04<1:41:32, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:24,277 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 58/1784 [03:04<1:41:32, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:27,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:08<1:41:46, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:27,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:08<1:41:46, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:27,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:08<1:41:46, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:31,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 59/1784 [03:08<1:41:46, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:31,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 60/1784 [03:11<1:42:35, 3.57s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:35,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:15<1:42:06, 3.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:35,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:15<1:42:06, 3.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:35,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 61/1784 [03:15<1:42:06, 3.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:38,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 62/1784 [03:18<1:41:25, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:38,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 62/1784 [03:18<1:41:25, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:38,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 62/1784 [03:18<1:41:25, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:42,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 63/1784 [03:22<1:41:18, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:42,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 63/1784 [03:22<1:41:18, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:42,043 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:25<1:41:01, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:45,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:25<1:41:01, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:45,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 64/1784 [03:25<1:41:01, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:49,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 65/1784 [03:29<1:40:24, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:49,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 65/1784 [03:29<1:40:24, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:49,060 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 65/1784 [03:29<1:40:24, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:52,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 66/1784 [03:32<1:39:47, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:52,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 66/1784 [03:32<1:39:47, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:52,491 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:36<1:39:26, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:55,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:36<1:39:26, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:55,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 67/1784 [03:36<1:39:26, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:59,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 68/1784 [03:39<1:38:23, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:59,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 68/1784 [03:39<1:38:23, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:48:59,376 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 68/1784 [03:39<1:38:23, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:02,740 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 69/1784 [03:42<1:37:56, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:02,740 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 69/1784 [03:42<1:37:56, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:02,740 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:46<1:37:49, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:06,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:46<1:37:49, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:06,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 70/1784 [03:46<1:37:49, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:09,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 71/1784 [03:49<1:37:01, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:09,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 71/1784 [03:49<1:37:01, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:09,504 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 71/1784 [03:49<1:37:01, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:12,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:52<1:36:31, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:16,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 72/1784 [03:52<1:36:31, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:16,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 73/1784 [03:56<1:36:09, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:16,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 73/1784 [03:56<1:36:09, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:16,203 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 73/1784 [03:56<1:36:09, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:19,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 74/1784 [03:59<1:35:10, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:19,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 74/1784 [03:59<1:35:10, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:19,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 75/1784 [04:02<1:34:40, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:22,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 75/1784 [04:02<1:34:40, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:22,786 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 75/1784 [04:02<1:34:40, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:26,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [04:06<1:34:10, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:26,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [04:06<1:34:10, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:26,062 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [04:06<1:34:10, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:29,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 76/1784 [04:06<1:34:10, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:29,349 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 77/1784 [04:09<1:33:17, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:32,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:12<1:32:48, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:32,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:12<1:32:48, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:32,532 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 78/1784 [04:12<1:32:48, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:35,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 79/1784 [04:15<1:31:47, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:38,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▍ | 79/1784 [04:15<1:31:47, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:38,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:18<1:30:56, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:38,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:18<1:30:56, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:38,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 80/1784 [04:18<1:30:56, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:41,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:21<1:30:06, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▌ | 81/1784 [04:21<1:30:06, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 82/1784 [04:24<1:28:47, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 82/1784 [04:24<1:28:47, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:45,050 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 82/1784 [04:24<1:28:47, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:48,105 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 82/1784 [04:24<1:28:47, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:48,105 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 83/1784 [04:28<1:28:16, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:51,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 83/1784 [04:28<1:28:16, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:51,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:30<1:26:49, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:51,084 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:30<1:26:49, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:54,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 84/1784 [04:30<1:26:49, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:54,018 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 85/1784 [04:33<1:24:55, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:56,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 85/1784 [04:33<1:24:55, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:56,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 86/1784 [04:36<1:23:30, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:56,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 86/1784 [04:36<1:23:30, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:56,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:39<1:22:22, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:49:59,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:39<1:22:22, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:02,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 87/1784 [04:39<1:22:22, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:02,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 88/1784 [04:42<1:21:28, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:05,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 88/1784 [04:42<1:21:28, 2.88s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:05,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 89/1784 [04:44<1:19:06, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:05,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 89/1784 [04:44<1:19:06, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:05,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 90/1784 [04:47<1:17:05, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:07,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 90/1784 [04:47<1:17:05, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:07,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 91/1784 [04:49<1:14:12, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:10,284 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 92/1784 [04:52<1:11:23, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:12,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 92/1784 [04:52<1:11:23, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:12,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 93/1784 [04:54<1:08:10, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:14,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 93/1784 [04:54<1:08:10, 2.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:14,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 94/1784 [04:56<1:04:02, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:16,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 94/1784 [04:56<1:04:02, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:16,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8955, 'learning_rate': 0.000184, 'epoch': 0.05} 5%|████▎ | 95/1784 [04:57<59:26, 2.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:18,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 95/1784 [04:57<59:26, 2.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:18,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 96/1784 [04:59<55:08, 1.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:21,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 96/1784 [04:59<55:08, 1.96s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:21,941 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 98/1784 [05:02<46:19, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:23,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▍ | 98/1784 [05:02<46:19, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:23,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6766, 'learning_rate': 0.000192, 'epoch': 0.05} 6%|████▍ | 99/1784 [05:03<42:28, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:25,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 99/1784 [05:03<42:28, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:25,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 100/1784 [05:05<43:49, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:25,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 100/1784 [05:05<43:49, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:28,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 100/1784 [05:05<43:49, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:28,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 101/1784 [05:09<1:03:00, 2.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:28,794 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:12<1:14:49, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:32,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:12<1:14:49, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:32,493 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 102/1784 [05:12<1:14:49, 2.67s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:36,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:16<1:22:45, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:36,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:16<1:22:45, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:36,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:16<1:22:45, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:39,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 103/1784 [05:16<1:22:45, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:39,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:19<1:28:38, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:39,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:19<1:28:38, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 104/1784 [05:19<1:28:38, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 105/1784 [05:23<1:31:48, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:43,380 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 105/1784 [05:23<1:31:48, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:46,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 105/1784 [05:23<1:31:48, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:46,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 106/1784 [05:27<1:34:01, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:46,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:30<1:35:33, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:50,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:30<1:35:33, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:50,490 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:30<1:35:33, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:54,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 107/1784 [05:30<1:35:33, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:54,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 108/1784 [05:34<1:36:41, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:54,034 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 108/1784 [05:34<1:36:41, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:57,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 108/1784 [05:34<1:36:41, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:57,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 109/1784 [05:37<1:37:23, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:50:57,573 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:41<1:37:44, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:01,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:41<1:37:44, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:01,135 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 110/1784 [05:41<1:37:44, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:04,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:44<1:37:39, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:04,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:44<1:37:39, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:04,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:44<1:37:39, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:08,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 111/1784 [05:44<1:37:39, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:08,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 112/1784 [05:48<1:37:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:08,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 112/1784 [05:48<1:37:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:11,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 112/1784 [05:48<1:37:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:11,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 113/1784 [05:51<1:35:59, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:14,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 113/1784 [05:51<1:35:59, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:14,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 114/1784 [05:55<1:36:08, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:14,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 114/1784 [05:55<1:36:08, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:18,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 114/1784 [05:55<1:36:08, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:18,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 115/1784 [05:58<1:35:24, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:18,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:51:21,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:51:21,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 116/1784 [06:01<1:34:54, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:25,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 116/1784 [06:01<1:34:54, 3.41s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:25,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 117/1784 [06:05<1:34:53, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:25,118 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 117/1784 [06:05<1:34:53, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:28,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████ | 117/1784 [06:05<1:34:53, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:28,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [06:08<1:34:00, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:28,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [06:08<1:34:00, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:31,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 118/1784 [06:08<1:34:00, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:31,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 119/1784 [06:11<1:34:26, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:35,241 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 119/1784 [06:11<1:34:26, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:35,241 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:15<1:33:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:35,241 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:15<1:33:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:38,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▏ | 120/1784 [06:15<1:33:31, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:38,533 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 121/1784 [06:18<1:32:52, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:41,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:21<1:33:00, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:41,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:21<1:33:00, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:41,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:21<1:33:00, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 122/1784 [06:21<1:33:00, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 123/1784 [06:25<1:31:47, 3.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:45,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:28<1:31:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:48,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:28<1:31:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:48,443 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:28<1:31:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:51,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 124/1784 [06:28<1:31:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:51,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:31<1:30:35, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:51,640 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:31<1:30:35, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:54,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 125/1784 [06:31<1:30:35, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:54,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 126/1784 [06:34<1:29:43, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:54,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 126/1784 [06:34<1:29:43, 3.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:54,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:37<1:29:07, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:51:58,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:37<1:29:07, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:01,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 127/1784 [06:37<1:29:07, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:01,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 128/1784 [06:41<1:28:29, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:01,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 128/1784 [06:41<1:28:29, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:01,179 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:44<1:27:52, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:04,337 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:44<1:27:52, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:07,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 129/1784 [06:44<1:27:52, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:07,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 130/1784 [06:47<1:26:51, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:07,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:50<1:26:08, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:10,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:50<1:26:08, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:10,517 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:50<1:26:08, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:13,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 131/1784 [06:50<1:26:08, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:13,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 132/1784 [06:53<1:25:48, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:13,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:56<1:25:35, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:16,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:56<1:25:35, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:16,747 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:56<1:25:35, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:19,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▊ | 133/1784 [06:56<1:25:35, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:19,689 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▊ | 134/1784 [06:59<1:23:40, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:22,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▊ | 134/1784 [06:59<1:23:40, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:22,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 135/1784 [07:02<1:22:26, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:22,584 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 135/1784 [07:02<1:22:26, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:25,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 135/1784 [07:02<1:22:26, 3.00s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:25,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 136/1784 [07:05<1:21:33, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:28,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 136/1784 [07:05<1:21:33, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:28,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|█████▉ | 137/1784 [07:08<1:20:11, 2.92s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:28,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [07:10<1:18:49, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:31,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [07:10<1:18:49, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:31,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [07:10<1:18:49, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:33,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 138/1784 [07:10<1:18:49, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:33,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 139/1784 [07:13<1:17:00, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:36,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 139/1784 [07:13<1:17:00, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:36,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 140/1784 [07:16<1:14:57, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:36,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████ | 140/1784 [07:16<1:14:57, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:36,439 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 141/1784 [07:18<1:12:41, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:38,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 142/1784 [07:20<1:10:02, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:41,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 142/1784 [07:20<1:10:02, 2.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:41,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 143/1784 [07:23<1:07:18, 2.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:43,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 143/1784 [07:23<1:07:18, 2.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:43,661 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 144/1784 [07:25<1:03:38, 2.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:45,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 144/1784 [07:25<1:03:38, 2.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:45,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 145/1784 [07:27<1:00:02, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:47,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 145/1784 [07:27<1:00:02, 2.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:47,722 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2152, 'learning_rate': 0.00028599999999999996, 'epoch': 0.08} 8%|██████▌ | 146/1784 [07:28<56:37, 2.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:49,610 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 146/1784 [07:28<56:37, 2.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:49,610 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 147/1784 [07:30<51:57, 1.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:52,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 147/1784 [07:30<51:57, 1.90s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:52,719 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 149/1784 [07:32<43:37, 1.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:55,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 149/1784 [07:32<43:37, 1.60s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:55,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2187, 'learning_rate': 0.000294, 'epoch': 0.08} 8%|██████▋ | 150/1784 [07:34<43:48, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:55,189 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 150/1784 [07:34<43:48, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:58,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 150/1784 [07:34<43:48, 1.61s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:58,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:38<1:01:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:52:58,245 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:38<1:01:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:01,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▌ | 151/1784 [07:38<1:01:53, 2.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:01,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:42<1:13:34, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:01,950 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:42<1:13:34, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:05,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 152/1784 [07:42<1:13:34, 2.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:05,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 153/1784 [07:45<1:21:41, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:05,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 153/1784 [07:45<1:21:41, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:09,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 153/1784 [07:45<1:21:41, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:09,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 154/1784 [07:49<1:27:34, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:13,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▋ | 154/1784 [07:49<1:27:34, 3.22s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:13,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:53<1:31:12, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:13,056 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:53<1:31:12, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:16,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 155/1784 [07:53<1:31:12, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:16,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:56<1:33:34, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:16,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:56<1:33:34, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:20,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 156/1784 [07:56<1:33:34, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:20,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 157/1784 [08:00<1:34:35, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:20,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▊ | 157/1784 [08:00<1:34:35, 3.49s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:23,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 158/1784 [08:04<1:35:09, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:23,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 158/1784 [08:04<1:35:09, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:23,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9038, 'learning_rate': 0.000312, 'epoch': 0.09} 9%|██████▉ | 159/1784 [08:07<1:35:20, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:27,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 159/1784 [08:07<1:35:20, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:31,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 159/1784 [08:07<1:35:20, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:31,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 160/1784 [08:11<1:35:31, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:31,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 160/1784 [08:11<1:35:31, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|██████▉ | 160/1784 [08:11<1:35:31, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:14<1:35:40, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:14<1:35:40, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 161/1784 [08:14<1:35:40, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 162/1784 [08:18<1:35:20, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 162/1784 [08:18<1:35:20, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:53:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:53:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:53:43,274 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 164/1784 [08:25<1:34:19, 3.49s/it]g-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 164/1784 [08:25<1:34:19, 3.49s/it]g-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 164/1784 [08:25<1:34:19, 3.49s/it]g-point operations will not be computed-03 00:53:34,579 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 165/1784 [08:28<1:33:51, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 165/1784 [08:28<1:33:51, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 166/1784 [08:32<1:33:41, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 166/1784 [08:32<1:33:41, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 166/1784 [08:32<1:33:41, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 167/1784 [08:35<1:32:50, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 167/1784 [08:35<1:32:50, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 167/1784 [08:35<1:32:50, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:53:51,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 168/1784 [08:38<1:32:19, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 168/1784 [08:38<1:32:19, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 169/1784 [08:42<1:31:34, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 169/1784 [08:42<1:31:34, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 169/1784 [08:42<1:31:34, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▍ | 170/1784 [08:45<1:31:05, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:10,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:10,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9869, 'learning_rate': 0.00033800000000000003, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 00:54:10,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 172/1784 [08:52<1:30:46, 3.38s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 172/1784 [08:52<1:30:46, 3.38s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 172/1784 [08:52<1:30:46, 3.38s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 173/1784 [08:55<1:30:11, 3.36s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▌ | 173/1784 [08:55<1:30:11, 3.36s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:20,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:20,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:20,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 175/1784 [09:02<1:28:52, 3.31s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:26,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:26,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6924, 'learning_rate': 0.000348, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 00:54:26,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 177/1784 [09:08<1:26:59, 3.25s/it]g-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:33,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:33,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8362, 'learning_rate': 0.000352, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 00:54:33,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:36,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:36,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:36,292 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:02,122 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 180/1784 [09:17<1:24:09, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:40,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 180/1784 [09:17<1:24:09, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:40,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 181/1784 [09:20<1:24:03, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:40,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 181/1784 [09:20<1:24:03, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:40,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 181/1784 [09:20<1:24:03, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:40,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 182/1784 [09:24<1:23:36, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:47,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 182/1784 [09:24<1:23:36, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:47,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 183/1784 [09:27<1:23:08, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:47,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 183/1784 [09:27<1:23:08, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:47,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 183/1784 [09:27<1:23:08, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:47,218 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 184/1784 [09:30<1:21:41, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 184/1784 [09:30<1:21:41, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 185/1784 [09:33<1:20:50, 3.03s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:57,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:54:57,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6795, 'learning_rate': 0.000368, 'epoch': 0.1} [WARNING|modeling_utils.py:388] 2022-03-03 00:54:57,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:54:53,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 187/1784 [09:38<1:18:57, 2.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 188/1784 [09:41<1:18:07, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▏ | 188/1784 [09:41<1:18:07, 2.94s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:05,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:05,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2655, 'learning_rate': 0.000374, 'epoch': 0.11} [WARNING|modeling_utils.py:388] 2022-03-03 00:55:05,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:01,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 190/1784 [09:46<1:14:05, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 190/1784 [09:46<1:14:05, 2.79s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 191/1784 [09:49<1:11:30, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▎ | 191/1784 [09:49<1:11:30, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:13,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:13,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:15,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:15,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:17,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:17,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:19,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:19,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:21,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:21,418 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:24,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:24,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9989, 'learning_rate': 0.00039200000000000004, 'epoch': 0.11} [WARNING|modeling_utils.py:388] 2022-03-03 00:55:25,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:25,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:27,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:27,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:27,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:31,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:31,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:31,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 202/1784 [10:13<1:11:54, 2.73s/it]g-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 202/1784 [10:13<1:11:54, 2.73s/it]g-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 202/1784 [10:13<1:11:54, 2.73s/it]g-point operations will not be computed-03 00:55:09,862 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 203/1784 [10:17<1:19:32, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 204/1784 [10:20<1:24:16, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 204/1784 [10:20<1:24:16, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4006, 'learning_rate': 0.000404, 'epoch': 0.11} 11%|████████▉ | 204/1784 [10:20<1:24:16, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 205/1784 [10:24<1:26:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 205/1784 [10:24<1:26:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 205/1784 [10:24<1:26:54, 3.30s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 206/1784 [10:27<1:28:43, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:52,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:55:52,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0031, 'learning_rate': 0.00041, 'epoch': 0.12} 12%|█████████ | 208/1784 [10:34<1:31:04, 3.47s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████ | 208/1784 [10:34<1:31:04, 3.47s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7179, 'learning_rate': 0.000412, 'epoch': 0.12} 12%|█████████ | 208/1784 [10:34<1:31:04, 3.47s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 209/1784 [10:38<1:31:14, 3.48s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 209/1784 [10:38<1:31:14, 3.48s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 209/1784 [10:38<1:31:14, 3.48s/it]g-point operations will not be computed-03 00:55:40,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 210/1784 [10:41<1:31:16, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 211/1784 [10:45<1:30:43, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▏ | 211/1784 [10:45<1:30:43, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9536, 'learning_rate': 0.00041799999999999997, 'epoch': 0.12} 12%|█████████▎ | 212/1784 [10:48<1:30:33, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 212/1784 [10:48<1:30:33, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6715, 'learning_rate': 0.00042, 'epoch': 0.12} 12%|█████████▎ | 212/1784 [10:48<1:30:33, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:05,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 213/1784 [10:52<1:30:38, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 214/1784 [10:55<1:30:19, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 214/1784 [10:55<1:30:19, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7195, 'learning_rate': 0.000424, 'epoch': 0.12} 12%|█████████▍ | 215/1784 [10:58<1:29:56, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 215/1784 [10:58<1:29:56, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8662, 'learning_rate': 0.000426, 'epoch': 0.12} 12%|█████████▍ | 215/1784 [10:58<1:29:56, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:15,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 216/1784 [11:02<1:29:49, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 217/1784 [11:05<1:29:16, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 217/1784 [11:05<1:29:16, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5386, 'learning_rate': 0.00043, 'epoch': 0.12} 12%|█████████▍ | 217/1784 [11:05<1:29:16, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 218/1784 [11:09<1:29:14, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 218/1784 [11:09<1:29:14, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 218/1784 [11:09<1:29:14, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:25,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 219/1784 [11:12<1:28:43, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 220/1784 [11:15<1:27:53, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 220/1784 [11:15<1:27:53, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9084, 'learning_rate': 0.000436, 'epoch': 0.12} 12%|█████████▋ | 221/1784 [11:19<1:27:12, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 221/1784 [11:19<1:27:12, 3.35s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:56:44,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:56:44,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9058, 'learning_rate': 0.00044, 'epoch': 0.12} 12%|█████████▊ | 223/1784 [11:25<1:26:57, 3.34s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 223/1784 [11:25<1:26:57, 3.34s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9857, 'learning_rate': 0.000442, 'epoch': 0.12} 12%|█████████▊ | 223/1784 [11:25<1:26:57, 3.34s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▊ | 224/1784 [11:29<1:26:55, 3.34s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:56:53,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:56:53,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1024, 'learning_rate': 0.000446, 'epoch': 0.13} [WARNING|modeling_utils.py:388] 2022-03-03 00:56:53,983 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:35<1:25:44, 3.30s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:35<1:25:44, 3.30s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 226/1784 [11:35<1:25:44, 3.30s/it]g-point operations will not be computed-03 00:56:35,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 227/1784 [11:38<1:24:43, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:02,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 228/1784 [11:42<1:23:59, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:02,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|█████████▉ | 228/1784 [11:42<1:23:59, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:02,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.799, 'learning_rate': 0.00045200000000000004, 'epoch': 0.13} 13%|█████████▉ | 228/1784 [11:42<1:23:59, 3.24s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:02,106 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 229/1784 [11:45<1:23:02, 3.20s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:08,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 230/1784 [11:48<1:22:07, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:08,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 230/1784 [11:48<1:22:07, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:08,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9686, 'learning_rate': 0.000456, 'epoch': 0.13} 13%|██████████ | 230/1784 [11:48<1:22:07, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:08,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 231/1784 [11:51<1:21:40, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:14,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 232/1784 [11:54<1:21:00, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:14,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 232/1784 [11:54<1:21:00, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:14,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2565, 'learning_rate': 0.00046, 'epoch': 0.13} 13%|██████████▏ | 232/1784 [11:54<1:21:00, 3.13s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:14,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 233/1784 [11:57<1:19:25, 3.07s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 234/1784 [12:00<1:17:57, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 234/1784 [12:00<1:17:57, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:24,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:24,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:27,540 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:27,540 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6232, 'learning_rate': 0.00046800000000000005, 'epoch': 0.13} 13%|██████████▎ | 237/1784 [12:08<1:13:45, 2.86s/it]g-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 237/1784 [12:08<1:13:45, 2.86s/it]g-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:32,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:57:32,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:57:20,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.3229, 'learning_rate': 0.000472, 'epoch': 0.13} 13%|██████████▍ | 239/1784 [12:13<1:10:28, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:36,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 239/1784 [12:13<1:10:28, 2.74s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:36,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 240/1784 [12:16<1:08:26, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:39,171 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▍ | 240/1784 [12:16<1:08:26, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:39,171 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 241/1784 [12:18<1:05:12, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:41,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 241/1784 [12:18<1:05:12, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:41,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 242/1784 [12:20<1:01:45, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:43,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▌ | 242/1784 [12:20<1:01:45, 2.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:43,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 243/1784 [12:22<58:02, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:45,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 243/1784 [12:22<58:02, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:45,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 244/1784 [12:24<54:19, 2.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:46,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 244/1784 [12:24<54:19, 2.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:46,905 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 246/1784 [12:27<47:00, 1.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:48,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 246/1784 [12:27<47:00, 1.83s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:48,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3271, 'learning_rate': 0.000486, 'epoch': 0.14} 14%|███████████ | 247/1784 [12:28<43:34, 1.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 247/1784 [12:28<43:34, 1.70s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:51,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 249/1784 [12:31<37:12, 1.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:53,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 249/1784 [12:31<37:12, 1.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:53,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3764, 'learning_rate': 0.000492, 'epoch': 0.14} 14%|███████████▏ | 250/1784 [12:33<38:30, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:53,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 250/1784 [12:33<38:30, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:53,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 250/1784 [12:33<38:30, 1.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:56,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 251/1784 [12:36<56:55, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:56,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 251/1784 [12:36<56:55, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:57:56,683 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 251/1784 [12:36<56:55, 2.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 252/1784 [12:40<1:08:40, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 252/1784 [12:40<1:08:40, 2.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.5569, 'learning_rate': 0.0005, 'epoch': 0.14} 14%|███████████ | 253/1784 [12:44<1:16:22, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 253/1784 [12:44<1:16:22, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1258, 'learning_rate': 0.0005020000000000001, 'epoch': 0.14} 14%|███████████ | 253/1784 [12:44<1:16:22, 2.99s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:00,488 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 254/1784 [12:48<1:21:58, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 255/1784 [12:51<1:25:30, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 255/1784 [12:51<1:25:30, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2723, 'learning_rate': 0.000506, 'epoch': 0.14} 14%|███████████▏ | 256/1784 [12:55<1:27:03, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 256/1784 [12:55<1:27:03, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2628, 'learning_rate': 0.000508, 'epoch': 0.14} 14%|███████████▏ | 257/1784 [12:59<1:28:22, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 257/1784 [12:59<1:28:22, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2784, 'learning_rate': 0.00051, 'epoch': 0.14} 14%|███████████▏ | 257/1784 [12:59<1:28:22, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 258/1784 [13:02<1:28:57, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:27,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:27,708 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4261, 'learning_rate': 0.000514, 'epoch': 0.15} 15%|███████████▎ | 260/1784 [13:09<1:29:46, 3.53s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▎ | 260/1784 [13:09<1:29:46, 3.53s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9251, 'learning_rate': 0.0005160000000000001, 'epoch': 0.15} 15%|███████████▍ | 261/1784 [13:13<1:29:29, 3.53s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 261/1784 [13:13<1:29:29, 3.53s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:38,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8142, 'learning_rate': 0.0005200000000000001, 'epoch': 0.15} 15%|███████████▍ | 263/1784 [13:20<1:28:29, 3.49s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▍ | 263/1784 [13:20<1:28:29, 3.49s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0686, 'learning_rate': 0.000522, 'epoch': 0.15} 15%|███████████▌ | 264/1784 [13:23<1:28:22, 3.49s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 264/1784 [13:23<1:28:22, 3.49s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6381, 'learning_rate': 0.000524, 'epoch': 0.15} 15%|███████████▌ | 264/1784 [13:23<1:28:22, 3.49s/it]g-point operations will not be computed-03 00:58:11,608 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▌ | 265/1784 [13:26<1:27:38, 3.46s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 266/1784 [13:30<1:27:01, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 266/1784 [13:30<1:27:01, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8176, 'learning_rate': 0.000528, 'epoch': 0.15} 15%|███████████▋ | 267/1784 [13:33<1:26:42, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 267/1784 [13:33<1:26:42, 3.43s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:58,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:58:58,738 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9445, 'learning_rate': 0.000532, 'epoch': 0.15} 15%|███████████▊ | 269/1784 [13:40<1:25:39, 3.39s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 269/1784 [13:40<1:25:39, 3.39s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2645, 'learning_rate': 0.0005340000000000001, 'epoch': 0.15} 15%|███████████▊ | 270/1784 [13:43<1:25:41, 3.40s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 270/1784 [13:43<1:25:41, 3.40s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:08,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:08,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3492, 'learning_rate': 0.0005380000000000001, 'epoch': 0.15} 15%|███████████▉ | 272/1784 [13:50<1:24:34, 3.36s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 272/1784 [13:50<1:24:34, 3.36s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3108, 'learning_rate': 0.00054, 'epoch': 0.15} 15%|███████████▉ | 272/1784 [13:50<1:24:34, 3.36s/it]g-point operations will not be computed-03 00:58:50,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 273/1784 [13:53<1:24:05, 3.34s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 274/1784 [13:57<1:23:22, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▉ | 274/1784 [13:57<1:23:22, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.3335, 'learning_rate': 0.0005440000000000001, 'epoch': 0.15} 15%|████████████ | 275/1784 [14:00<1:22:27, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 275/1784 [14:00<1:22:27, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:25,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:25,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1695, 'learning_rate': 0.0005480000000000001, 'epoch': 0.15} 16%|████████████ | 277/1784 [14:06<1:21:56, 3.26s/it]g-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████ | 277/1784 [14:06<1:21:56, 3.26s/it]g-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.2355, 'learning_rate': 0.00055, 'epoch': 0.16} [WARNING|modeling_utils.py:388] 2022-03-03 00:59:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:31,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▏ | 279/1784 [14:13<1:20:28, 3.21s/it]g-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:37,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:37,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:37,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 281/1784 [14:19<1:19:29, 3.17s/it]g-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 281/1784 [14:19<1:19:29, 3.17s/it]g-point operations will not be computed-03 00:59:17,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8305, 'learning_rate': 0.000558, 'epoch': 0.16} 16%|████████████▎ | 282/1784 [14:22<1:19:02, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:45,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 282/1784 [14:22<1:19:02, 3.16s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:45,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 283/1784 [14:25<1:18:10, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:45,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▎ | 283/1784 [14:25<1:18:10, 3.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:45,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.5372, 'learning_rate': 0.0005620000000000001, 'epoch': 0.16} 16%|████████████▍ | 284/1784 [14:28<1:17:08, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 284/1784 [14:28<1:17:08, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 285/1784 [14:31<1:15:56, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▍ | 285/1784 [14:31<1:15:56, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:55,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 00:59:55,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7623, 'learning_rate': 0.0005679999999999999, 'epoch': 0.16} 16%|████████████▌ | 287/1784 [14:37<1:13:43, 2.95s/it]g-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▌ | 287/1784 [14:37<1:13:43, 2.95s/it]g-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:01,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:01,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:04,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:04,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 00:59:51,651 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0644, 'learning_rate': 0.000574, 'epoch': 0.16} 16%|████████████▋ | 290/1784 [14:45<1:07:51, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:08,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 290/1784 [14:45<1:07:51, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:08,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 291/1784 [14:47<1:05:28, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▋ | 291/1784 [14:47<1:05:28, 2.63s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▊ | 292/1784 [14:49<1:03:20, 2.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▊ | 292/1784 [14:49<1:03:20, 2.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.9815, 'learning_rate': 0.00058, 'epoch': 0.16} [WARNING|modeling_utils.py:388] 2022-03-03 01:00:13,600 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:15,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:15,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:17,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:17,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4079, 'learning_rate': 0.0005859999999999999, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-03 01:00:19,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:19,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.8535, 'learning_rate': 0.00059, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-03 01:00:22,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:22,365 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:23,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:23,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:25,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:25,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:25,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:29,216 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:32,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:32,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.825, 'learning_rate': 0.0006, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-03 01:00:32,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [15:15<1:14:55, 3.04s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [15:15<1:14:55, 3.04s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▏ | 303/1784 [15:15<1:14:55, 3.04s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [15:18<1:20:04, 3.25s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [15:18<1:20:04, 3.25s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 304/1784 [15:18<1:20:04, 3.25s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 305/1784 [15:22<1:23:41, 3.40s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 305/1784 [15:22<1:23:41, 3.40s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▎ | 305/1784 [15:22<1:23:41, 3.40s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 306/1784 [15:26<1:25:40, 3.48s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:51,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:00:51,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5677, 'learning_rate': 0.00061, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-03 01:00:51,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:33<1:27:19, 3.55s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:33<1:27:19, 3.55s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▍ | 308/1784 [15:33<1:27:19, 3.55s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 309/1784 [15:37<1:27:37, 3.56s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 309/1784 [15:37<1:27:37, 3.56s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 309/1784 [15:37<1:27:37, 3.56s/it]g-point operations will not be computed-03 01:00:10,367 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 310/1784 [15:40<1:27:18, 3.55s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 311/1784 [15:44<1:26:49, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▌ | 311/1784 [15:44<1:26:49, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3806, 'learning_rate': 0.0006180000000000001, 'epoch': 0.17} 17%|█████████████▌ | 311/1784 [15:44<1:26:49, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 312/1784 [15:47<1:26:41, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 312/1784 [15:47<1:26:41, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 312/1784 [15:47<1:26:41, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:51<1:26:07, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:51<1:26:07, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 313/1784 [15:51<1:26:07, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:03,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 314/1784 [15:54<1:25:38, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▋ | 314/1784 [15:54<1:25:38, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 315/1784 [15:58<1:25:45, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 315/1784 [15:58<1:25:45, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 315/1784 [15:58<1:25:45, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [16:01<1:25:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [16:01<1:25:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 316/1784 [16:01<1:25:02, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:17,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 317/1784 [16:04<1:25:03, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 317/1784 [16:04<1:25:03, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 318/1784 [16:08<1:24:47, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 318/1784 [16:08<1:24:47, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 318/1784 [16:08<1:24:47, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 319/1784 [16:11<1:24:05, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 319/1784 [16:11<1:24:05, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 319/1784 [16:11<1:24:05, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 320/1784 [16:15<1:23:51, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 320/1784 [16:15<1:23:51, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:01:40,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:01:40,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:01:40,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 322/1784 [16:21<1:22:46, 3.40s/it]g-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 322/1784 [16:21<1:22:46, 3.40s/it]g-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 322/1784 [16:21<1:22:46, 3.40s/it]g-point operations will not be computed-03 01:01:28,340 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 323/1784 [16:25<1:22:06, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 323/1784 [16:25<1:22:06, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 324/1784 [16:28<1:20:57, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 324/1784 [16:28<1:20:57, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 325/1784 [16:31<1:20:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 325/1784 [16:31<1:20:06, 3.29s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:01:56,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:01:56,531 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7701, 'learning_rate': 0.000648, 'epoch': 0.18} 18%|██████████████▎ | 327/1784 [16:38<1:19:36, 3.28s/it]g-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 327/1784 [16:38<1:19:36, 3.28s/it]g-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:02,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:02,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1997, 'learning_rate': 0.000652, 'epoch': 0.18} 18%|██████████████▍ | 329/1784 [16:44<1:18:29, 3.24s/it]g-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 329/1784 [16:44<1:18:29, 3.24s/it]g-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2808, 'learning_rate': 0.0006540000000000001, 'epoch': 0.18} 18%|██████████████▍ | 329/1784 [16:44<1:18:29, 3.24s/it]g-point operations will not be computed-03 01:01:48,486 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 330/1784 [16:47<1:18:20, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 331/1784 [16:50<1:17:12, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▍ | 331/1784 [16:50<1:17:12, 3.19s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:15,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:15,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:15,574 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 333/1784 [16:57<1:15:35, 3.13s/it]g-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 333/1784 [16:57<1:15:35, 3.13s/it]g-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6766, 'learning_rate': 0.000662, 'epoch': 0.19} 19%|██████████████▌ | 333/1784 [16:57<1:15:35, 3.13s/it]g-point operations will not be computed-03 01:02:11,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▌ | 334/1784 [17:00<1:14:37, 3.09s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 335/1784 [17:02<1:13:21, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 335/1784 [17:02<1:13:21, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:27,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:27,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0338, 'learning_rate': 0.0006680000000000001, 'epoch': 0.19} 19%|██████████████▋ | 337/1784 [17:08<1:11:11, 2.95s/it]g-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 337/1784 [17:08<1:11:11, 2.95s/it]g-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:33,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:33,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4539, 'learning_rate': 0.0006720000000000001, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-03 01:02:33,116 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:23,170 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 339/1784 [17:14<1:08:56, 2.86s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:37,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 340/1784 [17:16<1:07:38, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 340/1784 [17:16<1:07:38, 2.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 341/1784 [17:19<1:06:07, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 341/1784 [17:19<1:06:07, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:43,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:43,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:45,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:45,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:48,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:48,072 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:50,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:50,055 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:51,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:51,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.198, 'learning_rate': 0.0006879999999999999, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-03 01:02:53,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:56,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:56,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5543, 'learning_rate': 0.000692, 'epoch': 0.2} {'loss': 5.9238, 'learning_rate': 0.000694, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-03 01:02:58,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:58,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:02:58,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:02,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:02,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:02,110 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 352/1784 [17:44<1:06:50, 2.80s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 352/1784 [17:44<1:06:50, 2.80s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 352/1784 [17:44<1:06:50, 2.80s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 353/1784 [17:48<1:13:24, 3.08s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 353/1784 [17:48<1:13:24, 3.08s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 353/1784 [17:48<1:13:24, 3.08s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 354/1784 [17:51<1:17:55, 3.27s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 354/1784 [17:51<1:17:55, 3.27s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 354/1784 [17:51<1:17:55, 3.27s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 355/1784 [17:55<1:21:01, 3.40s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 355/1784 [17:55<1:21:01, 3.40s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:20,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:20,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:20,667 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 357/1784 [18:02<1:23:36, 3.52s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 357/1784 [18:02<1:23:36, 3.52s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 357/1784 [18:02<1:23:36, 3.52s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 358/1784 [18:06<1:24:16, 3.55s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 358/1784 [18:06<1:24:16, 3.55s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 358/1784 [18:06<1:24:16, 3.55s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 359/1784 [18:09<1:24:43, 3.57s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 359/1784 [18:09<1:24:43, 3.57s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:35,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:35,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:35,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 361/1784 [18:17<1:25:03, 3.59s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 361/1784 [18:17<1:25:03, 3.59s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 361/1784 [18:17<1:25:03, 3.59s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 362/1784 [18:20<1:24:29, 3.56s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 362/1784 [18:20<1:24:29, 3.56s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 362/1784 [18:20<1:24:29, 3.56s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 363/1784 [18:24<1:23:47, 3.54s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:49,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:49,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9611, 'learning_rate': 0.000724, 'epoch': 0.2} [WARNING|modeling_utils.py:388] 2022-03-03 01:03:49,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 365/1784 [18:31<1:23:02, 3.51s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 365/1784 [18:31<1:23:02, 3.51s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 365/1784 [18:31<1:23:02, 3.51s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 366/1784 [18:34<1:22:24, 3.49s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 366/1784 [18:34<1:22:24, 3.49s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:59,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:59,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:03:59,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 368/1784 [18:41<1:21:27, 3.45s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 368/1784 [18:41<1:21:27, 3.45s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████ | 368/1784 [18:41<1:21:27, 3.45s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 369/1784 [18:44<1:21:14, 3.44s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:09,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:09,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.649, 'learning_rate': 0.000736, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 01:04:09,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 371/1784 [18:51<1:20:10, 3.40s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:16,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:16,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9999, 'learning_rate': 0.00074, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 01:04:16,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 373/1784 [18:58<1:18:52, 3.35s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 373/1784 [18:58<1:18:52, 3.35s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 373/1784 [18:58<1:18:52, 3.35s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 374/1784 [19:01<1:17:59, 3.32s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:26,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:26,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.1289, 'learning_rate': 0.000746, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 01:04:26,217 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 376/1784 [19:07<1:16:51, 3.28s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:32,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:32,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5028, 'learning_rate': 0.00075, 'epoch': 0.21} [WARNING|modeling_utils.py:388] 2022-03-03 01:04:32,643 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 378/1784 [19:14<1:15:33, 3.22s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 378/1784 [19:14<1:15:33, 3.22s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 378/1784 [19:14<1:15:33, 3.22s/it]g-point operations will not be computed-03 01:02:39,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 379/1784 [19:17<1:15:03, 3.21s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:40,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 380/1784 [19:20<1:14:26, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:40,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 380/1784 [19:20<1:14:26, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:40,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.76, 'learning_rate': 0.000756, 'epoch': 0.21} 21%|████████████████▌ | 380/1784 [19:20<1:14:26, 3.18s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:40,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 381/1784 [19:23<1:14:08, 3.17s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:46,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 382/1784 [19:26<1:13:39, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:46,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 382/1784 [19:26<1:13:39, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:46,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3031, 'learning_rate': 0.00076, 'epoch': 0.21} 21%|████████████████▋ | 382/1784 [19:26<1:13:39, 3.15s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:46,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 383/1784 [19:29<1:12:37, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▋ | 383/1784 [19:29<1:12:37, 3.11s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▊ | 384/1784 [19:32<1:11:13, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:57,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:04:57,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8042, 'learning_rate': 0.0007660000000000001, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-03 01:04:57,183 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|████████████████▉ | 386/1784 [19:38<1:09:43, 2.99s/it]g-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:02,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:02,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:05,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:05,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0925, 'learning_rate': 0.000772, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-03 01:05:05,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:04:52,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 389/1784 [19:46<1:05:08, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 389/1784 [19:46<1:05:08, 2.80s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 390/1784 [19:49<1:03:30, 2.73s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:13,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:13,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:15,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:15,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:17,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:17,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:19,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:19,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:21,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:21,665 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:23,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:23,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.1241, 'learning_rate': 0.0007880000000000001, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-03 01:05:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:26,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:27,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:27,249 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 10.8678, 'learning_rate': 0.0007940000000000001, 'epoch': 0.22} [WARNING|modeling_utils.py:388] 2022-03-03 01:05:28,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:28,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:28,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:32,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:32,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:05:32,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [20:15<1:02:49, 2.73s/it]g-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [20:15<1:02:49, 2.73s/it]g-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 402/1784 [20:15<1:02:49, 2.73s/it]g-point operations will not be computed-03 01:05:09,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▌ | 403/1784 [20:18<1:09:56, 3.04s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 404/1784 [20:22<1:14:11, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 404/1784 [20:22<1:14:11, 3.23s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8218, 'learning_rate': 0.000804, 'epoch': 0.23} 23%|█████████████████▋ | 405/1784 [20:26<1:17:35, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▋ | 405/1784 [20:26<1:17:35, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3772, 'learning_rate': 0.0008060000000000001, 'epoch': 0.23} 23%|█████████████████▋ | 405/1784 [20:26<1:17:35, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 406/1784 [20:29<1:19:35, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 406/1784 [20:29<1:19:35, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 406/1784 [20:29<1:19:35, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 407/1784 [20:33<1:20:36, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 407/1784 [20:33<1:20:36, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 407/1784 [20:33<1:20:36, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:05:42,252 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 408/1784 [20:37<1:20:50, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 409/1784 [20:40<1:20:55, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 409/1784 [20:40<1:20:55, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9483, 'learning_rate': 0.0008139999999999999, 'epoch': 0.23} 23%|█████████████████▉ | 409/1784 [20:40<1:20:55, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 410/1784 [20:44<1:21:04, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 410/1784 [20:44<1:21:04, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 410/1784 [20:44<1:21:04, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 411/1784 [20:47<1:20:44, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:06:12,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:06:12,758 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5031, 'learning_rate': 0.00082, 'epoch': 0.23} 23%|██████████████████ | 413/1784 [20:54<1:20:44, 3.53s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 413/1784 [20:54<1:20:44, 3.53s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.3287, 'learning_rate': 0.0008219999999999999, 'epoch': 0.23} 23%|██████████████████ | 413/1784 [20:54<1:20:44, 3.53s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 414/1784 [20:58<1:20:25, 3.52s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 414/1784 [20:58<1:20:25, 3.52s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 414/1784 [20:58<1:20:25, 3.52s/it]g-point operations will not be computed-03 01:06:00,430 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 415/1784 [21:01<1:20:23, 3.52s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 416/1784 [21:05<1:19:49, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 416/1784 [21:05<1:19:49, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4594, 'learning_rate': 0.000828, 'epoch': 0.23} 23%|██████████████████▏ | 416/1784 [21:05<1:19:49, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [21:08<1:19:10, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [21:08<1:19:10, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▏ | 417/1784 [21:08<1:19:10, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:25,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 418/1784 [21:12<1:19:07, 3.48s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 419/1784 [21:15<1:18:29, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▎ | 419/1784 [21:15<1:18:29, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.57, 'learning_rate': 0.000834, 'epoch': 0.23} 23%|██████████████████▎ | 419/1784 [21:15<1:18:29, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 420/1784 [21:18<1:18:54, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 420/1784 [21:18<1:18:54, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▎ | 420/1784 [21:18<1:18:54, 3.47s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 421/1784 [21:22<1:18:23, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:06:47,302 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:06:47,302 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8436, 'learning_rate': 0.00084, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:06:47,302 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 423/1784 [21:29<1:16:58, 3.39s/it]g-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 423/1784 [21:29<1:16:58, 3.39s/it]g-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▍ | 423/1784 [21:29<1:16:58, 3.39s/it]g-point operations will not be computed-03 01:06:35,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 424/1784 [21:32<1:16:29, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 425/1784 [21:35<1:16:01, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▌ | 425/1784 [21:35<1:16:01, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5169, 'learning_rate': 0.000846, 'epoch': 0.24} 24%|██████████████████▌ | 425/1784 [21:35<1:16:01, 3.36s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▋ | 426/1784 [21:38<1:14:58, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:03,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:03,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.98, 'learning_rate': 0.00085, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:03,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▋ | 428/1784 [21:45<1:13:19, 3.24s/it]g-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:10,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:10,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7028, 'learning_rate': 0.000854, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:10,019 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▊ | 430/1784 [21:51<1:12:13, 3.20s/it]g-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0355, 'learning_rate': 0.000858, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 432/1784 [21:57<1:11:09, 3.16s/it]g-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:22,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:22,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6191, 'learning_rate': 0.000862, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:22,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:06:55,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|██████████████████▉ | 434/1784 [22:03<1:09:13, 3.08s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 435/1784 [22:06<1:07:59, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 435/1784 [22:06<1:07:59, 3.02s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:31,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:31,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.7428, 'learning_rate': 0.0008680000000000001, 'epoch': 0.24} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:31,140 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:26,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 437/1784 [22:12<1:05:15, 2.91s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▏ | 438/1784 [22:15<1:04:27, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▏ | 438/1784 [22:15<1:04:27, 2.87s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:39,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:07:39,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.2282, 'learning_rate': 0.000874, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-03 01:07:39,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:07:35,384 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▏ | 440/1784 [22:20<1:01:31, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:43,278 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 441/1784 [22:22<59:35, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:45,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 441/1784 [22:22<59:35, 2.66s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:45,616 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 442/1784 [22:25<56:43, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:47,789 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 442/1784 [22:25<56:43, 2.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:47,789 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 443/1784 [22:27<53:22, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:49,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 443/1784 [22:27<53:22, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:49,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▉ | 444/1784 [22:29<50:16, 2.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:51,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▉ | 444/1784 [22:29<50:16, 2.25s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:51,623 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▉ | 445/1784 [22:30<47:23, 2.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:53,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▉ | 445/1784 [22:30<47:23, 2.12s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:53,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5112, 'learning_rate': 0.0008860000000000001, 'epoch': 0.25} 25%|████████████████████ | 447/1784 [22:34<41:04, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:54,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████ | 447/1784 [22:34<41:04, 1.84s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:54,970 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████ | 448/1784 [22:35<37:41, 1.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:57,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████ | 448/1784 [22:35<37:41, 1.69s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:57,690 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0517, 'learning_rate': 0.000892, 'epoch': 0.25} 25%|████████████████████▏ | 449/1784 [22:36<34:45, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:58,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▏ | 449/1784 [22:36<34:45, 1.56s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:58,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▏ | 450/1784 [22:38<36:02, 1.62s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:07:58,919 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▏ | 451/1784 [22:42<51:28, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:02,077 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▏ | 451/1784 [22:42<51:28, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:02,077 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▏ | 451/1784 [22:42<51:28, 2.32s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 452/1784 [22:46<1:01:32, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 452/1784 [22:46<1:01:32, 2.77s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8886, 'learning_rate': 0.0009000000000000001, 'epoch': 0.25} 25%|███████████████████▊ | 453/1784 [22:49<1:07:44, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 453/1784 [22:49<1:07:44, 3.05s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.934, 'learning_rate': 0.000902, 'epoch': 0.25} 25%|███████████████████▊ | 454/1784 [22:53<1:12:34, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|███████████████████▊ | 454/1784 [22:53<1:12:34, 3.27s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5421, 'learning_rate': 0.0009040000000000001, 'epoch': 0.25} 26%|███████████████████▉ | 455/1784 [22:57<1:15:24, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 455/1784 [22:57<1:15:24, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5831, 'learning_rate': 0.000906, 'epoch': 0.26} 26%|███████████████████▉ | 455/1784 [22:57<1:15:24, 3.40s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|███████████████████▉ | 456/1784 [23:01<1:17:28, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:08:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:08:26,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0166, 'learning_rate': 0.00091, 'epoch': 0.26} 26%|████████████████████ | 458/1784 [23:08<1:19:26, 3.59s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 458/1784 [23:08<1:19:26, 3.59s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5779, 'learning_rate': 0.000912, 'epoch': 0.26} 26%|████████████████████ | 459/1784 [23:12<1:19:33, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 459/1784 [23:12<1:19:33, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4935, 'learning_rate': 0.0009140000000000001, 'epoch': 0.26} 26%|████████████████████ | 460/1784 [23:15<1:19:27, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████ | 460/1784 [23:15<1:19:27, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:08:40,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:08:40,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 7.7115, 'learning_rate': 0.0009180000000000001, 'epoch': 0.26} 26%|████████████████████▏ | 462/1784 [23:22<1:19:14, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▏ | 462/1784 [23:22<1:19:14, 3.60s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3225, 'learning_rate': 0.00092, 'epoch': 0.26} 26%|████████████████████▏ | 463/1784 [23:26<1:18:35, 3.57s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▏ | 463/1784 [23:26<1:18:35, 3.57s/it]g-point operations will not be computed-03 01:08:05,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.9773, 'learning_rate': 0.0009220000000000001, 'epoch': 0.26} 26%|████████████████████▎ | 464/1784 [23:29<1:17:58, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 464/1784 [23:29<1:17:58, 3.54s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 465/1784 [23:33<1:17:34, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 465/1784 [23:33<1:17:34, 3.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.67, 'learning_rate': 0.0009260000000000001, 'epoch': 0.26} 26%|████████████████████▎ | 466/1784 [23:36<1:17:01, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▎ | 466/1784 [23:36<1:17:01, 3.51s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6142, 'learning_rate': 0.0009280000000000001, 'epoch': 0.26} 26%|████████████████████▍ | 467/1784 [23:40<1:16:45, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 467/1784 [23:40<1:16:45, 3.50s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:05,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:05,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.3521, 'learning_rate': 0.0009320000000000001, 'epoch': 0.26} 26%|████████████████████▌ | 469/1784 [23:47<1:16:11, 3.48s/it]g-point operations will not be computed-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 469/1784 [23:47<1:16:11, 3.48s/it]g-point operations will not be computed-03 01:08:53,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.3542, 'learning_rate': 0.000934, 'epoch': 0.26} 26%|████████████████████▌ | 470/1784 [23:50<1:15:33, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 470/1784 [23:50<1:15:33, 3.45s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 471/1784 [23:54<1:15:13, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 471/1784 [23:54<1:15:13, 3.44s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0444, 'learning_rate': 0.0009379999999999999, 'epoch': 0.26} 26%|████████████████████▋ | 472/1784 [23:57<1:14:44, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 472/1784 [23:57<1:14:44, 3.42s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:13,944 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.4191, 'learning_rate': 0.00094, 'epoch': 0.26} 27%|████████████████████▋ | 473/1784 [24:00<1:14:08, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 473/1784 [24:00<1:14:08, 3.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 474/1784 [24:04<1:13:45, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▋ | 474/1784 [24:04<1:13:45, 3.38s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.6918, 'learning_rate': 0.000944, 'epoch': 0.27} 27%|████████████████████▊ | 475/1784 [24:07<1:13:29, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▊ | 475/1784 [24:07<1:13:29, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6161, 'learning_rate': 0.000946, 'epoch': 0.27} 27%|████████████████████▊ | 475/1784 [24:07<1:13:29, 3.37s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:24,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▊ | 476/1784 [24:10<1:12:40, 3.33s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:33,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▊ | 477/1784 [24:13<1:12:11, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:33,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▊ | 477/1784 [24:13<1:12:11, 3.31s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:33,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.776, 'learning_rate': 0.00095, 'epoch': 0.27} 27%|████████████████████▉ | 478/1784 [24:17<1:11:26, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 478/1784 [24:17<1:11:26, 3.28s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 479/1784 [24:20<1:10:54, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 479/1784 [24:20<1:10:54, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.8037, 'learning_rate': 0.000954, 'epoch': 0.27} 27%|████████████████████▉ | 480/1784 [24:23<1:10:45, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|████████████████████▉ | 480/1784 [24:23<1:10:45, 3.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:48,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:48,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.6293, 'learning_rate': 0.000958, 'epoch': 0.27} 27%|█████████████████████ | 482/1784 [24:29<1:09:42, 3.21s/it]g-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 482/1784 [24:29<1:09:42, 3.21s/it]g-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:54,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:09:54,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 6.0316, 'learning_rate': 0.000962, 'epoch': 0.27} [WARNING|modeling_utils.py:388] 2022-03-03 01:09:54,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 484/1784 [24:36<1:08:13, 3.15s/it]g-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 484/1784 [24:36<1:08:13, 3.15s/it]g-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:10:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:10:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:10:00,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:09:40,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 486/1784 [24:42<1:06:10, 3.06s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:05,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 487/1784 [24:44<1:04:58, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:05,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 487/1784 [24:44<1:04:58, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:05,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.478, 'learning_rate': 0.0009699999999999999, 'epoch': 0.27} 27%|█████████████████████▎ | 487/1784 [24:44<1:04:58, 3.01s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:05,144 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 488/1784 [24:47<1:03:39, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 488/1784 [24:47<1:03:39, 2.95s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▍ | 489/1784 [24:50<1:02:18, 2.89s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:10:14,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:10:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-03 01:10:14,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-03 01:10:10,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5472, 'learning_rate': 0.000976, 'epoch': 0.27} {'loss': 5.2288, 'learning_rate': 0.000978, 'epoch': 0.28} 28%|██████████████████████ | 491/1784 [24:55<59:14, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:18,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 491/1784 [24:55<59:14, 2.75s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:18,606 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 492/1784 [24:58<57:00, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:20,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 492/1784 [24:58<57:00, 2.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:20,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 493/1784 [25:00<54:32, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:23,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 493/1784 [25:00<54:32, 2.53s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:23,137 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 494/1784 [25:02<51:28, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:25,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 494/1784 [25:02<51:28, 2.39s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:25,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 495/1784 [25:04<48:37, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:26,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 495/1784 [25:04<48:37, 2.26s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:26,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.5781, 'learning_rate': 0.000988, 'epoch': 0.28} 28%|██████████████████████▎ | 497/1784 [25:07<42:10, 1.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:28,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 497/1784 [25:07<42:10, 1.97s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:28,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 498/1784 [25:09<38:42, 1.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:31,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 498/1784 [25:09<38:42, 1.81s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:31,566 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 499/1784 [25:10<35:21, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:32,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 499/1784 [25:10<35:21, 1.65s/it][WARNING|modeling_utils.py:388] 2022-03-03 01:10:32,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2366] 2022-03-03 01:10:34,079 >> Num examples = 2642 | 500/1784 [25:12<35:59, 1.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-03 01:10:34,079 >> Num examples = 2642 | 500/1784 [25:12<35:59, 1.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-03 01:10:34,079 >> Num examples = 2642 | 500/1784 [25:12<35:59, 1.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-03 01:10:34,079 >> Num examples = 2642 | 500/1784 [25:12<35:59, 1.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 1%|█ | 4/331 [00:06<10:17, 1.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▎ | 5/331 [00:09<11:51, 2.18s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▌ | 6/331 [00:12<12:52, 2.38s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▊ | 7/331 [00:15<13:05, 2.42s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|██ | 8/331 [00:17<13:30, 2.51s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▎ | 9/331 [00:20<14:07, 2.63s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▍ | 10/331 [00:23<15:01, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 11/331 [00:26<14:30, 2.72s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|██▉ | 12/331 [00:29<14:23, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▏ | 13/331 [00:31<14:11, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▍ | 14/331 [00:34<13:58, 2.65s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▋ | 15/331 [00:37<15:10, 2.88s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▉ | 16/331 [00:41<16:03, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▏ | 17/331 [00:44<16:13, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▍ | 18/331 [00:46<14:47, 2.83s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▋ | 19/331 [00:49<14:36, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▉ | 20/331 [00:51<13:36, 2.63s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|█████▏ | 21/331 [00:54<14:07, 2.73s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▍ | 22/331 [00:57<15:14, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▋ | 23/331 [01:01<16:45, 3.26s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▉ | 24/331 [01:05<17:42, 3.46s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▏ | 25/331 [01:08<17:03, 3.35s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▍ | 26/331 [01:11<15:49, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▋ | 27/331 [01:14<15:52, 3.13s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▉ | 28/331 [01:17<15:20, 3.04s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▏ | 29/331 [01:20<14:57, 2.97s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▍ | 30/331 [01:22<14:22, 2.86s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▋ | 31/331 [01:25<13:48, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|███████▉ | 32/331 [01:28<13:34, 2.72s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▏ | 33/331 [01:30<13:37, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▍ | 34/331 [01:33<13:33, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▋ | 35/331 [01:36<13:43, 2.78s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▉ | 36/331 [01:39<14:18, 2.91s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▏ | 37/331 [01:43<14:59, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▍ | 38/331 [01:46<15:14, 3.12s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▋ | 39/331 [01:49<15:17, 3.14s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▉ | 40/331 [01:51<13:55, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|██████████▏ | 41/331 [01:54<13:13, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▍ | 42/331 [01:57<14:11, 2.95s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▋ | 43/331 [02:01<14:49, 3.09s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▉ | 44/331 [02:04<15:18, 3.20s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▏ | 45/331 [02:07<14:29, 3.04s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▍ | 46/331 [02:09<13:27, 2.83s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▋ | 47/331 [02:11<12:33, 2.65s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|███████████▉ | 48/331 [02:14<12:57, 2.75s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▏ | 49/331 [02:17<13:35, 2.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▍ | 50/331 [02:20<13:25, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▋ | 51/331 [02:23<13:45, 2.95s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|████████████▉ | 52/331 [02:26<13:07, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▏ | 53/331 [02:29<13:02, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▍ | 54/331 [02:31<12:28, 2.70s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▋ | 55/331 [02:35<13:36, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▊ | 56/331 [02:38<13:24, 2.93s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|██████████████ | 57/331 [02:40<12:59, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▎ | 58/331 [02:43<13:31, 2.97s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▌ | 59/331 [02:46<12:46, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▊ | 60/331 [02:49<12:23, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|███████████████ | 61/331 [02:52<12:51, 2.86s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▎ | 62/331 [02:54<12:42, 2.83s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▌ | 63/331 [02:58<13:50, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▊ | 64/331 [03:01<13:17, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████ | 65/331 [03:04<13:06, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▎ | 66/331 [03:08<14:16, 3.23s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▌ | 67/331 [03:11<14:55, 3.39s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|████████████████▊ | 68/331 [03:15<15:07, 3.45s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████ | 69/331 [03:18<14:45, 3.38s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▎ | 70/331 [03:21<14:23, 3.31s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▌ | 71/331 [03:25<14:29, 3.34s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|█████████████████▊ | 72/331 [03:28<14:23, 3.34s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████ | 73/331 [03:31<13:56, 3.24s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▎ | 74/331 [03:34<13:35, 3.17s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▌ | 75/331 [03:37<13:44, 3.22s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▊ | 76/331 [03:40<13:02, 3.07s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|███████████████████ | 77/331 [03:43<12:41, 3.00s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▎ | 78/331 [03:46<12:05, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▌ | 79/331 [03:48<11:44, 2.79s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▊ | 80/331 [03:51<11:33, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|████████████████████ | 81/331 [03:54<12:01, 2.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▎ | 82/331 [03:57<11:49, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▌ | 83/331 [04:00<12:08, 2.94s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▊ | 84/331 [04:04<12:54, 3.13s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████ | 85/331 [04:06<12:00, 2.93s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▎ | 86/331 [04:09<12:39, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▌ | 87/331 [04:12<12:15, 3.01s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|█████████████████████▊ | 88/331 [04:15<11:54, 2.94s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████ | 89/331 [04:17<11:09, 2.77s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▎ | 90/331 [04:20<10:36, 2.64s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▌ | 91/331 [04:23<11:03, 2.77s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|██████████████████████▊ | 92/331 [04:25<10:20, 2.60s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████ | 93/331 [04:28<10:31, 2.65s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████▎ | 94/331 [04:31<10:45, 2.72s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▌ | 95/331 [04:34<10:54, 2.77s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▊ | 96/331 [04:37<11:02, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|████████████████████████ | 97/331 [04:39<10:33, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▎ | 98/331 [04:42<10:52, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▌ | 99/331 [04:45<10:51, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▍ | 100/331 [04:47<10:19, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▋ | 101/331 [04:50<10:16, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|████████████████████████▉ | 102/331 [04:53<11:01, 2.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▏ | 103/331 [04:56<10:33, 2.78s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▍ | 104/331 [04:59<10:35, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▋ | 105/331 [05:02<10:41, 2.84s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▉ | 106/331 [05:04<10:38, 2.84s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▏ | 107/331 [05:07<09:55, 2.66s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▍ | 108/331 [05:09<09:42, 2.61s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▋ | 109/331 [05:12<09:39, 2.61s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▉ | 110/331 [05:15<10:09, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▏ | 111/331 [05:18<10:15, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▍ | 112/331 [05:21<10:14, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▋ | 113/331 [05:23<09:39, 2.66s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▉ | 114/331 [05:26<09:41, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▏ | 115/331 [05:28<09:38, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▍ | 116/331 [05:31<09:58, 2.79s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▋ | 117/331 [05:34<09:56, 2.79s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|████████████████████████████▉ | 118/331 [05:37<09:45, 2.75s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████ | 119/331 [05:40<09:45, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▎ | 120/331 [05:42<09:40, 2.75s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▌ | 121/331 [05:46<10:10, 2.91s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|█████████████████████████████▊ | 122/331 [05:48<09:52, 2.83s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████ | 123/331 [05:52<10:27, 3.02s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████▎ | 124/331 [05:55<10:16, 2.98s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▌ | 125/331 [05:58<10:49, 3.15s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▊ | 126/331 [06:01<10:55, 3.20s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████ | 127/331 [06:05<11:21, 3.34s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▎ | 128/331 [06:08<11:22, 3.36s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▌ | 129/331 [06:12<11:07, 3.30s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▊ | 130/331 [06:15<11:13, 3.35s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████ | 131/331 [06:19<11:28, 3.44s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▎ | 132/331 [06:22<10:50, 3.27s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▌ | 133/331 [06:24<10:08, 3.08s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▊ | 134/331 [06:27<09:48, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████ | 135/331 [06:30<09:53, 3.03s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▎ | 136/331 [06:33<10:06, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▌ | 137/331 [06:37<10:27, 3.23s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|█████████████████████████████████▊ | 138/331 [06:41<10:46, 3.35s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████ | 139/331 [06:43<09:38, 3.01s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▎ | 140/331 [06:47<10:19, 3.25s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▌ | 141/331 [06:49<09:49, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▋ | 142/331 [06:52<09:28, 3.01s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▉ | 143/331 [06:56<09:52, 3.15s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▏ | 144/331 [06:58<09:27, 3.04s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▍ | 145/331 [07:01<09:16, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▋ | 146/331 [07:05<09:43, 3.16s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▉ | 147/331 [07:08<09:22, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▏ | 148/331 [07:10<08:45, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▍ | 149/331 [07:13<08:16, 2.73s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 150/331 [07:16<08:36, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|████████████████████████████████████▉ | 151/331 [07:18<08:26, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▏ | 152/331 [07:21<08:03, 2.70s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▍ | 153/331 [07:23<07:57, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▋ | 154/331 [07:27<08:22, 2.84s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▉ | 155/331 [07:30<08:47, 3.00s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▏ | 156/331 [07:33<08:57, 3.07s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████▍ | 157/331 [07:37<09:14, 3.19s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▋ | 158/331 [07:40<09:17, 3.22s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▉ | 159/331 [07:43<09:23, 3.28s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|███████████████████████████████████████▏ | 160/331 [07:46<08:51, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▍ | 161/331 [07:49<08:39, 3.05s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▋ | 162/331 [07:53<09:05, 3.23s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▉ | 163/331 [07:56<09:06, 3.25s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▏ | 164/331 [07:59<08:38, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▍ | 165/331 [08:02<08:25, 3.05s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▌ | 166/331 [08:04<08:09, 2.97s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▊ | 167/331 [08:08<08:16, 3.03s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████ | 168/331 [08:10<07:49, 2.88s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▎ | 169/331 [08:13<07:56, 2.94s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▌ | 170/331 [08:16<07:30, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|█████████████████████████████████████████▊ | 171/331 [08:19<07:27, 2.79s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████ | 172/331 [08:21<07:10, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▎ | 173/331 [08:24<07:23, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▌ | 174/331 [08:27<07:04, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▊ | 175/331 [08:29<07:08, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████ | 176/331 [08:32<06:52, 2.66s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████▎ | 177/331 [08:35<07:14, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▌ | 178/331 [08:39<07:41, 3.01s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▊ | 179/331 [08:42<08:01, 3.17s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|████████████████████████████████████████████ | 180/331 [08:45<07:53, 3.13s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▎ | 181/331 [08:48<07:46, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▌ | 182/331 [08:50<07:09, 2.88s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▊ | 183/331 [08:53<06:37, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████ | 184/331 [08:55<06:11, 2.53s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▎ | 185/331 [08:57<05:46, 2.38s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▌ | 186/331 [09:00<05:57, 2.46s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▊ | 187/331 [09:03<06:26, 2.69s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████ | 188/331 [09:06<06:26, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▎ | 189/331 [09:08<06:07, 2.59s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▍ | 190/331 [09:10<05:52, 2.50s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▋ | 191/331 [09:13<05:51, 2.51s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▉ | 192/331 [09:15<05:41, 2.46s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|███████████████████████████████████████████████▏ | 193/331 [09:18<06:10, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▍ | 194/331 [09:20<05:49, 2.55s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▋ | 195/331 [09:23<05:43, 2.52s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▉ | 196/331 [09:26<05:48, 2.58s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▏ | 197/331 [09:29<06:06, 2.73s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▍ | 198/331 [09:31<05:48, 2.62s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▋ | 199/331 [09:34<05:54, 2.69s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▉ | 200/331 [09:36<05:36, 2.57s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▏ | 201/331 [09:39<05:32, 2.56s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▍ | 202/331 [09:42<05:41, 2.65s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▋ | 203/331 [09:44<05:41, 2.67s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|█████████████████████████████████████████████████▉ | 204/331 [09:48<06:01, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▏ | 205/331 [09:51<06:01, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▍ | 206/331 [09:53<05:52, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▋ | 207/331 [09:57<06:07, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▉ | 208/331 [10:00<06:12, 3.03s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▏ | 209/331 [10:02<05:43, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▍ | 210/331 [10:04<05:20, 2.65s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▋ | 211/331 [10:07<05:25, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▉ | 212/331 [10:09<05:09, 2.60s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|████████████████████████████████████████████████████ | 213/331 [10:12<05:07, 2.61s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▎ | 214/331 [10:14<04:49, 2.48s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▌ | 215/331 [10:16<04:37, 2.39s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▊ | 216/331 [10:20<05:06, 2.67s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████ | 217/331 [10:22<05:05, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▎ | 218/331 [10:26<05:19, 2.83s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▌ | 219/331 [10:28<05:15, 2.81s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▊ | 220/331 [10:31<05:01, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████ | 221/331 [10:34<05:03, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▎ | 222/331 [10:36<04:50, 2.67s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▌ | 223/331 [10:39<04:52, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|██████████████████████████████████████████████████████▊ | 224/331 [10:42<04:52, 2.73s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████ | 225/331 [10:45<04:47, 2.72s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████▎ | 226/331 [10:48<04:59, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▌ | 227/331 [10:50<04:53, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▊ | 228/331 [10:53<04:46, 2.78s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████ | 229/331 [10:56<04:41, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████▎ | 230/331 [10:58<04:32, 2.70s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▌ | 231/331 [11:01<04:38, 2.79s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▊ | 232/331 [11:04<04:31, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|█████████████████████████████████████████████████████████ | 233/331 [11:07<04:39, 2.85s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▎ | 234/331 [11:10<04:25, 2.74s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▌ | 235/331 [11:12<04:15, 2.66s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▊ | 236/331 [11:16<04:42, 2.97s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|█████████████████████████████████████████████████████████▉ | 237/331 [11:19<04:51, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▏ | 238/331 [11:22<04:49, 3.11s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▍ | 239/331 [11:26<04:49, 3.15s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▋ | 240/331 [11:29<04:52, 3.21s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|██████████████████████████████████████████████████████████▉ | 241/331 [11:32<04:57, 3.30s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▏ | 242/331 [11:36<04:55, 3.32s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▍ | 243/331 [11:39<04:52, 3.33s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▋ | 244/331 [11:43<04:58, 3.43s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▉ | 245/331 [11:46<04:46, 3.34s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|████████████████████████████████████████████████████████████▏ | 246/331 [11:50<04:55, 3.48s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▍ | 247/331 [11:53<04:42, 3.36s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▋ | 248/331 [11:55<04:21, 3.15s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▉ | 249/331 [11:58<04:00, 2.93s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▏ | 250/331 [12:00<03:48, 2.82s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▍ | 251/331 [12:03<03:51, 2.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▋ | 252/331 [12:06<03:38, 2.77s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▉ | 253/331 [12:09<03:49, 2.94s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▏ | 254/331 [12:12<03:40, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▍ | 255/331 [12:15<03:46, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▋ | 256/331 [12:18<03:38, 2.91s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|██████████████████████████████████████████████████████████████▉ | 257/331 [12:21<03:41, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▏ | 258/331 [12:24<03:27, 2.84s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▍ | 259/331 [12:26<03:21, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▋ | 260/331 [12:29<03:23, 2.86s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▊ | 261/331 [12:32<03:09, 2.71s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████ | 262/331 [12:34<03:07, 2.72s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████▎ | 263/331 [12:38<03:14, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▌ | 264/331 [12:40<03:07, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 265/331 [12:43<03:01, 2.75s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|█████████████████████████████████████████████████████████████████ | 266/331 [12:46<02:54, 2.69s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▎ | 267/331 [12:49<03:03, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▌ | 268/331 [12:52<03:00, 2.86s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▊ | 269/331 [12:55<03:07, 3.02s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████ | 270/331 [12:58<03:02, 2.99s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▎ | 271/331 [13:01<03:05, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▌ | 272/331 [13:04<02:56, 3.00s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▊ | 273/331 [13:07<02:55, 3.03s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████ | 274/331 [13:11<03:01, 3.19s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▎ | 275/331 [13:14<03:01, 3.23s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▌ | 276/331 [13:17<02:47, 3.05s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████████▊ | 277/331 [13:19<02:39, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████ | 278/331 [13:22<02:35, 2.93s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████▎ | 279/331 [13:26<02:45, 3.17s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▌ | 280/331 [13:29<02:37, 3.09s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▊ | 281/331 [13:32<02:39, 3.18s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████ | 282/331 [13:35<02:35, 3.17s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████████▎ | 283/331 [13:39<02:35, 3.24s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▍ | 284/331 [13:42<02:37, 3.35s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▋ | 285/331 [13:46<02:36, 3.40s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▉ | 286/331 [13:50<02:34, 3.44s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▏ | 287/331 [13:53<02:35, 3.54s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▍ | 288/331 [13:57<02:30, 3.50s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▋ | 289/331 [14:00<02:18, 3.29s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|██████████████████████████████████████████████████████████████████████▉ | 290/331 [14:02<02:07, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▏ | 291/331 [14:05<01:58, 2.96s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▍ | 292/331 [14:08<01:52, 2.88s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▋ | 293/331 [14:10<01:48, 2.87s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▉ | 294/331 [14:13<01:42, 2.76s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▏ | 295/331 [14:15<01:36, 2.68s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▍ | 296/331 [14:18<01:31, 2.61s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▋ | 297/331 [14:21<01:39, 2.92s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▉ | 298/331 [14:25<01:44, 3.16s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████████▏ | 299/331 [14:28<01:37, 3.04s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▍ | 300/331 [14:31<01:33, 3.00s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▋ | 301/331 [14:34<01:28, 2.95s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▉ | 302/331 [14:36<01:23, 2.89s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▏ | 303/331 [14:39<01:18, 2.80s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▍ | 304/331 [14:42<01:18, 2.90s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▋ | 305/331 [14:45<01:18, 3.02s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▉ | 306/331 [14:49<01:19, 3.19s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▏ | 307/331 [14:53<01:19, 3.32s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▎ | 308/331 [14:57<01:20, 3.51s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▌ | 309/331 [15:00<01:17, 3.54s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|███████████████████████████████████████████████████████████████████████████▊ | 310/331 [15:03<01:09, 3.32s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████ | 311/331 [15:06<01:06, 3.31s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████▎ | 312/331 [15:09<00:58, 3.10s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▌ | 313/331 [15:12<00:54, 3.04s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▊ | 314/331 [15:15<00:52, 3.09s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████ | 315/331 [15:18<00:51, 3.19s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████▎ | 316/331 [15:22<00:48, 3.22s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▌ | 317/331 [15:25<00:47, 3.36s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▊ | 318/331 [15:28<00:41, 3.18s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████████ | 319/331 [15:31<00:36, 3.02s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▎ | 320/331 [15:34<00:33, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▌ | 321/331 [15:37<00:30, 3.01s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▊ | 322/331 [15:40<00:28, 3.16s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:43<00:24, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:43<00:24, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████ | 323/331 [15:43<00:24, 3.06s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▌ | 325/331 [15:50<00:19, 3.22s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▊ | 326/331 [15:53<00:16, 3.26s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████ | 327/331 [15:57<00:12, 3.25s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [16:00<00:09, 3.26s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [16:00<00:09, 3.26s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 328/331 [16:00<00:09, 3.26s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▊| 330/331 [16:07<00:03, 3.35s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [16:08<00:00, 2.92s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 331/331 [16:08<00:00, 2.92s/it][INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/03/2022 01:26:46 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-03-03 01:26:46,475 >> Configuration saved in ./checkpoint-500/config.json [INFO|trainer.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 01:27:02,906 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 01:27:02,906 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-03 01:27:02,906 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-03 01:10:34,077 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/03/2022 01:28:45 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220302_214437-2u4nhnsf/run-2u4nhnsf.wandb', 'wandb/run-20220302_222605-10glutwr/run-10glutwr.wandb', 'wandb/run-20220302_233655-33dtvgaa/run-33dtvgaa.wandb', 'wandb/run-20220303_004520-25bnjrx1/run-25bnjrx1.wandb']. This may take a bit of time if the files are large.