0%| | 0/594 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:22,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:24,742 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8182, 'learning_rate': 0.0, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-02 18:02:27,332 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 1/594 [00:11<1:50:26, 11.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:02:29,957 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:32,529 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:35,059 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 5.0028, 'learning_rate': 6e-07, 'epoch': 0.0} [WARNING|modeling_utils.py:388] 2022-03-02 18:02:37,734 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 2/594 [00:21<1:46:30, 10.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:02:40,394 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:42,920 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:45,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8689, 'learning_rate': 1.2e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 18:02:47,969 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 3/594 [00:31<1:43:28, 10.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:02:50,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:53,125 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:02:55,631 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8344, 'learning_rate': 1.8e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 18:02:58,177 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 4/594 [00:42<1:42:08, 10.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:00,843 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:03,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:05,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7712, 'learning_rate': 2.4e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 18:03:08,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 5/594 [00:52<1:41:19, 10.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:10,956 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:13,466 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:15,947 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7584, 'learning_rate': 2.9999999999999997e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 18:03:18,401 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 6/594 [01:02<1:40:08, 10.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:20,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:23,438 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:25,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8249, 'learning_rate': 3.6e-06, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-03-02 18:03:28,476 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 7/594 [01:12<1:39:30, 10.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:31,077 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:33,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:35,990 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:38,423 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6967, 'learning_rate': 4.2e-06, 'epoch': 0.01} 1%|█ | 8/594 [01:22<1:38:38, 10.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:41,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:43,456 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:45,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:48,400 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 9/594 [01:32<1:38:05, 10.06s/it] 2%|█▏ | 9/594 [01:32<1:38:05, 10.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:03:50,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:53,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:55,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:03:58,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 10/594 [01:42<1:37:02, 9.97s/it] 2%|█▎ | 10/594 [01:42<1:37:02, 9.97s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:00,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:03,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:05,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:07,879 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 11/594 [01:51<1:36:07, 9.89s/it] 2%|█▍ | 11/594 [01:51<1:36:07, 9.89s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:10,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:12,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:15,113 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.646, 'learning_rate': 6.599999999999999e-06, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-03-02 18:04:17,460 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▌ | 12/594 [02:01<1:35:01, 9.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:19,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:22,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:24,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:27,004 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 13/594 [02:10<1:34:07, 9.72s/it] 2%|█▊ | 13/594 [02:10<1:34:07, 9.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:29,484 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:31,858 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:34,196 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:36,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 14/594 [02:20<1:33:31, 9.68s/it] 2%|█▉ | 14/594 [02:20<1:33:31, 9.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:39,064 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:41,393 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:43,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:46,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3786, 'learning_rate': 8.4e-06, 'epoch': 0.03} 3%|██ | 15/594 [02:30<1:33:01, 9.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:48,514 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:50,838 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:53,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:04:55,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 16/594 [02:39<1:31:41, 9.52s/it] 3%|██▏ | 16/594 [02:39<1:31:41, 9.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:04:57,730 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:00,033 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:02,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:04,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 17/594 [02:48<1:30:48, 9.44s/it] 3%|██▎ | 17/594 [02:48<1:30:48, 9.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:07,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:09,324 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:11,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:13,971 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▍ | 18/594 [02:57<1:30:20, 9.41s/it] 3%|██▍ | 18/594 [02:57<1:30:20, 9.41s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:16,406 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:18,685 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:20,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:23,264 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 19/594 [03:07<1:29:50, 9.37s/it] 3%|██▌ | 19/594 [03:07<1:29:50, 9.37s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:25,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:27,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:30,148 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5417, 'learning_rate': 1.14e-05, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-03-02 18:05:32,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▋ | 20/594 [03:16<1:29:07, 9.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:34,784 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:37,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:39,358 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:41,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 21/594 [03:25<1:28:37, 9.28s/it] 4%|██▊ | 21/594 [03:25<1:28:37, 9.28s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:43,928 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:46,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:48,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4333, 'learning_rate': 1.26e-05, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-03-02 18:05:50,560 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 22/594 [03:34<1:27:26, 9.17s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:05:52,894 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:55,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:05:57,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4354, 'learning_rate': 1.3199999999999997e-05, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-03-02 18:05:59,548 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 23/594 [03:43<1:26:45, 9.12s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:01,880 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:04,089 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:06,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:08,478 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4696, 'learning_rate': 1.3799999999999998e-05, 'epoch': 0.04} 4%|███▏ | 24/594 [03:52<1:26:04, 9.06s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:10,762 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:12,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:15,103 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2463, 'learning_rate': 1.44e-05, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-03-02 18:06:17,799 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 25/594 [04:01<1:26:40, 9.14s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:20,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:22,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:20,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:24,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:20,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 26/594 [04:10<1:25:33, 9.04s/it]g-point operations will not be computed-02 18:06:20,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 26/594 [04:10<1:25:33, 9.04s/it]g-point operations will not be computed-02 18:06:20,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 26/594 [04:10<1:25:33, 9.04s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:28,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:30,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:28,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:33,147 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:28,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 27/594 [04:19<1:24:18, 8.92s/it]g-point operations will not be computed-02 18:06:28,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 27/594 [04:19<1:24:18, 8.92s/it]g-point operations will not be computed-02 18:06:28,853 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 27/594 [04:19<1:24:18, 8.92s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:37,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:39,557 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:37,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:41,682 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:37,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:27<1:23:12, 8.82s/it]g-point operations will not be computed-02 18:06:37,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:27<1:23:12, 8.82s/it]g-point operations will not be computed-02 18:06:37,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:27<1:23:12, 8.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:46,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:48,220 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:46,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:46,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:50,315 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:46,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 29/594 [04:36<1:22:29, 8.76s/it]g-point operations will not be computed-02 18:06:46,092 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▉ | 29/594 [04:36<1:22:29, 8.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:06:54,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:56,797 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:54,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:06:58,915 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:06:54,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 30/594 [04:44<1:21:39, 8.69s/it]g-point operations will not be computed-02 18:06:54,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 30/594 [04:44<1:21:39, 8.69s/it]g-point operations will not be computed-02 18:06:54,716 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 30/594 [04:44<1:21:39, 8.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:05,208 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:07,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:53<1:20:19, 8.56s/it]g-point operations will not be computed-02 18:07:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:53<1:20:19, 8.56s/it]g-point operations will not be computed-02 18:07:03,121 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:53<1:20:19, 8.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:11,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:13,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:11,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:15,471 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:11,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:01<1:19:19, 8.47s/it]g-point operations will not be computed-02 18:07:11,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:01<1:19:19, 8.47s/it]g-point operations will not be computed-02 18:07:11,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:01<1:19:19, 8.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:21,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:23,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:09<1:18:51, 8.43s/it]g-point operations will not be computed-02 18:07:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:09<1:18:51, 8.43s/it]g-point operations will not be computed-02 18:07:19,731 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:09<1:18:51, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:27,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:29,825 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:27,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:31,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:27,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:31,774 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:27,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:17<1:17:05, 8.26s/it]g-point operations will not be computed-02 18:07:27,883 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:17<1:17:05, 8.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:35,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:37,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:35,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:39,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:35,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:39,611 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:35,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 35/594 [05:25<1:15:42, 8.13s/it]g-point operations will not be computed-02 18:07:35,723 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 35/594 [05:25<1:15:42, 8.13s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:43,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:45,458 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:43,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:47,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:43,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:47,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:43,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 36/594 [05:33<1:14:30, 8.01s/it]g-point operations will not be computed-02 18:07:43,569 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 36/594 [05:33<1:14:30, 8.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:51,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:53,076 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:51,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:56,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:51,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:07:56,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:07:51,243 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2423, 'learning_rate': 2.1599999999999996e-05, 'epoch': 0.06} 6%|████▉ | 37/594 [05:40<1:12:54, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:07:58,672 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:00,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:02,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:04,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 38/594 [05:47<1:11:19, 7.70s/it] 6%|█████ | 38/594 [05:47<1:11:19, 7.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:05,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 38/594 [05:47<1:11:19, 7.70s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:05,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:55<1:09:34, 7.52s/it]g-point operations will not be computed-02 18:08:05,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:55<1:09:34, 7.52s/it]g-point operations will not be computed-02 18:08:05,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:55<1:09:34, 7.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:13,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:55<1:09:34, 7.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:13,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:16,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:13,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:16,413 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:13,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 40/594 [06:01<1:07:40, 7.33s/it]g-point operations will not be computed-02 18:08:13,015 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 40/594 [06:01<1:07:40, 7.33s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:19,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:23,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:19,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 41/594 [06:08<1:05:26, 7.10s/it]g-point operations will not be computed-02 18:08:19,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 41/594 [06:08<1:05:26, 7.10s/it]g-point operations will not be computed-02 18:08:19,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 41/594 [06:08<1:05:26, 7.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:26,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:29,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:26,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 42/594 [06:14<1:02:53, 6.84s/it]g-point operations will not be computed-02 18:08:26,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 42/594 [06:14<1:02:53, 6.84s/it]g-point operations will not be computed-02 18:08:26,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 42/594 [06:14<1:02:53, 6.84s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:32,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:35,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:32,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 43/594 [06:20<59:33, 6.49s/it]g-point operations will not be computed-02 18:08:32,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 43/594 [06:20<59:33, 6.49s/it]g-point operations will not be computed-02 18:08:32,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 43/594 [06:20<59:33, 6.49s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:37,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|██████ | 44/594 [06:25<55:45, 6.08s/it]g-point operations will not be computed-02 18:08:37,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|██████ | 44/594 [06:25<55:45, 6.08s/it]g-point operations will not be computed-02 18:08:37,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|██████ | 44/594 [06:25<55:45, 6.08s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:42,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:45,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:42,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:45,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:42,953 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 45/594 [06:30<51:37, 5.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:47,429 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:49,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:47,429 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:49,459 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:47,429 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 46/594 [06:34<47:20, 5.18s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:51,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:53,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:51,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:53,210 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:51,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 47/594 [06:37<43:07, 4.73s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:08:55,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 48/594 [06:41<39:00, 4.29s/it]g-point operations will not be computed-02 18:08:55,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 48/594 [06:41<39:00, 4.29s/it]g-point operations will not be computed-02 18:08:55,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:59,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:58,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:08:59,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:08:58,167 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▊ | 49/594 [06:44<35:02, 3.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:00,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:47<32:27, 3.58s/it]g-point operations will not be computed-02 18:09:00,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:47<32:27, 3.58s/it]g-point operations will not be computed-02 18:09:00,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:47<32:27, 3.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:06,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:47<32:27, 3.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:06,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:11,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:06,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:11,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:06,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [06:57<51:54, 5.74s/it]g-point operations will not be computed-02 18:09:06,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [06:57<51:54, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [06:57<51:54, 5.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:21,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:07<1:03:50, 7.07s/it]g-point operations will not be computed-02 18:09:16,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:07<1:03:50, 7.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:26,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:07<1:03:50, 7.07s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:26,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:31,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:26,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:18<1:12:22, 8.03s/it]g-point operations will not be computed-02 18:09:26,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:18<1:12:22, 8.03s/it]g-point operations will not be computed-02 18:09:26,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:18<1:12:22, 8.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:36,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:18<1:12:22, 8.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:36,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:41,949 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:36,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:28<1:17:57, 8.66s/it]g-point operations will not be computed-02 18:09:36,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:28<1:17:57, 8.66s/it]g-point operations will not be computed-02 18:09:36,930 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:28<1:17:57, 8.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:47,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:28<1:17:57, 8.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:47,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:09:51,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:47,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:38<1:21:07, 9.03s/it]g-point operations will not be computed-02 18:09:47,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:38<1:21:07, 9.03s/it]g-point operations will not be computed-02 18:09:47,013 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:38<1:21:07, 9.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:56,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:38<1:21:07, 9.03s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:09:56,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:01,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:09:56,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:48<1:23:32, 9.32s/it]g-point operations will not be computed-02 18:09:56,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:48<1:23:32, 9.32s/it]g-point operations will not be computed-02 18:09:56,984 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:48<1:23:32, 9.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:06,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:48<1:23:32, 9.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:06,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:11,887 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:06,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [07:58<1:25:11, 9.52s/it]g-point operations will not be computed-02 18:10:06,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [07:58<1:25:11, 9.52s/it]g-point operations will not be computed-02 18:10:06,923 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [07:58<1:25:11, 9.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:16,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [07:58<1:25:11, 9.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:16,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:21,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:16,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:21,753 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:16,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:08<1:26:04, 9.64s/it]g-point operations will not be computed-02 18:10:16,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:08<1:26:04, 9.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:08<1:26:04, 9.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:31,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:31,601 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:17<1:26:20, 9.68s/it]g-point operations will not be computed-02 18:10:26,736 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:17<1:26:20, 9.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:36,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:17<1:26:20, 9.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:36,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:41,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:36,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:41,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:36,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:27<1:26:30, 9.72s/it]g-point operations will not be computed-02 18:10:36,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:27<1:26:30, 9.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:46,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:27<1:26:30, 9.72s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:46,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:51,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:46,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:10:51,024 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:46,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:37<1:25:49, 9.66s/it]g-point operations will not be computed-02 18:10:46,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:37<1:25:49, 9.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:55,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:37<1:25:49, 9.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:10:55,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:00,487 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:10:55,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:46<1:25:10, 9.61s/it]g-point operations will not be computed-02 18:10:55,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:46<1:25:10, 9.61s/it]g-point operations will not be computed-02 18:10:55,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:46<1:25:10, 9.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:05,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:46<1:25:10, 9.61s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:05,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:10,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:05,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:10,020 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:05,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [08:56<1:24:57, 9.60s/it]g-point operations will not be computed-02 18:11:05,240 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [08:56<1:24:57, 9.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:14,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [08:56<1:24:57, 9.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:14,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:19,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:14,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:05<1:24:07, 9.52s/it]g-point operations will not be computed-02 18:11:14,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:05<1:24:07, 9.52s/it]g-point operations will not be computed-02 18:11:14,821 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:05<1:24:07, 9.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:24,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:05<1:24:07, 9.52s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:24,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:28,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:24,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:28,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:24,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:14<1:23:26, 9.47s/it]g-point operations will not be computed-02 18:11:24,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:14<1:23:26, 9.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:33,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:14<1:23:26, 9.47s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:33,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:38,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:33,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:24<1:23:06, 9.44s/it]g-point operations will not be computed-02 18:11:33,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:24<1:23:06, 9.44s/it]g-point operations will not be computed-02 18:11:33,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:24<1:23:06, 9.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:42,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:24<1:23:06, 9.44s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:42,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:47,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:42,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:33<1:22:47, 9.43s/it]g-point operations will not be computed-02 18:11:42,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:33<1:22:47, 9.43s/it]g-point operations will not be computed-02 18:11:42,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:33<1:22:47, 9.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:52,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:33<1:22:47, 9.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:11:52,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:11:56,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:11:52,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:43<1:22:16, 9.38s/it]g-point operations will not be computed-02 18:11:52,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:43<1:22:16, 9.38s/it]g-point operations will not be computed-02 18:11:52,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:43<1:22:16, 9.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:01,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:43<1:22:16, 9.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:01,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:06,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:01,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:52<1:21:23, 9.30s/it]g-point operations will not be computed-02 18:12:01,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:52<1:21:23, 9.30s/it]g-point operations will not be computed-02 18:12:01,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:52<1:21:23, 9.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:10,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:52<1:21:23, 9.30s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:10,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:15,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:10,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:15,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:10,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:01<1:20:59, 9.27s/it]g-point operations will not be computed-02 18:12:10,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:01<1:20:59, 9.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:19,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:01<1:20:59, 9.27s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:19,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:24,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:19,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:24,341 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:19,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:10<1:20:19, 9.22s/it]g-point operations will not be computed-02 18:12:19,815 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:10<1:20:19, 9.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:10<1:20:19, 9.22s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:33,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:12:33,389 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:19<1:19:46, 9.17s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:19<1:19:46, 9.17s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:19<1:19:46, 9.17s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:19<1:19:46, 9.17s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:28<1:18:58, 9.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:28<1:18:58, 9.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0847, 'learning_rate': 4.259999999999999e-05, 'epoch': 0.12} 12%|█████████▊ | 73/594 [10:28<1:18:58, 9.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:28<1:18:58, 9.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:28<1:18:58, 9.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:37<1:18:15, 9.03s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:37<1:18:15, 9.03s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:37<1:18:15, 9.03s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:37<1:18:15, 9.03s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:46<1:18:57, 9.13s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:46<1:18:57, 9.13s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2665, 'learning_rate': 4.3799999999999994e-05, 'epoch': 0.13} 13%|██████████ | 75/594 [10:46<1:18:57, 9.13s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:46<1:18:57, 9.13s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.24, 'learning_rate': 4.4399999999999995e-05, 'epoch': 0.13} 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.219, 'learning_rate': 4.4999999999999996e-05, 'epoch': 0.13} 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [10:55<1:17:48, 9.01s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:12<1:15:40, 8.80s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:12<1:15:40, 8.80s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1975, 'learning_rate': 4.56e-05, 'epoch': 0.13} 13%|██████████▌ | 78/594 [11:12<1:15:40, 8.80s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:12<1:15:40, 8.80s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:21<1:14:54, 8.73s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:21<1:14:54, 8.73s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1354, 'learning_rate': 4.62e-05, 'epoch': 0.13} 13%|██████████▋ | 79/594 [11:21<1:14:54, 8.73s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:21<1:14:54, 8.73s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:29<1:13:52, 8.62s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:29<1:13:52, 8.62s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3206, 'learning_rate': 4.68e-05, 'epoch': 0.13} 13%|██████████▊ | 80/594 [11:29<1:13:52, 8.62s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:29<1:13:52, 8.62s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:37<1:12:52, 8.52s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:37<1:12:52, 8.52s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2772, 'learning_rate': 4.7399999999999993e-05, 'epoch': 0.14} 14%|██████████▉ | 81/594 [11:37<1:12:52, 8.52s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:37<1:12:52, 8.52s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:46<1:11:56, 8.43s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:46<1:11:56, 8.43s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2269, 'learning_rate': 4.7999999999999994e-05, 'epoch': 0.14} 14%|███████████ | 82/594 [11:46<1:11:56, 8.43s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:46<1:11:56, 8.43s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:54<1:11:16, 8.37s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:54<1:11:16, 8.37s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3281, 'learning_rate': 4.8599999999999995e-05, 'epoch': 0.14} 14%|███████████▏ | 83/594 [11:54<1:11:16, 8.37s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:54<1:11:16, 8.37s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:02<1:09:49, 8.21s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:02<1:09:49, 8.21s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.291, 'learning_rate': 4.9199999999999997e-05, 'epoch': 0.14} 14%|███████████▎ | 84/594 [12:02<1:09:49, 8.21s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:02<1:09:49, 8.21s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:09<1:08:44, 8.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:09<1:08:44, 8.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2093, 'learning_rate': 4.98e-05, 'epoch': 0.14} 14%|███████████▍ | 85/594 [12:09<1:08:44, 8.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:09<1:08:44, 8.10s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:17<1:07:34, 7.98s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:17<1:07:34, 7.98s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1592, 'learning_rate': 5.04e-05, 'epoch': 0.14} 14%|███████████▌ | 86/594 [12:17<1:07:34, 7.98s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:41,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:41,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:41,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4026, 'learning_rate': 5.1e-05, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-02 18:14:41,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 88/594 [12:32<1:04:27, 7.64s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 88/594 [12:32<1:04:27, 7.64s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2414, 'learning_rate': 5.1599999999999994e-05, 'epoch': 0.15} 15%|███████████▊ | 88/594 [12:32<1:04:27, 7.64s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:55,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:55,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.245, 'learning_rate': 5.2199999999999995e-05, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-03-02 18:14:55,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:55,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:14:55,462 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 90/594 [12:46<1:00:37, 7.22s/it]g-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:05,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:05,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:05,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:12:28,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:52<57:53, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:09,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:52<57:53, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:09,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:52<57:53, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:09,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:52<57:53, 6.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:09,873 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 92/594 [12:57<54:54, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 92/594 [12:57<54:54, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▋ | 92/594 [12:57<54:54, 6.56s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:19,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:19,482 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:23,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:23,234 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:15,571 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 94/594 [13:08<48:28, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 94/594 [13:08<48:28, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▉ | 94/594 [13:08<48:28, 5.82s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:29,000 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:31,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:31,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:33,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:35,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:35,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:36,866 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:38,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:38,626 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:41,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:41,741 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:43,128 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:46,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:46,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.205, 'learning_rate': 5.88e-05, 'epoch': 0.17} [WARNING|modeling_utils.py:388] 2022-03-02 18:15:46,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:51,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:51,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:15:51,513 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:40<46:52, 5.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:40<46:52, 5.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:40<46:52, 5.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:40<46:52, 5.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:40<46:52, 5.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2077, 'learning_rate': 6.0599999999999996e-05, 'epoch': 0.17} 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:50<58:02, 7.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:11<1:10:17, 8.61s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:11<1:10:17, 8.61s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.13, 'learning_rate': 6.12e-05, 'epoch': 0.17} 18%|█████████████▊ | 104/594 [14:11<1:10:17, 8.61s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:11<1:10:17, 8.61s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:21<1:13:26, 9.01s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:21<1:13:26, 9.01s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1796, 'learning_rate': 6.18e-05, 'epoch': 0.18} 18%|█████████████▉ | 105/594 [14:21<1:13:26, 9.01s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:21<1:13:26, 9.01s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:30<1:15:38, 9.30s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:30<1:15:38, 9.30s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1452, 'learning_rate': 6.239999999999999e-05, 'epoch': 0.18} 18%|██████████████ | 106/594 [14:30<1:15:38, 9.30s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:30<1:15:38, 9.30s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:30<1:15:38, 9.30s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:40<1:16:52, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:40<1:16:52, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:40<1:16:52, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:40<1:16:52, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:40<1:16:52, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4238, 'learning_rate': 6.419999999999999e-05, 'epoch': 0.18} 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:50<1:17:34, 9.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:10<1:18:16, 9.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:10<1:18:16, 9.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2019, 'learning_rate': 6.479999999999999e-05, 'epoch': 0.18} 19%|██████████████▋ | 110/594 [15:10<1:18:16, 9.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:10<1:18:16, 9.70s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:19<1:17:46, 9.66s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:19<1:17:46, 9.66s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2325, 'learning_rate': 6.539999999999999e-05, 'epoch': 0.19} 19%|██████████████▊ | 111/594 [15:19<1:17:46, 9.66s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:19<1:17:46, 9.66s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2623, 'learning_rate': 6.599999999999999e-05, 'epoch': 0.19} g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:38<1:16:41, 9.57s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:38<1:16:41, 9.57s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:38<1:16:41, 9.57s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:38<1:16:41, 9.57s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2787, 'learning_rate': 6.72e-05, 'epoch': 0.19} [WARNING|modeling_utils.py:388] 2022-03-02 18:18:09,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:18:09,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:18:09,117 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [15:57<1:15:35, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [15:57<1:15:35, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [15:57<1:15:35, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [15:57<1:15:35, 9.47s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:06<1:14:52, 9.40s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:06<1:14:52, 9.40s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2513, 'learning_rate': 6.84e-05, 'epoch': 0.2} 20%|███████████████▍ | 116/594 [16:06<1:14:52, 9.40s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:06<1:14:52, 9.40s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:06<1:14:52, 9.40s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:16<1:14:22, 9.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:16<1:14:22, 9.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:16<1:14:22, 9.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:16<1:14:22, 9.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:25<1:13:55, 9.32s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:25<1:13:55, 9.32s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1303, 'learning_rate': 6.96e-05, 'epoch': 0.2} 20%|███████████████▋ | 118/594 [16:25<1:13:55, 9.32s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:25<1:13:55, 9.32s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:25<1:13:55, 9.32s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:34<1:13:13, 9.25s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:34<1:13:13, 9.25s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:34<1:13:13, 9.25s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:34<1:13:13, 9.25s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:43<1:12:47, 9.21s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:43<1:12:47, 9.21s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1342, 'learning_rate': 7.079999999999999e-05, 'epoch': 0.2} 20%|███████████████▉ | 120/594 [16:43<1:12:47, 9.21s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:43<1:12:47, 9.21s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:52<1:12:10, 9.16s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:52<1:12:10, 9.16s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3001, 'learning_rate': 7.139999999999999e-05, 'epoch': 0.2} 20%|████████████████ | 121/594 [16:52<1:12:10, 9.16s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:52<1:12:10, 9.16s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:52<1:12:10, 9.16s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:01<1:11:22, 9.07s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:01<1:11:22, 9.07s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:01<1:11:22, 9.07s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:01<1:11:22, 9.07s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:01<1:11:22, 9.07s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:10<1:10:37, 9.00s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:10<1:10:37, 9.00s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:10<1:10:37, 9.00s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:10<1:10:37, 9.00s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:10<1:10:37, 9.00s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:18<1:09:49, 8.91s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:18<1:09:49, 8.91s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:18<1:09:49, 8.91s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:18<1:09:49, 8.91s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2877, 'learning_rate': 7.379999999999999e-05, 'epoch': 0.21} 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:28<1:10:07, 8.97s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:36<1:09:13, 8.88s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:36<1:09:13, 8.88s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:36<1:09:13, 8.88s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:36<1:09:13, 8.88s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:36<1:09:13, 8.88s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:45<1:08:30, 8.80s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:45<1:08:30, 8.80s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:45<1:08:30, 8.80s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:45<1:08:30, 8.80s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:45<1:08:30, 8.80s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:53<1:07:26, 8.68s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:53<1:07:26, 8.68s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:53<1:07:26, 8.68s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:53<1:07:26, 8.68s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:53<1:07:26, 8.68s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:02<1:06:27, 8.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:02<1:06:27, 8.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:02<1:06:27, 8.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:02<1:06:27, 8.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:02<1:06:27, 8.58s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:10<1:05:51, 8.52s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:10<1:05:51, 8.52s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:10<1:05:51, 8.52s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:10<1:05:51, 8.52s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:10<1:05:51, 8.52s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:18<1:05:09, 8.44s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:18<1:05:09, 8.44s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:18<1:05:09, 8.44s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:18<1:05:09, 8.44s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:26<1:04:16, 8.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:26<1:04:16, 8.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2138, 'learning_rate': 7.8e-05, 'epoch': 0.22} 22%|█████████████████▌ | 132/594 [18:26<1:04:16, 8.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:26<1:04:16, 8.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:26<1:04:16, 8.35s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:34<1:02:48, 8.18s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:34<1:02:48, 8.18s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:34<1:02:48, 8.18s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:34<1:02:48, 8.18s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:34<1:02:48, 8.18s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:42<1:01:55, 8.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:42<1:01:55, 8.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:42<1:01:55, 8.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:42<1:01:55, 8.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:42<1:01:55, 8.08s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:50<1:01:01, 7.98s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:50<1:01:01, 7.98s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:50<1:01:01, 7.98s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:50<1:01:01, 7.98s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:50<1:01:01, 7.98s/it]g-point operations will not be computed-02 18:15:25,710 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 136/594 [18:57<59:55, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 136/594 [18:57<59:55, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▌ | 136/594 [18:57<59:55, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▋ | 137/594 [19:05<58:56, 7.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▋ | 137/594 [19:05<58:56, 7.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1802, 'learning_rate': 8.1e-05, 'epoch': 0.23} 23%|██████████████████▋ | 137/594 [19:05<58:56, 7.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▋ | 137/594 [19:05<58:56, 7.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▋ | 137/594 [19:05<58:56, 7.74s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:12<57:37, 7.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:12<57:37, 7.58s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:21:33,963 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:19<56:21, 7.43s/it]g-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:19<56:21, 7.43s/it]g-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2233, 'learning_rate': 8.22e-05, 'epoch': 0.23} 23%|██████████████████▉ | 139/594 [19:19<56:21, 7.43s/it]g-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:19<56:21, 7.43s/it]g-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:19<56:21, 7.43s/it]g-point operations will not be computed-02 18:21:15,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:26<54:56, 7.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:26<54:56, 7.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:26<54:56, 7.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:26<54:56, 7.26s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 141/594 [19:32<53:01, 7.02s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:21:52,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:21:52,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:21:52,102 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:21:44,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 142/594 [19:38<50:30, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:56,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 142/594 [19:38<50:30, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:56,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 142/594 [19:38<50:30, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:56,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▎ | 142/594 [19:38<50:30, 6.71s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:21:56,483 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 143/594 [19:44<48:02, 6.39s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:04,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:04,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▋ | 144/594 [19:49<44:42, 5.96s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:08,005 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:10,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:10,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:12,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:14,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:14,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:16,437 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:18,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:18,160 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:19,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:19,871 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:22,822 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:24,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:24,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:26,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:26,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5903, 'learning_rate': 8.879999999999999e-05, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-03-02 18:22:26,999 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:32,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:32,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:22:32,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:21<41:20, 5.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:21<41:20, 5.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:21<41:20, 5.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:21<41:20, 5.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:31<51:24, 6.98s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:31<51:24, 6.98s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2278, 'learning_rate': 8.999999999999999e-05, 'epoch': 0.26} 26%|████████████████████▋ | 152/594 [20:31<51:24, 6.98s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:31<51:24, 6.98s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:31<51:24, 6.98s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 153/594 [20:41<58:04, 7.90s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 153/594 [20:41<58:04, 7.90s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 153/594 [20:41<58:04, 7.90s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▊ | 153/594 [20:41<58:04, 7.90s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:51<1:02:26, 8.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:51<1:02:26, 8.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.145, 'learning_rate': 9.12e-05, 'epoch': 0.26} 26%|████████████████████▍ | 154/594 [20:51<1:02:26, 8.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:51<1:02:26, 8.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:01<1:05:12, 8.91s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:01<1:05:12, 8.91s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1925, 'learning_rate': 9.18e-05, 'epoch': 0.26} 26%|████████████████████▌ | 155/594 [21:01<1:05:12, 8.91s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:01<1:05:12, 8.91s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:11<1:06:57, 9.17s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:11<1:06:57, 9.17s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1712, 'learning_rate': 9.24e-05, 'epoch': 0.26} [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.301, 'learning_rate': 9.3e-05, 'epoch': 0.26} [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:23:34,553 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:30<1:08:43, 9.46s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:30<1:08:43, 9.46s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1469, 'learning_rate': 9.36e-05, 'epoch': 0.27} 27%|█████████████████████ | 158/594 [21:30<1:08:43, 9.46s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:30<1:08:43, 9.46s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:40<1:09:05, 9.53s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:40<1:09:05, 9.53s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0775, 'learning_rate': 9.419999999999999e-05, 'epoch': 0.27} 27%|█████████████████████▏ | 159/594 [21:40<1:09:05, 9.53s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:40<1:09:05, 9.53s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:40<1:09:05, 9.53s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:49<1:09:16, 9.58s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:49<1:09:16, 9.58s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:49<1:09:16, 9.58s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:49<1:09:16, 9.58s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▍ | 161/594 [21:59<1:09:14, 9.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▍ | 161/594 [21:59<1:09:14, 9.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2729, 'learning_rate': 9.539999999999999e-05, 'epoch': 0.27} 27%|█████████████████████▍ | 161/594 [21:59<1:09:14, 9.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▍ | 161/594 [21:59<1:09:14, 9.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▍ | 161/594 [21:59<1:09:14, 9.60s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:09<1:08:53, 9.57s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:09<1:08:53, 9.57s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:09<1:08:53, 9.57s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1282, 'learning_rate': 9.659999999999999e-05, 'epoch': 0.27} [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:24:34,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:28<1:08:10, 9.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:28<1:08:10, 9.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:28<1:08:10, 9.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:28<1:08:10, 9.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:28<1:08:10, 9.51s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:37<1:07:33, 9.45s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:37<1:07:33, 9.45s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:37<1:07:33, 9.45s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:37<1:07:33, 9.45s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:46<1:07:12, 9.42s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:46<1:07:12, 9.42s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3345, 'learning_rate': 9.839999999999999e-05, 'epoch': 0.28} 28%|██████████████████████ | 166/594 [22:46<1:07:12, 9.42s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:46<1:07:12, 9.42s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:46<1:07:12, 9.42s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 167/594 [22:56<1:06:55, 9.40s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 167/594 [22:56<1:06:55, 9.40s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 167/594 [22:56<1:06:55, 9.40s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▏ | 167/594 [22:56<1:06:55, 9.40s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:05<1:06:14, 9.33s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:05<1:06:14, 9.33s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3872, 'learning_rate': 9.96e-05, 'epoch': 0.28} 28%|██████████████████████▎ | 168/594 [23:05<1:06:14, 9.33s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:05<1:06:14, 9.33s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:05<1:06:14, 9.33s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:14<1:05:45, 9.28s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:14<1:05:45, 9.28s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:14<1:05:45, 9.28s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:14<1:05:45, 9.28s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:23<1:05:27, 9.26s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:23<1:05:27, 9.26s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2435, 'learning_rate': 0.0001008, 'epoch': 0.29} 29%|██████████████████████▌ | 170/594 [23:23<1:05:27, 9.26s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:23<1:05:27, 9.26s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:23<1:05:27, 9.26s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:32<1:04:57, 9.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:32<1:04:57, 9.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:32<1:04:57, 9.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:32<1:04:57, 9.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 172/594 [23:41<1:04:15, 9.14s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 172/594 [23:41<1:04:15, 9.14s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1444, 'learning_rate': 0.000102, 'epoch': 0.29} 29%|██████████████████████▉ | 172/594 [23:41<1:04:15, 9.14s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▉ | 172/594 [23:41<1:04:15, 9.14s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:50<1:03:59, 9.12s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:50<1:03:59, 9.12s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1539, 'learning_rate': 0.0001026, 'epoch': 0.29} 29%|███████████████████████ | 173/594 [23:50<1:03:59, 9.12s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:50<1:03:59, 9.12s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:50<1:03:59, 9.12s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [23:59<1:03:14, 9.03s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [23:59<1:03:14, 9.03s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [23:59<1:03:14, 9.03s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [23:59<1:03:14, 9.03s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [23:59<1:03:14, 9.03s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:08<1:03:28, 9.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:08<1:03:28, 9.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:08<1:03:28, 9.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:08<1:03:28, 9.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:08<1:03:28, 9.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:17<1:02:25, 8.96s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:17<1:02:25, 8.96s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:17<1:02:25, 8.96s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:17<1:02:25, 8.96s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:26<1:01:39, 8.87s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:26<1:01:39, 8.87s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3348, 'learning_rate': 0.00010499999999999999, 'epoch': 0.3} 30%|███████████████████████▌ | 177/594 [24:26<1:01:39, 8.87s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1479, 'learning_rate': 0.00010559999999999998, 'epoch': 0.3} [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:26:50,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:43<1:00:24, 8.73s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:43<1:00:24, 8.73s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:43<1:00:24, 8.73s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:43<1:00:24, 8.73s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:43<1:00:24, 8.73s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:51<59:31, 8.63s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:51<59:31, 8.63s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:51<59:31, 8.63s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:51<59:31, 8.63s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:51<59:31, 8.63s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [24:59<58:45, 8.54s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [24:59<58:45, 8.54s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [24:59<58:45, 8.54s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [24:59<58:45, 8.54s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [24:59<58:45, 8.54s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:08<57:46, 8.41s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:08<57:46, 8.41s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:08<57:46, 8.41s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:08<57:46, 8.41s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:08<57:46, 8.41s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:16<57:17, 8.36s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:16<57:17, 8.36s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:16<57:17, 8.36s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:16<57:17, 8.36s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:16<57:17, 8.36s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:24<56:08, 8.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:24<56:08, 8.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:24<56:08, 8.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:24<56:08, 8.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:24<56:08, 8.21s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:32<55:07, 8.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:32<55:07, 8.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:32<55:07, 8.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:32<55:07, 8.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:32<55:07, 8.09s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▎ | 186/594 [25:39<54:00, 7.94s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▎ | 186/594 [25:39<54:00, 7.94s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:01,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:01,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:47<52:56, 7.81s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:47<52:56, 7.81s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:47<52:56, 7.81s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:47<52:56, 7.81s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:47<52:56, 7.81s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [25:54<51:53, 7.67s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [25:54<51:53, 7.67s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [25:54<51:53, 7.67s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:17,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:17,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4813, 'learning_rate': 0.00011219999999999999, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 18:28:17,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:17,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:17,541 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 190/594 [26:08<48:49, 7.25s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:27,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:27,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:27,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████ | 191/594 [26:14<46:47, 6.97s/it]g-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:33,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:33,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:33,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:22:02,017 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 192/594 [26:20<44:43, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 192/594 [26:20<44:43, 6.68s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:41,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:41,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6878, 'learning_rate': 0.0001146, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-03-02 18:28:45,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:45,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 194/594 [26:30<39:31, 5.93s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:49,520 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:51,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:51,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:53,979 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:55,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:55,938 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:57,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:57,901 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:28:59,622 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:01,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:01,420 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:04,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:04,421 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:05,674 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:08,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:08,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5335, 'learning_rate': 0.0001188, 'epoch': 0.34} [WARNING|modeling_utils.py:388] 2022-03-02 18:29:08,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:14,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:14,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:29:14,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:03<37:13, 5.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:03<37:13, 5.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:03<37:13, 5.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:03<37:13, 5.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:03<37:13, 5.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:13<46:04, 7.05s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:13<46:04, 7.05s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:13<46:04, 7.05s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:13<46:04, 7.05s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:23<51:40, 7.93s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:23<51:40, 7.93s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3993, 'learning_rate': 0.00012059999999999999, 'epoch': 0.34} 34%|███████████████████████████▋ | 203/594 [27:23<51:40, 7.93s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:23<51:40, 7.93s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:33<55:29, 8.54s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:33<55:29, 8.54s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1615, 'learning_rate': 0.00012119999999999999, 'epoch': 0.34} 34%|███████████████████████████▊ | 204/594 [27:33<55:29, 8.54s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:33<55:29, 8.54s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:43<58:05, 8.96s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:43<58:05, 8.96s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.226, 'learning_rate': 0.00012179999999999999, 'epoch': 0.34} 35%|███████████████████████████▉ | 205/594 [27:43<58:05, 8.96s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:43<58:05, 8.96s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:53<59:46, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:53<59:46, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1952, 'learning_rate': 0.0001224, 'epoch': 0.35} 35%|████████████████████████████ | 206/594 [27:53<59:46, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:53<59:46, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:53<59:46, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:02<1:00:40, 9.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:02<1:00:40, 9.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:02<1:00:40, 9.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:02<1:00:40, 9.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:12<1:01:20, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:12<1:01:20, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3538, 'learning_rate': 0.0001236, 'epoch': 0.35} 35%|███████████████████████████▋ | 208/594 [28:12<1:01:20, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:12<1:01:20, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:22<1:01:27, 9.58s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:22<1:01:27, 9.58s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.265, 'learning_rate': 0.00012419999999999998, 'epoch': 0.35} 35%|███████████████████████████▊ | 209/594 [28:22<1:01:27, 9.58s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:22<1:01:27, 9.58s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3468, 'learning_rate': 0.00012479999999999997, 'epoch': 0.35} 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3613, 'learning_rate': 0.00012539999999999999, 'epoch': 0.35} 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:32<1:01:27, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:51<1:01:08, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:51<1:01:08, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:51<1:01:08, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:51<1:01:08, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:51<1:01:08, 9.60s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:00<1:00:45, 9.57s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:00<1:00:45, 9.57s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:00<1:00:45, 9.57s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:00<1:00:45, 9.57s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:10<1:00:21, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:10<1:00:21, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3405, 'learning_rate': 0.00012719999999999997, 'epoch': 0.36} 36%|████████████████████████████▍ | 214/594 [29:10<1:00:21, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:10<1:00:21, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:10<1:00:21, 9.53s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.277, 'learning_rate': 0.00012839999999999998, 'epoch': 0.36} 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▌ | 215/594 [29:19<1:00:05, 9.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:37<58:30, 9.31s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:37<58:30, 9.31s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:37<58:30, 9.31s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:37<58:30, 9.31s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:47<57:54, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:47<57:54, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3264, 'learning_rate': 0.00012959999999999998, 'epoch': 0.37} 37%|█████████████████████████████▋ | 218/594 [29:47<57:54, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:47<57:54, 9.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [29:56<57:35, 9.21s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [29:56<57:35, 9.21s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4251, 'learning_rate': 0.0001302, 'epoch': 0.37} 37%|█████████████████████████████▊ | 219/594 [29:56<57:35, 9.21s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [29:56<57:35, 9.21s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:05<57:06, 9.16s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:05<57:06, 9.16s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4401, 'learning_rate': 0.00013079999999999998, 'epoch': 0.37} 37%|██████████████████████████████ | 220/594 [30:05<57:06, 9.16s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:05<57:06, 9.16s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:14<56:38, 9.11s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:14<56:38, 9.11s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2936, 'learning_rate': 0.0001314, 'epoch': 0.37} 37%|██████████████████████████████▏ | 221/594 [30:14<56:38, 9.11s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:14<56:38, 9.11s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:14<56:38, 9.11s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 222/594 [30:23<56:15, 9.07s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 222/594 [30:23<56:15, 9.07s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 222/594 [30:23<56:15, 9.07s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▎ | 222/594 [30:23<56:15, 9.07s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:31<55:31, 8.98s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:31<55:31, 8.98s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3694, 'learning_rate': 0.0001326, 'epoch': 0.37} 38%|██████████████████████████████▍ | 223/594 [30:31<55:31, 8.98s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:31<55:31, 8.98s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:40<54:47, 8.88s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:40<54:47, 8.88s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5794, 'learning_rate': 0.0001338, 'epoch': 0.38} [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:33:01,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [30:58<54:23, 8.87s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [30:58<54:23, 8.87s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2256, 'learning_rate': 0.0001344, 'epoch': 0.38} 38%|██████████████████████████████▊ | 226/594 [30:58<54:23, 8.87s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [30:58<54:23, 8.87s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [30:58<54:23, 8.87s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:06<53:34, 8.76s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:06<53:34, 8.76s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:06<53:34, 8.76s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:06<53:34, 8.76s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:06<53:34, 8.76s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:15<52:56, 8.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:15<52:56, 8.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:15<52:56, 8.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:15<52:56, 8.68s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4223, 'learning_rate': 0.0001362, 'epoch': 0.39} g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:32<51:36, 8.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:32<51:36, 8.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.384, 'learning_rate': 0.0001368, 'epoch': 0.39} 39%|███████████████████████████████▎ | 230/594 [31:32<51:36, 8.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:32<51:36, 8.51s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:40<50:52, 8.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:40<50:52, 8.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1981, 'learning_rate': 0.0001374, 'epoch': 0.39} 39%|███████████████████████████████▌ | 231/594 [31:40<50:52, 8.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:40<50:52, 8.41s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:48<50:12, 8.32s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:48<50:12, 8.32s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2797, 'learning_rate': 0.000138, 'epoch': 0.39} 39%|███████████████████████████████▋ | 232/594 [31:48<50:12, 8.32s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:48<50:12, 8.32s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:48<50:12, 8.32s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [31:56<49:34, 8.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [31:56<49:34, 8.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [31:56<49:34, 8.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [31:56<49:34, 8.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [31:56<49:34, 8.24s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:04<48:50, 8.14s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:04<48:50, 8.14s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:04<48:50, 8.14s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:04<48:50, 8.14s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:04<48:50, 8.14s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:12<48:08, 8.04s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:12<48:08, 8.04s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:12<48:08, 8.04s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:12<48:08, 8.04s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:12<48:08, 8.04s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 236/594 [32:19<47:06, 7.90s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 236/594 [32:19<47:06, 7.90s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:34:41,313 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:34:41,313 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:26<45:47, 7.70s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:26<45:47, 7.70s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:26<45:47, 7.70s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:26<45:47, 7.70s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:26<45:47, 7.70s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▍ | 238/594 [32:34<44:38, 7.52s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▍ | 238/594 [32:34<44:38, 7.52s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:34:55,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:34:55,518 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 239/594 [32:41<43:44, 7.39s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 239/594 [32:41<43:44, 7.39s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▌ | 239/594 [32:41<43:44, 7.39s/it]g-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:03,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:03,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3545, 'learning_rate': 0.00014279999999999997, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-03-02 18:35:03,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:03,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:03,982 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:28:38,065 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 241/594 [32:54<40:35, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 241/594 [32:54<40:35, 6.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:16,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:16,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4465, 'learning_rate': 0.00014399999999999998, 'epoch': 0.41} [WARNING|modeling_utils.py:388] 2022-03-02 18:35:16,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:16,195 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:21,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:21,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:25,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:25,772 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:11,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▎ | 244/594 [33:10<34:51, 5.98s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:30,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:30,570 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▍ | 245/594 [33:15<32:31, 5.59s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:33,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:33,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:35,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:37,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:37,807 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:39,565 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:42,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:42,757 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:44,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:44,253 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:46,857 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:48,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:48,445 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6152, 'learning_rate': 0.00014879999999999998, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-02 18:35:53,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:53,914 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:59,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:59,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4272, 'learning_rate': 0.0001494, 'epoch': 0.42} [WARNING|modeling_utils.py:388] 2022-03-02 18:35:59,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:59,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:35:59,052 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [33:53<39:56, 7.01s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [33:53<39:56, 7.01s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.582, 'learning_rate': 0.00015, 'epoch': 0.42} 42%|██████████████████████████████████▎ | 252/594 [33:53<39:56, 7.01s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [33:53<39:56, 7.01s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:03<44:56, 7.91s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:03<44:56, 7.91s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7457, 'learning_rate': 0.00015059999999999997, 'epoch': 0.43} 43%|██████████████████████████████████▌ | 253/594 [34:03<44:56, 7.91s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:03<44:56, 7.91s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:13<48:24, 8.54s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:13<48:24, 8.54s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4816, 'learning_rate': 0.0001512, 'epoch': 0.43} 43%|██████████████████████████████████▋ | 254/594 [34:13<48:24, 8.54s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:13<48:24, 8.54s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:23<50:37, 8.96s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:23<50:37, 8.96s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8845, 'learning_rate': 0.00015179999999999998, 'epoch': 0.43} 43%|██████████████████████████████████▊ | 255/594 [34:23<50:37, 8.96s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:23<50:37, 8.96s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:32<51:55, 9.22s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:32<51:55, 9.22s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8351, 'learning_rate': 0.0001524, 'epoch': 0.43} 43%|██████████████████████████████████▉ | 256/594 [34:32<51:55, 9.22s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:32<51:55, 9.22s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:42<52:37, 9.37s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:42<52:37, 9.37s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5857, 'learning_rate': 0.00015299999999999998, 'epoch': 0.43} 43%|███████████████████████████████████ | 257/594 [34:42<52:37, 9.37s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:42<52:37, 9.37s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:52<53:04, 9.48s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:52<53:04, 9.48s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.389, 'learning_rate': 0.0001536, 'epoch': 0.43} 43%|███████████████████████████████████▏ | 258/594 [34:52<53:04, 9.48s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:52<53:04, 9.48s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:02<53:34, 9.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:02<53:34, 9.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4282, 'learning_rate': 0.00015419999999999998, 'epoch': 0.44} 44%|███████████████████████████████████▎ | 259/594 [35:02<53:34, 9.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:02<53:34, 9.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.647, 'learning_rate': 0.0001548, 'epoch': 0.44} 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:11<53:36, 9.63s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:21<53:06, 9.57s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:21<53:06, 9.57s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:21<53:06, 9.57s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:21<53:06, 9.57s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:21<53:06, 9.57s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:30<52:45, 9.53s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:30<52:45, 9.53s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:30<52:45, 9.53s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:30<52:45, 9.53s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:40<52:30, 9.52s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:40<52:30, 9.52s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8514, 'learning_rate': 0.00015659999999999998, 'epoch': 0.44} 44%|███████████████████████████████████▊ | 263/594 [35:40<52:30, 9.52s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:40<52:30, 9.52s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:40<52:30, 9.52s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:49<52:03, 9.47s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:49<52:03, 9.47s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:49<52:03, 9.47s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:49<52:03, 9.47s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:49<52:03, 9.47s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [35:58<51:39, 9.42s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [35:58<51:39, 9.42s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [35:58<51:39, 9.42s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [35:58<51:39, 9.42s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [35:58<51:39, 9.42s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:08<51:10, 9.36s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:08<51:10, 9.36s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:08<51:10, 9.36s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:08<51:10, 9.36s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:08<51:10, 9.36s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4451, 'learning_rate': 0.0001596, 'epoch': 0.45} 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:17<50:38, 9.29s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:35<49:46, 9.19s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:35<49:46, 9.19s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:35<49:46, 9.19s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:35<49:46, 9.19s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 270/594 [36:44<49:23, 9.15s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 270/594 [36:44<49:23, 9.15s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4475, 'learning_rate': 0.0001608, 'epoch': 0.45} 45%|████████████████████████████████████▊ | 270/594 [36:44<49:23, 9.15s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 270/594 [36:44<49:23, 9.15s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▊ | 270/594 [36:44<49:23, 9.15s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [36:53<48:49, 9.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [36:53<48:49, 9.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [36:53<48:49, 9.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [36:53<48:49, 9.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:02<48:28, 9.03s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:02<48:28, 9.03s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6113, 'learning_rate': 0.000162, 'epoch': 0.46} 46%|█████████████████████████████████████ | 272/594 [37:02<48:28, 9.03s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:02<48:28, 9.03s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 273/594 [37:11<47:59, 8.97s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 273/594 [37:11<47:59, 8.97s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7242, 'learning_rate': 0.0001626, 'epoch': 0.46} 46%|█████████████████████████████████████▏ | 273/594 [37:11<47:59, 8.97s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▏ | 273/594 [37:11<47:59, 8.97s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▎ | 274/594 [37:20<47:38, 8.93s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▎ | 274/594 [37:20<47:38, 8.93s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6371, 'learning_rate': 0.0001632, 'epoch': 0.46} 46%|█████████████████████████████████████▎ | 274/594 [37:20<47:38, 8.93s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▎ | 274/594 [37:20<47:38, 8.93s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:29<48:08, 9.05s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:29<48:08, 9.05s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4794, 'learning_rate': 0.0001638, 'epoch': 0.46} 46%|█████████████████████████████████████▌ | 275/594 [37:29<48:08, 9.05s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:29<48:08, 9.05s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:38<47:27, 8.95s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:38<47:27, 8.95s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3902, 'learning_rate': 0.0001644, 'epoch': 0.46} 46%|█████████████████████████████████████▋ | 276/594 [37:38<47:27, 8.95s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:38<47:27, 8.95s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:38<47:27, 8.95s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 277/594 [37:46<46:45, 8.85s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 277/594 [37:46<46:45, 8.85s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 277/594 [37:46<46:45, 8.85s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▊ | 277/594 [37:46<46:45, 8.85s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 278/594 [37:55<46:04, 8.75s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 278/594 [37:55<46:04, 8.75s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5436, 'learning_rate': 0.0001656, 'epoch': 0.47} 47%|█████████████████████████████████████▉ | 278/594 [37:55<46:04, 8.75s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|█████████████████████████████████████▉ | 278/594 [37:55<46:04, 8.75s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:03<45:26, 8.66s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:03<45:26, 8.66s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4553, 'learning_rate': 0.0001662, 'epoch': 0.47} 47%|██████████████████████████████████████ | 279/594 [38:03<45:26, 8.66s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:03<45:26, 8.66s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:12<45:00, 8.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:12<45:00, 8.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5571, 'learning_rate': 0.0001668, 'epoch': 0.47} 47%|██████████████████████████████████████▏ | 280/594 [38:12<45:00, 8.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:12<45:00, 8.60s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:20<44:22, 8.51s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:20<44:22, 8.51s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5108, 'learning_rate': 0.0001674, 'epoch': 0.47} 47%|██████████████████████████████████████▎ | 281/594 [38:20<44:22, 8.51s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:20<44:22, 8.51s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:20<44:22, 8.51s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:28<43:39, 8.39s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:28<43:39, 8.39s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:28<43:39, 8.39s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:28<43:39, 8.39s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:28<43:39, 8.39s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:36<43:03, 8.31s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:36<43:03, 8.31s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:36<43:03, 8.31s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:36<43:03, 8.31s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:36<43:03, 8.31s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:44<42:24, 8.21s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:44<42:24, 8.21s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:44<42:24, 8.21s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:44<42:24, 8.21s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:44<42:24, 8.21s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:52<41:34, 8.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:52<41:34, 8.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:52<41:34, 8.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:52<41:34, 8.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:52<41:34, 8.07s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [38:59<40:45, 7.94s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [38:59<40:45, 7.94s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [38:59<40:45, 7.94s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [38:59<40:45, 7.94s/it]g-point operations will not be computed-02 18:35:28,283 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7449, 'learning_rate': 0.00017099999999999998, 'epoch': 0.48} 48%|███████████████████████████████████████▏ | 287/594 [39:07<39:52, 7.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:07<39:52, 7.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:07<39:52, 7.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:07<39:52, 7.79s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:14<38:57, 7.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:14<38:57, 7.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:14<38:57, 7.64s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:37,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:37,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3885, 'learning_rate': 0.00017219999999999998, 'epoch': 0.49} [WARNING|modeling_utils.py:388] 2022-03-02 18:41:37,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:37,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:37,793 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▌ | 290/594 [39:28<36:37, 7.23s/it]g-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:47,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:47,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:47,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 291/594 [39:34<35:19, 7.00s/it]g-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:54,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:54,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:54,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▊ | 292/594 [39:40<33:39, 6.69s/it]g-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:59,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:59,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:41:59,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:41:25,477 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:46<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:46<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:46<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:07,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:10,347 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:12,648 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:12,648 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4993, 'learning_rate': 0.00017579999999999996, 'epoch': 0.5} [WARNING|modeling_utils.py:388] 2022-03-02 18:42:15,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:15,909 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:04,051 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▎ | 296/594 [40:00<26:01, 5.24s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:17,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:04<23:46, 4.80s/it]g-point operations will not be computed-02 18:42:17,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:04<23:46, 4.80s/it]g-point operations will not be computed-02 18:42:17,978 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:04<23:46, 4.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:21,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:04<23:46, 4.80s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:21,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▋ | 298/594 [40:07<21:36, 4.38s/it]g-point operations will not be computed-02 18:42:21,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:26,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:24,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:26,271 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:24,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 299/594 [40:10<19:14, 3.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:27,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▊ | 299/594 [40:10<19:14, 3.91s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:27,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:13<17:43, 3.62s/it]g-point operations will not be computed-02 18:42:27,618 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:13<17:43, 3.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:13<17:43, 3.62s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:37,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:42:37,861 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:24<27:47, 5.69s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:24<27:47, 5.69s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:24<27:47, 5.69s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:24<27:47, 5.69s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:24<27:47, 5.69s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:34<34:11, 7.03s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:34<34:11, 7.03s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:34<34:11, 7.03s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:34<34:11, 7.03s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:34<34:11, 7.03s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:44<38:19, 7.90s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:44<38:19, 7.90s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:44<38:19, 7.90s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:44<38:19, 7.90s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:44<38:19, 7.90s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [40:54<41:06, 8.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [40:54<41:06, 8.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [40:54<41:06, 8.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [40:54<41:06, 8.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [40:54<41:06, 8.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:04<42:55, 8.91s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:04<42:55, 8.91s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:04<42:55, 8.91s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:04<42:55, 8.91s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:04<42:55, 8.91s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:14<44:10, 9.20s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:14<44:10, 9.20s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:14<44:10, 9.20s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:14<44:10, 9.20s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:23<44:58, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:23<44:58, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6516, 'learning_rate': 0.00018299999999999998, 'epoch': 0.52} 52%|█████████████████████████████████████████▊ | 307/594 [41:23<44:58, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:23<44:58, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:33<45:15, 9.49s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:33<45:15, 9.49s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7071, 'learning_rate': 0.0001836, 'epoch': 0.52} 52%|██████████████████████████████████████████ | 308/594 [41:33<45:15, 9.49s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:33<45:15, 9.49s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:33<45:15, 9.49s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:43<45:13, 9.52s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:43<45:13, 9.52s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:43<45:13, 9.52s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:43<45:13, 9.52s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:43<45:13, 9.52s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:52<45:19, 9.58s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:52<45:19, 9.58s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:52<45:19, 9.58s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:52<45:19, 9.58s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:02<45:23, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:02<45:23, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2875, 'learning_rate': 0.00018539999999999998, 'epoch': 0.52} 52%|██████████████████████████████████████████▍ | 311/594 [42:02<45:23, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:02<45:23, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:12<45:13, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:12<45:13, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5176, 'learning_rate': 0.000186, 'epoch': 0.52} 53%|██████████████████████████████████████████▌ | 312/594 [42:12<45:13, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:12<45:13, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:12<45:13, 9.62s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:21<44:53, 9.59s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:21<44:53, 9.59s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:21<44:53, 9.59s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:21<44:53, 9.59s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:31<44:32, 9.54s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:31<44:32, 9.54s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6015, 'learning_rate': 0.0001872, 'epoch': 0.53} 53%|██████████████████████████████████████████▊ | 314/594 [42:31<44:32, 9.54s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:31<44:32, 9.54s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:31<44:32, 9.54s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:40<44:11, 9.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:40<44:11, 9.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:40<44:11, 9.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:40<44:11, 9.51s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:49<43:33, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:49<43:33, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6336, 'learning_rate': 0.00018839999999999997, 'epoch': 0.53} 53%|███████████████████████████████████████████ | 316/594 [42:49<43:33, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:49<43:33, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:49<43:33, 9.40s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [42:59<43:14, 9.37s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [42:59<43:14, 9.37s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [42:59<43:14, 9.37s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [42:59<43:14, 9.37s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:08<42:54, 9.33s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:08<42:54, 9.33s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8962, 'learning_rate': 0.00018959999999999997, 'epoch': 0.53} 54%|███████████████████████████████████████████▎ | 318/594 [43:08<42:54, 9.33s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:08<42:54, 9.33s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:17<42:33, 9.29s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:17<42:33, 9.29s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5626, 'learning_rate': 0.0001902, 'epoch': 0.54} 54%|███████████████████████████████████████████▌ | 319/594 [43:17<42:33, 9.29s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:17<42:33, 9.29s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:26<42:09, 9.23s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:26<42:09, 9.23s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5467, 'learning_rate': 0.00019079999999999998, 'epoch': 0.54} 54%|███████████████████████████████████████████▋ | 320/594 [43:26<42:09, 9.23s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:26<42:09, 9.23s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:35<41:48, 9.19s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:35<41:48, 9.19s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2766, 'learning_rate': 0.0001914, 'epoch': 0.54} 54%|███████████████████████████████████████████▊ | 321/594 [43:35<41:48, 9.19s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:35<41:48, 9.19s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:35<41:48, 9.19s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:44<41:27, 9.14s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:44<41:27, 9.14s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:44<41:27, 9.14s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:44<41:27, 9.14s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:44<41:27, 9.14s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3947, 'learning_rate': 0.00019319999999999998, 'epoch': 0.54} 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:53<40:50, 9.04s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:11<40:42, 9.08s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:11<40:42, 9.08s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:11<40:42, 9.08s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:11<40:42, 9.08s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:20<40:07, 8.98s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:20<40:07, 8.98s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4117, 'learning_rate': 0.00019439999999999998, 'epoch': 0.55} 55%|████████████████████████████████████████████▍ | 326/594 [44:20<40:07, 8.98s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:20<40:07, 8.98s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:28<39:25, 8.86s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:28<39:25, 8.86s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7301, 'learning_rate': 0.000195, 'epoch': 0.55} 55%|████████████████████████████████████████████▌ | 327/594 [44:28<39:25, 8.86s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:46:53,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:46:53,627 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:46:57,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:46:57,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:46:57,951 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:45<38:20, 8.68s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:45<38:20, 8.68s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:45<38:20, 8.68s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:45<38:20, 8.68s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:45<38:20, 8.68s/it]g-point operations will not be computed-02 18:42:32,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:54<37:49, 8.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:54<37:49, 8.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:54<37:49, 8.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:54<37:49, 8.60s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:02<36:56, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:02<36:56, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:02<36:56, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:02<36:56, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:02<36:56, 8.43s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:10<36:16, 8.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:10<36:16, 8.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:10<36:16, 8.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:10<36:16, 8.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:10<36:16, 8.31s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:18<35:39, 8.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:18<35:39, 8.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:18<35:39, 8.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:18<35:39, 8.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:18<35:39, 8.20s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:26<35:03, 8.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:26<35:03, 8.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:26<35:03, 8.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:26<35:03, 8.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:26<35:03, 8.09s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:33<34:29, 7.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:33<34:29, 7.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:33<34:29, 7.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:33<34:29, 7.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:33<34:29, 7.99s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▊ | 336/594 [45:41<33:48, 7.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▊ | 336/594 [45:41<33:48, 7.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:03,297 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:03,297 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:49<33:09, 7.74s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:49<33:09, 7.74s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:49<33:09, 7.74s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:49<33:09, 7.74s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:49<33:09, 7.74s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 338/594 [45:56<32:30, 7.62s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 338/594 [45:56<32:30, 7.62s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:17,660 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▏ | 339/594 [46:03<31:26, 7.40s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▏ | 339/594 [46:03<31:26, 7.40s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4543, 'learning_rate': 0.0002022, 'epoch': 0.57} 57%|██████████████████████████████████████████████▏ | 339/594 [46:03<31:26, 7.40s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:25,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:25,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6159, 'learning_rate': 0.0002028, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-02 18:48:25,795 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:31,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:31,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4517, 'learning_rate': 0.00020339999999999998, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-03-02 18:48:36,172 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▋ | 342/594 [46:21<27:11, 6.47s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▋ | 342/594 [46:21<27:11, 6.47s/it]g-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:40,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:40,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:40,303 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:47:12,613 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▊ | 343/594 [46:26<25:30, 6.10s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:46,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:46,383 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▉ | 344/594 [46:31<23:39, 5.68s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:49,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:49,704 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:51,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:53,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:53,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:55,415 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:57,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:48:57,215 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:00,444 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:01,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:01,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:04,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:04,552 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:05,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:05,796 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:07,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:07,374 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:12,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:12,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:12,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:17,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:17,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:17,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:17,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:49:17,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:11<27:31, 6.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:11<27:31, 6.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:11<27:31, 6.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:11<27:31, 6.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:11<27:31, 6.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:21<31:14, 7.78s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:21<31:14, 7.78s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:21<31:14, 7.78s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:21<31:14, 7.78s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▎ | 354/594 [47:31<33:41, 8.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▎ | 354/594 [47:31<33:41, 8.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4975, 'learning_rate': 0.00021119999999999996, 'epoch': 0.6} 60%|████████████████████████████████████████████████▎ | 354/594 [47:31<33:41, 8.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▎ | 354/594 [47:31<33:41, 8.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:41<35:10, 8.83s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:41<35:10, 8.83s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4032, 'learning_rate': 0.00021179999999999997, 'epoch': 0.6} 60%|████████████████████████████████████████████████▍ | 355/594 [47:41<35:10, 8.83s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:41<35:10, 8.83s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:51<36:07, 9.11s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:51<36:07, 9.11s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3613, 'learning_rate': 0.00021239999999999996, 'epoch': 0.6} 60%|████████████████████████████████████████████████▌ | 356/594 [47:51<36:07, 9.11s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:51<36:07, 9.11s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:00<36:49, 9.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:00<36:49, 9.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3984, 'learning_rate': 0.00021299999999999997, 'epoch': 0.6} 60%|████████████████████████████████████████████████▋ | 357/594 [48:00<36:49, 9.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:00<36:49, 9.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:10<37:03, 9.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:10<37:03, 9.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7058, 'learning_rate': 0.00021359999999999996, 'epoch': 0.6} 60%|████████████████████████████████████████████████▊ | 358/594 [48:10<37:03, 9.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:10<37:03, 9.42s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:20<37:01, 9.45s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:20<37:01, 9.45s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3473, 'learning_rate': 0.00021419999999999998, 'epoch': 0.6} 60%|████████████████████████████████████████████████▉ | 359/594 [48:20<37:01, 9.45s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:20<37:01, 9.45s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:20<37:01, 9.45s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:29<36:58, 9.48s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:29<36:58, 9.48s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:29<36:58, 9.48s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:29<36:58, 9.48s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6861, 'learning_rate': 0.00021539999999999998, 'epoch': 0.61} g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:48<36:29, 9.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:48<36:29, 9.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:48<36:29, 9.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:48<36:29, 9.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7367, 'learning_rate': 0.00021659999999999998, 'epoch': 0.61} g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:07<35:58, 9.39s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:07<35:58, 9.39s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:07<35:58, 9.39s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:07<35:58, 9.39s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:17<37:00, 9.70s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:17<37:00, 9.70s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7612, 'learning_rate': 0.00021779999999999998, 'epoch': 0.61} 61%|█████████████████████████████████████████████████▊ | 365/594 [49:17<37:00, 9.70s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:17<37:00, 9.70s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:27<36:37, 9.64s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:27<36:37, 9.64s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.447, 'learning_rate': 0.00021839999999999997, 'epoch': 0.62} 62%|█████████████████████████████████████████████████▉ | 366/594 [49:27<36:37, 9.64s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:27<36:37, 9.64s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:27<36:37, 9.64s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:36<35:48, 9.46s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:36<35:48, 9.46s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:36<35:48, 9.46s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:36<35:48, 9.46s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:45<35:09, 9.34s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:45<35:09, 9.34s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3665, 'learning_rate': 0.00021959999999999997, 'epoch': 0.62} 62%|██████████████████████████████████████████████████▏ | 368/594 [49:45<35:09, 9.34s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:45<35:09, 9.34s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:45<35:09, 9.34s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:54<34:38, 9.24s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:54<34:38, 9.24s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:54<34:38, 9.24s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:54<34:38, 9.24s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:54<34:38, 9.24s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:03<34:09, 9.15s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:03<34:09, 9.15s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:03<34:09, 9.15s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:03<34:09, 9.15s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:03<34:09, 9.15s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:11<33:41, 9.07s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:11<33:41, 9.07s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:11<33:41, 9.07s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:11<33:41, 9.07s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 372/594 [50:20<33:23, 9.02s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 372/594 [50:20<33:23, 9.02s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4241, 'learning_rate': 0.00022199999999999998, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-02 18:52:43,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▊ | 373/594 [50:29<33:00, 8.96s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▊ | 373/594 [50:29<33:00, 8.96s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3496, 'learning_rate': 0.0002226, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-02 18:52:52,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:52:52,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:38<32:36, 8.89s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:38<32:36, 8.89s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:38<32:36, 8.89s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:38<32:36, 8.89s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:47<32:53, 9.01s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:47<32:53, 9.01s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5995, 'learning_rate': 0.0002238, 'epoch': 0.63} 63%|███████████████████████████████████████████████████▏ | 375/594 [50:47<32:53, 9.01s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:47<32:53, 9.01s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5976, 'learning_rate': 0.00022439999999999998, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-03-02 18:53:16,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:53:16,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:53:16,839 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:05<31:54, 8.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:05<31:54, 8.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:05<31:54, 8.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:05<31:54, 8.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:05<31:54, 8.82s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:13<31:26, 8.74s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:13<31:26, 8.74s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:13<31:26, 8.74s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:13<31:26, 8.74s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:21<30:53, 8.62s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:21<30:53, 8.62s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5382, 'learning_rate': 0.00022619999999999997, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▋ | 379/594 [51:21<30:53, 8.62s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:21<30:53, 8.62s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:30<30:24, 8.52s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:30<30:24, 8.52s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5339, 'learning_rate': 0.00022679999999999998, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▊ | 380/594 [51:30<30:24, 8.52s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:30<30:24, 8.52s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:38<29:57, 8.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:38<29:57, 8.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6268, 'learning_rate': 0.00022739999999999997, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▉ | 381/594 [51:38<29:57, 8.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:38<29:57, 8.44s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:46<29:23, 8.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:46<29:23, 8.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7323, 'learning_rate': 0.00022799999999999999, 'epoch': 0.64} 64%|████████████████████████████████████████████████████ | 382/594 [51:46<29:23, 8.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:46<29:23, 8.32s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:54<29:02, 8.26s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:54<29:02, 8.26s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7925, 'learning_rate': 0.00022859999999999997, 'epoch': 0.64} 64%|████████████████████████████████████████████████████▏ | 383/594 [51:54<29:02, 8.26s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:54<29:02, 8.26s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:02<28:33, 8.16s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:02<28:33, 8.16s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5124, 'learning_rate': 0.0002292, 'epoch': 0.65} 65%|████████████████████████████████████████████████████▎ | 384/594 [52:02<28:33, 8.16s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:02<28:33, 8.16s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:10<28:00, 8.04s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:10<28:00, 8.04s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.706, 'learning_rate': 0.00022979999999999997, 'epoch': 0.65} 65%|████████████████████████████████████████████████████▌ | 385/594 [52:10<28:00, 8.04s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:10<28:00, 8.04s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:17<27:25, 7.91s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:17<27:25, 7.91s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8178, 'learning_rate': 0.0002304, 'epoch': 0.65} [WARNING|modeling_utils.py:388] 2022-03-02 18:54:39,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 387/594 [52:25<26:45, 7.76s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 387/594 [52:25<26:45, 7.76s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.981, 'learning_rate': 0.00023099999999999998, 'epoch': 0.65} 65%|████████████████████████████████████████████████████▊ | 387/594 [52:25<26:45, 7.76s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 387/594 [52:25<26:45, 7.76s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:32<26:11, 7.63s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:32<26:11, 7.63s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4796, 'learning_rate': 0.0002316, 'epoch': 0.65} 65%|████████████████████████████████████████████████████▉ | 388/594 [52:32<26:11, 7.63s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:32<26:11, 7.63s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:32<26:11, 7.63s/it]g-point operations will not be computed-02 18:48:44,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████████ | 389/594 [52:39<25:29, 7.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████████ | 389/594 [52:39<25:29, 7.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████████ | 389/594 [52:39<25:29, 7.46s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▏ | 390/594 [52:46<24:39, 7.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▏ | 390/594 [52:46<24:39, 7.25s/it][WARNING|modeling_utils.py:388] 2022-03-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:05,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:05,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 391/594 [52:52<23:36, 6.98s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 391/594 [52:52<23:36, 6.98s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:12,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:12,016 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▍ | 392/594 [52:58<22:32, 6.69s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▍ | 392/594 [52:58<22:32, 6.69s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:17,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:17,830 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:20,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:20,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:24,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▋ | 394/594 [53:09<20:04, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▋ | 394/594 [53:09<20:04, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:28,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:30,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:30,473 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:32,681 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:34,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:34,699 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:36,726 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:38,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:38,523 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:40,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:40,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:42,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:43,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:43,771 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:46,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:46,633 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:48,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:48,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:54,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:54,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 18:55:54,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:43<19:22, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:43<19:22, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:43<19:22, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:43<19:22, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:43<19:22, 6.02s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:54<23:50, 7.45s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:54<23:50, 7.45s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:54<23:50, 7.45s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:54<23:50, 7.45s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:54<23:50, 7.45s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:04<26:32, 8.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:04<26:32, 8.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:04<26:32, 8.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:04<26:32, 8.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:04<26:32, 8.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 404/594 [54:14<28:14, 8.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 404/594 [54:14<28:14, 8.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 404/594 [54:14<28:14, 8.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 404/594 [54:14<28:14, 8.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████ | 404/594 [54:14<28:14, 8.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:25<29:25, 9.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:25<29:25, 9.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:25<29:25, 9.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:25<29:25, 9.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:25<29:25, 9.34s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:35<30:15, 9.66s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:35<30:15, 9.66s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:35<30:15, 9.66s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:35<30:15, 9.66s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:35<30:15, 9.66s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:45<30:36, 9.82s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:45<30:36, 9.82s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:45<30:36, 9.82s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:45<30:36, 9.82s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:45<30:36, 9.82s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:56<30:53, 9.97s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:56<30:53, 9.97s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:56<30:53, 9.97s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:56<30:53, 9.97s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:56<30:53, 9.97s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:06<30:52, 10.01s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:06<30:52, 10.01s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:06<30:52, 10.01s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:06<30:52, 10.01s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:06<30:52, 10.01s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:16<31:12, 10.18s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:16<31:12, 10.18s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:16<31:12, 10.18s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:16<31:12, 10.18s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:16<31:12, 10.18s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 411/594 [55:26<30:39, 10.05s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 411/594 [55:26<30:39, 10.05s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 411/594 [55:26<30:39, 10.05s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 411/594 [55:26<30:39, 10.05s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████ | 411/594 [55:26<30:39, 10.05s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:36<30:13, 9.96s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:36<30:13, 9.96s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:36<30:13, 9.96s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:36<30:13, 9.96s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:46<29:56, 9.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:46<29:56, 9.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4157, 'learning_rate': 0.0002466, 'epoch': 0.69} 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:46<29:56, 9.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:46<29:56, 9.92s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:56<29:47, 9.93s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:56<29:47, 9.93s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5498, 'learning_rate': 0.0002472, 'epoch': 0.7} 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:56<29:47, 9.93s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:56<29:47, 9.93s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:56<29:47, 9.93s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4638, 'learning_rate': 0.00024839999999999997, 'epoch': 0.7} 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [56:05<29:11, 9.78s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 417/594 [56:24<28:28, 9.65s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 417/594 [56:24<28:28, 9.65s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4486, 'learning_rate': 0.000249, 'epoch': 0.7} 70%|████████████████████████████████████████████████████████▊ | 417/594 [56:24<28:28, 9.65s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▊ | 417/594 [56:24<28:28, 9.65s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:34<28:32, 9.73s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:34<28:32, 9.73s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1511, 'learning_rate': 0.00024959999999999994, 'epoch': 0.7} 70%|█████████████████████████████████████████████████████████ | 418/594 [56:34<28:32, 9.73s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:34<28:32, 9.73s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:34<28:32, 9.73s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:44<28:14, 9.68s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:44<28:14, 9.68s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:44<28:14, 9.68s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:44<28:14, 9.68s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:53<27:53, 9.62s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:53<27:53, 9.62s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.502, 'learning_rate': 0.00025079999999999997, 'epoch': 0.71} 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:53<27:53, 9.62s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:53<27:53, 9.62s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:53<27:53, 9.62s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [57:03<27:31, 9.55s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [57:03<27:31, 9.55s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [57:03<27:31, 9.55s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [57:03<27:31, 9.55s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [57:03<27:31, 9.55s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:12<27:25, 9.56s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:12<27:25, 9.56s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:12<27:25, 9.56s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:12<27:25, 9.56s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:12<27:25, 9.56s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:21<26:57, 9.46s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:21<26:57, 9.46s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:21<26:57, 9.46s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:21<26:57, 9.46s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:31<26:49, 9.47s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:31<26:49, 9.47s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4421, 'learning_rate': 0.0002532, 'epoch': 0.71} 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:31<26:49, 9.47s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:31<26:49, 9.47s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:31<26:49, 9.47s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:40<26:33, 9.43s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:40<26:33, 9.43s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:40<26:33, 9.43s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:40<26:33, 9.43s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:49<25:53, 9.25s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:49<25:53, 9.25s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4904, 'learning_rate': 0.00025439999999999995, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████ | 426/594 [57:49<25:53, 9.25s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:49<25:53, 9.25s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:58<25:18, 9.09s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:58<25:18, 9.09s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2661, 'learning_rate': 0.00025499999999999996, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:58<25:18, 9.09s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:58<25:18, 9.09s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [58:06<24:45, 8.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [58:06<24:45, 8.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2795, 'learning_rate': 0.0002556, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████▎ | 428/594 [58:06<24:45, 8.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [58:06<24:45, 8.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [58:06<24:45, 8.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:15<24:14, 8.81s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:15<24:14, 8.81s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:15<24:14, 8.81s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:15<24:14, 8.81s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:15<24:14, 8.81s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:23<23:46, 8.70s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:23<23:46, 8.70s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:23<23:46, 8.70s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:23<23:46, 8.70s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:32<23:23, 8.61s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:32<23:23, 8.61s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7066, 'learning_rate': 0.00025739999999999997, 'epoch': 0.72} 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:32<23:23, 8.61s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:32<23:23, 8.61s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:40<23:00, 8.52s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:40<23:00, 8.52s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.22, 'learning_rate': 0.000258, 'epoch': 0.73} 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:40<23:00, 8.52s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:40<23:00, 8.52s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:48<22:38, 8.44s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:48<22:38, 8.44s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4888, 'learning_rate': 0.0002586, 'epoch': 0.73} 73%|███████████████████████████████████████████████████████████ | 433/594 [58:48<22:38, 8.44s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:48<22:38, 8.44s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:56<22:11, 8.32s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:56<22:11, 8.32s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5165, 'learning_rate': 0.00025919999999999996, 'epoch': 0.73} 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:56<22:11, 8.32s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:56<22:11, 8.32s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:56<22:11, 8.32s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [59:04<21:44, 8.20s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [59:04<21:44, 8.20s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [59:04<21:44, 8.20s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [59:04<21:44, 8.20s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [59:04<21:44, 8.20s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:12<21:20, 8.11s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:12<21:20, 8.11s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:12<21:20, 8.11s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:12<21:20, 8.11s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:12<21:20, 8.11s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:20<20:48, 7.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:20<20:48, 7.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:20<20:48, 7.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:20<20:48, 7.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:20<20:48, 7.95s/it]g-point operations will not be computed-02 18:54:57,586 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:27<20:24, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:27<20:24, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:27<20:24, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:27<20:24, 7.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:35<19:51, 7.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:35<19:51, 7.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:35<19:51, 7.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:35<19:51, 7.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:35<19:51, 7.69s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████ | 440/594 [59:42<19:16, 7.51s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:01,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:01,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:01,827 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 441/594 [59:48<18:29, 7.25s/it]g-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:08,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:08,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:08,209 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▎ | 442/594 [59:55<17:32, 6.92s/it]g-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:14,113 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:16,797 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:16,797 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8587, 'learning_rate': 0.0002646, 'epoch': 0.74} [WARNING|modeling_utils.py:388] 2022-03-02 19:02:20,766 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████ | 444/594 [1:00:05<15:22, 6.15s/it]g-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████ | 444/594 [1:00:05<15:22, 6.15s/it]g-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:24,547 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:26,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:26,849 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3795, 'learning_rate': 0.00026579999999999996, 'epoch': 0.75} [WARNING|modeling_utils.py:388] 2022-03-02 19:02:30,177 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:30,177 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:01:45,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▎ | 446/594 [1:00:15<13:07, 5.32s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:02:32,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:34,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:32,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:34,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:32,294 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▍ | 447/594 [1:00:18<12:00, 4.90s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:02:36,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▌ | 448/594 [1:00:22<10:50, 4.45s/it]g-point operations will not be computed-02 19:02:36,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▌ | 448/594 [1:00:22<10:50, 4.45s/it]g-point operations will not be computed-02 19:02:36,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:40,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:39,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:40,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:39,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:43,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:42,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 450/594 [1:00:28<08:47, 3.66s/it]g-point operations will not be computed-02 19:02:42,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 450/594 [1:00:28<08:47, 3.66s/it]g-point operations will not be computed-02 19:02:42,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 450/594 [1:00:28<08:47, 3.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▊ | 450/594 [1:00:28<08:47, 3.66s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:02:52,680 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:39<13:58, 5.86s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:39<13:58, 5.86s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4935, 'learning_rate': 0.0002694, 'epoch': 0.76} 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:39<13:58, 5.86s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:39<13:58, 5.86s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:49<16:55, 7.15s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:49<16:55, 7.15s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6275, 'learning_rate': 0.00027, 'epoch': 0.76} 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:49<16:55, 7.15s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:49<16:55, 7.15s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:59<18:51, 8.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:59<18:51, 8.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4756, 'learning_rate': 0.00027059999999999996, 'epoch': 0.76} 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:59<18:51, 8.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:59<18:51, 8.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:09<20:03, 8.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:09<20:03, 8.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7369, 'learning_rate': 0.0002712, 'epoch': 0.76} 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:09<20:03, 8.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:09<20:03, 8.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6098, 'learning_rate': 0.0002718, 'epoch': 0.77} g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▋ | 456/594 [1:01:29<21:18, 9.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▋ | 456/594 [1:01:29<21:18, 9.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6067, 'learning_rate': 0.0002724, 'epoch': 0.77} 77%|████████████████████████████████████████████████████████████▋ | 456/594 [1:01:29<21:18, 9.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▋ | 456/594 [1:01:29<21:18, 9.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▋ | 456/594 [1:01:29<21:18, 9.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 457/594 [1:01:38<21:29, 9.41s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 457/594 [1:01:38<21:29, 9.41s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 457/594 [1:01:38<21:29, 9.41s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 457/594 [1:01:38<21:29, 9.41s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▊ | 457/594 [1:01:38<21:29, 9.41s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:48<21:36, 9.54s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:48<21:36, 9.54s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:48<21:36, 9.54s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:48<21:36, 9.54s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:58<21:34, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:58<21:34, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7952, 'learning_rate': 0.0002742, 'epoch': 0.77} 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:58<21:34, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:58<21:34, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:08<21:27, 9.61s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:08<21:27, 9.61s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5183, 'learning_rate': 0.0002748, 'epoch': 0.77} 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:08<21:27, 9.61s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:08<21:27, 9.61s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:17<21:20, 9.63s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:17<21:20, 9.63s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8535, 'learning_rate': 0.00027539999999999997, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:17<21:20, 9.63s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:17<21:20, 9.63s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:17<21:20, 9.63s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:27<21:11, 9.64s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:27<21:11, 9.64s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:27<21:11, 9.64s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:27<21:11, 9.64s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:36<20:55, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:36<20:55, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7839, 'learning_rate': 0.0002766, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:36<20:55, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:36<20:55, 9.59s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:46<20:41, 9.55s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:46<20:41, 9.55s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6477, 'learning_rate': 0.0002772, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:46<20:41, 9.55s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:46<20:41, 9.55s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:55<20:27, 9.52s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:55<20:27, 9.52s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8635, 'learning_rate': 0.0002778, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:55<20:27, 9.52s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:55<20:27, 9.52s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:05<20:13, 9.48s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:05<20:13, 9.48s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4758, 'learning_rate': 0.0002784, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:05<20:13, 9.48s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:05<20:13, 9.48s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:05<20:13, 9.48s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████ | 467/594 [1:03:14<19:52, 9.39s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████ | 467/594 [1:03:14<19:52, 9.39s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████ | 467/594 [1:03:14<19:52, 9.39s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████ | 467/594 [1:03:14<19:52, 9.39s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4608, 'learning_rate': 0.00027959999999999997, 'epoch': 0.79} 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7147, 'learning_rate': 0.0002802, 'epoch': 0.79} 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:23<19:39, 9.36s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 470/594 [1:03:42<19:09, 9.27s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 470/594 [1:03:42<19:09, 9.27s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3855, 'learning_rate': 0.0002808, 'epoch': 0.79} 79%|██████████████████████████████████████████████████████████████▌ | 470/594 [1:03:42<19:09, 9.27s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 470/594 [1:03:42<19:09, 9.27s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▌ | 470/594 [1:03:42<19:09, 9.27s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:51<18:50, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:51<18:50, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:51<18:50, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:51<18:50, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:51<18:50, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:04:00<18:41, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:04:00<18:41, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:04:00<18:41, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:04:00<18:41, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:04:00<18:41, 9.19s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:09<18:25, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:09<18:25, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:09<18:25, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:06:34,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:06:34,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6407, 'learning_rate': 0.00028319999999999994, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-03-02 19:06:34,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:06:34,328 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:27<18:15, 9.21s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:27<18:15, 9.21s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4258, 'learning_rate': 0.00028379999999999996, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:27<18:15, 9.21s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:27<18:15, 9.21s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:27<18:15, 9.21s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:36<17:58, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:36<17:58, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:36<17:58, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:36<17:58, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:36<17:58, 9.14s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 477/594 [1:04:45<17:35, 9.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 477/594 [1:04:45<17:35, 9.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 477/594 [1:04:45<17:35, 9.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▍ | 477/594 [1:04:45<17:35, 9.02s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 478/594 [1:04:54<17:12, 8.90s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 478/594 [1:04:54<17:12, 8.90s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5687, 'learning_rate': 0.00028559999999999995, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████▌ | 478/594 [1:04:54<17:12, 8.90s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 478/594 [1:04:54<17:12, 8.90s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▌ | 478/594 [1:04:54<17:12, 8.90s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:02<16:55, 8.83s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:02<16:55, 8.83s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:02<16:55, 8.83s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:02<16:55, 8.83s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:11<16:35, 8.73s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:11<16:35, 8.73s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3326, 'learning_rate': 0.0002868, 'epoch': 0.81} 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:11<16:35, 8.73s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:11<16:35, 8.73s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:19<16:12, 8.60s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:19<16:12, 8.60s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9138, 'learning_rate': 0.00028739999999999994, 'epoch': 0.81} 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:19<16:12, 8.60s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:19<16:12, 8.60s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:27<15:53, 8.51s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:27<15:53, 8.51s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.992, 'learning_rate': 0.00028799999999999995, 'epoch': 0.81} 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:27<15:53, 8.51s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:27<15:53, 8.51s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:36<15:39, 8.46s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:36<15:39, 8.46s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7403, 'learning_rate': 0.00028859999999999997, 'epoch': 0.81} 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:36<15:39, 8.46s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:36<15:39, 8.46s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:44<15:15, 8.33s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:44<15:15, 8.33s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7293, 'learning_rate': 0.0002892, 'epoch': 0.81} 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:44<15:15, 8.33s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:44<15:15, 8.33s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:52<14:56, 8.22s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:52<14:56, 8.22s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5899, 'learning_rate': 0.00028979999999999994, 'epoch': 0.82} 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:52<14:56, 8.22s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:52<14:56, 8.22s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:59<14:29, 8.05s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:59<14:29, 8.05s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4421, 'learning_rate': 0.00029039999999999996, 'epoch': 0.82} 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:59<14:29, 8.05s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:59<14:29, 8.05s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▊ | 487/594 [1:06:07<14:07, 7.92s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▊ | 487/594 [1:06:07<14:07, 7.92s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6241, 'learning_rate': 0.00029099999999999997, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-03-02 19:08:29,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:29,157 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 488/594 [1:06:14<13:43, 7.76s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 488/594 [1:06:14<13:43, 7.76s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 488/594 [1:06:14<13:43, 7.76s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 488/594 [1:06:14<13:43, 7.76s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▉ | 488/594 [1:06:14<13:43, 7.76s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 489/594 [1:06:21<13:14, 7.57s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 489/594 [1:06:21<13:14, 7.57s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:43,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:43,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:28<12:46, 7.37s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:28<12:46, 7.37s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:28<12:46, 7.37s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:28<12:46, 7.37s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:51,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:51,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:51,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:57,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:08:57,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7008, 'learning_rate': 0.000294, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-02 19:08:57,788 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:03,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:03,814 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3374, 'learning_rate': 0.00029459999999999995, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-03-02 19:09:07,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:07,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▋ | 494/594 [1:06:53<10:26, 6.26s/it]g-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:11,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:11,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:11,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:02:47,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▊ | 495/594 [1:06:58<09:40, 5.86s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:09:15,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:17,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:15,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:17,583 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:15,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|█████████████████████████████████████████████████████████████████▉ | 496/594 [1:07:02<08:51, 5.42s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:09:19,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:21,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:19,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:21,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:19,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████ | 497/594 [1:07:06<08:01, 4.96s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:09:23,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▏ | 498/594 [1:07:09<07:10, 4.49s/it]g-point operations will not be computed-02 19:09:23,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▏ | 498/594 [1:07:09<07:10, 4.49s/it]g-point operations will not be computed-02 19:09:23,453 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:28,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:26,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-03-02 19:09:28,080 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:09:26,687 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▎ | 499/594 [1:07:12<06:20, 4.01s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:09:29,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2366] 2022-03-02 19:09:31,887 >> Num examples = 2642▍ | 500/594 [1:07:15<05:47, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-02 19:09:31,887 >> Num examples = 2642▍ | 500/594 [1:07:15<05:47, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7669, 'learning_rate': 0.0002988, 'epoch': 0.84} [INFO|trainer.py:2366] 2022-03-02 19:09:31,887 >> Num examples = 2642▍ | 500/594 [1:07:15<05:47, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-02 19:09:31,887 >> Num examples = 2642▍ | 500/594 [1:07:15<05:47, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-02 19:09:31,887 >> Num examples = 2642▍ | 500/594 [1:07:15<05:47, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▉ | 5/221 [00:11<09:31, 2.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▎ | 6/221 [00:15<10:15, 2.86s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 7/221 [00:18<11:15, 3.16s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 7/221 [00:18<11:15, 3.16s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 7/221 [00:18<11:15, 3.16s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▍ | 9/221 [00:24<10:52, 3.08s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▋ | 10/221 [00:28<11:45, 3.34s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████ | 11/221 [00:33<12:42, 3.63s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▍ | 12/221 [00:35<11:52, 3.41s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▊ | 13/221 [00:39<11:28, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|█████▏ | 14/221 [00:42<11:29, 3.33s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▌ | 15/221 [00:47<12:54, 3.76s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▉ | 16/221 [00:51<13:48, 4.04s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▎ | 17/221 [00:55<13:13, 3.89s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▋ | 18/221 [00:59<12:57, 3.83s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████ | 19/221 [01:02<12:15, 3.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▍ | 20/221 [01:05<11:39, 3.48s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|███████▊ | 21/221 [01:08<10:57, 3.29s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▏ | 22/221 [01:11<10:45, 3.24s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▌ | 23/221 [01:14<10:31, 3.19s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▉ | 24/221 [01:18<11:00, 3.35s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▎ | 25/221 [01:22<11:29, 3.52s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▋ | 26/221 [01:25<11:44, 3.61s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|██████████ | 27/221 [01:28<10:41, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▍ | 28/221 [01:32<11:13, 3.49s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▊ | 29/221 [01:36<11:57, 3.74s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▏ | 30/221 [01:39<11:10, 3.51s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▌ | 31/221 [01:42<10:11, 3.22s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▊ | 32/221 [01:45<10:06, 3.21s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▏ | 33/221 [01:49<10:34, 3.38s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▌ | 34/221 [01:52<10:40, 3.43s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|████████████▉ | 35/221 [01:55<10:15, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▎ | 36/221 [01:59<10:08, 3.29s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▋ | 37/221 [02:03<10:58, 3.58s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|██████████████ | 38/221 [02:06<10:18, 3.38s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▍ | 39/221 [02:10<10:39, 3.51s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▊ | 40/221 [02:12<10:02, 3.33s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▏ | 41/221 [02:16<10:16, 3.42s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▌ | 42/221 [02:21<11:10, 3.75s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▉ | 43/221 [02:24<10:39, 3.59s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▎ | 44/221 [02:29<11:41, 3.97s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▋ | 45/221 [02:33<12:12, 4.16s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████ | 46/221 [02:37<12:03, 4.14s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▍ | 47/221 [02:41<11:50, 4.08s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|█████████████████▊ | 48/221 [02:45<11:46, 4.08s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▏ | 49/221 [02:49<11:19, 3.95s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▌ | 50/221 [02:53<11:18, 3.97s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▉ | 51/221 [02:56<10:37, 3.75s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▎ | 52/221 [02:59<09:57, 3.53s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▋ | 53/221 [03:02<09:23, 3.36s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|████████████████████ | 54/221 [03:06<09:38, 3.47s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▍ | 55/221 [03:10<09:40, 3.50s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▊ | 56/221 [03:14<10:26, 3.79s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▏ | 57/221 [03:18<10:30, 3.84s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▌ | 58/221 [03:21<10:04, 3.71s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|█████████████████████▉ | 59/221 [03:25<09:34, 3.55s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▎ | 60/221 [03:27<08:46, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|██████████████████████▋ | 61/221 [03:31<08:57, 3.36s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████ | 62/221 [03:34<08:40, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▍ | 63/221 [03:37<08:42, 3.30s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▋ | 64/221 [03:41<08:41, 3.32s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|████████████████████████ | 65/221 [03:44<08:35, 3.30s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▍ | 66/221 [03:47<08:41, 3.37s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▊ | 67/221 [03:50<08:07, 3.16s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▏ | 68/221 [03:54<08:48, 3.45s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▌ | 69/221 [03:57<08:26, 3.34s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▉ | 70/221 [04:01<08:21, 3.32s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▎ | 71/221 [04:04<08:09, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▎ | 71/221 [04:04<08:09, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▎ | 71/221 [04:04<08:09, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|███████████████████████████ | 73/221 [04:10<07:57, 3.23s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|███████████████████████████▍ | 74/221 [04:13<07:53, 3.22s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▊ | 75/221 [04:16<07:45, 3.19s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|████████████████████████████▏ | 76/221 [04:19<07:40, 3.17s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▌ | 77/221 [04:22<07:35, 3.17s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▉ | 78/221 [04:26<07:46, 3.26s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▎ | 79/221 [04:29<07:29, 3.17s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▋ | 80/221 [04:32<07:28, 3.18s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████ | 81/221 [04:36<07:52, 3.37s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████▍ | 82/221 [04:40<08:25, 3.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▊ | 83/221 [04:45<08:49, 3.84s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████▏ | 84/221 [04:49<08:52, 3.88s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████▌ | 85/221 [04:53<09:13, 4.07s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▉ | 86/221 [04:57<08:55, 3.97s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|████████████████████████████████▎ | 87/221 [05:01<09:11, 4.12s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▋ | 88/221 [05:05<08:36, 3.89s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|█████████████████████████████████ | 89/221 [05:08<08:07, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▍ | 90/221 [05:11<08:03, 3.69s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▊ | 91/221 [05:16<08:17, 3.83s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▏ | 92/221 [05:20<08:29, 3.95s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▌ | 93/221 [05:24<08:36, 4.04s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▉ | 94/221 [05:28<08:24, 3.97s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|███████████████████████████████████▏ | 95/221 [05:32<08:22, 3.99s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|███████████████████████████████████▌ | 96/221 [05:36<08:12, 3.94s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▉ | 97/221 [05:40<08:24, 4.07s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|████████████████████████████████████▎ | 98/221 [05:44<08:09, 3.98s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 99/221 [05:47<07:23, 3.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 100/221 [05:50<07:20, 3.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████ | 101/221 [05:53<06:56, 3.47s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▍ | 102/221 [05:56<06:35, 3.33s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▊ | 103/221 [06:00<06:52, 3.49s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████ | 104/221 [06:04<07:06, 3.64s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▍ | 105/221 [06:09<07:30, 3.88s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▊ | 106/221 [06:13<07:28, 3.90s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|███████████████████████████████████████▏ | 107/221 [06:16<06:56, 3.66s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▌ | 108/221 [06:20<07:15, 3.85s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▉ | 109/221 [06:24<07:16, 3.90s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▎ | 110/221 [06:27<06:54, 3.73s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▋ | 111/221 [06:31<06:36, 3.60s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████ | 112/221 [06:35<06:39, 3.66s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▍ | 113/221 [06:38<06:34, 3.65s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|█████████████████████████████████████████▊ | 114/221 [06:41<06:19, 3.55s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▏ | 115/221 [06:45<06:12, 3.52s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▌ | 116/221 [06:48<05:57, 3.41s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▉ | 117/221 [06:51<05:50, 3.37s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▉ | 117/221 [06:51<05:50, 3.37s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▉ | 117/221 [06:51<05:50, 3.37s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▌ | 119/221 [06:59<06:21, 3.74s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▉ | 120/221 [07:04<06:35, 3.92s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▎ | 121/221 [07:07<06:21, 3.81s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▋ | 122/221 [07:10<05:37, 3.41s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████ | 123/221 [07:12<05:05, 3.12s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▍ | 124/221 [07:15<05:02, 3.11s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|█████████████████████████████████████████████▊ | 125/221 [07:19<05:19, 3.33s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▏ | 126/221 [07:22<05:02, 3.18s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▌ | 127/221 [07:25<04:50, 3.09s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▉ | 128/221 [07:28<04:33, 2.94s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|███████████████████████████████████████████████▎ | 129/221 [07:31<04:51, 3.17s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▋ | 130/221 [07:34<04:34, 3.01s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|████████████████████████████████████████████████ | 131/221 [07:38<04:48, 3.20s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▍ | 132/221 [07:40<04:29, 3.03s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▋ | 133/221 [07:43<04:30, 3.07s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████ | 134/221 [07:46<04:18, 2.97s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▍ | 135/221 [07:49<04:23, 3.06s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|█████████████████████████████████████████████████▊ | 136/221 [07:53<04:36, 3.26s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▏ | 137/221 [07:56<04:38, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▌ | 138/221 [08:00<04:47, 3.46s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▉ | 139/221 [08:04<04:48, 3.51s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▎ | 140/221 [08:06<04:18, 3.19s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▋ | 141/221 [08:10<04:14, 3.18s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|████████████████████████████████████████████████████ | 142/221 [08:13<04:07, 3.13s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▍ | 143/221 [08:15<03:47, 2.91s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▊ | 144/221 [08:19<04:04, 3.17s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▏ | 145/221 [08:22<03:58, 3.14s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▌ | 146/221 [08:25<04:08, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|█████████████████████████████████████████████████████▉ | 147/221 [08:28<03:53, 3.15s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▏ | 148/221 [08:32<03:52, 3.19s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▌ | 149/221 [08:35<03:52, 3.23s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|██████████████████████████████████████████████████████▉ | 150/221 [08:38<03:47, 3.21s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████▎ | 151/221 [08:42<03:56, 3.38s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▋ | 152/221 [08:45<03:49, 3.33s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████ | 153/221 [08:48<03:42, 3.27s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▍ | 154/221 [08:52<03:43, 3.34s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▊ | 155/221 [08:55<03:41, 3.36s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▏ | 156/221 [08:59<03:42, 3.42s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▌ | 157/221 [09:01<03:26, 3.22s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▉ | 158/221 [09:06<03:49, 3.65s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▎ | 159/221 [09:10<03:47, 3.68s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▎ | 159/221 [09:10<03:47, 3.68s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▎ | 159/221 [09:10<03:47, 3.68s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████ | 161/221 [09:18<03:55, 3.92s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▍ | 162/221 [09:22<03:51, 3.92s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▋ | 163/221 [09:26<03:54, 4.05s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|████████████████████████████████████████████████████████████ | 164/221 [09:31<03:59, 4.20s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▍ | 165/221 [09:34<03:44, 4.01s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▊ | 166/221 [09:38<03:25, 3.73s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▏ | 167/221 [09:41<03:18, 3.67s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▌ | 168/221 [09:44<03:05, 3.49s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▉ | 169/221 [09:48<03:06, 3.59s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▎ | 170/221 [09:52<03:06, 3.65s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▋ | 171/221 [09:55<03:03, 3.66s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████ | 172/221 [09:59<02:54, 3.57s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▍ | 173/221 [10:02<02:49, 3.54s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▊ | 174/221 [10:05<02:40, 3.41s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████▏ | 175/221 [10:08<02:32, 3.32s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▌ | 176/221 [10:12<02:36, 3.47s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 177/221 [10:15<02:24, 3.29s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▏ | 178/221 [10:19<02:27, 3.44s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▌ | 179/221 [10:22<02:22, 3.39s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▉ | 180/221 [10:26<02:28, 3.62s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▎ | 181/221 [10:30<02:28, 3.71s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▋ | 182/221 [10:34<02:23, 3.69s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████ | 183/221 [10:38<02:26, 3.86s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▍ | 184/221 [10:42<02:21, 3.83s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████████▊ | 185/221 [10:45<02:11, 3.66s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████▏ | 186/221 [10:50<02:16, 3.89s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▌ | 187/221 [10:53<02:07, 3.76s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▉ | 188/221 [10:57<02:07, 3.87s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▎ | 189/221 [11:01<02:05, 3.93s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▋ | 190/221 [11:06<02:06, 4.08s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████████ | 191/221 [11:10<02:06, 4.23s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▎ | 192/221 [11:15<02:02, 4.24s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▋ | 193/221 [11:18<01:50, 3.94s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████ | 194/221 [11:21<01:39, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▍ | 195/221 [11:24<01:31, 3.51s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▊ | 196/221 [11:27<01:25, 3.42s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▏ | 197/221 [11:30<01:17, 3.24s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▌ | 198/221 [11:34<01:21, 3.53s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▉ | 199/221 [11:39<01:23, 3.80s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████████▎ | 200/221 [11:42<01:17, 3.67s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▋ | 201/221 [11:45<01:10, 3.54s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████████ | 202/221 [11:48<01:02, 3.31s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▍ | 203/221 [11:52<01:01, 3.40s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▊ | 204/221 [11:56<01:02, 3.70s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▏ | 205/221 [12:01<01:04, 4.02s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▌ | 206/221 [12:05<01:02, 4.18s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|███████████████████████████████████████████████████████████████████████████▊ | 207/221 [12:09<00:54, 3.93s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████▏ | 208/221 [12:12<00:50, 3.86s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▌ | 209/221 [12:16<00:43, 3.66s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▉ | 210/221 [12:20<00:41, 3.75s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████▎ | 211/221 [12:24<00:39, 3.93s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▋ | 212/221 [12:28<00:34, 3.83s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████████ | 213/221 [12:31<00:28, 3.58s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▍ | 214/221 [12:34<00:24, 3.56s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▊ | 215/221 [12:38<00:22, 3.74s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▏ | 216/221 [12:42<00:19, 3.86s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▌ | 217/221 [12:46<00:15, 3.87s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████████▉ | 218/221 [12:50<00:11, 3.83s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 219/221 [12:54<00:07, 3.81s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▋| 220/221 [12:58<00:03, 4.00s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▋| 220/221 [12:58<00:03, 4.00s/it][INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/02/2022 19:22:36 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-03-02 19:22:36,240 >> Configuration saved in ./checkpoint-500/config.json [INFO|trainer.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/02/2022 19:24:13 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb']. This may take a bit of time if the files are large. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-03-02 19:22:41,354 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:35<7:11:51, 278.62s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:35<7:11:51, 278.62s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:35<7:11:51, 278.62s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:35<7:11:51, 278.62s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:46<5:04:00, 198.27s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:46<5:04:00, 198.27s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6392, 'learning_rate': 0.0003, 'epoch': 0.84} 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:46<5:04:00, 198.27s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:46<5:04:00, 198.27s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:57<3:35:20, 141.98s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:57<3:35:20, 141.98s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7407, 'learning_rate': 0.00029680851063829784, 'epoch': 0.85} 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:57<3:35:20, 141.98s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:57<3:35:20, 141.98s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:07<2:33:46, 102.52s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:07<2:33:46, 102.52s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.9224, 'learning_rate': 0.0002936170212765957, 'epoch': 0.85} 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:07<2:33:46, 102.52s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:07<2:33:46, 102.52s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7926, 'learning_rate': 0.0002904255319148936, 'epoch': 0.85} onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:28<1:21:27, 55.54s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:28<1:21:27, 55.54s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 5.0733, 'learning_rate': 0.0002872340425531915, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:28<1:21:27, 55.54s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:28<1:21:27, 55.54s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:28<1:21:27, 55.54s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7175, 'learning_rate': 0.0002808510638297872, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▋ | 507/594 [1:23:38<1:00:50, 41.96s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:23:59<36:36, 25.84s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:23:59<36:36, 25.84s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:23:59<36:36, 25.84s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:23:59<36:36, 25.84s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:09<29:28, 21.05s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:09<29:28, 21.05s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7577, 'learning_rate': 0.000274468085106383, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:09<29:28, 21.05s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:09<29:28, 21.05s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:19<24:26, 17.67s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:19<24:26, 17.67s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7992, 'learning_rate': 0.00027127659574468084, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:19<24:26, 17.67s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:19<24:26, 17.67s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████ | 512/594 [1:24:29<21:07, 15.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████ | 512/594 [1:24:29<21:07, 15.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5604, 'learning_rate': 0.0002680851063829787, 'epoch': 0.86} 86%|████████████████████████████████████████████████████████████████████ | 512/594 [1:24:29<21:07, 15.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████ | 512/594 [1:24:29<21:07, 15.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████ | 512/594 [1:24:29<21:07, 15.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:39<18:32, 13.73s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:39<18:32, 13.73s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:39<18:32, 13.73s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:39<18:32, 13.73s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:39<18:32, 13.73s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:48<16:46, 12.59s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:48<16:46, 12.59s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:48<16:46, 12.59s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:48<16:46, 12.59s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:24:58<15:24, 11.70s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:24:58<15:24, 11.70s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6264, 'learning_rate': 0.0002585106382978723, 'epoch': 0.87} 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:24:58<15:24, 11.70s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:24:58<15:24, 11.70s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:24:58<15:24, 11.70s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:07<14:17, 10.99s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:07<14:17, 10.99s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:07<14:17, 10.99s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:07<14:17, 10.99s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:17<13:25, 10.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:17<13:25, 10.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5963, 'learning_rate': 0.00025212765957446806, 'epoch': 0.87} 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:17<13:25, 10.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:17<13:25, 10.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:17<13:25, 10.46s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▉ | 518/594 [1:25:26<12:44, 10.06s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▉ | 518/594 [1:25:26<12:44, 10.06s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▉ | 518/594 [1:25:26<12:44, 10.06s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▉ | 518/594 [1:25:26<12:44, 10.06s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:35<12:15, 9.81s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:35<12:15, 9.81s/it]onfig.jsonner.py:560] 2022-03-02 19:09:31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5033, 'learning_rate': 0.00024574468085106384, 'epoch': 0.87} [WARNING|modeling_utils.py:388] 2022-03-02 19:27:56,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:27:56,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:27:56,169 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:44<11:49, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:44<11:49, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:44<11:49, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:44<11:49, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▎ | 521/594 [1:25:53<11:31, 9.48s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▎ | 521/594 [1:25:53<11:31, 9.48s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6696, 'learning_rate': 0.00023936170212765956, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▎ | 521/594 [1:25:53<11:31, 9.48s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▎ | 521/594 [1:25:53<11:31, 9.48s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:02<11:12, 9.34s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:02<11:12, 9.34s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.8345, 'learning_rate': 0.00023617021276595742, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:02<11:12, 9.34s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:28:27,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:28:27,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3613, 'learning_rate': 0.00023297872340425529, 'epoch': 0.88} [WARNING|modeling_utils.py:388] 2022-03-02 19:28:27,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:28:27,876 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:20<10:38, 9.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:20<10:38, 9.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1583, 'learning_rate': 0.00022978723404255317, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:20<10:38, 9.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:20<10:38, 9.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:20<10:38, 9.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:30<10:35, 9.21s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:30<10:35, 9.21s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:30<10:35, 9.21s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:30<10:35, 9.21s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|█████████████████████████████████████████████████████████████████████▉ | 526/594 [1:26:38<10:20, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|█████████████████████████████████████████████████████████████████████▉ | 526/594 [1:26:38<10:20, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7744, 'learning_rate': 0.0002234042553191489, 'epoch': 0.88} 89%|█████████████████████████████████████████████████████████████████████▉ | 526/594 [1:26:38<10:20, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|█████████████████████████████████████████████████████████████████████▉ | 526/594 [1:26:38<10:20, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████ | 527/594 [1:26:47<10:04, 9.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████ | 527/594 [1:26:47<10:04, 9.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6864, 'learning_rate': 0.00022021276595744679, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████ | 527/594 [1:26:47<10:04, 9.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:29:12,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:29:12,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:29:12,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4407, 'learning_rate': 0.00021702127659574468, 'epoch': 0.89} [WARNING|modeling_utils.py:388] 2022-03-02 19:29:12,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:29:12,585 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:05<09:33, 8.82s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:05<09:33, 8.82s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6176, 'learning_rate': 0.00021382978723404254, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:05<09:33, 8.82s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:05<09:33, 8.82s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:13<09:19, 8.74s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:13<09:19, 8.74s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3369, 'learning_rate': 0.0002106382978723404, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:13<09:19, 8.74s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:13<09:19, 8.74s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:22<09:05, 8.66s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:22<09:05, 8.66s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4068, 'learning_rate': 0.0002074468085106383, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:22<09:05, 8.66s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:22<09:05, 8.66s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:30<08:54, 8.61s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:30<08:54, 8.61s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7584, 'learning_rate': 0.00020425531914893615, 'epoch': 0.89} 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:30<08:54, 8.61s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:30<08:54, 8.61s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:38<08:38, 8.50s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:38<08:38, 8.50s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6745, 'learning_rate': 0.00020106382978723404, 'epoch': 0.9} 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:38<08:38, 8.50s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:38<08:38, 8.50s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:38<08:38, 8.50s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:46<08:22, 8.37s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:46<08:22, 8.37s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:46<08:22, 8.37s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:46<08:22, 8.37s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:46<08:22, 8.37s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:54<08:04, 8.22s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:54<08:04, 8.22s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:54<08:04, 8.22s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:54<08:04, 8.22s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:54<08:04, 8.22s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:02<07:47, 8.07s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:02<07:47, 8.07s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:02<07:47, 8.07s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:02<07:47, 8.07s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:02<07:47, 8.07s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▍ | 537/594 [1:28:10<07:31, 7.91s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:29,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:29,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:29,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:17<07:15, 7.78s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:17<07:15, 7.78s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:17<07:15, 7.78s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:17<07:15, 7.78s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:17<07:15, 7.78s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▋ | 539/594 [1:28:24<06:57, 7.60s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:44,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:44,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:44,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▊ | 540/594 [1:28:31<06:39, 7.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▊ | 540/594 [1:28:31<06:39, 7.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:30:52,612 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:38<06:17, 7.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:38<06:17, 7.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6673, 'learning_rate': 0.000175531914893617, 'epoch': 0.91} 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:38<06:17, 7.13s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:00,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:00,395 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7293, 'learning_rate': 0.0001723404255319149, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 19:31:04,826 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|████████████████████████████████████████████████████████████████████████▏ | 543/594 [1:28:50<05:33, 6.55s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|████████████████████████████████████████████████████████████████████████▏ | 543/594 [1:28:50<05:33, 6.55s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4276, 'learning_rate': 0.00016914893617021274, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-03-02 19:31:10,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:10,236 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|████████████████████████████████████████████████████████████████████████▎ | 544/594 [1:28:55<05:07, 6.15s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:13,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:13,961 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:16,197 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:18,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:18,398 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:20,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:22,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:22,326 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:24,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:25,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:25,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:28,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:28,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:30,298 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:31,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:31,587 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:33,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:33,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:38,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:38,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:31:38,824 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:27<04:07, 5.77s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:27<04:07, 5.77s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:27<04:07, 5.77s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:27<04:07, 5.77s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:27<04:07, 5.77s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:38<04:57, 7.09s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:38<04:57, 7.09s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:38<04:57, 7.09s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:38<04:57, 7.09s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:48<05:26, 7.98s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:48<05:26, 7.98s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5965, 'learning_rate': 0.0001372340425531915, 'epoch': 0.93} 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:48<05:26, 7.98s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:48<05:26, 7.98s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:48<05:26, 7.98s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:58<05:45, 8.63s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:58<05:45, 8.63s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:58<05:45, 8.63s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:58<05:45, 8.63s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:08<05:51, 9.02s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:08<05:51, 9.02s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4921, 'learning_rate': 0.00013085106382978724, 'epoch': 0.93} 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:08<05:51, 9.02s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:08<05:51, 9.02s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:18<05:57, 9.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:18<05:57, 9.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.547, 'learning_rate': 0.0001276595744680851, 'epoch': 0.93} 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:18<05:57, 9.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:18<05:57, 9.40s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:55, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:55, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.6281, 'learning_rate': 0.00012446808510638296, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:55, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:55, 9.59s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:48, 9.67s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:48, 9.67s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5267, 'learning_rate': 0.00012127659574468084, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:48, 9.67s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:48, 9.67s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3407, 'learning_rate': 0.00011808510638297871, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2849, 'learning_rate': 0.00011489361702127659, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▎ | 559/594 [1:30:48<05:40, 9.73s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:18, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:18, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.7138, 'learning_rate': 0.00011170212765957445, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:18, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:18, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:08, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:08, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3811, 'learning_rate': 0.00010851063829787234, 'epoch': 0.94} 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:08, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:08, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:08, 9.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:56, 9.56s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:56, 9.56s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:56, 9.56s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:56, 9.56s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:56, 9.56s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:44, 9.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:44, 9.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:44, 9.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:44, 9.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:44, 9.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:33, 9.44s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:33, 9.44s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:33, 9.44s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:33, 9.44s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:33, 9.44s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:21, 9.35s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:21, 9.35s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:21, 9.35s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:21, 9.35s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▍ | 567/594 [1:32:03<04:11, 9.30s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▍ | 567/594 [1:32:03<04:11, 9.30s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4626, 'learning_rate': 9.25531914893617e-05, 'epoch': 0.95} 95%|███████████████████████████████████████████████████████████████████████████▍ | 567/594 [1:32:03<04:11, 9.30s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▍ | 567/594 [1:32:03<04:11, 9.30s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▍ | 567/594 [1:32:03<04:11, 9.30s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<04:00, 9.27s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<04:00, 9.27s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<04:00, 9.27s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<04:00, 9.27s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▋ | 569/594 [1:32:21<03:51, 9.24s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▋ | 569/594 [1:32:21<03:51, 9.24s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3564, 'learning_rate': 8.617021276595745e-05, 'epoch': 0.96} 96%|███████████████████████████████████████████████████████████████████████████▋ | 569/594 [1:32:21<03:51, 9.24s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▋ | 569/594 [1:32:21<03:51, 9.24s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▋ | 569/594 [1:32:21<03:51, 9.24s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:40, 9.17s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:40, 9.17s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:40, 9.17s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:40, 9.17s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:39<03:29, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:39<03:29, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.5387, 'learning_rate': 7.978723404255319e-05, 'epoch': 0.96} 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:39<03:29, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:39<03:29, 9.12s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:48<03:18, 9.04s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:48<03:18, 9.04s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4192, 'learning_rate': 7.659574468085105e-05, 'epoch': 0.96} 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:48<03:18, 9.04s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:48<03:18, 9.04s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:48<03:18, 9.04s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████▏ | 573/594 [1:32:57<03:07, 8.94s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████▏ | 573/594 [1:32:57<03:07, 8.94s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████▏ | 573/594 [1:32:57<03:07, 8.94s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████▏ | 573/594 [1:32:57<03:07, 8.94s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████▏ | 573/594 [1:32:57<03:07, 8.94s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3714, 'learning_rate': 6.702127659574467e-05, 'epoch': 0.97} 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▎ | 574/594 [1:33:06<02:57, 8.88s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4526, 'learning_rate': 6.382978723404255e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:39,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4474, 'learning_rate': 6.063829787234042e-05, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-03-02 19:35:52,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:52,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:35:52,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:40<02:18, 8.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:40<02:18, 8.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:40<02:18, 8.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:40<02:18, 8.64s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.53s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.53s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3285, 'learning_rate': 5.425531914893617e-05, 'epoch': 0.97} 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.53s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.53s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:57<01:58, 8.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:57<01:58, 8.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3842, 'learning_rate': 5.106382978723404e-05, 'epoch': 0.98} 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:57<01:58, 8.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:57<01:58, 8.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:57<01:58, 8.47s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:05<01:48, 8.36s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:05<01:48, 8.36s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:05<01:48, 8.36s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:05<01:48, 8.36s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:05<01:48, 8.36s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:13<01:38, 8.18s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:13<01:38, 8.18s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:13<01:38, 8.18s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:13<01:38, 8.18s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:13<01:38, 8.18s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:20<01:28, 8.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:20<01:28, 8.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:20<01:28, 8.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:20<01:28, 8.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:20<01:28, 8.03s/it]g-point operations will not be computed31,884 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:28<01:18, 7.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:28<01:18, 7.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:28<01:18, 7.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:28<01:18, 7.83s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:35<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:35<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:36:56,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:36:56,414 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|█████████████████████████████████████████████████████████████████████████████▉ | 586/594 [1:34:41<00:58, 7.30s/it]g-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|█████████████████████████████████████████████████████████████████████████████▉ | 586/594 [1:34:41<00:58, 7.30s/it]g-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:02,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:02,641 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████ | 587/594 [1:34:48<00:48, 6.95s/it]g-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████ | 587/594 [1:34:48<00:48, 6.95s/it]g-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:08,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:08,474 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▏| 588/594 [1:34:53<00:39, 6.58s/it]g-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:12,509 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:15,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:15,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4791, 'learning_rate': 2.234042553191489e-05, 'epoch': 0.99} [WARNING|modeling_utils.py:388] 2022-03-02 19:37:18,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:18,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:36:46,188 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▍| 590/594 [1:35:03<00:23, 5.76s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:37:21,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:23,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:21,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:23,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:21,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▌| 591/594 [1:35:07<00:15, 5.29s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:37:25,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:26,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:25,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:26,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:25,112 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▋| 592/594 [1:35:11<00:09, 4.85s/it][WARNING|modeling_utils.py:388] 2022-03-02 19:37:28,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▊| 593/594 [1:35:15<00:04, 4.37s/it]g-point operations will not be computed-02 19:37:28,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▊| 593/594 [1:35:15<00:04, 4.37s/it]g-point operations will not be computed-02 19:37:28,817 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:33,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:31,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-03-02 19:37:33,238 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-02 19:37:31,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4898, 'learning_rate': 6.382978723404255e-06, 'epoch': 1.0} [INFO|trainer.py:2114] 2022-03-02 19:37:33,976 >> Saving model checkpoint to ./=)███| 594/594 [1:35:17<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-02 19:37:50,389 >> Saving model checkpoint to ./ ./pytorch_model.bin:17<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modeling_utils.py:1081] 2022-03-02 19:38:06,575 >> Model weights saved in ./pytorch_model.bin:17<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 0%|▏ | 13.8M/2.99G [00:01<03:41, 14.4MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 2%|▊ | 52.4M/2.99G [00:03<02:46, 19.0MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 3%|█▍ | 94.8M/2.99G [00:05<02:28, 21.0MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 4%|██▏ | 137M/2.99G [00:07<02:23, 21.4MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▊ | 178M/2.99G [00:09<02:20, 21.6MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 7%|███▍ | 219M/2.99G [00:11<02:18, 21.6MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 7%|███▍ | 219M/2.99G [00:11<02:18, 21.6MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 7%|███▍ | 219M/2.99G [00:11<02:18, 21.6MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 7%|███▍ | 219M/2.99G [00:11<02:18, 21.6MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|███████████| 34.4M/34.4M [00:19<00:00, 16.2MB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/02/2022 19:42:23 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 100%|████████████| 34.4M/34.4M [02:38<00:00, 153kB/s][INFO|trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modelcard.py:460] 2022-03-02 19:42:26,793 >> Dropping the following result as it does not have all the necessary fields:trainer.py:1492] 2022-03-02 19:37:33,975 >> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 0%| | 32.0k/34.5M [00:00> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 0%| | 32.0k/34.5M [00:00> 1,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/02/2022 19:42:32 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 52%|█████▋ | 17.8M/34.5M [00:01<00:00, 18.6MB/s]To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-searchimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-02 19:42:35,380 >> Num examples = 2642in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-03-02 19:42:35,380 >> Num examples = 2642in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. ***** train metrics ***** epoch = 1.0 train_loss = 4.4789 train_runtime = 1:35:19.37 train_samples = 28538 train_samples_per_second = 4.99 train_steps_per_second = 0.104 0%| | 0/221 [00:00> Saving model checkpoint to ./ | 3/221 [00:06<08:53, 2.45s/it] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-03-02 19:58:07,253 >> Saving model checkpoint to ./ | 3/221 [00:06<08:53, 2.45s/it] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 03/02/2022 19:58:07 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow ***** eval metrics ***** epoch = 1.0 eval_loss = 4.5154 eval_runtime = 0:15:31.87 eval_samples = 2642 eval_samples_per_second = 2.835 eval_steps_per_second = 0.237 [INFO|modeling_utils.py:1081] 2022-03-02 19:58:23,568 >> Model weights saved in ./pytorch_model.bin:06<08:53, 2.45s/it] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220302_180214-gd4yxtv7/run-gd4yxtv7.wandb: 0%| | 32.0k/34.6M [00:00 main0214-gd4yxtv7/run-gd4yxtv7.wandb: 0%| | 32.0k/34.6M [00:00ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.