0%| | 0/594 [00:00> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:50:46,670 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:50:49,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:50:52,198 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▏ | 1/594 [00:11<1:55:24, 11.68s/it] 0%|▏ | 1/594 [00:11<1:55:24, 11.68s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:50:54,875 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:50:57,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:50:59,977 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:02,669 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 0%|▎ | 2/594 [00:22<1:49:02, 11.05s/it] 0%|▎ | 2/594 [00:22<1:49:02, 11.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:05,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:07,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:10,534 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:13,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▍ | 3/594 [00:32<1:45:36, 10.72s/it] 1%|▍ | 3/594 [00:32<1:45:36, 10.72s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:15,785 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:18,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:20,906 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8238, 'learning_rate': 6.000000000000001e-08, 'epoch': 0.01} [WARNING|modeling_utils.py:388] 2022-02-28 18:51:23,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▌ | 4/594 [00:42<1:44:01, 10.58s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:26,127 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:28,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:31,156 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:33,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▋ | 5/594 [00:53<1:42:47, 10.47s/it] 1%|▋ | 5/594 [00:53<1:42:47, 10.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:36,325 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:38,859 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:41,371 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:43,903 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▊ | 6/594 [01:03<1:41:37, 10.37s/it] 1%|▊ | 6/594 [01:03<1:41:37, 10.37s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:46,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:49,006 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:51,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:54,061 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|▉ | 7/594 [01:13<1:40:46, 10.30s/it] 1%|▉ | 7/594 [01:13<1:40:46, 10.30s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:51:56,668 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:51:59,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:01,684 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:04,163 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 1%|█ | 8/594 [01:23<1:39:59, 10.24s/it] 1%|█ | 8/594 [01:23<1:39:59, 10.24s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:06,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:09,190 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:11,715 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:14,228 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▏ | 9/594 [01:33<1:39:17, 10.18s/it] 2%|█▏ | 9/594 [01:33<1:39:17, 10.18s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:16,835 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:19,298 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:21,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:24,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▎ | 10/594 [01:43<1:38:46, 10.15s/it] 2%|█▎ | 10/594 [01:43<1:38:46, 10.15s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:26,900 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:29,307 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:31,782 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:34,202 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▍ | 11/594 [01:53<1:37:53, 10.07s/it] 2%|█▍ | 11/594 [01:53<1:37:53, 10.07s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:36,744 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:39,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:41,595 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:44,023 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8817, 'learning_rate': 2.2e-07, 'epoch': 0.02} 2%|█▌ | 12/594 [02:03<1:36:58, 10.00s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:46,561 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:48,998 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:51,386 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8089, 'learning_rate': 2.4000000000000003e-07, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-02-28 18:52:53,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▊ | 13/594 [02:13<1:36:08, 9.93s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:52:56,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:52:58,769 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:01,174 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7724, 'learning_rate': 2.6e-07, 'epoch': 0.02} [WARNING|modeling_utils.py:388] 2022-02-28 18:53:03,603 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 2%|█▉ | 14/594 [02:23<1:35:37, 9.89s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:06,078 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:08,450 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:10,881 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.662, 'learning_rate': 2.8e-07, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-02-28 18:53:13,272 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██ | 15/594 [02:32<1:34:49, 9.83s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:15,778 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:18,146 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:20,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:22,845 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▏ | 16/594 [02:42<1:33:55, 9.75s/it] 3%|██▏ | 16/594 [02:42<1:33:55, 9.75s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:25,269 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:27,621 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:30,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8698, 'learning_rate': 3.2e-07, 'epoch': 0.03} [WARNING|modeling_utils.py:388] 2022-02-28 18:53:32,360 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▎ | 17/594 [02:51<1:33:04, 9.68s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:34,806 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:37,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:39,501 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:41,851 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9494, 'learning_rate': 3.4000000000000003e-07, 'epoch': 0.03} 3%|██▍ | 18/594 [03:01<1:32:22, 9.62s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:44,308 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:46,648 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:49,001 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:51,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 3%|██▌ | 19/594 [03:10<1:31:51, 9.58s/it] 3%|██▌ | 19/594 [03:10<1:31:51, 9.58s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:53:53,707 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:55,992 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:53:58,255 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:00,594 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9558, 'learning_rate': 3.8e-07, 'epoch': 0.03} 3%|██▋ | 20/594 [03:20<1:30:43, 9.48s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:02,936 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:05,235 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:07,495 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:09,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▊ | 21/594 [03:29<1:29:46, 9.40s/it] 4%|██▊ | 21/594 [03:29<1:29:46, 9.40s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:12,142 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:14,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:16,703 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.8373, 'learning_rate': 4.2000000000000006e-07, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-02-28 18:54:18,929 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|██▉ | 22/594 [03:38<1:28:50, 9.32s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:21,304 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:23,556 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:25,803 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:28,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███ | 23/594 [03:47<1:28:03, 9.25s/it] 4%|███ | 23/594 [03:47<1:28:03, 9.25s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:30,379 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:32,564 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:34,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.9537, 'learning_rate': 4.6000000000000004e-07, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-02-28 18:54:36,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▏ | 24/594 [03:56<1:27:00, 9.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:39,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:41,469 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:43,657 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7198, 'learning_rate': 4.800000000000001e-07, 'epoch': 0.04} [WARNING|modeling_utils.py:388] 2022-02-28 18:54:46,385 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▎ | 25/594 [04:05<1:27:35, 9.24s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:48,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:50,952 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:48,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:53,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:48,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:53,120 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:48,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 26/594 [04:14<1:26:24, 9.13s/it]g-point operations will not be computed-28 18:54:48,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 4%|███▌ | 26/594 [04:14<1:26:24, 9.13s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:54:57,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:54:59,658 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:57,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:01,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:57,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:01,836 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:54:57,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 27/594 [04:23<1:25:05, 9.00s/it]g-point operations will not be computed-28 18:54:57,527 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▋ | 27/594 [04:23<1:25:05, 9.00s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:08,382 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:10,554 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:32<1:24:08, 8.92s/it]g-point operations will not be computed-28 18:55:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:32<1:24:08, 8.92s/it]g-point operations will not be computed-28 18:55:06,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|███▊ | 28/594 [04:32<1:24:08, 8.92s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:14,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:17,067 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:14,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:21,321 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:14,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:21,321 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:14,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7345, 'learning_rate': 5.4e-07, 'epoch': 0.05} 5%|███▉ | 29/594 [04:40<1:23:00, 8.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:23,516 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:25,639 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:27,746 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:29,820 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 30/594 [04:49<1:22:07, 8.74s/it] 5%|████ | 30/594 [04:49<1:22:07, 8.74s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:31,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████ | 30/594 [04:49<1:22:07, 8.74s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:31,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:36,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:31,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:57<1:20:54, 8.62s/it]g-point operations will not be computed-28 18:55:31,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:57<1:20:54, 8.62s/it]g-point operations will not be computed-28 18:55:31,967 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:57<1:20:54, 8.62s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:40,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▏ | 31/594 [04:57<1:20:54, 8.62s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:40,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:44,359 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:40,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:05<1:19:34, 8.49s/it]g-point operations will not be computed-28 18:55:40,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:05<1:19:34, 8.49s/it]g-point operations will not be computed-28 18:55:40,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:05<1:19:34, 8.49s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:48,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 5%|████▎ | 32/594 [05:05<1:19:34, 8.49s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:48,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:55:52,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:48,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:14<1:18:43, 8.42s/it]g-point operations will not be computed-28 18:55:48,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:14<1:18:43, 8.42s/it]g-point operations will not be computed-28 18:55:48,535 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:14<1:18:43, 8.42s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:56,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▍ | 33/594 [05:14<1:18:43, 8.42s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:55:56,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:00,646 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:55:56,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:22<1:17:18, 8.28s/it]g-point operations will not be computed-28 18:55:56,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:22<1:17:18, 8.28s/it]g-point operations will not be computed-28 18:55:56,688 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:22<1:17:18, 8.28s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:04,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▌ | 34/594 [05:22<1:17:18, 8.28s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:04,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:08,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:04,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:08,508 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:04,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 35/594 [05:29<1:15:56, 8.15s/it]g-point operations will not be computed-28 18:56:04,609 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 35/594 [05:29<1:15:56, 8.15s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:12,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▋ | 35/594 [05:29<1:15:56, 8.15s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:12,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:16,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:12,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:16,262 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:12,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 36/594 [05:37<1:14:36, 8.02s/it]g-point operations will not be computed-28 18:56:12,451 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 36/594 [05:37<1:14:36, 8.02s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:20,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▊ | 36/594 [05:37<1:14:36, 8.02s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:20,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:23,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:20,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 37/594 [05:45<1:12:58, 7.86s/it]g-point operations will not be computed-28 18:56:20,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 37/594 [05:45<1:12:58, 7.86s/it]g-point operations will not be computed-28 18:56:20,131 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|████▉ | 37/594 [05:45<1:12:58, 7.86s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:31,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:31,153 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 38/594 [05:52<1:11:12, 7.68s/it]g-point operations will not be computed-28 18:56:27,563 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 38/594 [05:52<1:11:12, 7.68s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:34,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 6%|█████ | 38/594 [05:52<1:11:12, 7.68s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:34,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:38,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:34,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:38,322 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:34,816 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:59<1:09:33, 7.52s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:41,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▎ | 39/594 [05:59<1:09:33, 7.52s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:41,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:45,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:41,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:45,289 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:41,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 40/594 [06:06<1:07:48, 7.34s/it]g-point operations will not be computed-28 18:56:41,867 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▍ | 40/594 [06:06<1:07:48, 7.34s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:48,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:51,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:48,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:51,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:48,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 41/594 [06:13<1:05:22, 7.09s/it]g-point operations will not be computed-28 18:56:48,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▌ | 41/594 [06:13<1:05:22, 7.09s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:56:55,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:58,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:55,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:56:58,201 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:56:55,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 42/594 [06:19<1:02:54, 6.84s/it]g-point operations will not be computed-28 18:56:55,154 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▋ | 42/594 [06:19<1:02:54, 6.84s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:01,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:04,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:01,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:04,098 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:01,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 43/594 [06:24<59:42, 6.50s/it]g-point operations will not be computed-28 18:57:01,281 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|█████▉ | 43/594 [06:24<59:42, 6.50s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:06,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:09,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:06,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:09,368 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:06,855 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 7%|██████ | 44/594 [06:30<55:51, 6.09s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:11,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:14,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:11,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:14,139 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:11,856 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▏ | 45/594 [06:34<51:43, 5.65s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:16,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:18,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:16,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:18,373 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:16,372 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▎ | 46/594 [06:38<47:23, 5.19s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:20,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:22,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:20,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:22,115 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:20,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▍ | 47/594 [06:42<43:06, 4.73s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:23,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 48/594 [06:45<38:57, 4.28s/it]g-point operations will not be computed-28 18:57:23,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▋ | 48/594 [06:45<38:57, 4.28s/it]g-point operations will not be computed-28 18:57:23,899 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:28,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:27,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:28,405 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:27,048 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▊ | 49/594 [06:48<35:02, 3.86s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:29,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▊ | 49/594 [06:48<35:02, 3.86s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:29,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:51<32:15, 3.56s/it]g-point operations will not be computed-28 18:57:29,790 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:51<32:15, 3.56s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:34,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 8%|██████▉ | 50/594 [06:51<32:15, 3.56s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:34,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:40,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:34,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [07:02<52:08, 5.76s/it]g-point operations will not be computed-28 18:57:34,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [07:02<52:08, 5.76s/it]g-point operations will not be computed-28 18:57:34,917 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [07:02<52:08, 5.76s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 51/594 [07:02<52:08, 5.76s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:57:50,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:12<1:04:22, 7.13s/it]g-point operations will not be computed-28 18:57:45,507 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:12<1:04:22, 7.13s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████ | 52/594 [07:12<1:04:22, 7.13s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:57:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:00,966 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:57:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:23<1:13:01, 8.10s/it]g-point operations will not be computed-28 18:57:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:23<1:13:01, 8.10s/it]g-point operations will not be computed-28 18:57:55,828 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:23<1:13:01, 8.10s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:06,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▏ | 53/594 [07:23<1:13:01, 8.10s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:06,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:11,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:06,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:33<1:18:36, 8.73s/it]g-point operations will not be computed-28 18:58:06,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:33<1:18:36, 8.73s/it]g-point operations will not be computed-28 18:58:06,193 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:33<1:18:36, 8.73s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▎ | 54/594 [07:33<1:18:36, 8.73s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:21,339 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:43<1:22:06, 9.14s/it]g-point operations will not be computed-28 18:58:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:43<1:22:06, 9.14s/it]g-point operations will not be computed-28 18:58:16,310 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:43<1:22:06, 9.14s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:26,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▍ | 55/594 [07:43<1:22:06, 9.14s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:26,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:31,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:26,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:53<1:24:17, 9.40s/it]g-point operations will not be computed-28 18:58:26,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:53<1:24:17, 9.40s/it]g-point operations will not be computed-28 18:58:26,426 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:53<1:24:17, 9.40s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:36,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 9%|███████▌ | 56/594 [07:53<1:24:17, 9.40s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:36,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:41,362 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:36,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [08:03<1:25:42, 9.58s/it]g-point operations will not be computed-28 18:58:36,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [08:03<1:25:42, 9.58s/it]g-point operations will not be computed-28 18:58:36,397 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [08:03<1:25:42, 9.58s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:46,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▋ | 57/594 [08:03<1:25:42, 9.58s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:46,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:51,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:46,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:58:51,222 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:46,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:13<1:26:22, 9.67s/it]g-point operations will not be computed-28 18:58:46,348 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:13<1:26:22, 9.67s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:56,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▊ | 58/594 [08:13<1:26:22, 9.67s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:58:56,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:01,073 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:58:56,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:23<1:26:38, 9.72s/it]g-point operations will not be computed-28 18:58:56,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:23<1:26:38, 9.72s/it]g-point operations will not be computed-28 18:58:56,224 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:23<1:26:38, 9.72s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:06,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|███████▉ | 59/594 [08:23<1:26:38, 9.72s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:06,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:10,994 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:06,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:32<1:26:52, 9.76s/it]g-point operations will not be computed-28 18:59:06,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:32<1:26:52, 9.76s/it]g-point operations will not be computed-28 18:59:06,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:32<1:26:52, 9.76s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:15,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████ | 60/594 [08:32<1:26:52, 9.76s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:15,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:20,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:15,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:20,642 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:15,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:42<1:26:16, 9.71s/it]g-point operations will not be computed-28 18:59:15,943 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:42<1:26:16, 9.71s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▏ | 61/594 [08:42<1:26:16, 9.71s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:30,180 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:52<1:25:38, 9.66s/it]g-point operations will not be computed-28 18:59:25,454 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:52<1:25:38, 9.66s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:34,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 10%|████████▎ | 62/594 [08:52<1:25:38, 9.66s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:34,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:39,692 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:34,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [09:01<1:25:13, 9.63s/it]g-point operations will not be computed-28 18:59:34,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [09:01<1:25:13, 9.63s/it]g-point operations will not be computed-28 18:59:34,962 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [09:01<1:25:13, 9.63s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▍ | 63/594 [09:01<1:25:13, 9.63s/it][WARNING|modeling_utils.py:388] 2022-02-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:49,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 18:59:49,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:11<1:24:31, 9.57s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:11<1:24:31, 9.57s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:11<1:24:31, 9.57s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▌ | 64/594 [09:11<1:24:31, 9.57s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:20<1:23:40, 9.49s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:20<1:23:40, 9.49s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5554, 'learning_rate': 1.2400000000000002e-06, 'epoch': 0.11} 11%|████████▊ | 65/594 [09:20<1:23:40, 9.49s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▊ | 65/594 [09:20<1:23:40, 9.49s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:29<1:23:21, 9.47s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:29<1:23:21, 9.47s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4926, 'learning_rate': 1.26e-06, 'epoch': 0.11} 11%|████████▉ | 66/594 [09:29<1:23:21, 9.47s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|████████▉ | 66/594 [09:29<1:23:21, 9.47s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:39<1:22:54, 9.44s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:39<1:22:54, 9.44s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4971, 'learning_rate': 1.28e-06, 'epoch': 0.11} 11%|█████████ | 67/594 [09:39<1:22:54, 9.44s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████ | 67/594 [09:39<1:22:54, 9.44s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:48<1:22:19, 9.39s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:48<1:22:19, 9.39s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4337, 'learning_rate': 1.3e-06, 'epoch': 0.11} 11%|█████████▏ | 68/594 [09:48<1:22:19, 9.39s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 11%|█████████▏ | 68/594 [09:48<1:22:19, 9.39s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:57<1:21:40, 9.33s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▎ | 69/594 [09:57<1:21:40, 9.33s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5496, 'learning_rate': 1.32e-06, 'epoch': 0.12} [WARNING|modeling_utils.py:388] 2022-02-28 19:00:45,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:00:45,042 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:06<1:21:14, 9.30s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:06<1:21:14, 9.30s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:06<1:21:14, 9.30s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▍ | 70/594 [10:06<1:21:14, 9.30s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:15<1:20:37, 9.25s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:15<1:20:37, 9.25s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6406, 'learning_rate': 1.3600000000000001e-06, 'epoch': 0.12} 12%|█████████▌ | 71/594 [10:15<1:20:37, 9.25s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:15<1:20:37, 9.25s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▌ | 71/594 [10:15<1:20:37, 9.25s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:25<1:19:57, 9.19s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:25<1:19:57, 9.19s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:25<1:19:57, 9.19s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▋ | 72/594 [10:25<1:19:57, 9.19s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:33<1:18:46, 9.07s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:33<1:18:46, 9.07s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3985, 'learning_rate': 1.4000000000000001e-06, 'epoch': 0.12} 12%|█████████▊ | 73/594 [10:33<1:18:46, 9.07s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▊ | 73/594 [10:33<1:18:46, 9.07s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:42<1:17:45, 8.97s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:42<1:17:45, 8.97s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.582, 'learning_rate': 1.42e-06, 'epoch': 0.12} 12%|█████████▉ | 74/594 [10:42<1:17:45, 8.97s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:42<1:17:45, 8.97s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 12%|█████████▉ | 74/594 [10:42<1:17:45, 8.97s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:51<1:18:19, 9.05s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:51<1:18:19, 9.05s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:51<1:18:19, 9.05s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:51<1:18:19, 9.05s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████ | 75/594 [10:51<1:18:19, 9.05s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [11:00<1:17:08, 8.93s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [11:00<1:17:08, 8.93s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [11:00<1:17:08, 8.93s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [11:00<1:17:08, 8.93s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▏ | 76/594 [11:00<1:17:08, 8.93s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 77/594 [11:09<1:16:11, 8.84s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 77/594 [11:09<1:16:11, 8.84s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 77/594 [11:09<1:16:11, 8.84s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▎ | 77/594 [11:09<1:16:11, 8.84s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:17<1:15:20, 8.76s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:17<1:15:20, 8.76s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5355, 'learning_rate': 1.5e-06, 'epoch': 0.13} 13%|██████████▌ | 78/594 [11:17<1:15:20, 8.76s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:17<1:15:20, 8.76s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▌ | 78/594 [11:17<1:15:20, 8.76s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:26<1:14:30, 8.68s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:26<1:14:30, 8.68s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:26<1:14:30, 8.68s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▋ | 79/594 [11:26<1:14:30, 8.68s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:34<1:13:31, 8.58s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:34<1:13:31, 8.58s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5652, 'learning_rate': 1.54e-06, 'epoch': 0.13} 13%|██████████▊ | 80/594 [11:34<1:13:31, 8.58s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 13%|██████████▊ | 80/594 [11:34<1:13:31, 8.58s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:42<1:12:39, 8.50s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:42<1:12:39, 8.50s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5097, 'learning_rate': 1.56e-06, 'epoch': 0.14} 14%|██████████▉ | 81/594 [11:42<1:12:39, 8.50s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:42<1:12:39, 8.50s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|██████████▉ | 81/594 [11:42<1:12:39, 8.50s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:51<1:11:54, 8.43s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:51<1:11:54, 8.43s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:51<1:11:54, 8.43s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████ | 82/594 [11:51<1:11:54, 8.43s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:59<1:10:47, 8.31s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:59<1:10:47, 8.31s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5395, 'learning_rate': 1.6000000000000001e-06, 'epoch': 0.14} 14%|███████████▏ | 83/594 [11:59<1:10:47, 8.31s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:59<1:10:47, 8.31s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▏ | 83/594 [11:59<1:10:47, 8.31s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:07<1:09:41, 8.20s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:07<1:09:41, 8.20s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:07<1:09:41, 8.20s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:07<1:09:41, 8.20s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▎ | 84/594 [12:07<1:09:41, 8.20s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:14<1:08:38, 8.09s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:14<1:08:38, 8.09s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:14<1:08:38, 8.09s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:14<1:08:38, 8.09s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▍ | 85/594 [12:14<1:08:38, 8.09s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:22<1:07:41, 7.99s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:22<1:07:41, 7.99s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:22<1:07:41, 7.99s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:22<1:07:41, 7.99s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 14%|███████████▌ | 86/594 [12:22<1:07:41, 7.99s/it]g-point operations will not be computed-28 18:59:44,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 87/594 [12:30<1:06:13, 7.84s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 87/594 [12:30<1:06:13, 7.84s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 87/594 [12:30<1:06:13, 7.84s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▋ | 87/594 [12:30<1:06:13, 7.84s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 88/594 [12:37<1:04:24, 7.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 88/594 [12:37<1:04:24, 7.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|███████████▊ | 88/594 [12:37<1:04:24, 7.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:24,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:24,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5239, 'learning_rate': 1.72e-06, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-02-28 19:03:24,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:24,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:24,725 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:12,506 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 90/594 [12:50<1:00:23, 7.19s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:33,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 90/594 [12:50<1:00:23, 7.19s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:33,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 90/594 [12:50<1:00:23, 7.19s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:33,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████ | 90/594 [12:50<1:00:23, 7.19s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:33,012 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:57<57:40, 6.88s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 15%|████████████▌ | 91/594 [12:57<57:40, 6.88s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:43,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:43,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.6208, 'learning_rate': 1.7800000000000001e-06, 'epoch': 0.15} [WARNING|modeling_utils.py:388] 2022-02-28 19:03:47,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:47,199 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|████████████▊ | 93/594 [13:07<51:19, 6.15s/it]g-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:50,997 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:53,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:53,350 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:55,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:55,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:03:55,676 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:39,025 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████ | 95/594 [13:17<44:46, 5.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:03:58,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:00,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:58,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:00,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:03:58,954 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▎ | 96/594 [13:21<41:20, 4.98s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:02,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 97/594 [13:25<38:09, 4.61s/it]g-point operations will not be computed-28 19:04:02,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 97/594 [13:25<38:09, 4.61s/it]g-point operations will not be computed-28 19:04:02,945 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 97/594 [13:25<38:09, 4.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:06,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▍ | 97/594 [13:25<38:09, 4.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:06,555 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 16%|█████████████▌ | 98/594 [13:28<34:45, 4.20s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:09,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 99/594 [13:31<31:17, 3.79s/it]g-point operations will not be computed-28 19:04:09,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 99/594 [13:31<31:17, 3.79s/it]g-point operations will not be computed-28 19:04:09,693 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:13,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:12,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:13,546 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:12,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 100/594 [13:34<29:02, 3.53s/it]g-point operations will not be computed-28 19:04:12,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 100/594 [13:34<29:02, 3.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 100/594 [13:34<29:02, 3.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:22,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:04:22,714 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:44<46:29, 5.66s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:44<46:29, 5.66s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:44<46:29, 5.66s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▊ | 101/594 [13:44<46:29, 5.66s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:54<57:35, 7.02s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:54<57:35, 7.02s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3413, 'learning_rate': 1.98e-06, 'epoch': 0.17} 17%|█████████████▉ | 102/594 [13:54<57:35, 7.02s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:54<57:35, 7.02s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▉ | 102/594 [13:54<57:35, 7.02s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 103/594 [14:04<1:04:47, 7.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 103/594 [14:04<1:04:47, 7.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 103/594 [14:04<1:04:47, 7.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 17%|█████████████▋ | 103/594 [14:04<1:04:47, 7.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:15<1:09:55, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:15<1:09:55, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2955, 'learning_rate': 2.02e-06, 'epoch': 0.17} 18%|█████████████▊ | 104/594 [14:15<1:09:55, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:15<1:09:55, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▊ | 104/594 [14:15<1:09:55, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:25<1:13:27, 9.01s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:25<1:13:27, 9.01s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:25<1:13:27, 9.01s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:25<1:13:27, 9.01s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|█████████████▉ | 105/594 [14:25<1:13:27, 9.01s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:35<1:15:46, 9.32s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:35<1:15:46, 9.32s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:35<1:15:46, 9.32s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:35<1:15:46, 9.32s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████ | 106/594 [14:35<1:15:46, 9.32s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:45<1:16:57, 9.48s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:45<1:16:57, 9.48s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:45<1:16:57, 9.48s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▏ | 107/594 [14:45<1:16:57, 9.48s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:54<1:17:26, 9.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:54<1:17:26, 9.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4975, 'learning_rate': 2.1000000000000002e-06, 'epoch': 0.18} 18%|██████████████▎ | 108/594 [14:54<1:17:26, 9.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▎ | 108/594 [14:54<1:17:26, 9.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 109/594 [15:04<1:17:56, 9.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 109/594 [15:04<1:17:56, 9.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5738, 'learning_rate': 2.12e-06, 'epoch': 0.18} 18%|██████████████▍ | 109/594 [15:04<1:17:56, 9.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 18%|██████████████▍ | 109/594 [15:04<1:17:56, 9.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:14<1:18:08, 9.69s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:14<1:18:08, 9.69s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3586, 'learning_rate': 2.1400000000000003e-06, 'epoch': 0.18} 19%|██████████████▋ | 110/594 [15:14<1:18:08, 9.69s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:14<1:18:08, 9.69s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▋ | 110/594 [15:14<1:18:08, 9.69s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:24<1:17:54, 9.68s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:24<1:17:54, 9.68s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:24<1:17:54, 9.68s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▊ | 111/594 [15:24<1:17:54, 9.68s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 112/594 [15:33<1:17:13, 9.61s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 112/594 [15:33<1:17:13, 9.61s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4207, 'learning_rate': 2.1800000000000003e-06, 'epoch': 0.19} 19%|██████████████▉ | 112/594 [15:33<1:17:13, 9.61s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|██████████████▉ | 112/594 [15:33<1:17:13, 9.61s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:43<1:16:49, 9.58s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:43<1:16:49, 9.58s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4198, 'learning_rate': 2.2e-06, 'epoch': 0.19} 19%|███████████████ | 113/594 [15:43<1:16:49, 9.58s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:43<1:16:49, 9.58s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████ | 113/594 [15:43<1:16:49, 9.58s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 114/594 [15:52<1:16:15, 9.53s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 114/594 [15:52<1:16:15, 9.53s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 114/594 [15:52<1:16:15, 9.53s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▏ | 114/594 [15:52<1:16:15, 9.53s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [16:01<1:15:44, 9.49s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [16:01<1:15:44, 9.49s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3794, 'learning_rate': 2.24e-06, 'epoch': 0.19} 19%|███████████████▎ | 115/594 [16:01<1:15:44, 9.49s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [16:01<1:15:44, 9.49s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 19%|███████████████▎ | 115/594 [16:01<1:15:44, 9.49s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:11<1:15:07, 9.43s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:11<1:15:07, 9.43s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:11<1:15:07, 9.43s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:11<1:15:07, 9.43s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▍ | 116/594 [16:11<1:15:07, 9.43s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:20<1:14:31, 9.37s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:20<1:14:31, 9.37s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:20<1:14:31, 9.37s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▌ | 117/594 [16:20<1:14:31, 9.37s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:29<1:13:42, 9.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:29<1:13:42, 9.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2504, 'learning_rate': 2.3000000000000004e-06, 'epoch': 0.2} 20%|███████████████▋ | 118/594 [16:29<1:13:42, 9.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:29<1:13:42, 9.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▋ | 118/594 [16:29<1:13:42, 9.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:38<1:12:55, 9.21s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:38<1:12:55, 9.21s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:38<1:12:55, 9.21s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:38<1:12:55, 9.21s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▊ | 119/594 [16:38<1:12:55, 9.21s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:47<1:12:20, 9.16s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:47<1:12:20, 9.16s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:47<1:12:20, 9.16s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:47<1:12:20, 9.16s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|███████████████▉ | 120/594 [16:47<1:12:20, 9.16s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:56<1:11:59, 9.13s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:56<1:11:59, 9.13s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:56<1:11:59, 9.13s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:56<1:11:59, 9.13s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 20%|████████████████ | 121/594 [16:56<1:11:59, 9.13s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:05<1:11:08, 9.04s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:05<1:11:08, 9.04s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:05<1:11:08, 9.04s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▏ | 122/594 [17:05<1:11:08, 9.04s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:14<1:10:34, 8.99s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:14<1:10:34, 8.99s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.315, 'learning_rate': 2.4000000000000003e-06, 'epoch': 0.21} 21%|████████████████▎ | 123/594 [17:14<1:10:34, 8.99s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▎ | 123/594 [17:14<1:10:34, 8.99s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:23<1:09:51, 8.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:23<1:09:51, 8.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3808, 'learning_rate': 2.42e-06, 'epoch': 0.21} 21%|████████████████▍ | 124/594 [17:23<1:09:51, 8.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▍ | 124/594 [17:23<1:09:51, 8.92s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3912, 'learning_rate': 2.4400000000000004e-06, 'epoch': 0.21} 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▌ | 125/594 [17:32<1:10:19, 9.00s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:40<1:09:38, 8.93s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:40<1:09:38, 8.93s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:40<1:09:38, 8.93s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:40<1:09:38, 8.93s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▊ | 126/594 [17:40<1:09:38, 8.93s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:49<1:08:51, 8.85s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:49<1:08:51, 8.85s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:49<1:08:51, 8.85s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:49<1:08:51, 8.85s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 21%|████████████████▉ | 127/594 [17:49<1:08:51, 8.85s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:58<1:07:39, 8.71s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:58<1:07:39, 8.71s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:58<1:07:39, 8.71s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████ | 128/594 [17:58<1:07:39, 8.71s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:06<1:06:59, 8.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:06<1:06:59, 8.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3792, 'learning_rate': 2.52e-06, 'epoch': 0.22} 22%|█████████████████▏ | 129/594 [18:06<1:06:59, 8.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▏ | 129/594 [18:06<1:06:59, 8.64s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:14<1:06:09, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:14<1:06:09, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4051, 'learning_rate': 2.5400000000000002e-06, 'epoch': 0.22} 22%|█████████████████▎ | 130/594 [18:14<1:06:09, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▎ | 130/594 [18:14<1:06:09, 8.56s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:23<1:05:36, 8.50s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:23<1:05:36, 8.50s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3441, 'learning_rate': 2.56e-06, 'epoch': 0.22} 22%|█████████████████▍ | 131/594 [18:23<1:05:36, 8.50s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▍ | 131/594 [18:23<1:05:36, 8.50s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:31<1:04:59, 8.44s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:31<1:04:59, 8.44s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2955, 'learning_rate': 2.5800000000000003e-06, 'epoch': 0.22} 22%|█████████████████▌ | 132/594 [18:31<1:04:59, 8.44s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:31<1:04:59, 8.44s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▌ | 132/594 [18:31<1:04:59, 8.44s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:39<1:03:39, 8.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:39<1:03:39, 8.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:39<1:03:39, 8.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:39<1:03:39, 8.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 22%|█████████████████▋ | 133/594 [18:39<1:03:39, 8.29s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:47<1:02:45, 8.19s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:47<1:02:45, 8.19s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:47<1:02:45, 8.19s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:47<1:02:45, 8.19s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▊ | 134/594 [18:47<1:02:45, 8.19s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:55<1:01:51, 8.09s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:55<1:01:51, 8.09s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:55<1:01:51, 8.09s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:55<1:01:51, 8.09s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|█████████████████▉ | 135/594 [18:55<1:01:51, 8.09s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 136/594 [19:02<1:00:48, 7.97s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 136/594 [19:02<1:00:48, 7.97s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████ | 136/594 [19:02<1:00:48, 7.97s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:09:50,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:09:50,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3315, 'learning_rate': 2.68e-06, 'epoch': 0.23} [WARNING|modeling_utils.py:388] 2022-02-28 19:09:50,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:09:50,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:09:50,908 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:17<58:30, 7.70s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:17<58:30, 7.70s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:17<58:30, 7.70s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:17<58:30, 7.70s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▊ | 138/594 [19:17<58:30, 7.70s/it]g-point operations will not be computed-28 19:04:17,435 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:25<57:08, 7.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:25<57:08, 7.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:25<57:08, 7.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 23%|██████████████████▉ | 139/594 [19:25<57:08, 7.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:31<55:44, 7.37s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████ | 140/594 [19:31<55:44, 7.37s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:17,434 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 141/594 [19:38<53:46, 7.12s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 141/594 [19:38<53:46, 7.12s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.453, 'learning_rate': 2.7600000000000003e-06, 'epoch': 0.24} 24%|███████████████████▏ | 141/594 [19:38<53:46, 7.12s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▏ | 141/594 [19:38<53:46, 7.12s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:25,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:25,095 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:29,475 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 143/594 [19:50<48:46, 6.49s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▌ | 143/594 [19:50<48:46, 6.49s/it]g-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:33,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:33,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:33,468 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:07,338 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▋ | 144/594 [19:55<45:19, 6.04s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:37,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:39,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:37,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:39,431 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:37,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 145/594 [20:00<42:16, 5.65s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 145/594 [20:00<42:16, 5.65s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 24%|███████████████████▊ | 145/594 [20:00<42:16, 5.65s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:44,819 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:46,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:46,755 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:48,498 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:51,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:51,728 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:53,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:53,232 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:54,567 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:57,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:10:57,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7168, 'learning_rate': 2.9400000000000002e-06, 'epoch': 0.25} [WARNING|modeling_utils.py:388] 2022-02-28 19:10:57,615 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:11:03,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:11:03,136 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:27<42:27, 5.75s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:27<42:27, 5.75s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3009, 'learning_rate': 2.96e-06, 'epoch': 0.25} 25%|████████████████████▌ | 151/594 [20:27<42:27, 5.75s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 25%|████████████████████▌ | 151/594 [20:27<42:27, 5.75s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2353, 'learning_rate': 2.9800000000000003e-06, 'epoch': 0.26} 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2273, 'learning_rate': 3e-06, 'epoch': 0.26} 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 152/594 [20:38<52:20, 7.11s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:58<1:02:46, 8.56s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:58<1:02:46, 8.56s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1507, 'learning_rate': 3.0200000000000003e-06, 'epoch': 0.26} 26%|████████████████████▍ | 154/594 [20:58<1:02:46, 8.56s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▍ | 154/594 [20:58<1:02:46, 8.56s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:08<1:05:40, 8.98s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:08<1:05:40, 8.98s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1878, 'learning_rate': 3.04e-06, 'epoch': 0.26} 26%|████████████████████▌ | 155/594 [21:08<1:05:40, 8.98s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▌ | 155/594 [21:08<1:05:40, 8.98s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:17<1:07:28, 9.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:17<1:07:28, 9.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1888, 'learning_rate': 3.0600000000000003e-06, 'epoch': 0.26} 26%|████████████████████▋ | 156/594 [21:17<1:07:28, 9.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:17<1:07:28, 9.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▋ | 156/594 [21:17<1:07:28, 9.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 157/594 [21:27<1:08:24, 9.39s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 157/594 [21:27<1:08:24, 9.39s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 157/594 [21:27<1:08:24, 9.39s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 26%|████████████████████▉ | 157/594 [21:27<1:08:24, 9.39s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:37<1:08:37, 9.44s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:37<1:08:37, 9.44s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1796, 'learning_rate': 3.1000000000000004e-06, 'epoch': 0.27} 27%|█████████████████████ | 158/594 [21:37<1:08:37, 9.44s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████ | 158/594 [21:37<1:08:37, 9.44s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:46<1:09:09, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:46<1:09:09, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1439, 'learning_rate': 3.12e-06, 'epoch': 0.27} 27%|█████████████████████▏ | 159/594 [21:46<1:09:09, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▏ | 159/594 [21:46<1:09:09, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:56<1:09:16, 9.58s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:56<1:09:16, 9.58s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2612, 'learning_rate': 3.1400000000000004e-06, 'epoch': 0.27} 27%|█████████████████████▎ | 160/594 [21:56<1:09:16, 9.58s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▎ | 160/594 [21:56<1:09:16, 9.58s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3082, 'learning_rate': 3.1600000000000002e-06, 'epoch': 0.27} g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:15<1:08:53, 9.57s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:15<1:08:53, 9.57s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:15<1:08:53, 9.57s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▌ | 162/594 [22:15<1:08:53, 9.57s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▋ | 163/594 [22:25<1:08:33, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▋ | 163/594 [22:25<1:08:33, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2179, 'learning_rate': 3.2000000000000003e-06, 'epoch': 0.27} 27%|█████████████████████▋ | 163/594 [22:25<1:08:33, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▋ | 163/594 [22:25<1:08:33, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 27%|█████████████████████▋ | 163/594 [22:25<1:08:33, 9.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:34<1:07:56, 9.48s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:34<1:07:56, 9.48s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:34<1:07:56, 9.48s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:34<1:07:56, 9.48s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▊ | 164/594 [22:34<1:07:56, 9.48s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:43<1:07:16, 9.41s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:43<1:07:16, 9.41s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:43<1:07:16, 9.41s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|█████████████████████▉ | 165/594 [22:43<1:07:16, 9.41s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3925, 'learning_rate': 3.2600000000000006e-06, 'epoch': 0.28} 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2303, 'learning_rate': 3.2800000000000004e-06, 'epoch': 0.28} 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████ | 166/594 [22:53<1:06:55, 9.38s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:11<1:05:55, 9.29s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:11<1:05:55, 9.29s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4668, 'learning_rate': 3.3000000000000006e-06, 'epoch': 0.28} 28%|██████████████████████▎ | 168/594 [23:11<1:05:55, 9.29s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▎ | 168/594 [23:11<1:05:55, 9.29s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:20<1:05:32, 9.25s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:20<1:05:32, 9.25s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2762, 'learning_rate': 3.3200000000000004e-06, 'epoch': 0.28} 28%|██████████████████████▍ | 169/594 [23:20<1:05:32, 9.25s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 28%|██████████████████████▍ | 169/594 [23:20<1:05:32, 9.25s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:29<1:05:07, 9.22s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:29<1:05:07, 9.22s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3279, 'learning_rate': 3.3400000000000006e-06, 'epoch': 0.29} 29%|██████████████████████▌ | 170/594 [23:29<1:05:07, 9.22s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▌ | 170/594 [23:29<1:05:07, 9.22s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:38<1:04:28, 9.15s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|██████████████████████▋ | 171/594 [23:38<1:04:28, 9.15s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1924, 'learning_rate': 3.3600000000000004e-06, 'epoch': 0.29} 29%|██████████████████████▋ | 171/594 [23:38<1:04:28, 9.15s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:14:28,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:14:28,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1893, 'learning_rate': 3.3800000000000007e-06, 'epoch': 0.29} [WARNING|modeling_utils.py:388] 2022-02-28 19:14:28,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:14:28,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:14:28,192 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:56<1:03:24, 9.04s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:56<1:03:24, 9.04s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:56<1:03:24, 9.04s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:56<1:03:24, 9.04s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████ | 173/594 [23:56<1:03:24, 9.04s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [24:05<1:02:48, 8.97s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [24:05<1:02:48, 8.97s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [24:05<1:02:48, 8.97s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [24:05<1:02:48, 8.97s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▏ | 174/594 [24:05<1:02:48, 8.97s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:14<1:03:18, 9.07s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:14<1:03:18, 9.07s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:14<1:03:18, 9.07s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:14<1:03:18, 9.07s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 29%|███████████████████████▎ | 175/594 [24:14<1:03:18, 9.07s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:23<1:02:21, 8.95s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:23<1:02:21, 8.95s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:23<1:02:21, 8.95s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:23<1:02:21, 8.95s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▍ | 176/594 [24:23<1:02:21, 8.95s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:32<1:01:39, 8.87s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:32<1:01:39, 8.87s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:32<1:01:39, 8.87s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▌ | 177/594 [24:32<1:01:39, 8.87s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 178/594 [24:40<1:00:47, 8.77s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▋ | 178/594 [24:40<1:00:47, 8.77s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:15:25,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:15:25,542 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:49<1:00:14, 8.71s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:49<1:00:14, 8.71s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3358, 'learning_rate': 3.52e-06, 'epoch': 0.3} 30%|███████████████████████▊ | 179/594 [24:49<1:00:14, 8.71s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|███████████████████████▊ | 179/594 [24:49<1:00:14, 8.71s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:57<59:33, 8.63s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:57<59:33, 8.63s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1873, 'learning_rate': 3.54e-06, 'epoch': 0.3} 30%|████████████████████████▌ | 180/594 [24:57<59:33, 8.63s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▌ | 180/594 [24:57<59:33, 8.63s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [25:06<58:48, 8.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [25:06<58:48, 8.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2644, 'learning_rate': 3.5600000000000002e-06, 'epoch': 0.3} 30%|████████████████████████▋ | 181/594 [25:06<58:48, 8.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 30%|████████████████████████▋ | 181/594 [25:06<58:48, 8.54s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:14<58:02, 8.45s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:14<58:02, 8.45s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2642, 'learning_rate': 3.58e-06, 'epoch': 0.31} 31%|████████████████████████▊ | 182/594 [25:14<58:02, 8.45s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:14<58:02, 8.45s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▊ | 182/594 [25:14<58:02, 8.45s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:22<57:13, 8.35s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:22<57:13, 8.35s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:22<57:13, 8.35s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:22<57:13, 8.35s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|████████████████████████▉ | 183/594 [25:22<57:13, 8.35s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:30<56:02, 8.20s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:30<56:02, 8.20s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:30<56:02, 8.20s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:30<56:02, 8.20s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████ | 184/594 [25:30<56:02, 8.20s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:37<54:54, 8.05s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:37<54:54, 8.05s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:37<54:54, 8.05s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:37<54:54, 8.05s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▏ | 185/594 [25:37<54:54, 8.05s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▎ | 186/594 [25:45<53:51, 7.92s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▎ | 186/594 [25:45<53:51, 7.92s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:31,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:31,678 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:53<52:50, 7.79s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:53<52:50, 7.79s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:53<52:50, 7.79s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:53<52:50, 7.79s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 31%|█████████████████████████▌ | 187/594 [25:53<52:50, 7.79s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [26:00<51:34, 7.62s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [26:00<51:34, 7.62s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▋ | 188/594 [26:00<51:34, 7.62s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:47,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:47,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5345, 'learning_rate': 3.7200000000000004e-06, 'epoch': 0.32} [WARNING|modeling_utils.py:388] 2022-02-28 19:16:47,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:47,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:47,783 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|█████████████████████████▉ | 190/594 [26:14<48:44, 7.24s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:57,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:57,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:16:57,745 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████ | 191/594 [26:20<46:38, 6.94s/it]g-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:03,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:03,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:03,863 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:10:41,768 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 192/594 [26:26<44:28, 6.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 192/594 [26:26<44:28, 6.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 32%|██████████████████████████▏ | 192/594 [26:26<44:28, 6.64s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:12,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:12,158 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:16,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:16,085 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 33%|██████████████████████████▍ | 194/594 [26:36<39:36, 5.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:19,773 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:22,076 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:22,076 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:24,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:26,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:26,291 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:28,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:28,275 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:29,991 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:31,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:31,718 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:34,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:34,727 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:35,973 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:38,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:38,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.7293, 'learning_rate': 3.94e-06, 'epoch': 0.34} [WARNING|modeling_utils.py:388] 2022-02-28 19:17:38,840 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:44,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:44,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:17:44,280 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:09<37:14, 5.69s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:09<37:14, 5.69s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:09<37:14, 5.69s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:09<37:14, 5.69s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▍ | 201/594 [27:09<37:14, 5.69s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:19<46:06, 7.06s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:19<46:06, 7.06s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:19<46:06, 7.06s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:19<46:06, 7.06s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▌ | 202/594 [27:19<46:06, 7.06s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:29<51:49, 7.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:29<51:49, 7.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:29<51:49, 7.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:29<51:49, 7.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▋ | 203/594 [27:29<51:49, 7.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:39<55:39, 8.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:39<55:39, 8.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:39<55:39, 8.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 34%|███████████████████████████▊ | 204/594 [27:39<55:39, 8.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:49<58:07, 8.97s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:49<58:07, 8.97s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2201, 'learning_rate': 4.04e-06, 'epoch': 0.34} 35%|███████████████████████████▉ | 205/594 [27:49<58:07, 8.97s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 205/594 [27:49<58:07, 8.97s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:59<59:36, 9.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:59<59:36, 9.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1575, 'learning_rate': 4.060000000000001e-06, 'epoch': 0.35} 35%|████████████████████████████ | 206/594 [27:59<59:36, 9.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|████████████████████████████ | 206/594 [27:59<59:36, 9.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:08<1:00:24, 9.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:08<1:00:24, 9.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2356, 'learning_rate': 4.08e-06, 'epoch': 0.35} 35%|███████████████████████████▌ | 207/594 [28:08<1:00:24, 9.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:08<1:00:24, 9.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▌ | 207/594 [28:08<1:00:24, 9.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:18<1:01:06, 9.50s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:18<1:01:06, 9.50s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:18<1:01:06, 9.50s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▋ | 208/594 [28:18<1:01:06, 9.50s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:28<1:01:32, 9.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:28<1:01:32, 9.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2149, 'learning_rate': 4.12e-06, 'epoch': 0.35} 35%|███████████████████████████▊ | 209/594 [28:28<1:01:32, 9.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▊ | 209/594 [28:28<1:01:32, 9.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:38<1:01:31, 9.61s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:38<1:01:31, 9.61s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3318, 'learning_rate': 4.14e-06, 'epoch': 0.35} 35%|███████████████████████████▉ | 210/594 [28:38<1:01:31, 9.61s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 35%|███████████████████████████▉ | 210/594 [28:38<1:01:31, 9.61s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████ | 211/594 [28:47<1:01:16, 9.60s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████ | 211/594 [28:47<1:01:16, 9.60s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.299, 'learning_rate': 4.16e-06, 'epoch': 0.35} 36%|████████████████████████████ | 211/594 [28:47<1:01:16, 9.60s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████ | 211/594 [28:47<1:01:16, 9.60s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:57<1:00:50, 9.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:57<1:00:50, 9.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2981, 'learning_rate': 4.18e-06, 'epoch': 0.36} 36%|████████████████████████████▏ | 212/594 [28:57<1:00:50, 9.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:57<1:00:50, 9.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▏ | 212/594 [28:57<1:00:50, 9.56s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:06<1:00:25, 9.51s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:06<1:00:25, 9.51s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:06<1:00:25, 9.51s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▎ | 213/594 [29:06<1:00:25, 9.51s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:15<1:00:04, 9.49s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:15<1:00:04, 9.49s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.353, 'learning_rate': 4.22e-06, 'epoch': 0.36} 36%|████████████████████████████▍ | 214/594 [29:15<1:00:04, 9.49s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:15<1:00:04, 9.49s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|████████████████████████████▍ | 214/594 [29:15<1:00:04, 9.49s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 215/594 [29:25<59:47, 9.47s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 215/594 [29:25<59:47, 9.47s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 215/594 [29:25<59:47, 9.47s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 215/594 [29:25<59:47, 9.47s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▎ | 215/594 [29:25<59:47, 9.47s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▍ | 216/594 [29:34<59:20, 9.42s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▍ | 216/594 [29:34<59:20, 9.42s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▍ | 216/594 [29:34<59:20, 9.42s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▍ | 216/594 [29:34<59:20, 9.42s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 36%|█████████████████████████████▍ | 216/594 [29:34<59:20, 9.42s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:43<58:44, 9.35s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:43<58:44, 9.35s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:43<58:44, 9.35s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:43<58:44, 9.35s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▌ | 217/594 [29:43<58:44, 9.35s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:53<58:27, 9.33s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:53<58:27, 9.33s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:53<58:27, 9.33s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▋ | 218/594 [29:53<58:27, 9.33s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [30:02<58:05, 9.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [30:02<58:05, 9.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2177, 'learning_rate': 4.32e-06, 'epoch': 0.37} 37%|█████████████████████████████▊ | 219/594 [30:02<58:05, 9.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [30:02<58:05, 9.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|█████████████████████████████▊ | 219/594 [30:02<58:05, 9.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:11<57:43, 9.26s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:11<57:43, 9.26s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:11<57:43, 9.26s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:11<57:43, 9.26s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████ | 220/594 [30:11<57:43, 9.26s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 37%|██████████████████████████████▏ | 221/594 [30:20<57:05, 9.18s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3275, 'learning_rate': 4.38e-06, 'epoch': 0.37} [WARNING|modeling_utils.py:388] 2022-02-28 19:21:16,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:21:16,620 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:38<55:54, 9.04s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:38<55:54, 9.04s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:38<55:54, 9.04s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:38<55:54, 9.04s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▍ | 223/594 [30:38<55:54, 9.04s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:47<55:11, 8.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:47<55:11, 8.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:47<55:11, 8.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:47<55:11, 8.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▌ | 224/594 [30:47<55:11, 8.95s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 225/594 [30:56<55:26, 9.02s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 225/594 [30:56<55:26, 9.02s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 225/594 [30:56<55:26, 9.02s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 225/594 [30:56<55:26, 9.02s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▋ | 225/594 [30:56<55:26, 9.02s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [31:05<54:49, 8.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [31:05<54:49, 8.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [31:05<54:49, 8.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [31:05<54:49, 8.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▊ | 226/594 [31:05<54:49, 8.94s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:13<54:02, 8.83s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:13<54:02, 8.83s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:13<54:02, 8.83s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|██████████████████████████████▉ | 227/594 [31:13<54:02, 8.83s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:22<53:23, 8.75s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:22<53:23, 8.75s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2403, 'learning_rate': 4.5e-06, 'epoch': 0.38} 38%|███████████████████████████████ | 228/594 [31:22<53:23, 8.75s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 38%|███████████████████████████████ | 228/594 [31:22<53:23, 8.75s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3736, 'learning_rate': 4.520000000000001e-06, 'epoch': 0.39} g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:38<51:58, 8.57s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:38<51:58, 8.57s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3332, 'learning_rate': 4.540000000000001e-06, 'epoch': 0.39} 39%|███████████████████████████████▎ | 230/594 [31:38<51:58, 8.57s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▎ | 230/594 [31:38<51:58, 8.57s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:47<51:09, 8.46s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:47<51:09, 8.46s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2064, 'learning_rate': 4.56e-06, 'epoch': 0.39} 39%|███████████████████████████████▌ | 231/594 [31:47<51:09, 8.46s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▌ | 231/594 [31:47<51:09, 8.46s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:55<50:29, 8.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:55<50:29, 8.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2761, 'learning_rate': 4.58e-06, 'epoch': 0.39} 39%|███████████████████████████████▋ | 232/594 [31:55<50:29, 8.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▋ | 232/594 [31:55<50:29, 8.37s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [32:03<49:56, 8.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [32:03<49:56, 8.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3082, 'learning_rate': 4.600000000000001e-06, 'epoch': 0.39} 39%|███████████████████████████████▊ | 233/594 [32:03<49:56, 8.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▊ | 233/594 [32:03<49:56, 8.30s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:11<49:19, 8.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:11<49:19, 8.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2818, 'learning_rate': 4.620000000000001e-06, 'epoch': 0.39} 39%|███████████████████████████████▉ | 234/594 [32:11<49:19, 8.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 39%|███████████████████████████████▉ | 234/594 [32:11<49:19, 8.22s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:19<48:17, 8.07s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:19<48:17, 8.07s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4403, 'learning_rate': 4.6400000000000005e-06, 'epoch': 0.4} 40%|████████████████████████████████ | 235/594 [32:19<48:17, 8.07s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████ | 235/594 [32:19<48:17, 8.07s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 236/594 [32:26<47:11, 7.91s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▏ | 236/594 [32:26<47:11, 7.91s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2043, 'learning_rate': 4.66e-06, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-02-28 19:23:12,832 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:34<46:13, 7.77s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:34<46:13, 7.77s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2007, 'learning_rate': 4.680000000000001e-06, 'epoch': 0.4} 40%|████████████████████████████████▎ | 237/594 [32:34<46:13, 7.77s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▎ | 237/594 [32:34<46:13, 7.77s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▍ | 238/594 [32:41<45:03, 7.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▍ | 238/594 [32:41<45:03, 7.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2285, 'learning_rate': 4.7e-06, 'epoch': 0.4} 40%|████████████████████████████████▍ | 238/594 [32:41<45:03, 7.59s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:28,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:28,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1442, 'learning_rate': 4.7200000000000005e-06, 'epoch': 0.4} [WARNING|modeling_utils.py:388] 2022-02-28 19:23:28,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:28,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:28,927 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 40%|████████████████████████████████▋ | 240/594 [32:55<42:41, 7.24s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:38,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:38,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:38,893 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|████████████████████████████████▊ | 241/594 [33:01<40:46, 6.93s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:45,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:45,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:45,039 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████ | 242/594 [33:07<39:04, 6.66s/it]g-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:50,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:50,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:50,904 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:17:08,204 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 243/594 [33:13<37:19, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 243/594 [33:13<37:19, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 41%|█████████████████████████████████▏ | 243/594 [33:13<37:19, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:23:58,884 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:01,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:01,330 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:03,597 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:05,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:05,801 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:07,775 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:09,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:09,761 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:11,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:13,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:13,256 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:14,787 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:14,787 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:17,647 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:18,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:18,975 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:20,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:20,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:26,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:26,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:24:26,074 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 251/594 [33:50<32:32, 5.69s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 251/594 [33:50<32:32, 5.69s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 251/594 [33:50<32:32, 5.69s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 251/594 [33:50<32:32, 5.69s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▏ | 251/594 [33:50<32:32, 5.69s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [34:00<40:05, 7.03s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [34:00<40:05, 7.03s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [34:00<40:05, 7.03s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [34:00<40:05, 7.03s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 42%|██████████████████████████████████▎ | 252/594 [34:00<40:05, 7.03s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:10<44:53, 7.90s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:10<44:53, 7.90s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:10<44:53, 7.90s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:10<44:53, 7.90s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▌ | 253/594 [34:10<44:53, 7.90s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:20<48:13, 8.51s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:20<48:13, 8.51s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:20<48:13, 8.51s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▋ | 254/594 [34:20<48:13, 8.51s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:30<50:21, 8.91s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:30<50:21, 8.91s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2932, 'learning_rate': 5.04e-06, 'epoch': 0.43} 43%|██████████████████████████████████▊ | 255/594 [34:30<50:21, 8.91s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▊ | 255/594 [34:30<50:21, 8.91s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:40<51:40, 9.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:40<51:40, 9.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2446, 'learning_rate': 5.060000000000001e-06, 'epoch': 0.43} 43%|██████████████████████████████████▉ | 256/594 [34:40<51:40, 9.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:40<51:40, 9.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|██████████████████████████████████▉ | 256/594 [34:40<51:40, 9.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:50<52:25, 9.33s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:50<52:25, 9.33s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:50<52:25, 9.33s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████ | 257/594 [34:50<52:25, 9.33s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:59<53:02, 9.47s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:59<53:02, 9.47s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.203, 'learning_rate': 5.1e-06, 'epoch': 0.43} 43%|███████████████████████████████████▏ | 258/594 [34:59<53:02, 9.47s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 43%|███████████████████████████████████▏ | 258/594 [34:59<53:02, 9.47s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:09<53:24, 9.56s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:09<53:24, 9.56s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2629, 'learning_rate': 5.12e-06, 'epoch': 0.44} 44%|███████████████████████████████████▎ | 259/594 [35:09<53:24, 9.56s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:09<53:24, 9.56s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▎ | 259/594 [35:09<53:24, 9.56s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:19<53:05, 9.54s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:19<53:05, 9.54s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:19<53:05, 9.54s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:19<53:05, 9.54s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▍ | 260/594 [35:19<53:05, 9.54s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:28<52:38, 9.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:28<52:38, 9.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:28<52:38, 9.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:28<52:38, 9.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▌ | 261/594 [35:28<52:38, 9.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:37<52:10, 9.43s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:37<52:10, 9.43s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:37<52:10, 9.43s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:37<52:10, 9.43s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▋ | 262/594 [35:37<52:10, 9.43s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:47<51:58, 9.42s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:47<51:58, 9.42s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:47<51:58, 9.42s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:47<51:58, 9.42s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|███████████████████████████████████▊ | 263/594 [35:47<51:58, 9.42s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:56<51:40, 9.40s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:56<51:40, 9.40s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:56<51:40, 9.40s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 44%|████████████████████████████████████ | 264/594 [35:56<51:40, 9.40s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [36:05<51:25, 9.38s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [36:05<51:25, 9.38s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1044, 'learning_rate': 5.240000000000001e-06, 'epoch': 0.45} 45%|████████████████████████████████████▏ | 265/594 [36:05<51:25, 9.38s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [36:05<51:25, 9.38s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▏ | 265/594 [36:05<51:25, 9.38s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:15<51:05, 9.34s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:15<51:05, 9.34s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:15<51:05, 9.34s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▎ | 266/594 [36:15<51:05, 9.34s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:24<50:39, 9.29s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:24<50:39, 9.29s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2392, 'learning_rate': 5.28e-06, 'epoch': 0.45} 45%|████████████████████████████████████▍ | 267/594 [36:24<50:39, 9.29s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:24<50:39, 9.29s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▍ | 267/594 [36:24<50:39, 9.29s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 268/594 [36:33<50:11, 9.24s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 268/594 [36:33<50:11, 9.24s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 268/594 [36:33<50:11, 9.24s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▌ | 268/594 [36:33<50:11, 9.24s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2911, 'learning_rate': 5.320000000000001e-06, 'epoch': 0.45} 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2804, 'learning_rate': 5.3400000000000005e-06, 'epoch': 0.45} 45%|████████████████████████████████████▋ | 269/594 [36:42<49:59, 9.23s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:27:38,895 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [37:00<49:07, 9.12s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [37:00<49:07, 9.12s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2093, 'learning_rate': 5.36e-06, 'epoch': 0.46} 46%|████████████████████████████████████▉ | 271/594 [37:00<49:07, 9.12s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|████████████████████████████████████▉ | 271/594 [37:00<49:07, 9.12s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3397, 'learning_rate': 5.380000000000001e-06, 'epoch': 0.46} 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2085, 'learning_rate': 5.400000000000001e-06, 'epoch': 0.46} 46%|█████████████████████████████████████ | 272/594 [37:09<48:42, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2158, 'learning_rate': 5.420000000000001e-06, 'epoch': 0.46} [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:03,578 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:36<48:15, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:36<48:15, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:36<48:15, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▌ | 275/594 [37:36<48:15, 9.08s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:45<47:19, 8.93s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:45<47:19, 8.93s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2118, 'learning_rate': 5.460000000000001e-06, 'epoch': 0.46} 46%|█████████████████████████████████████▋ | 276/594 [37:45<47:19, 8.93s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 46%|█████████████████████████████████████▋ | 276/594 [37:45<47:19, 8.93s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3481, 'learning_rate': 5.480000000000001e-06, 'epoch': 0.47} g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:42,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:28:42,759 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1363, 'learning_rate': 5.500000000000001e-06, 'epoch': 0.47} [WARNING|modeling_utils.py:388] 2022-02-28 19:28:49,035 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:10<45:14, 8.62s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:10<45:14, 8.62s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3209, 'learning_rate': 5.5200000000000005e-06, 'epoch': 0.47} 47%|██████████████████████████████████████ | 279/594 [38:10<45:14, 8.62s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████ | 279/594 [38:10<45:14, 8.62s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:19<44:52, 8.57s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:19<44:52, 8.57s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3781, 'learning_rate': 5.540000000000001e-06, 'epoch': 0.47} 47%|██████████████████████████████████████▏ | 280/594 [38:19<44:52, 8.57s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▏ | 280/594 [38:19<44:52, 8.57s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:27<44:13, 8.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:27<44:13, 8.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3857, 'learning_rate': 5.560000000000001e-06, 'epoch': 0.47} 47%|██████████████████████████████████████▎ | 281/594 [38:27<44:13, 8.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▎ | 281/594 [38:27<44:13, 8.48s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:35<43:31, 8.37s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:35<43:31, 8.37s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2594, 'learning_rate': 5.580000000000001e-06, 'epoch': 0.47} 47%|██████████████████████████████████████▍ | 282/594 [38:35<43:31, 8.37s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:35<43:31, 8.37s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 47%|██████████████████████████████████████▍ | 282/594 [38:35<43:31, 8.37s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:43<42:55, 8.28s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:43<42:55, 8.28s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:43<42:55, 8.28s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:43<42:55, 8.28s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▌ | 283/594 [38:43<42:55, 8.28s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:51<42:13, 8.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:51<42:13, 8.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:51<42:13, 8.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:51<42:13, 8.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▋ | 284/594 [38:51<42:13, 8.17s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:59<41:32, 8.07s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:59<41:32, 8.07s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:59<41:32, 8.07s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:59<41:32, 8.07s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|██████████████████████████████████████▊ | 285/594 [38:59<41:32, 8.07s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [39:07<40:47, 7.95s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████ | 286/594 [39:07<40:47, 7.95s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:29:53,124 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:14<39:55, 7.80s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:14<39:55, 7.80s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.5148, 'learning_rate': 5.68e-06, 'epoch': 0.48} 48%|███████████████████████████████████████▏ | 287/594 [39:14<39:55, 7.80s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:14<39:55, 7.80s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▏ | 287/594 [39:14<39:55, 7.80s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:21<39:00, 7.65s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:21<39:00, 7.65s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:21<39:00, 7.65s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:21<39:00, 7.65s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 48%|███████████████████████████████████████▎ | 288/594 [39:21<39:00, 7.65s/it]g-point operations will not be computed-28 19:23:55,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 289/594 [39:28<37:49, 7.44s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 289/594 [39:28<37:49, 7.44s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 289/594 [39:28<37:49, 7.44s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▍ | 289/594 [39:28<37:49, 7.44s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▌ | 290/594 [39:35<36:37, 7.23s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:19,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:19,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:19,239 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▋ | 291/594 [39:41<35:19, 6.99s/it]g-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:25,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:25,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:25,525 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▊ | 292/594 [39:47<33:41, 6.69s/it]g-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:31,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:31,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:31,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:11,010 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:53<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:53<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 49%|███████████████████████████████████████▉ | 293/594 [39:53<32:00, 6.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:39,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:41,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:41,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:41,811 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:35,464 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▏ | 295/594 [40:03<28:16, 5.67s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:45,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:47,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:45,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:47,355 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:45,301 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▎ | 296/594 [40:07<26:08, 5.26s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:49,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:51,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:49,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:51,336 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:49,479 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:11<23:52, 4.82s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:53,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▌ | 297/594 [40:11<23:52, 4.82s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:30:53,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 50%|████████████████████████████████████████▋ | 298/594 [40:15<21:39, 4.39s/it]g-point operations will not be computed-28 19:30:53,187 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:57,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:56,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:30:57,780 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:56,440 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:31:00,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:59,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:31:00,248 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:30:59,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:20<17:45, 3.62s/it]g-point operations will not be computed-28 19:30:59,107 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:20<17:45, 3.62s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|████████████████████████████████████████▉ | 300/594 [40:20<17:45, 3.62s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:31:09,318 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:31<27:43, 5.68s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:31<27:43, 5.68s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3267, 'learning_rate': 5.9600000000000005e-06, 'epoch': 0.51} 51%|█████████████████████████████████████████ | 301/594 [40:31<27:43, 5.68s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:31<27:43, 5.68s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████ | 301/594 [40:31<27:43, 5.68s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:41<33:57, 6.98s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:41<33:57, 6.98s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:41<33:57, 6.98s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:41<33:57, 6.98s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▏ | 302/594 [40:41<33:57, 6.98s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:51<38:06, 7.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:51<38:06, 7.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:51<38:06, 7.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:51<38:06, 7.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▎ | 303/594 [40:51<38:06, 7.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [41:01<40:48, 8.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [41:01<40:48, 8.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [41:01<40:48, 8.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▍ | 304/594 [41:01<40:48, 8.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:10<42:39, 8.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:10<42:39, 8.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3256, 'learning_rate': 6.040000000000001e-06, 'epoch': 0.51} 51%|█████████████████████████████████████████▌ | 305/594 [41:10<42:39, 8.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 51%|█████████████████████████████████████████▌ | 305/594 [41:10<42:39, 8.86s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:20<43:44, 9.11s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:20<43:44, 9.11s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.313, 'learning_rate': 6.0600000000000004e-06, 'epoch': 0.51} 52%|█████████████████████████████████████████▋ | 306/594 [41:20<43:44, 9.11s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▋ | 306/594 [41:20<43:44, 9.11s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:30<44:20, 9.27s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:30<44:20, 9.27s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1031, 'learning_rate': 6.08e-06, 'epoch': 0.52} 52%|█████████████████████████████████████████▊ | 307/594 [41:30<44:20, 9.27s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|█████████████████████████████████████████▊ | 307/594 [41:30<44:20, 9.27s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:39<44:36, 9.36s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:39<44:36, 9.36s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3191, 'learning_rate': 6.1e-06, 'epoch': 0.52} 52%|██████████████████████████████████████████ | 308/594 [41:39<44:36, 9.36s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████ | 308/594 [41:39<44:36, 9.36s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:49<44:51, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:49<44:51, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2111, 'learning_rate': 6.120000000000001e-06, 'epoch': 0.52} 52%|██████████████████████████████████████████▏ | 309/594 [41:49<44:51, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:49<44:51, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▏ | 309/594 [41:49<44:51, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:58<44:47, 9.46s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:58<44:47, 9.46s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:58<44:47, 9.46s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:58<44:47, 9.46s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▎ | 310/594 [41:58<44:47, 9.46s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:08<44:40, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:08<44:40, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:08<44:40, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:08<44:40, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 52%|██████████████████████████████████████████▍ | 311/594 [42:08<44:40, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:17<44:31, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:17<44:31, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:17<44:31, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▌ | 312/594 [42:17<44:31, 9.47s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:27<44:11, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:27<44:11, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2251, 'learning_rate': 6.200000000000001e-06, 'epoch': 0.53} 53%|██████████████████████████████████████████▋ | 313/594 [42:27<44:11, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:27<44:11, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▋ | 313/594 [42:27<44:11, 9.44s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:36<43:45, 9.38s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:36<43:45, 9.38s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:36<43:45, 9.38s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▊ | 314/594 [42:36<43:45, 9.38s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:45<43:28, 9.35s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:45<43:28, 9.35s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2471, 'learning_rate': 6.24e-06, 'epoch': 0.53} 53%|██████████████████████████████████████████▉ | 315/594 [42:45<43:28, 9.35s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:45<43:28, 9.35s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|██████████████████████████████████████████▉ | 315/594 [42:45<43:28, 9.35s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:54<43:05, 9.30s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:54<43:05, 9.30s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:54<43:05, 9.30s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:54<43:05, 9.30s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████ | 316/594 [42:54<43:05, 9.30s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [43:04<42:43, 9.25s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [43:04<42:43, 9.25s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [43:04<42:43, 9.25s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [43:04<42:43, 9.25s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 53%|███████████████████████████████████████████▏ | 317/594 [43:04<42:43, 9.25s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:13<42:14, 9.18s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:13<42:14, 9.18s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:13<42:14, 9.18s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:13<42:14, 9.18s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▎ | 318/594 [43:13<42:14, 9.18s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:22<41:41, 9.10s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:22<41:41, 9.10s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:22<41:41, 9.10s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▌ | 319/594 [43:22<41:41, 9.10s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:31<41:22, 9.06s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:31<41:22, 9.06s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.265, 'learning_rate': 6.34e-06, 'epoch': 0.54} 54%|███████████████████████████████████████████▋ | 320/594 [43:31<41:22, 9.06s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:31<41:22, 9.06s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▋ | 320/594 [43:31<41:22, 9.06s/it]g-point operations will not be computed-28 19:31:04,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:39<40:57, 9.00s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:39<40:57, 9.00s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▊ | 321/594 [43:39<40:57, 9.00s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:48<40:36, 8.96s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:48<40:36, 8.96s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1692, 'learning_rate': 6.380000000000001e-06, 'epoch': 0.54} 54%|███████████████████████████████████████████▉ | 322/594 [43:48<40:36, 8.96s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|███████████████████████████████████████████▉ | 322/594 [43:48<40:36, 8.96s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:57<39:59, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:57<39:59, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1871, 'learning_rate': 6.4000000000000006e-06, 'epoch': 0.54} 54%|████████████████████████████████████████████ | 323/594 [43:57<39:59, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 54%|████████████████████████████████████████████ | 323/594 [43:57<39:59, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▏ | 324/594 [44:06<39:34, 8.79s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▏ | 324/594 [44:06<39:34, 8.79s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2543, 'learning_rate': 6.42e-06, 'epoch': 0.54} 55%|████████████████████████████████████████████▏ | 324/594 [44:06<39:34, 8.79s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▏ | 324/594 [44:06<39:34, 8.79s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:15<40:04, 8.94s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:15<40:04, 8.94s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2072, 'learning_rate': 6.440000000000001e-06, 'epoch': 0.55} 55%|████████████████████████████████████████████▎ | 325/594 [44:15<40:04, 8.94s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▎ | 325/594 [44:15<40:04, 8.94s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:23<39:31, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:23<39:31, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1326, 'learning_rate': 6.460000000000001e-06, 'epoch': 0.55} 55%|████████████████████████████████████████████▍ | 326/594 [44:23<39:31, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▍ | 326/594 [44:23<39:31, 8.85s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:32<38:42, 8.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:32<38:42, 8.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.4151, 'learning_rate': 6.480000000000001e-06, 'epoch': 0.55} 55%|████████████████████████████████████████████▌ | 327/594 [44:32<38:42, 8.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:32<38:42, 8.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▌ | 327/594 [44:32<38:42, 8.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▋ | 328/594 [44:40<38:08, 8.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▋ | 328/594 [44:40<38:08, 8.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▋ | 328/594 [44:40<38:08, 8.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▋ | 328/594 [44:40<38:08, 8.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▋ | 328/594 [44:40<38:08, 8.61s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:49<37:40, 8.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:49<37:40, 8.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:49<37:40, 8.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:49<37:40, 8.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 55%|████████████████████████████████████████████▊ | 329/594 [44:49<37:40, 8.53s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:57<37:15, 8.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:57<37:15, 8.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:57<37:15, 8.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:57<37:15, 8.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████ | 330/594 [44:57<37:15, 8.47s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:05<36:36, 8.35s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:05<36:36, 8.35s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:05<36:36, 8.35s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:05<36:36, 8.35s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▏ | 331/594 [45:05<36:36, 8.35s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:13<36:07, 8.27s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:13<36:07, 8.27s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:13<36:07, 8.27s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:13<36:07, 8.27s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▎ | 332/594 [45:13<36:07, 8.27s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:21<35:30, 8.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:21<35:30, 8.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:21<35:30, 8.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:21<35:30, 8.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▍ | 333/594 [45:21<35:30, 8.16s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:29<34:54, 8.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:29<34:54, 8.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:29<34:54, 8.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:29<34:54, 8.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▌ | 334/594 [45:29<34:54, 8.05s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:36<34:19, 7.95s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:36<34:19, 7.95s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:36<34:19, 7.95s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:36<34:19, 7.95s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 56%|█████████████████████████████████████████████▋ | 335/594 [45:36<34:19, 7.95s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▊ | 336/594 [45:44<33:47, 7.86s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:28,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:28,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:28,865 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:52<33:11, 7.75s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:52<33:11, 7.75s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:52<33:11, 7.75s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:52<33:11, 7.75s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|█████████████████████████████████████████████▉ | 337/594 [45:52<33:11, 7.75s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████ | 338/594 [45:59<32:28, 7.61s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:43,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:43,370 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▏ | 339/594 [46:06<31:29, 7.41s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 57%|██████████████████████████████████████████████▏ | 339/594 [46:06<31:29, 7.41s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2594, 'learning_rate': 6.720000000000001e-06, 'epoch': 0.57} 57%|██████████████████████████████████████████████▏ | 339/594 [46:06<31:29, 7.41s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.287, 'learning_rate': 6.740000000000001e-06, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-02-28 19:36:53,151 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:59,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:36:59,244 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2912, 'learning_rate': 6.760000000000001e-06, 'epoch': 0.57} [WARNING|modeling_utils.py:388] 2022-02-28 19:37:03,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:03,602 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▋ | 342/594 [46:24<27:13, 6.48s/it]g-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:07,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:07,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:07,705 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:34:22,619 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▊ | 343/594 [46:29<25:29, 6.09s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:11,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:13,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:11,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:13,781 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:11,470 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|██████████████████████████████████████████████▉ | 344/594 [46:34<23:39, 5.68s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:18,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:18,094 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:16,101 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████ | 345/594 [46:38<21:42, 5.23s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:20,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:22,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:20,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:22,008 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:20,132 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▏ | 346/594 [46:42<19:53, 4.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:23,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:25,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:23,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:25,580 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:23,854 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▎ | 347/594 [46:45<18:09, 4.41s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:27,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 58%|███████████████████████████████████████████████▎ | 347/594 [46:45<18:09, 4.41s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:27,293 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▍ | 348/594 [46:48<16:26, 4.01s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:30,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▌ | 349/594 [46:51<14:42, 3.60s/it]g-point operations will not be computed-28 19:37:30,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▌ | 349/594 [46:51<14:42, 3.60s/it]g-point operations will not be computed-28 19:37:30,230 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:33,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:32,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:33,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:32,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 350/594 [46:54<13:44, 3.38s/it]g-point operations will not be computed-28 19:37:32,804 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 350/594 [46:54<13:44, 3.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▋ | 350/594 [46:54<13:44, 3.38s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:42,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:37:42,882 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▊ | 351/594 [47:04<22:16, 5.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▊ | 351/594 [47:04<22:16, 5.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▊ | 351/594 [47:04<22:16, 5.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▊ | 351/594 [47:04<22:16, 5.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|███████████████████████████████████████████████▊ | 351/594 [47:04<22:16, 5.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:15<27:46, 6.89s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:15<27:46, 6.89s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:15<27:46, 6.89s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:15<27:46, 6.89s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████ | 352/594 [47:15<27:46, 6.89s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1032, 'learning_rate': 7.0200000000000006e-06, 'epoch': 0.6} 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 59%|████████████████████████████████████████████████▏ | 353/594 [47:25<31:24, 7.82s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:44<35:13, 8.84s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:44<35:13, 8.84s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1884, 'learning_rate': 7.04e-06, 'epoch': 0.6} 60%|████████████████████████████████████████████████▍ | 355/594 [47:44<35:13, 8.84s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▍ | 355/594 [47:44<35:13, 8.84s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:54<36:12, 9.13s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:54<36:12, 9.13s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2002, 'learning_rate': 7.06e-06, 'epoch': 0.6} 60%|████████████████████████████████████████████████▌ | 356/594 [47:54<36:12, 9.13s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▌ | 356/594 [47:54<36:12, 9.13s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:04<36:48, 9.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:04<36:48, 9.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2122, 'learning_rate': 7.08e-06, 'epoch': 0.6} 60%|████████████████████████████████████████████████▋ | 357/594 [48:04<36:48, 9.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▋ | 357/594 [48:04<36:48, 9.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:14<37:13, 9.46s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:14<37:13, 9.46s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1464, 'learning_rate': 7.100000000000001e-06, 'epoch': 0.6} 60%|████████████████████████████████████████████████▊ | 358/594 [48:14<37:13, 9.46s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▊ | 358/594 [48:14<37:13, 9.46s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:23<37:19, 9.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:23<37:19, 9.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1748, 'learning_rate': 7.1200000000000004e-06, 'epoch': 0.6} 60%|████████████████████████████████████████████████▉ | 359/594 [48:23<37:19, 9.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:23<37:19, 9.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 60%|████████████████████████████████████████████████▉ | 359/594 [48:23<37:19, 9.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:33<37:14, 9.55s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:33<37:14, 9.55s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:33<37:14, 9.55s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:33<37:14, 9.55s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████ | 360/594 [48:33<37:14, 9.55s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 361/594 [48:42<36:52, 9.49s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 361/594 [48:42<36:52, 9.49s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 361/594 [48:42<36:52, 9.49s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 361/594 [48:42<36:52, 9.49s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▏ | 361/594 [48:42<36:52, 9.49s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:52<36:28, 9.43s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:52<36:28, 9.43s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:52<36:28, 9.43s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▎ | 362/594 [48:52<36:28, 9.43s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▌ | 363/594 [49:01<36:11, 9.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▌ | 363/594 [49:01<36:11, 9.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2577, 'learning_rate': 7.2000000000000005e-06, 'epoch': 0.61} 61%|█████████████████████████████████████████████████▌ | 363/594 [49:01<36:11, 9.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▌ | 363/594 [49:01<36:11, 9.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▌ | 363/594 [49:01<36:11, 9.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:10<35:58, 9.38s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:10<35:58, 9.38s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:10<35:58, 9.38s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:10<35:58, 9.38s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▋ | 364/594 [49:10<35:58, 9.38s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:19<35:41, 9.35s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:19<35:41, 9.35s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:19<35:41, 9.35s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:19<35:41, 9.35s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 61%|█████████████████████████████████████████████████▊ | 365/594 [49:19<35:41, 9.35s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:29<35:18, 9.29s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:29<35:18, 9.29s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:29<35:18, 9.29s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|█████████████████████████████████████████████████▉ | 366/594 [49:29<35:18, 9.29s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.179, 'learning_rate': 7.280000000000001e-06, 'epoch': 0.62} 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████ | 367/594 [49:38<34:55, 9.23s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:47<34:37, 9.19s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:47<34:37, 9.19s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:47<34:37, 9.19s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▏ | 368/594 [49:47<34:37, 9.19s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:56<34:24, 9.18s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:56<34:24, 9.18s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2615, 'learning_rate': 7.32e-06, 'epoch': 0.62} 62%|██████████████████████████████████████████████████▎ | 369/594 [49:56<34:24, 9.18s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:56<34:24, 9.18s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▎ | 369/594 [49:56<34:24, 9.18s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:05<34:02, 9.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:05<34:02, 9.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:05<34:02, 9.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▍ | 370/594 [50:05<34:02, 9.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:14<33:40, 9.06s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:14<33:40, 9.06s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0527, 'learning_rate': 7.360000000000001e-06, 'epoch': 0.62} 62%|██████████████████████████████████████████████████▌ | 371/594 [50:14<33:40, 9.06s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 62%|██████████████████████████████████████████████████▌ | 371/594 [50:14<33:40, 9.06s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 372/594 [50:23<33:26, 9.04s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|██████████████████████████████████████████████████▋ | 372/594 [50:23<33:26, 9.04s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2189, 'learning_rate': 7.3800000000000005e-06, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0695, 'learning_rate': 7.4e-06, 'epoch': 0.63} [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:41:10,528 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:41<32:46, 8.94s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:41<32:46, 8.94s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0884, 'learning_rate': 7.420000000000001e-06, 'epoch': 0.63} 63%|███████████████████████████████████████████████████ | 374/594 [50:41<32:46, 8.94s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████ | 374/594 [50:41<32:46, 8.94s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:50<33:07, 9.07s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:50<33:07, 9.07s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2513, 'learning_rate': 7.440000000000001e-06, 'epoch': 0.63} 63%|███████████████████████████████████████████████████▏ | 375/594 [50:50<33:07, 9.07s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▏ | 375/594 [50:50<33:07, 9.07s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▎ | 376/594 [50:59<32:35, 8.97s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▎ | 376/594 [50:59<32:35, 8.97s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1336, 'learning_rate': 7.4600000000000006e-06, 'epoch': 0.63} 63%|███████████████████████████████████████████████████▎ | 376/594 [50:59<32:35, 8.97s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▎ | 376/594 [50:59<32:35, 8.97s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:07<32:03, 8.86s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:07<32:03, 8.86s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2286, 'learning_rate': 7.48e-06, 'epoch': 0.63} 63%|███████████████████████████████████████████████████▍ | 377/594 [51:07<32:03, 8.86s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:07<32:03, 8.86s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 63%|███████████████████████████████████████████████████▍ | 377/594 [51:07<32:03, 8.86s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:16<31:36, 8.78s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:16<31:36, 8.78s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:16<31:36, 8.78s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▌ | 378/594 [51:16<31:36, 8.78s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:24<31:06, 8.68s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:24<31:06, 8.68s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1134, 'learning_rate': 7.520000000000001e-06, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▋ | 379/594 [51:24<31:06, 8.68s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▋ | 379/594 [51:24<31:06, 8.68s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:33<30:36, 8.58s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:33<30:36, 8.58s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.337, 'learning_rate': 7.540000000000001e-06, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▊ | 380/594 [51:33<30:36, 8.58s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▊ | 380/594 [51:33<30:36, 8.58s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:41<30:09, 8.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:41<30:09, 8.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2637, 'learning_rate': 7.5600000000000005e-06, 'epoch': 0.64} 64%|███████████████████████████████████████████████████▉ | 381/594 [51:41<30:09, 8.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|███████████████████████████████████████████████████▉ | 381/594 [51:41<30:09, 8.50s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:49<29:40, 8.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:49<29:40, 8.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1276, 'learning_rate': 7.58e-06, 'epoch': 0.64} 64%|████████████████████████████████████████████████████ | 382/594 [51:49<29:40, 8.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████ | 382/594 [51:49<29:40, 8.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:57<29:13, 8.31s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:57<29:13, 8.31s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0296, 'learning_rate': 7.600000000000001e-06, 'epoch': 0.64} 64%|████████████████████████████████████████████████████▏ | 383/594 [51:57<29:13, 8.31s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 64%|████████████████████████████████████████████████████▏ | 383/594 [51:57<29:13, 8.31s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:05<28:46, 8.22s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:05<28:46, 8.22s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2207, 'learning_rate': 7.620000000000001e-06, 'epoch': 0.65} 65%|████████████████████████████████████████████████████▎ | 384/594 [52:05<28:46, 8.22s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:05<28:46, 8.22s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▎ | 384/594 [52:05<28:46, 8.22s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:13<28:16, 8.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:13<28:16, 8.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:13<28:16, 8.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:13<28:16, 8.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▌ | 385/594 [52:13<28:16, 8.12s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:21<27:40, 7.98s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:21<27:40, 7.98s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:21<27:40, 7.98s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:21<27:40, 7.98s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▋ | 386/594 [52:21<27:40, 7.98s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▊ | 387/594 [52:28<27:04, 7.85s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:13,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:13,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:13,097 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:36<26:29, 7.72s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:36<26:29, 7.72s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:36<26:29, 7.72s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:36<26:29, 7.72s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|████████████████████████████████████████████████████▉ | 388/594 [52:36<26:29, 7.72s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████████ | 389/594 [52:43<25:42, 7.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 65%|█████████████████████████████████████████████████████ | 389/594 [52:43<25:42, 7.53s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:28,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:28,958 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▏ | 390/594 [52:50<24:52, 7.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▏ | 390/594 [52:50<24:52, 7.32s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:35,448 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 391/594 [52:56<23:48, 7.04s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▎ | 391/594 [52:56<23:48, 7.04s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3634, 'learning_rate': 7.76e-06, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-02-28 19:43:41,524 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▍ | 392/594 [53:02<22:39, 6.73s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▍ | 392/594 [53:02<22:39, 6.73s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2915, 'learning_rate': 7.78e-06, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-02-28 19:43:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:47,300 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▌ | 393/594 [53:08<21:25, 6.40s/it]g-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:51,311 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:53,754 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:53,754 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3662, 'learning_rate': 7.820000000000001e-06, 'epoch': 0.66} [WARNING|modeling_utils.py:388] 2022-02-28 19:43:57,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:43:57,246 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:37:37,752 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 66%|█████████████████████████████████████████████████████▊ | 395/594 [53:17<18:30, 5.58s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:43:59,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:44:01,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:43:59,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:44:01,412 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:43:59,425 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████ | 396/594 [53:21<16:54, 5.13s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:03,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▏ | 397/594 [53:25<15:25, 4.70s/it]g-point operations will not be computed-28 19:44:03,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▏ | 397/594 [53:25<15:25, 4.70s/it]g-point operations will not be computed-28 19:44:03,427 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▏ | 397/594 [53:25<15:25, 4.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:07,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▏ | 397/594 [53:25<15:25, 4.70s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:07,032 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▎ | 398/594 [53:28<13:58, 4.28s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:44:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:44:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:44:14,066 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:12,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:44:14,066 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:12,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 400/594 [53:34<11:32, 3.57s/it]g-point operations will not be computed-28 19:44:12,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 400/594 [53:34<11:32, 3.57s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 67%|██████████████████████████████████████████████████████▌ | 400/594 [53:34<11:32, 3.57s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:44:23,057 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:45<18:06, 5.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:45<18:06, 5.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2421, 'learning_rate': 7.960000000000002e-06, 'epoch': 0.67} 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:45<18:06, 5.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▋ | 401/594 [53:45<18:06, 5.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:55<22:23, 7.00s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:55<22:23, 7.00s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1492, 'learning_rate': 7.980000000000002e-06, 'epoch': 0.68} 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:55<22:23, 7.00s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▊ | 402/594 [53:55<22:23, 7.00s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.211, 'learning_rate': 8.000000000000001e-06, 'epoch': 0.68} 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1921, 'learning_rate': 8.020000000000001e-06, 'epoch': 0.68} 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|██████████████████████████████████████████████████████▉ | 403/594 [54:05<25:10, 7.91s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:24<27:47, 8.82s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:24<27:47, 8.82s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1333, 'learning_rate': 8.040000000000001e-06, 'epoch': 0.68} 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:24<27:47, 8.82s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▏ | 405/594 [54:24<27:47, 8.82s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:34<28:29, 9.09s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:34<28:29, 9.09s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1795, 'learning_rate': 8.06e-06, 'epoch': 0.68} 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:34<28:29, 9.09s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 68%|███████████████████████████████████████████████████████▎ | 406/594 [54:34<28:29, 9.09s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:44<28:57, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:44<28:57, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1875, 'learning_rate': 8.08e-06, 'epoch': 0.68} 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:44<28:57, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▌ | 407/594 [54:44<28:57, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:53<29:07, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:53<29:07, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1004, 'learning_rate': 8.1e-06, 'epoch': 0.69} 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:53<29:07, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▋ | 408/594 [54:53<29:07, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:03<29:06, 9.44s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:03<29:06, 9.44s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1633, 'learning_rate': 8.120000000000002e-06, 'epoch': 0.69} 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:03<29:06, 9.44s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▊ | 409/594 [55:03<29:06, 9.44s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1125, 'learning_rate': 8.14e-06, 'epoch': 0.69} 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1233, 'learning_rate': 8.16e-06, 'epoch': 0.69} 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|███████████████████████████████████████████████████████▉ | 410/594 [55:12<28:59, 9.46s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:31<28:39, 9.45s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:31<28:39, 9.45s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2896, 'learning_rate': 8.18e-06, 'epoch': 0.69} 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:31<28:39, 9.45s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:31<28:39, 9.45s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 69%|████████████████████████████████████████████████████████▏ | 412/594 [55:31<28:39, 9.45s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:41<28:26, 9.43s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:41<28:26, 9.43s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:41<28:26, 9.43s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:41<28:26, 9.43s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▎ | 413/594 [55:41<28:26, 9.43s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:50<28:09, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:50<28:09, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:50<28:09, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▍ | 414/594 [55:50<28:09, 9.39s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [55:59<27:48, 9.32s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [55:59<27:48, 9.32s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2061, 'learning_rate': 8.24e-06, 'epoch': 0.7} 70%|████████████████████████████████████████████████████████▌ | 415/594 [55:59<27:48, 9.32s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [55:59<27:48, 9.32s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▌ | 415/594 [55:59<27:48, 9.32s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1973, 'learning_rate': 8.28e-06, 'epoch': 0.7} 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|████████████████████████████████████████████████████████▋ | 416/594 [56:08<27:34, 9.29s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:27<26:59, 9.20s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:27<26:59, 9.20s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:27<26:59, 9.20s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 70%|█████████████████████████████████████████████████████████ | 418/594 [56:27<26:59, 9.20s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:36<26:39, 9.14s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:36<26:39, 9.14s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2931, 'learning_rate': 8.32e-06, 'epoch': 0.7} 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:36<26:39, 9.14s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:36<26:39, 9.14s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▏ | 419/594 [56:36<26:39, 9.14s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:45<26:22, 9.10s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:45<26:22, 9.10s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:45<26:22, 9.10s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:45<26:22, 9.10s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▎ | 420/594 [56:45<26:22, 9.10s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [56:54<26:04, 9.04s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [56:54<26:04, 9.04s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [56:54<26:04, 9.04s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▍ | 421/594 [56:54<26:04, 9.04s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:03<25:52, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:03<25:52, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1124, 'learning_rate': 8.380000000000001e-06, 'epoch': 0.71} 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:03<25:52, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:03<25:52, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▌ | 422/594 [57:03<25:52, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:11<25:36, 8.99s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:11<25:36, 8.99s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:11<25:36, 8.99s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▋ | 423/594 [57:11<25:36, 8.99s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:20<25:16, 8.92s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:20<25:16, 8.92s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1725, 'learning_rate': 8.42e-06, 'epoch': 0.71} 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:20<25:16, 8.92s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 71%|█████████████████████████████████████████████████████████▊ | 424/594 [57:20<25:16, 8.92s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:30<25:26, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:30<25:26, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.166, 'learning_rate': 8.44e-06, 'epoch': 0.71} 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:30<25:26, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:30<25:26, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|█████████████████████████████████████████████████████████▉ | 425/594 [57:30<25:26, 9.03s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:38<25:01, 8.94s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:38<25:01, 8.94s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:38<25:01, 8.94s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:38<25:01, 8.94s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████ | 426/594 [57:38<25:01, 8.94s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:47<24:35, 8.84s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:47<24:35, 8.84s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:47<24:35, 8.84s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▏ | 427/594 [57:47<24:35, 8.84s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [57:55<24:07, 8.72s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [57:55<24:07, 8.72s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1219, 'learning_rate': 8.5e-06, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████▎ | 428/594 [57:55<24:07, 8.72s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▎ | 428/594 [57:55<24:07, 8.72s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:04<23:44, 8.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:04<23:44, 8.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.237, 'learning_rate': 8.52e-06, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:04<23:44, 8.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▌ | 429/594 [58:04<23:44, 8.63s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:12<23:28, 8.59s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:12<23:28, 8.59s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1834, 'learning_rate': 8.540000000000001e-06, 'epoch': 0.72} 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:12<23:28, 8.59s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 72%|██████████████████████████████████████████████████████████▋ | 430/594 [58:12<23:28, 8.59s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:20<23:03, 8.49s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:20<23:03, 8.49s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2492, 'learning_rate': 8.560000000000001e-06, 'epoch': 0.72} 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:20<23:03, 8.49s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:20<23:03, 8.49s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▊ | 431/594 [58:20<23:03, 8.49s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:29<22:44, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:29<22:44, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:29<22:44, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:29<22:44, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|██████████████████████████████████████████████████████████▉ | 432/594 [58:29<22:44, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:37<22:35, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:37<22:35, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:37<22:35, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:37<22:35, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████ | 433/594 [58:37<22:35, 8.42s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:45<22:20, 8.38s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:45<22:20, 8.38s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:45<22:20, 8.38s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:45<22:20, 8.38s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▏ | 434/594 [58:45<22:20, 8.38s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [58:53<21:56, 8.28s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [58:53<21:56, 8.28s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [58:53<21:56, 8.28s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [58:53<21:56, 8.28s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▎ | 435/594 [58:53<21:56, 8.28s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:02<21:53, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:02<21:53, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:02<21:53, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 73%|███████████████████████████████████████████████████████████▍ | 436/594 [59:02<21:53, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:10<21:43, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:10<21:43, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0674, 'learning_rate': 8.68e-06, 'epoch': 0.73} 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:10<21:43, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:10<21:43, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▌ | 437/594 [59:10<21:43, 8.31s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:18<21:16, 8.18s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:18<21:16, 8.18s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:18<21:16, 8.18s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:18<21:16, 8.18s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▋ | 438/594 [59:18<21:16, 8.18s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:25<20:37, 7.98s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:25<20:37, 7.98s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:25<20:37, 7.98s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:25<20:37, 7.98s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|███████████████████████████████████████████████████████████▊ | 439/594 [59:25<20:37, 7.98s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████ | 440/594 [59:33<20:07, 7.84s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:17,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:17,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:17,607 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 441/594 [59:40<19:22, 7.60s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▏ | 441/594 [59:40<19:22, 7.60s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:25,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:25,910 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▎ | 442/594 [59:46<18:22, 7.25s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 74%|████████████████████████████████████████████████████████████▎ | 442/594 [59:46<18:22, 7.25s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:31,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:31,874 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▍ | 443/594 [59:52<17:10, 6.83s/it]g-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:36,126 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:44:17,925 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|████████████████████████████████████████████████████████████▌ | 444/594 [59:58<16:02, 6.42s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:42,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:42,480 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 75%|███████████████████████████████████████████████████████████▏ | 445/594 [1:00:03<14:49, 5.97s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:46,076 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:48,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:48,242 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:50,381 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:52,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:52,312 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:54,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:54,212 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:55,868 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:57,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:50:57,452 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:00,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:00,194 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:01,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:01,916 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:07,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:07,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:51:07,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:32<14:16, 5.99s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:32<14:16, 5.99s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:32<14:16, 5.99s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:32<14:16, 5.99s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|███████████████████████████████████████████████████████████▉ | 451/594 [1:00:32<14:16, 5.99s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:43<17:24, 7.35s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:43<17:24, 7.35s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:43<17:24, 7.35s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:43<17:24, 7.35s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████ | 452/594 [1:00:43<17:24, 7.35s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:53<19:19, 8.22s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:53<19:19, 8.22s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:53<19:19, 8.22s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:53<19:19, 8.22s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▏ | 453/594 [1:00:53<19:19, 8.22s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:03<20:34, 8.81s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:03<20:34, 8.81s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:03<20:34, 8.81s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:03<20:34, 8.81s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 76%|████████████████████████████████████████████████████████████▍ | 454/594 [1:01:03<20:34, 8.81s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.152, 'learning_rate': 9.060000000000001e-06, 'epoch': 0.77} 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▌ | 455/594 [1:01:13<21:20, 9.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2426, 'learning_rate': 9.080000000000001e-06, 'epoch': 0.77} g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:44<22:16, 9.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:44<22:16, 9.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:44<22:16, 9.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|████████████████████████████████████████████████████████████▉ | 458/594 [1:01:44<22:16, 9.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:54<22:12, 9.87s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:54<22:12, 9.87s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1891, 'learning_rate': 9.12e-06, 'epoch': 0.77} 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:54<22:12, 9.87s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████ | 459/594 [1:01:54<22:12, 9.87s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:04<22:07, 9.90s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:04<22:07, 9.90s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2, 'learning_rate': 9.14e-06, 'epoch': 0.77} 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:04<22:07, 9.90s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 77%|█████████████████████████████████████████████████████████████▏ | 460/594 [1:02:04<22:07, 9.90s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:14<22:02, 9.95s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:14<22:02, 9.95s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1147, 'learning_rate': 9.16e-06, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:14<22:02, 9.95s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▎ | 461/594 [1:02:14<22:02, 9.95s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:23<21:47, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:23<21:47, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1357, 'learning_rate': 9.180000000000002e-06, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:23<21:47, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▍ | 462/594 [1:02:23<21:47, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:33<21:37, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:33<21:37, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1446, 'learning_rate': 9.200000000000002e-06, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:33<21:37, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:33<21:37, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▌ | 463/594 [1:02:33<21:37, 9.91s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:43<21:24, 9.88s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:43<21:24, 9.88s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:43<21:24, 9.88s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▋ | 464/594 [1:02:43<21:24, 9.88s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:53<21:03, 9.79s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:53<21:03, 9.79s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2335, 'learning_rate': 9.240000000000001e-06, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:53<21:03, 9.79s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▊ | 465/594 [1:02:53<21:03, 9.79s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:02<20:44, 9.72s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:02<20:44, 9.72s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1698, 'learning_rate': 9.260000000000001e-06, 'epoch': 0.78} 78%|█████████████████████████████████████████████████████████████▉ | 466/594 [1:03:02<20:44, 9.72s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1785, 'learning_rate': 9.280000000000001e-06, 'epoch': 0.79} [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:53:52,614 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:21<20:01, 9.53s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:21<20:01, 9.53s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:21<20:01, 9.53s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▏ | 468/594 [1:03:21<20:01, 9.53s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1838, 'learning_rate': 9.32e-06, 'epoch': 0.79} g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:54:20,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:54:20,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1536, 'learning_rate': 9.340000000000002e-06, 'epoch': 0.79} [WARNING|modeling_utils.py:388] 2022-02-28 19:54:20,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:54:20,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:54:20,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:49<19:07, 9.33s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:49<19:07, 9.33s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.081, 'learning_rate': 9.360000000000002e-06, 'epoch': 0.79} 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:49<19:07, 9.33s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:49<19:07, 9.33s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▋ | 471/594 [1:03:49<19:07, 9.33s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:03:58<18:57, 9.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:03:58<18:57, 9.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:03:58<18:57, 9.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 79%|██████████████████████████████████████████████████████████████▊ | 472/594 [1:03:58<18:57, 9.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:07<18:43, 9.29s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:07<18:43, 9.29s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1796, 'learning_rate': 9.4e-06, 'epoch': 0.8} 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:07<18:43, 9.29s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|██████████████████████████████████████████████████████████████▉ | 473/594 [1:04:07<18:43, 9.29s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 474/594 [1:04:16<18:22, 9.19s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 474/594 [1:04:16<18:22, 9.19s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.0716, 'learning_rate': 9.42e-06, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████ | 474/594 [1:04:16<18:22, 9.19s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████ | 474/594 [1:04:16<18:22, 9.19s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:26<18:28, 9.31s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:26<18:28, 9.31s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1163, 'learning_rate': 9.440000000000001e-06, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:26<18:28, 9.31s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:26<18:28, 9.31s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▏ | 475/594 [1:04:26<18:28, 9.31s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1631, 'learning_rate': 9.48e-06, 'epoch': 0.8} 80%|███████████████████████████████████████████████████████████████▎ | 476/594 [1:04:35<18:05, 9.20s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.1923, 'learning_rate': 9.5e-06, 'epoch': 0.8} [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:55:30,765 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:01<16:55, 8.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:01<16:55, 8.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:01<16:55, 8.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:01<16:55, 8.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▋ | 479/594 [1:05:01<16:55, 8.83s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:09<16:36, 8.74s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:09<16:36, 8.74s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:09<16:36, 8.74s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:09<16:36, 8.74s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▊ | 480/594 [1:05:09<16:36, 8.74s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:17<16:10, 8.59s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:17<16:10, 8.59s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:17<16:10, 8.59s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:17<16:10, 8.59s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|███████████████████████████████████████████████████████████████▉ | 481/594 [1:05:17<16:10, 8.59s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:26<15:49, 8.48s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:26<15:49, 8.48s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:26<15:49, 8.48s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:26<15:49, 8.48s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████ | 482/594 [1:05:26<15:49, 8.48s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:34<15:36, 8.43s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:34<15:36, 8.43s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:34<15:36, 8.43s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:34<15:36, 8.43s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▏ | 483/594 [1:05:34<15:36, 8.43s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:42<15:15, 8.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:42<15:15, 8.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:42<15:15, 8.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:42<15:15, 8.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 81%|████████████████████████████████████████████████████████████████▎ | 484/594 [1:05:42<15:15, 8.32s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:50<14:54, 8.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:50<14:54, 8.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:50<14:54, 8.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:50<14:54, 8.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▌ | 485/594 [1:05:50<14:54, 8.21s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:58<14:28, 8.04s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:58<14:28, 8.04s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:58<14:28, 8.04s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:58<14:28, 8.04s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▋ | 486/594 [1:05:58<14:28, 8.04s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▊ | 487/594 [1:06:05<14:06, 7.92s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▊ | 487/594 [1:06:05<14:06, 7.92s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|████████████████████████████████████████████████████████████████▊ | 487/594 [1:06:05<14:06, 7.92s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:56:53,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:56:53,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.2906, 'learning_rate': 9.7e-06, 'epoch': 0.82} [WARNING|modeling_utils.py:388] 2022-02-28 19:56:53,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:56:53,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:56:53,593 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 489/594 [1:06:20<13:17, 7.60s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████ | 489/594 [1:06:20<13:17, 7.60s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:05,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:05,989 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:27<12:49, 7.40s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:27<12:49, 7.40s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 82%|█████████████████████████████████████████████████████████████████▏ | 490/594 [1:06:27<12:49, 7.40s/it]g-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:14,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:14,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3359, 'learning_rate': 9.760000000000001e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-02-28 19:57:14,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:14,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:14,266 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:50:40,086 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▍ | 492/594 [1:06:39<11:40, 6.87s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▍ | 492/594 [1:06:39<11:40, 6.87s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:26,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:26,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.036, 'learning_rate': 9.800000000000001e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-02-28 19:57:30,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:30,539 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 83%|█████████████████████████████████████████████████████████████████▋ | 494/594 [1:06:51<10:23, 6.24s/it]g-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:34,369 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:36,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:36,760 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed {'loss': 4.3611, 'learning_rate': 9.84e-06, 'epoch': 0.83} [WARNING|modeling_utils.py:388] 2022-02-28 19:57:40,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:40,166 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:22,058 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|█████████████████████████████████████████████████████████████████▉ | 496/594 [1:07:00<08:50, 5.42s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:57:42,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:44,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:42,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:44,184 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:42,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████ | 497/594 [1:07:04<08:02, 4.97s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:57:46,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▏ | 498/594 [1:07:08<07:12, 4.50s/it]g-point operations will not be computed-28 19:57:46,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▏ | 498/594 [1:07:08<07:12, 4.50s/it]g-point operations will not be computed-28 19:57:46,100 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:50,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:49,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [WARNING|modeling_utils.py:388] 2022-02-28 19:57:50,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 19:57:49,343 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed 84%|██████████████████████████████████████████████████████████████████▎ | 499/594 [1:07:10<06:21, 4.02s/it][WARNING|modeling_utils.py:388] 2022-02-28 19:57:52,143 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed [INFO|trainer.py:2366] 2022-02-28 19:57:54,586 >> Num examples = 2642▍ | 500/594 [1:07:14<05:49, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-02-28 19:57:54,586 >> Num examples = 2642▍ | 500/594 [1:07:14<05:49, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4108, 'learning_rate': 9.940000000000001e-06, 'epoch': 0.84} [INFO|trainer.py:2366] 2022-02-28 19:57:54,586 >> Num examples = 2642▍ | 500/594 [1:07:14<05:49, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-02-28 19:57:54,586 >> Num examples = 2642▍ | 500/594 [1:07:14<05:49, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2366] 2022-02-28 19:57:54,586 >> Num examples = 2642▍ | 500/594 [1:07:14<05:49, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 2%|█▉ | 5/221 [00:11<09:37, 2.67s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▎ | 6/221 [00:15<10:18, 2.88s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 3%|██▋ | 7/221 [00:18<11:15, 3.16s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███ | 8/221 [00:21<11:01, 3.11s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 4%|███▍ | 9/221 [00:24<10:51, 3.07s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|███▋ | 10/221 [00:28<11:48, 3.36s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████ | 11/221 [00:33<12:41, 3.63s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 5%|████▍ | 12/221 [00:36<11:57, 3.43s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|████▊ | 13/221 [00:39<11:36, 3.35s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 6%|█████▏ | 14/221 [00:42<11:38, 3.37s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▌ | 15/221 [00:47<13:03, 3.80s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7%|█████▉ | 16/221 [00:52<13:56, 4.08s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▎ | 17/221 [00:55<13:15, 3.90s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 8%|██████▋ | 18/221 [00:59<13:04, 3.86s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████ | 19/221 [01:02<12:26, 3.70s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 9%|███████▍ | 20/221 [01:06<11:51, 3.54s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|███████▊ | 21/221 [01:08<11:05, 3.33s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▏ | 22/221 [01:11<10:50, 3.27s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 10%|████████▌ | 23/221 [01:15<10:37, 3.22s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|████████▉ | 24/221 [01:18<11:08, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 11%|█████████▎ | 25/221 [01:22<11:39, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|█████████▋ | 26/221 [01:26<11:53, 3.66s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 12%|██████████ | 27/221 [01:29<10:51, 3.36s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▍ | 28/221 [01:33<11:22, 3.53s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 13%|██████████▊ | 29/221 [01:37<12:05, 3.78s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▏ | 30/221 [01:40<11:19, 3.56s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▌ | 31/221 [01:43<10:21, 3.27s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 14%|███████████▊ | 32/221 [01:46<10:16, 3.26s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▏ | 33/221 [01:50<10:45, 3.43s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 15%|████████████▌ | 34/221 [01:53<10:49, 3.47s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|████████████▉ | 35/221 [01:56<10:19, 3.33s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 16%|█████████████▎ | 36/221 [02:00<10:07, 3.28s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|█████████████▋ | 37/221 [02:04<10:56, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 17%|██████████████ | 38/221 [02:07<10:23, 3.41s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▍ | 39/221 [02:11<10:36, 3.50s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 18%|██████████████▊ | 40/221 [02:14<10:01, 3.32s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▏ | 41/221 [02:17<10:14, 3.41s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▌ | 42/221 [02:22<11:10, 3.75s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 19%|███████████████▉ | 43/221 [02:25<10:38, 3.59s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▎ | 44/221 [02:30<11:41, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 20%|████████████████▋ | 45/221 [02:34<12:12, 4.16s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████ | 46/221 [02:38<12:05, 4.15s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 21%|█████████████████▍ | 47/221 [02:42<11:53, 4.10s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|█████████████████▊ | 48/221 [02:47<11:46, 4.09s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▏ | 49/221 [02:50<11:21, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▏ | 49/221 [02:50<11:21, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 22%|██████████████████▏ | 49/221 [02:50<11:21, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 23%|██████████████████▉ | 51/221 [02:57<10:39, 3.76s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▎ | 52/221 [03:00<09:55, 3.53s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|███████████████████▋ | 53/221 [03:03<09:21, 3.34s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 24%|████████████████████ | 54/221 [03:07<09:38, 3.46s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▍ | 55/221 [03:11<09:42, 3.51s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 25%|████████████████████▊ | 56/221 [03:15<10:29, 3.82s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▏ | 57/221 [03:19<10:31, 3.85s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 26%|█████████████████████▌ | 58/221 [03:23<10:10, 3.74s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|█████████████████████▉ | 59/221 [03:26<09:38, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 27%|██████████████████████▎ | 60/221 [03:28<08:49, 3.29s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|██████████████████████▋ | 61/221 [03:32<08:59, 3.37s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 28%|███████████████████████ | 62/221 [03:35<08:40, 3.27s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▍ | 63/221 [03:39<08:45, 3.33s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|███████████████████████▋ | 64/221 [03:42<08:40, 3.32s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 29%|████████████████████████ | 65/221 [03:45<08:36, 3.31s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▍ | 66/221 [03:49<08:44, 3.38s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 30%|████████████████████████▊ | 67/221 [03:51<08:11, 3.19s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▏ | 68/221 [03:56<08:52, 3.48s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 31%|█████████████████████████▌ | 69/221 [03:59<08:30, 3.36s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|█████████████████████████▉ | 70/221 [04:02<08:24, 3.34s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 32%|██████████████████████████▎ | 71/221 [04:05<08:14, 3.30s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|██████████████████████████▋ | 72/221 [04:08<07:46, 3.13s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|███████████████████████████ | 73/221 [04:12<08:04, 3.28s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 33%|███████████████████████████▍ | 74/221 [04:15<08:00, 3.27s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|███████████████████████████▊ | 75/221 [04:18<07:56, 3.26s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 34%|████████████████████████████▏ | 76/221 [04:21<07:50, 3.24s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▌ | 77/221 [04:24<07:44, 3.22s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 35%|████████████████████████████▉ | 78/221 [04:28<07:50, 3.29s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▎ | 79/221 [04:31<07:34, 3.20s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 36%|█████████████████████████████▋ | 80/221 [04:34<07:31, 3.20s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████ | 81/221 [04:38<07:54, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 37%|██████████████████████████████▍ | 82/221 [04:42<08:26, 3.64s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|██████████████████████████████▊ | 83/221 [04:46<08:51, 3.85s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████▏ | 84/221 [04:50<08:53, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 38%|███████████████████████████████▌ | 85/221 [04:55<09:13, 4.07s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|███████████████████████████████▉ | 86/221 [04:59<08:55, 3.97s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 39%|████████████████████████████████▎ | 87/221 [05:03<09:12, 4.12s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|████████████████████████████████▋ | 88/221 [05:06<08:38, 3.90s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 40%|█████████████████████████████████ | 89/221 [05:10<08:07, 3.69s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▍ | 90/221 [05:13<08:04, 3.70s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 41%|█████████████████████████████████▊ | 91/221 [05:18<08:17, 3.82s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▏ | 92/221 [05:22<08:31, 3.97s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 42%|██████████████████████████████████▌ | 93/221 [05:26<08:37, 4.04s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|██████████████████████████████████▉ | 94/221 [05:30<08:22, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|███████████████████████████████████▏ | 95/221 [05:34<08:19, 3.96s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 43%|███████████████████████████████████▌ | 96/221 [05:38<08:08, 3.91s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|███████████████████████████████████▉ | 97/221 [05:42<08:21, 4.04s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 44%|████████████████████████████████████▎ | 98/221 [05:46<08:07, 3.97s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 99/221 [05:49<07:22, 3.63s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 45%|████████████████████████████████████▋ | 100/221 [05:52<07:18, 3.62s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████ | 101/221 [05:55<06:57, 3.48s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 46%|█████████████████████████████████████▍ | 102/221 [05:58<06:37, 3.34s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|█████████████████████████████████████▊ | 103/221 [06:02<06:51, 3.49s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 47%|██████████████████████████████████████ | 104/221 [06:06<07:03, 3.62s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▍ | 105/221 [06:10<07:27, 3.85s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|██████████████████████████████████████▊ | 106/221 [06:14<07:27, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 48%|███████████████████████████████████████▏ | 107/221 [06:18<06:55, 3.64s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▌ | 108/221 [06:22<07:13, 3.84s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 49%|███████████████████████████████████████▉ | 109/221 [06:26<07:15, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▎ | 110/221 [06:29<06:54, 3.73s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 50%|████████████████████████████████████████▋ | 111/221 [06:33<06:37, 3.61s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████ | 112/221 [06:36<06:39, 3.66s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 51%|█████████████████████████████████████████▍ | 113/221 [06:40<06:34, 3.65s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|█████████████████████████████████████████▊ | 114/221 [06:43<06:19, 3.55s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▏ | 115/221 [06:47<06:14, 3.53s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 52%|██████████████████████████████████████████▌ | 116/221 [06:50<05:59, 3.42s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|██████████████████████████████████████████▉ | 117/221 [06:53<05:52, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 53%|███████████████████████████████████████████▏ | 118/221 [06:57<06:00, 3.50s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▌ | 119/221 [07:01<06:20, 3.73s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 54%|███████████████████████████████████████████▉ | 120/221 [07:06<06:35, 3.92s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▎ | 121/221 [07:09<06:20, 3.81s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 55%|████████████████████████████████████████████▋ | 122/221 [07:12<05:37, 3.41s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████ | 123/221 [07:14<05:05, 3.12s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 56%|█████████████████████████████████████████████▍ | 124/221 [07:17<05:00, 3.10s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|█████████████████████████████████████████████▊ | 125/221 [07:21<05:18, 3.32s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▏ | 126/221 [07:24<05:01, 3.17s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 57%|██████████████████████████████████████████████▌ | 127/221 [07:27<04:49, 3.08s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|██████████████████████████████████████████████▉ | 128/221 [07:29<04:33, 2.94s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 58%|███████████████████████████████████████████████▎ | 129/221 [07:33<04:50, 3.16s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|███████████████████████████████████████████████▋ | 130/221 [07:36<04:34, 3.02s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 59%|████████████████████████████████████████████████ | 131/221 [07:39<04:47, 3.20s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▍ | 132/221 [07:42<04:28, 3.01s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 60%|████████████████████████████████████████████████▋ | 133/221 [07:45<04:28, 3.05s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████ | 134/221 [07:48<04:16, 2.95s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 61%|█████████████████████████████████████████████████▍ | 135/221 [07:51<04:20, 3.03s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|█████████████████████████████████████████████████▊ | 136/221 [07:55<04:37, 3.26s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▏ | 137/221 [07:58<04:40, 3.34s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 62%|██████████████████████████████████████████████████▌ | 138/221 [08:02<04:50, 3.50s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|██████████████████████████████████████████████████▉ | 139/221 [08:06<04:52, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 63%|███████████████████████████████████████████████████▎ | 140/221 [08:08<04:22, 3.24s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|███████████████████████████████████████████████████▋ | 141/221 [08:11<04:18, 3.23s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 64%|████████████████████████████████████████████████████ | 142/221 [08:14<04:08, 3.14s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▍ | 143/221 [08:17<03:47, 2.92s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 65%|████████████████████████████████████████████████████▊ | 144/221 [08:21<04:05, 3.19s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▏ | 145/221 [08:24<04:01, 3.18s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 66%|█████████████████████████████████████████████████████▌ | 146/221 [08:28<04:10, 3.34s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|█████████████████████████████████████████████████████▉ | 147/221 [08:30<03:54, 3.17s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▏ | 148/221 [08:34<03:53, 3.19s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 67%|██████████████████████████████████████████████████████▌ | 149/221 [08:37<03:53, 3.25s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|██████████████████████████████████████████████████████▉ | 150/221 [08:40<03:49, 3.23s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 68%|███████████████████████████████████████████████████████▎ | 151/221 [08:44<03:57, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|███████████████████████████████████████████████████████▋ | 152/221 [08:47<03:49, 3.33s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 69%|████████████████████████████████████████████████████████ | 153/221 [08:50<03:43, 3.28s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▍ | 154/221 [08:54<03:44, 3.35s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 70%|████████████████████████████████████████████████████████▊ | 155/221 [08:57<03:42, 3.37s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▏ | 156/221 [09:01<03:40, 3.40s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▌ | 157/221 [09:03<03:24, 3.19s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 71%|█████████████████████████████████████████████████████████▉ | 158/221 [09:08<03:47, 3.62s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▎ | 159/221 [09:12<03:45, 3.64s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 72%|██████████████████████████████████████████████████████████▋ | 160/221 [09:16<03:50, 3.77s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████ | 161/221 [09:20<03:53, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 73%|███████████████████████████████████████████████████████████▍ | 162/221 [09:24<03:50, 3.90s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|███████████████████████████████████████████████████████████▋ | 163/221 [09:28<03:54, 4.05s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 74%|████████████████████████████████████████████████████████████ | 164/221 [09:33<04:00, 4.22s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▍ | 165/221 [09:36<03:45, 4.03s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 75%|████████████████████████████████████████████████████████████▊ | 166/221 [09:40<03:26, 3.75s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▏ | 167/221 [09:43<03:18, 3.67s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▌ | 168/221 [09:46<03:04, 3.49s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 76%|█████████████████████████████████████████████████████████████▉ | 169/221 [09:50<03:05, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▎ | 170/221 [09:54<03:06, 3.65s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 77%|██████████████████████████████████████████████████████████████▋ | 171/221 [09:57<03:03, 3.68s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████ | 172/221 [10:01<02:54, 3.57s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 78%|███████████████████████████████████████████████████████████████▍ | 173/221 [10:04<02:49, 3.54s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|███████████████████████████████████████████████████████████████▊ | 174/221 [10:07<02:40, 3.41s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 79%|████████████████████████████████████████████████████████████████▏ | 175/221 [10:10<02:32, 3.31s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▌ | 176/221 [10:14<02:36, 3.47s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 80%|████████████████████████████████████████████████████████████████▊ | 177/221 [10:17<02:25, 3.31s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▏ | 178/221 [10:21<02:28, 3.45s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▌ | 179/221 [10:24<02:22, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 81%|█████████████████████████████████████████████████████████████████▉ | 180/221 [10:28<02:28, 3.62s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▎ | 181/221 [10:32<02:28, 3.71s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 82%|██████████████████████████████████████████████████████████████████▋ | 182/221 [10:36<02:24, 3.70s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████ | 183/221 [10:40<02:27, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 83%|███████████████████████████████████████████████████████████████████▍ | 184/221 [10:44<02:22, 3.84s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|███████████████████████████████████████████████████████████████████▊ | 185/221 [10:47<02:11, 3.65s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████████▏ | 186/221 [10:52<02:16, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▌ | 187/221 [10:55<02:07, 3.76s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████████▉ | 188/221 [10:59<02:07, 3.87s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▎ | 189/221 [11:03<02:05, 3.93s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|█████████████████████████████████████████████████████████████████████▋ | 190/221 [11:08<02:06, 4.09s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|██████████████████████████████████████████████████████████████████████ | 191/221 [11:12<02:07, 4.25s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▎ | 192/221 [11:17<02:03, 4.26s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|██████████████████████████████████████████████████████████████████████▋ | 193/221 [11:20<01:50, 3.94s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████ | 194/221 [11:23<01:39, 3.68s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|███████████████████████████████████████████████████████████████████████▍ | 195/221 [11:26<01:30, 3.49s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|███████████████████████████████████████████████████████████████████████▊ | 196/221 [11:29<01:25, 3.43s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|████████████████████████████████████████████████████████████████████████▏ | 197/221 [11:32<01:17, 3.23s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▌ | 198/221 [11:36<01:20, 3.51s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|████████████████████████████████████████████████████████████████████████▉ | 199/221 [11:41<01:23, 3.79s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|█████████████████████████████████████████████████████████████████████████▎ | 200/221 [11:44<01:17, 3.68s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|█████████████████████████████████████████████████████████████████████████▋ | 201/221 [11:47<01:11, 3.56s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|██████████████████████████████████████████████████████████████████████████ | 202/221 [11:50<01:03, 3.33s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▍ | 203/221 [11:54<01:01, 3.41s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|██████████████████████████████████████████████████████████████████████████▊ | 204/221 [11:58<01:02, 3.70s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▏ | 205/221 [12:03<01:04, 4.01s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|███████████████████████████████████████████████████████████████████████████▌ | 206/221 [12:07<01:02, 4.19s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|███████████████████████████████████████████████████████████████████████████▊ | 207/221 [12:11<00:55, 3.95s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|████████████████████████████████████████████████████████████████████████████▏ | 208/221 [12:15<00:50, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▌ | 209/221 [12:18<00:44, 3.70s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|████████████████████████████████████████████████████████████████████████████▉ | 210/221 [12:22<00:41, 3.80s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|█████████████████████████████████████████████████████████████████████████████▎ | 211/221 [12:26<00:39, 3.97s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|█████████████████████████████████████████████████████████████████████████████▋ | 212/221 [12:30<00:35, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|██████████████████████████████████████████████████████████████████████████████ | 213/221 [12:33<00:29, 3.63s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▍ | 214/221 [12:37<00:25, 3.61s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|██████████████████████████████████████████████████████████████████████████████▊ | 215/221 [12:41<00:22, 3.77s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▏ | 216/221 [12:45<00:19, 3.88s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|███████████████████████████████████████████████████████████████████████████████▌ | 217/221 [12:49<00:15, 3.89s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|███████████████████████████████████████████████████████████████████████████████▉ | 218/221 [12:53<00:11, 3.88s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|████████████████████████████████████████████████████████████████████████████████▎| 219/221 [12:56<00:07, 3.88s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|████████████████████████████████████████████████████████████████████████████████▋| 220/221 [13:01<00:04, 4.08s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 221/221 [13:03<00:00, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|█████████████████████████████████████████████████████████████████████████████████| 221/221 [13:03<00:00, 3.39s/it][INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 02/28/2022 20:11:01 - INFO - datasets.metric - Removing /home/sanchit_huggingface_co/.cache/huggingface/metrics/wer/default/default_experiment-1-0.arrow [INFO|configuration_utils.py:438] 2022-02-28 20:11:01,435 >> Configuration saved in ./checkpoint-500/config.json [INFO|trainer.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 02/28/2022 20:12:38 - WARNING - huggingface_hub.repository - Adding files tracked by Git LFS: ['wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb']. This may take a bit of time if the files are large. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|feature_extraction_utils.py:324] 2022-02-28 20:11:06,749 >> Configuration saved in ./checkpoint-500/preprocessor_config.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:38<7:13:46, 279.86s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:38<7:13:46, 279.86s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:38<7:13:46, 279.86s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:38<7:13:46, 279.86s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 84%|████████████████████████████████████████████████████████████████ | 501/594 [1:22:38<7:13:46, 279.86s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:48<5:05:19, 199.13s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:48<5:05:19, 199.13s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:48<5:05:19, 199.13s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▏ | 502/594 [1:22:48<5:05:19, 199.13s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:59<3:36:16, 142.60s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:59<3:36:16, 142.60s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1053, 'learning_rate': 1e-05, 'epoch': 0.85} 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:59<3:36:16, 142.60s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:59<3:36:16, 142.60s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▎ | 503/594 [1:22:59<3:36:16, 142.60s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:10<2:34:25, 102.95s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:10<2:34:25, 102.95s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:10<2:34:25, 102.95s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|████████████████████████████████████████████████████████████████▍ | 504/594 [1:23:10<2:34:25, 102.95s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 505/594 [1:23:20<1:51:27, 75.14s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 505/594 [1:23:20<1:51:27, 75.14s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1617, 'learning_rate': 9.787234042553192e-06, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▍ | 505/594 [1:23:20<1:51:27, 75.14s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 505/594 [1:23:20<1:51:27, 75.14s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▍ | 505/594 [1:23:20<1:51:27, 75.14s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1643, 'learning_rate': 9.574468085106385e-06, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.158, 'learning_rate': 9.46808510638298e-06, 'epoch': 0.85} 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 85%|█████████████████████████████████████████████████████████████████▌ | 506/594 [1:23:30<1:21:44, 55.73s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:24:01<36:30, 25.77s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:24:01<36:30, 25.77s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0941, 'learning_rate': 9.361702127659576e-06, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:24:01<36:30, 25.77s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▋ | 509/594 [1:24:01<36:30, 25.77s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:11<29:26, 21.03s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:11<29:26, 21.03s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.17, 'learning_rate': 9.255319148936171e-06, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:11<29:26, 21.03s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:11<29:26, 21.03s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▊ | 510/594 [1:24:11<29:26, 21.03s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1385, 'learning_rate': 9.042553191489362e-06, 'epoch': 0.86} 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|███████████████████████████████████████████████████████████████████▉ | 511/594 [1:24:21<24:29, 17.70s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:40<18:26, 13.66s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:40<18:26, 13.66s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1332, 'learning_rate': 8.936170212765958e-06, 'epoch': 0.86} 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:40<18:26, 13.66s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:40<18:26, 13.66s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 86%|████████████████████████████████████████████████████████████████████▏ | 513/594 [1:24:40<18:26, 13.66s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:50<16:42, 12.53s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:50<16:42, 12.53s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:50<16:42, 12.53s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:50<16:42, 12.53s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▎ | 514/594 [1:24:50<16:42, 12.53s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:25:00<15:22, 11.68s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:25:00<15:22, 11.68s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:25:00<15:22, 11.68s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▍ | 515/594 [1:25:00<15:22, 11.68s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:09<14:19, 11.02s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:09<14:19, 11.02s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1385, 'learning_rate': 8.617021276595746e-06, 'epoch': 0.87} 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:09<14:19, 11.02s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▋ | 516/594 [1:25:09<14:19, 11.02s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:19<13:30, 10.52s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:19<13:30, 10.52s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1791, 'learning_rate': 8.510638297872341e-06, 'epoch': 0.87} 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:19<13:30, 10.52s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|████████████████████████████████████████████████████████████████████▊ | 517/594 [1:25:19<13:30, 10.52s/it]onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. onfig.jsonner.py:560] 2022-02-28 19:57:54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2385, 'learning_rate': 8.404255319148937e-06, 'epoch': 0.87} [WARNING|modeling_utils.py:388] 2022-02-28 20:16:13,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:16:13,515 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:37<12:22, 9.89s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:37<12:22, 9.89s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0488, 'learning_rate': 8.297872340425532e-06, 'epoch': 0.87} 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:37<12:22, 9.89s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 87%|█████████████████████████████████████████████████████████████████████ | 519/594 [1:25:37<12:22, 9.89s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1701, 'learning_rate': 8.191489361702128e-06, 'epoch': 0.87} 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1984, 'learning_rate': 8.085106382978723e-06, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▏ | 520/594 [1:25:46<11:56, 9.68s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:05<11:15, 9.39s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:05<11:15, 9.39s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:05<11:15, 9.39s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▍ | 522/594 [1:26:05<11:15, 9.39s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▌ | 523/594 [1:26:14<11:00, 9.31s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▌ | 523/594 [1:26:14<11:00, 9.31s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0696, 'learning_rate': 7.872340425531916e-06, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▌ | 523/594 [1:26:14<11:00, 9.31s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▌ | 523/594 [1:26:14<11:00, 9.31s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▌ | 523/594 [1:26:14<11:00, 9.31s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▋ | 524/594 [1:26:23<10:44, 9.21s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:08,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:08,082 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:32<10:37, 9.24s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:32<10:37, 9.24s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0892, 'learning_rate': 7.659574468085107e-06, 'epoch': 0.88} 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:32<10:37, 9.24s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 88%|█████████████████████████████████████████████████████████████████████▊ | 525/594 [1:26:32<10:37, 9.24s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0812, 'learning_rate': 7.553191489361703e-06, 'epoch': 0.88} g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████ | 527/594 [1:26:49<10:02, 8.99s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████ | 527/594 [1:26:49<10:02, 8.99s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:34,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:34,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:34,791 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▏ | 528/594 [1:26:58<09:46, 8.88s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▏ | 528/594 [1:26:58<09:46, 8.88s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:17:45,409 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:06<09:27, 8.73s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:06<09:27, 8.73s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.056, 'learning_rate': 7.234042553191491e-06, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:06<09:27, 8.73s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▎ | 529/594 [1:27:06<09:27, 8.73s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:15<09:13, 8.66s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:15<09:13, 8.66s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0824, 'learning_rate': 7.127659574468085e-06, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:15<09:13, 8.66s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▍ | 530/594 [1:27:15<09:13, 8.66s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:23<08:59, 8.57s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:23<08:59, 8.57s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0396, 'learning_rate': 7.021276595744682e-06, 'epoch': 0.89} 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:23<08:59, 8.57s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 89%|██████████████████████████████████████████████████████████████████████▌ | 531/594 [1:27:23<08:59, 8.57s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:32<08:48, 8.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:32<08:48, 8.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2931, 'learning_rate': 6.914893617021278e-06, 'epoch': 0.89} 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:32<08:48, 8.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▊ | 532/594 [1:27:32<08:48, 8.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:40<08:32, 8.40s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:40<08:32, 8.40s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1061, 'learning_rate': 6.808510638297873e-06, 'epoch': 0.9} 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:40<08:32, 8.40s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|██████████████████████████████████████████████████████████████████████▉ | 533/594 [1:27:40<08:32, 8.40s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:48<08:18, 8.30s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:48<08:18, 8.30s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.124, 'learning_rate': 6.702127659574469e-06, 'epoch': 0.9} 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:48<08:18, 8.30s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████ | 534/594 [1:27:48<08:18, 8.30s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:56<08:03, 8.19s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:56<08:03, 8.19s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.149, 'learning_rate': 6.595744680851064e-06, 'epoch': 0.9} 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:56<08:03, 8.19s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▏ | 535/594 [1:27:56<08:03, 8.19s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:04<07:46, 8.05s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:04<07:46, 8.05s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0031, 'learning_rate': 6.48936170212766e-06, 'epoch': 0.9} 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:04<07:46, 8.05s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▎ | 536/594 [1:28:04<07:46, 8.05s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▍ | 537/594 [1:28:11<07:30, 7.91s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 90%|███████████████████████████████████████████████████████████████████████▍ | 537/594 [1:28:11<07:30, 7.91s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2344, 'learning_rate': 6.382978723404256e-06, 'epoch': 0.9} [WARNING|modeling_utils.py:388] 2022-02-28 20:18:57,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:18:57,735 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:19<07:14, 7.77s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:19<07:14, 7.77s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:19<07:14, 7.77s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:19<07:14, 7.77s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▌ | 538/594 [1:28:19<07:14, 7.77s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▋ | 539/594 [1:28:26<06:56, 7.58s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:10,214 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▊ | 540/594 [1:28:33<06:38, 7.38s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▊ | 540/594 [1:28:33<06:38, 7.38s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:18,503 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:39<06:16, 7.10s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:39<06:16, 7.10s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.151, 'learning_rate': 5.957446808510638e-06, 'epoch': 0.91} 91%|███████████████████████████████████████████████████████████████████████▉ | 541/594 [1:28:39<06:16, 7.10s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:26,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:26,237 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1377, 'learning_rate': 5.851063829787235e-06, 'epoch': 0.91} [WARNING|modeling_utils.py:388] 2022-02-28 20:19:30,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:30,659 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|████████████████████████████████████████████████████████████████████████▏ | 543/594 [1:28:51<05:32, 6.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 91%|████████████████████████████████████████████████████████████████████████▏ | 543/594 [1:28:51<05:32, 6.52s/it]g-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:35,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:35,995 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed54,583 >> The following columns in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|████████████████████████████████████████████████████████████████████████▎ | 544/594 [1:28:56<05:05, 6.12s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|████████████████████████████████████████████████████████████████████████▎ | 544/594 [1:28:56<05:05, 6.12s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 92%|████████████████████████████████████████████████████████████████████████▎ | 544/594 [1:28:56<05:05, 6.12s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:41,993 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:44,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:44,191 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:46,165 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:48,108 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:49,852 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:51,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:51,617 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:54,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:54,701 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:56,030 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:57,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:57,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:58,948 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:19:58,948 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:20:04,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:20:04,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:20:04,521 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:29<04:07, 5.75s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:29<04:07, 5.75s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:29<04:07, 5.75s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:29<04:07, 5.75s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▎ | 551/594 [1:29:29<04:07, 5.75s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:39<04:56, 7.07s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:39<04:56, 7.07s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:39<04:56, 7.07s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:39<04:56, 7.07s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▍ | 552/594 [1:29:39<04:56, 7.07s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:49<05:26, 7.96s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:49<05:26, 7.96s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:49<05:26, 7.96s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:49<05:26, 7.96s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▌ | 553/594 [1:29:49<05:26, 7.96s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:59<05:43, 8.58s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:59<05:43, 8.58s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:59<05:43, 8.58s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▋ | 554/594 [1:29:59<05:43, 8.58s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:09<05:49, 8.97s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:09<05:49, 8.97s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0749, 'learning_rate': 4.468085106382979e-06, 'epoch': 0.93} 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:09<05:49, 8.97s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 93%|█████████████████████████████████████████████████████████████████████████▊ | 555/594 [1:30:09<05:49, 8.97s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:19<05:50, 9.23s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:19<05:50, 9.23s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.106, 'learning_rate': 4.361702127659575e-06, 'epoch': 0.93} 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:19<05:50, 9.23s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|█████████████████████████████████████████████████████████████████████████▉ | 556/594 [1:30:19<05:50, 9.23s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:47, 9.40s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:47, 9.40s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1974, 'learning_rate': 4.255319148936171e-06, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:47, 9.40s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:47, 9.40s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████ | 557/594 [1:30:28<05:47, 9.40s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0609, 'learning_rate': 4.042553191489362e-06, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▏ | 558/594 [1:30:38<05:42, 9.52s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▍ | 560/594 [1:30:57<05:24, 9.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▍ | 560/594 [1:30:57<05:24, 9.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1074, 'learning_rate': 3.936170212765958e-06, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▍ | 560/594 [1:30:57<05:24, 9.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▍ | 560/594 [1:30:57<05:24, 9.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:15, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:15, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4338, 'learning_rate': 3.8297872340425535e-06, 'epoch': 0.94} 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:15, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 94%|██████████████████████████████████████████████████████████████████████████▌ | 561/594 [1:31:07<05:15, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:06, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:06, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.041, 'learning_rate': 3.723404255319149e-06, 'epoch': 0.94} 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:06, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▋ | 562/594 [1:31:17<05:06, 9.56s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:53, 9.47s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:53, 9.47s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0901, 'learning_rate': 3.6170212765957453e-06, 'epoch': 0.95} 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:53, 9.47s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:53, 9.47s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|██████████████████████████████████████████████████████████████████████████▉ | 563/594 [1:31:26<04:53, 9.47s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:41, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:41, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:41, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:41, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████ | 564/594 [1:31:35<04:41, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:32, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:32, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:32, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▏ | 565/594 [1:31:44<04:32, 9.39s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0428, 'learning_rate': 3.297872340425532e-06, 'epoch': 0.95} 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1384, 'learning_rate': 3.191489361702128e-06, 'epoch': 0.95} 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 95%|███████████████████████████████████████████████████████████████████████████▎ | 566/594 [1:31:54<04:20, 9.30s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.0906, 'learning_rate': 3.0851063829787237e-06, 'epoch': 0.96} 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▌ | 568/594 [1:32:12<03:58, 9.19s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 3.9581, 'learning_rate': 2.978723404255319e-06, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-02-28 20:23:06,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:23:06,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:23:06,229 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:37, 9.08s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:37, 9.08s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:37, 9.08s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:37, 9.08s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▊ | 570/594 [1:32:30<03:37, 9.08s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:38<03:27, 9.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:38<03:27, 9.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:38<03:27, 9.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|███████████████████████████████████████████████████████████████████████████▉ | 571/594 [1:32:38<03:27, 9.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1715, 'learning_rate': 2.6595744680851065e-06, 'epoch': 0.96} 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 96%|████████████████████████████████████████████████████████████████████████████ | 572/594 [1:32:47<03:16, 8.94s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2385, 'learning_rate': 2.553191489361702e-06, 'epoch': 0.96} [WARNING|modeling_utils.py:388] 2022-02-28 20:23:41,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:23:41,323 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.1462, 'learning_rate': 2.446808510638298e-06, 'epoch': 0.97} [WARNING|modeling_utils.py:388] 2022-02-28 20:23:49,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:23:49,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:23:49,922 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▍ | 575/594 [1:33:14<02:49, 8.90s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▍ | 575/594 [1:33:14<02:49, 8.90s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▍ | 575/594 [1:33:14<02:49, 8.90s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▍ | 575/594 [1:33:14<02:49, 8.90s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▍ | 575/594 [1:33:14<02:49, 8.90s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▌ | 576/594 [1:33:22<02:38, 8.80s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▌ | 576/594 [1:33:22<02:38, 8.80s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▌ | 576/594 [1:33:22<02:38, 8.80s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▌ | 576/594 [1:33:22<02:38, 8.80s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▌ | 576/594 [1:33:22<02:38, 8.80s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▋ | 577/594 [1:33:31<02:27, 8.69s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▋ | 577/594 [1:33:31<02:27, 8.69s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▋ | 577/594 [1:33:31<02:27, 8.69s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▋ | 577/594 [1:33:31<02:27, 8.69s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▋ | 577/594 [1:33:31<02:27, 8.69s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:39<02:17, 8.62s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:39<02:17, 8.62s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:39<02:17, 8.62s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|████████████████████████████████████████████████████████████████████████████▊ | 578/594 [1:33:39<02:17, 8.62s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 97%|█████████████████████████████████████████████████████████████████████████████ | 579/594 [1:33:48<02:08, 8.55s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:24:32,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:24:32,712 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:56<01:58, 8.46s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:56<01:58, 8.46s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.2692, 'learning_rate': 1.8085106382978727e-06, 'epoch': 0.98} 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:56<01:58, 8.46s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▏ | 580/594 [1:33:56<01:58, 8.46s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:04<01:48, 8.32s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:04<01:48, 8.32s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.3664, 'learning_rate': 1.7021276595744682e-06, 'epoch': 0.98} 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:04<01:48, 8.32s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:04<01:48, 8.32s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▎ | 581/594 [1:34:04<01:48, 8.32s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:12<01:37, 8.16s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:12<01:37, 8.16s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:12<01:37, 8.16s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:12<01:37, 8.16s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▍ | 582/594 [1:34:12<01:37, 8.16s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:19<01:27, 8.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:19<01:27, 8.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:19<01:27, 8.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:19<01:27, 8.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▌ | 583/594 [1:34:19<01:27, 8.00s/it]g-point operations will not be computed-28 20:19:38,544 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:27<01:18, 7.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:27<01:18, 7.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:27<01:18, 7.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▋ | 584/594 [1:34:27<01:18, 7.81s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:34<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:34<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:34<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 98%|█████████████████████████████████████████████████████████████████████████████▊ | 585/594 [1:34:34<01:08, 7.59s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:21,288 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:25,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:25,937 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████ | 587/594 [1:34:46<00:48, 6.94s/it]g-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████ | 587/594 [1:34:46<00:48, 6.94s/it]g-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:31,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:31,800 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▏| 588/594 [1:34:52<00:39, 6.59s/it]g-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:35,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:35,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:35,889 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:09,472 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▎| 589/594 [1:34:57<00:30, 6.18s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:39,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:42,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:39,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:42,038 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:39,706 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▍| 590/594 [1:35:02<00:23, 5.77s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:44,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:46,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:44,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:46,388 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:44,344 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 99%|██████████████████████████████████████████████████████████████████████████████▌| 591/594 [1:35:06<00:15, 5.29s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:48,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:50,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:48,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:50,261 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:48,416 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▋| 592/594 [1:35:10<00:09, 4.83s/it][WARNING|modeling_utils.py:388] 2022-02-28 20:25:52,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▊| 593/594 [1:35:13<00:04, 4.35s/it]g-point operations will not be computed-28 20:25:52,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 100%|██████████████████████████████████████████████████████████████████████████████▊| 593/594 [1:35:13<00:04, 4.35s/it]g-point operations will not be computed-28 20:25:52,075 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:56,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:55,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [WARNING|modeling_utils.py:388] 2022-02-28 20:25:56,492 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed-28 20:25:55,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. {'loss': 4.4907, 'learning_rate': 3.1914893617021275e-07, 'epoch': 1.0} [INFO|trainer.py:2114] 2022-02-28 20:25:57,230 >> Saving model checkpoint to ./=)███| 594/594 [1:35:16<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|trainer.py:2114] 2022-02-28 20:26:13,677 >> Saving model checkpoint to ./ ./pytorch_model.bin:16<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modeling_utils.py:1081] 2022-02-28 20:26:29,910 >> Model weights saved in ./pytorch_model.bin:16<00:00, 3.88s/it][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 0%| | 32.0k/2.99G [00:00> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 1%|▍ | 30.9M/2.99G [00:02<03:12, 16.5MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 2%|█ | 68.5M/2.99G [00:04<02:48, 18.6MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 4%|█▋ | 108M/2.99G [00:06<02:37, 19.7MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 5%|██▎ | 146M/2.99G [00:08<02:32, 20.0MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 187M/2.99G [00:10<02:24, 20.8MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 187M/2.99G [00:10<02:24, 20.8MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 187M/2.99G [00:10<02:24, 20.8MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 187M/2.99G [00:10<02:24, 20.8MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file pytorch_model.bin: 6%|██▉ | 187M/2.99G [00:10<02:24, 20.8MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7bf7ab8..3ed2d83 main -> main5039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 7bf7ab8..3ed2d83 main -> main5039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.6M/35.6M [00:18<00:00, 15.9MB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 02/28/2022 20:30:44 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|████████████| 35.6M/35.6M [02:34<00:00, 165kB/s][INFO|trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modelcard.py:460] 2022-02-28 20:30:47,706 >> Dropping the following result as it does not have all the necessary fields:trainer.py:1492] 2022-02-28 20:25:57,228 >> 5,181 >> Could not estimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 48%|█████▏ | 16.9M/35.6M [00:01<00:01, 17.7MB/s]To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-searchimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 48%|█████▏ | 16.9M/35.6M [00:01<00:01, 17.7MB/s]To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-searchimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 02/28/2022 20:30:53 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search 3ed2d83..40e6505 main -> main Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 48%|█████▏ | 16.9M/35.6M [00:01<00:01, 17.7MB/s]To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-searchimate the number of tokens of the input, floating-point operations will not be computed.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. ***** train metrics ***** epoch = 1.0 train_loss = 4.3367 train_runtime = 1:35:18.43 train_samples = 28538 train_samples_per_second = 4.991 train_steps_per_second = 0.104 [INFO|trainer.py:2366] 2022-02-28 20:30:56,095 >> Num examples = 2642in the evaluation set don't have a corresponding argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 0%| | 0/221 [00:00> Saving model checkpoint to ./ | 2/221 [00:03<05:45, 1.58s/it] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. [INFO|modeling_utils.py:1081] 2022-02-28 20:46:48,871 >> Model weights saved in ./pytorch_model.bin:03<05:45, 1.58s/it] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 41%|████▍ | 14.6M/35.7M [00:01<00:01, 15.2MB/s] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 41%|████▍ | 14.6M/35.7M [00:01<00:01, 15.2MB/s] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. 02/28/2022 20:47:18 - WARNING - huggingface_hub.repository - To https://huggingface.co/sanchit-gandhi/wav2vec2-gpt2-wandb-grid-search Upload file wandb/run-20220228_185039-2l3jouo4/run-2l3jouo4.wandb: 100%|███████████| 35.7M/35.7M [00:02<00:00, 18.7MB/s] argument in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message. return ModelInfo(**d)f.finetuned_from)formers/src/transformers/modelcard.py", line 611, in from_trainercard31, in mainule>ent in `SpeechEncoderDecoderModel.forward` and have been ignored: input_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.ut_length. If input_length are not expected by `SpeechEncoderDecoderModel.forward`, you can safely ignore this message.