2024/01/01 22:38:49 - mmengine - INFO - Iter(train) [ 500/640000] base_lr: 2.0000e-04 lr: 2.0000e-05 eta: 11 days, 5:24:02 time: 1.5166 data_time: 0.0114 memory: 25685 grad_norm: 33.0434 loss: 6.2737 caption_loss_cls: 12.5696 detection_loss_cls: 1.6176 detection_loss_reg: 0.9343 semantic_segmentation_loss_cls: 0.2450 grounding_loss_reg: 11.4400 instance_segmentation_loss_cls: 1.0438 instance_segmentation_loss_reg: 0.9658 instance_segmentation_loss_poly: 3.9015 2024/01/01 22:51:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/01 22:51:19 - mmengine - INFO - Iter(train) [ 1000/640000] base_lr: 2.0000e-04 lr: 2.0000e-05 eta: 11 days, 3:37:28 time: 1.5077 data_time: 0.0109 memory: 25685 grad_norm: 23.1533 loss: 5.0888 caption_loss_cls: 9.4701 detection_loss_cls: 0.9446 detection_loss_reg: 0.7794 semantic_segmentation_loss_cls: 0.1739 grounding_loss_reg: 9.5128 instance_segmentation_loss_cls: 0.7035 instance_segmentation_loss_reg: 0.7910 instance_segmentation_loss_poly: 3.0479 2024/01/01 23:03:54 - mmengine - INFO - Iter(train) [ 1500/640000] base_lr: 2.0000e-04 lr: 2.0000e-05 eta: 11 days, 3:34:26 time: 1.5086 data_time: 0.0108 memory: 25685 grad_norm: 17.6954 loss: 4.4624 caption_loss_cls: 8.0333 detection_loss_cls: 0.6946 detection_loss_reg: 0.7212 semantic_segmentation_loss_cls: 0.1363 grounding_loss_reg: 8.6129 instance_segmentation_loss_cls: 0.5395 instance_segmentation_loss_reg: 0.7208 instance_segmentation_loss_poly: 2.6850 2024/01/01 23:16:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/01 23:16:53 - mmengine - INFO - Iter(train) [ 2000/640000] base_lr: 2.0000e-04 lr: 2.0000e-05 eta: 11 days, 5:34:12 time: 1.5211 data_time: 0.0109 memory: 25685 grad_norm: 14.6185 loss: 4.0903 caption_loss_cls: 7.1540 detection_loss_cls: 0.5592 detection_loss_reg: 0.6734 semantic_segmentation_loss_cls: 0.1155 grounding_loss_reg: 8.0990 instance_segmentation_loss_cls: 0.4318 instance_segmentation_loss_reg: 0.6744 instance_segmentation_loss_poly: 2.4185 2024/01/01 23:16:53 - mmengine - INFO - Saving checkpoint at 2000 iterations 2024/01/01 23:30:09 - mmengine - INFO - Iter(train) [ 2500/640000] base_lr: 1.9999e-04 lr: 1.9999e-05 eta: 11 days, 7:51:16 time: 1.5352 data_time: 0.0211 memory: 25685 grad_norm: 12.5598 loss: 3.8558 caption_loss_cls: 6.6163 detection_loss_cls: 0.4667 detection_loss_reg: 0.6441 semantic_segmentation_loss_cls: 0.1012 grounding_loss_reg: 7.7361 instance_segmentation_loss_cls: 0.3723 instance_segmentation_loss_reg: 0.6402 instance_segmentation_loss_poly: 2.2595 2024/01/01 23:43:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/01 23:43:06 - mmengine - INFO - Iter(train) [ 3000/640000] base_lr: 1.9999e-04 lr: 1.9999e-05 eta: 11 days, 8:13:25 time: 1.5385 data_time: 0.0195 memory: 25685 grad_norm: 11.0927 loss: 3.6406 caption_loss_cls: 6.2271 detection_loss_cls: 0.3971 detection_loss_reg: 0.6141 semantic_segmentation_loss_cls: 0.0899 grounding_loss_reg: 7.4892 instance_segmentation_loss_cls: 0.3317 instance_segmentation_loss_reg: 0.6187 instance_segmentation_loss_poly: 2.1436 2024/01/01 23:55:54 - mmengine - INFO - Iter(train) [ 3500/640000] base_lr: 1.9999e-04 lr: 1.9999e-05 eta: 11 days, 7:57:26 time: 1.5382 data_time: 0.0182 memory: 25685 grad_norm: 9.9919 loss: 3.4821 caption_loss_cls: 5.9094 detection_loss_cls: 0.3506 detection_loss_reg: 0.5930 semantic_segmentation_loss_cls: 0.0806 grounding_loss_reg: 7.2719 instance_segmentation_loss_cls: 0.3003 instance_segmentation_loss_reg: 0.5994 instance_segmentation_loss_poly: 2.0556 2024/01/02 00:08:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 00:08:55 - mmengine - INFO - Iter(train) [ 4000/640000] base_lr: 1.9998e-04 lr: 1.9998e-05 eta: 11 days, 8:14:24 time: 1.5410 data_time: 0.0173 memory: 25685 grad_norm: 9.1000 loss: 3.3253 caption_loss_cls: 5.6507 detection_loss_cls: 0.3172 detection_loss_reg: 0.5730 semantic_segmentation_loss_cls: 0.0723 grounding_loss_reg: 7.1102 instance_segmentation_loss_cls: 0.2751 instance_segmentation_loss_reg: 0.5886 instance_segmentation_loss_poly: 1.9959 2024/01/02 00:08:55 - mmengine - INFO - Saving checkpoint at 4000 iterations 2024/01/02 00:21:53 - mmengine - INFO - Iter(train) [ 4500/640000] base_lr: 1.9998e-04 lr: 1.9998e-05 eta: 11 days, 8:18:33 time: 1.5458 data_time: 0.0234 memory: 25685 grad_norm: 5.3651 loss: 2.8592 caption_loss_cls: 5.4213 detection_loss_cls: 0.2880 detection_loss_reg: 0.5596 semantic_segmentation_loss_cls: 0.0673 grounding_loss_reg: 6.9553 instance_segmentation_loss_cls: 0.2568 instance_segmentation_loss_reg: 0.5855 instance_segmentation_loss_poly: 1.9652 2024/01/02 00:35:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 00:35:04 - mmengine - INFO - Iter(train) [ 5000/640000] base_lr: 1.9997e-04 lr: 1.9997e-05 eta: 11 days, 8:47:36 time: 1.5562 data_time: 0.0234 memory: 25685 grad_norm: 4.0557 loss: 2.6573 caption_loss_cls: 5.2657 detection_loss_cls: 0.2647 detection_loss_reg: 0.5519 semantic_segmentation_loss_cls: 0.0629 grounding_loss_reg: 6.8333 instance_segmentation_loss_cls: 0.2395 instance_segmentation_loss_reg: 0.5718 instance_segmentation_loss_poly: 1.9069 2024/01/02 00:48:47 - mmengine - INFO - Iter(train) [ 5500/640000] base_lr: 1.9996e-04 lr: 1.9996e-05 eta: 11 days, 10:11:42 time: 1.5734 data_time: 0.0237 memory: 25685 grad_norm: 3.5501 loss: 2.5471 caption_loss_cls: 5.1347 detection_loss_cls: 0.2488 detection_loss_reg: 0.5443 semantic_segmentation_loss_cls: 0.0596 grounding_loss_reg: 6.7323 instance_segmentation_loss_cls: 0.2202 instance_segmentation_loss_reg: 0.5538 instance_segmentation_loss_poly: 1.8377 2024/01/02 01:01:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 01:01:39 - mmengine - INFO - Iter(train) [ 6000/640000] base_lr: 1.9996e-04 lr: 1.9996e-05 eta: 11 days, 9:48:27 time: 1.5716 data_time: 0.0236 memory: 25685 grad_norm: 3.2403 loss: 2.4700 caption_loss_cls: 4.9812 detection_loss_cls: 0.2342 detection_loss_reg: 0.5422 semantic_segmentation_loss_cls: 0.0566 grounding_loss_reg: 6.6311 instance_segmentation_loss_cls: 0.2092 instance_segmentation_loss_reg: 0.5491 instance_segmentation_loss_poly: 1.8086 2024/01/02 01:01:39 - mmengine - INFO - Saving checkpoint at 6000 iterations 2024/01/02 01:14:56 - mmengine - INFO - Iter(train) [ 6500/640000] base_lr: 1.9995e-04 lr: 1.9995e-05 eta: 11 days, 10:06:21 time: 1.5717 data_time: 0.0233 memory: 25685 grad_norm: 3.0500 loss: 2.3861 caption_loss_cls: 4.8710 detection_loss_cls: 0.2199 detection_loss_reg: 0.5281 semantic_segmentation_loss_cls: 0.0540 grounding_loss_reg: 6.5381 instance_segmentation_loss_cls: 0.1993 instance_segmentation_loss_reg: 0.5403 instance_segmentation_loss_poly: 1.7710 2024/01/02 01:28:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 01:28:04 - mmengine - INFO - Iter(train) [ 7000/640000] base_lr: 1.9994e-04 lr: 1.9994e-05 eta: 11 days, 10:08:13 time: 1.5745 data_time: 0.0234 memory: 25685 grad_norm: 2.9244 loss: 2.3349 caption_loss_cls: 4.7694 detection_loss_cls: 0.2083 detection_loss_reg: 0.5228 semantic_segmentation_loss_cls: 0.0518 grounding_loss_reg: 6.4398 instance_segmentation_loss_cls: 0.1897 instance_segmentation_loss_reg: 0.5305 instance_segmentation_loss_poly: 1.7287 2024/01/02 01:40:36 - mmengine - INFO - Iter(train) [ 7500/640000] base_lr: 1.9993e-04 lr: 1.9993e-05 eta: 11 days, 9:15:13 time: 1.5702 data_time: 0.0234 memory: 25685 grad_norm: 2.8625 loss: 2.3061 caption_loss_cls: 4.6567 detection_loss_cls: 0.1997 detection_loss_reg: 0.5203 semantic_segmentation_loss_cls: 0.0499 grounding_loss_reg: 6.3569 instance_segmentation_loss_cls: 0.1828 instance_segmentation_loss_reg: 0.5266 instance_segmentation_loss_poly: 1.7111 2024/01/02 01:53:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 01:53:49 - mmengine - INFO - Iter(train) [ 8000/640000] base_lr: 1.9992e-04 lr: 1.9992e-05 eta: 11 days, 9:23:22 time: 1.5736 data_time: 0.0234 memory: 25685 grad_norm: 2.8339 loss: 2.2754 caption_loss_cls: 4.5615 detection_loss_cls: 0.1904 detection_loss_reg: 0.5112 semantic_segmentation_loss_cls: 0.0480 grounding_loss_reg: 6.2871 instance_segmentation_loss_cls: 0.1751 instance_segmentation_loss_reg: 0.5179 instance_segmentation_loss_poly: 1.6790 2024/01/02 01:53:49 - mmengine - INFO - Saving checkpoint at 8000 iterations 2024/01/02 02:07:14 - mmengine - INFO - Iter(train) [ 8500/640000] base_lr: 1.9991e-04 lr: 1.9991e-05 eta: 11 days, 9:42:42 time: 1.5803 data_time: 0.0235 memory: 25685 grad_norm: 2.8029 loss: 2.2441 caption_loss_cls: 4.5020 detection_loss_cls: 0.1833 detection_loss_reg: 0.5096 semantic_segmentation_loss_cls: 0.0465 grounding_loss_reg: 6.2036 instance_segmentation_loss_cls: 0.1689 instance_segmentation_loss_reg: 0.5148 instance_segmentation_loss_poly: 1.6613 2024/01/02 02:20:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 02:20:11 - mmengine - INFO - Iter(train) [ 9000/640000] base_lr: 1.9990e-04 lr: 1.9990e-05 eta: 11 days, 9:25:40 time: 1.5767 data_time: 0.0236 memory: 25685 grad_norm: 2.8035 loss: 2.2228 caption_loss_cls: 4.4150 detection_loss_cls: 0.1772 detection_loss_reg: 0.5058 semantic_segmentation_loss_cls: 0.0453 grounding_loss_reg: 6.1349 instance_segmentation_loss_cls: 0.1630 instance_segmentation_loss_reg: 0.5116 instance_segmentation_loss_poly: 1.6408 2024/01/02 02:33:11 - mmengine - INFO - Iter(train) [ 9500/640000] base_lr: 1.9989e-04 lr: 1.9989e-05 eta: 11 days, 9:12:41 time: 1.5658 data_time: 0.0234 memory: 25685 grad_norm: 2.8062 loss: 2.1915 caption_loss_cls: 4.3586 detection_loss_cls: 0.1717 detection_loss_reg: 0.5069 semantic_segmentation_loss_cls: 0.0441 grounding_loss_reg: 6.0706 instance_segmentation_loss_cls: 0.1578 instance_segmentation_loss_reg: 0.5077 instance_segmentation_loss_poly: 1.6251 2024/01/02 02:46:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 02:46:04 - mmengine - INFO - Iter(train) [ 10000/640000] base_lr: 1.9988e-04 lr: 1.9988e-05 eta: 11 days, 8:52:16 time: 1.5660 data_time: 0.0234 memory: 25685 grad_norm: 2.8024 loss: 2.1573 caption_loss_cls: 4.3004 detection_loss_cls: 0.1662 detection_loss_reg: 0.5013 semantic_segmentation_loss_cls: 0.0429 grounding_loss_reg: 5.9928 instance_segmentation_loss_cls: 0.1524 instance_segmentation_loss_reg: 0.5018 instance_segmentation_loss_poly: 1.6009 2024/01/02 02:46:04 - mmengine - INFO - Saving checkpoint at 10000 iterations 2024/01/02 02:59:29 - mmengine - INFO - Iter(train) [ 10500/640000] base_lr: 1.9987e-04 lr: 1.9987e-05 eta: 11 days, 9:04:36 time: 1.5682 data_time: 0.0235 memory: 25685 grad_norm: 2.7639 loss: 2.1169 caption_loss_cls: 4.2461 detection_loss_cls: 0.1618 detection_loss_reg: 0.5024 semantic_segmentation_loss_cls: 0.0414 grounding_loss_reg: 5.9452 instance_segmentation_loss_cls: 0.1480 instance_segmentation_loss_reg: 0.4978 instance_segmentation_loss_poly: 1.5823 2024/01/02 03:12:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 03:12:25 - mmengine - INFO - Iter(train) [ 11000/640000] base_lr: 1.9985e-04 lr: 1.9985e-05 eta: 11 days, 8:47:02 time: 1.5650 data_time: 0.0244 memory: 25685 grad_norm: 2.7669 loss: 2.1001 caption_loss_cls: 4.2006 detection_loss_cls: 0.1575 detection_loss_reg: 0.4985 semantic_segmentation_loss_cls: 0.0402 grounding_loss_reg: 5.8884 instance_segmentation_loss_cls: 0.1447 instance_segmentation_loss_reg: 0.4976 instance_segmentation_loss_poly: 1.5757 2024/01/02 03:24:59 - mmengine - INFO - Iter(train) [ 11500/640000] base_lr: 1.9984e-04 lr: 1.9984e-05 eta: 11 days, 8:09:32 time: 1.5657 data_time: 0.0244 memory: 25685 grad_norm: 2.7860 loss: 2.0791 caption_loss_cls: 4.1493 detection_loss_cls: 0.1535 detection_loss_reg: 0.4969 semantic_segmentation_loss_cls: 0.0393 grounding_loss_reg: 5.8287 instance_segmentation_loss_cls: 0.1416 instance_segmentation_loss_reg: 0.4965 instance_segmentation_loss_poly: 1.5665 2024/01/02 03:38:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 03:38:03 - mmengine - INFO - Iter(train) [ 12000/640000] base_lr: 1.9983e-04 lr: 1.9983e-05 eta: 11 days, 8:01:05 time: 1.5635 data_time: 0.0245 memory: 25685 grad_norm: 2.8107 loss: 2.0797 caption_loss_cls: 4.1074 detection_loss_cls: 0.1499 detection_loss_reg: 0.4944 semantic_segmentation_loss_cls: 0.0384 grounding_loss_reg: 5.7714 instance_segmentation_loss_cls: 0.1381 instance_segmentation_loss_reg: 0.4957 instance_segmentation_loss_poly: 1.5577 2024/01/02 03:38:03 - mmengine - INFO - Saving checkpoint at 12000 iterations 2024/01/02 03:50:56 - mmengine - INFO - Iter(train) [ 12500/640000] base_lr: 1.9981e-04 lr: 1.9981e-05 eta: 11 days, 7:42:10 time: 1.5554 data_time: 0.0244 memory: 25685 grad_norm: 2.8161 loss: 2.0580 caption_loss_cls: 4.0755 detection_loss_cls: 0.1461 detection_loss_reg: 0.4919 semantic_segmentation_loss_cls: 0.0376 grounding_loss_reg: 5.7236 instance_segmentation_loss_cls: 0.1357 instance_segmentation_loss_reg: 0.4938 instance_segmentation_loss_poly: 1.5482 2024/01/02 04:04:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 04:04:01 - mmengine - INFO - Iter(train) [ 13000/640000] base_lr: 1.9980e-04 lr: 1.9980e-05 eta: 11 days, 7:34:10 time: 1.5576 data_time: 0.0244 memory: 25685 grad_norm: 2.7797 loss: 2.0305 caption_loss_cls: 4.0433 detection_loss_cls: 0.1418 detection_loss_reg: 0.4908 semantic_segmentation_loss_cls: 0.0368 grounding_loss_reg: 5.6830 instance_segmentation_loss_cls: 0.1336 instance_segmentation_loss_reg: 0.4950 instance_segmentation_loss_poly: 1.5449 2024/01/02 04:16:39 - mmengine - INFO - Iter(train) [ 13500/640000] base_lr: 1.9978e-04 lr: 1.9978e-05 eta: 11 days, 7:04:06 time: 1.5520 data_time: 0.0244 memory: 25685 grad_norm: 2.7891 loss: 2.0227 caption_loss_cls: 4.0018 detection_loss_cls: 0.1390 detection_loss_reg: 0.4882 semantic_segmentation_loss_cls: 0.0360 grounding_loss_reg: 5.6347 instance_segmentation_loss_cls: 0.1316 instance_segmentation_loss_reg: 0.4963 instance_segmentation_loss_poly: 1.5454 2024/01/02 04:29:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 04:29:19 - mmengine - INFO - Iter(train) [ 14000/640000] base_lr: 1.9976e-04 lr: 1.9976e-05 eta: 11 days, 6:36:55 time: 1.5487 data_time: 0.0244 memory: 25685 grad_norm: 2.7944 loss: 2.0063 caption_loss_cls: 3.9520 detection_loss_cls: 0.1363 detection_loss_reg: 0.4855 semantic_segmentation_loss_cls: 0.0357 grounding_loss_reg: 5.5892 instance_segmentation_loss_cls: 0.1287 instance_segmentation_loss_reg: 0.4925 instance_segmentation_loss_poly: 1.5310 2024/01/02 04:29:19 - mmengine - INFO - Saving checkpoint at 14000 iterations 2024/01/02 04:42:39 - mmengine - INFO - Iter(train) [ 14500/640000] base_lr: 1.9975e-04 lr: 1.9975e-05 eta: 11 days, 6:39:31 time: 1.5474 data_time: 0.0245 memory: 25685 grad_norm: 2.8188 loss: 2.0091 caption_loss_cls: 3.9194 detection_loss_cls: 0.1337 detection_loss_reg: 0.4841 semantic_segmentation_loss_cls: 0.0350 grounding_loss_reg: 5.5552 instance_segmentation_loss_cls: 0.1263 instance_segmentation_loss_reg: 0.4920 instance_segmentation_loss_poly: 1.5242 2024/01/02 04:55:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 04:55:54 - mmengine - INFO - Iter(train) [ 15000/640000] base_lr: 1.9973e-04 lr: 1.9973e-05 eta: 11 days, 6:38:18 time: 1.5524 data_time: 0.0237 memory: 25685 grad_norm: 2.8160 loss: 2.0085 caption_loss_cls: 3.8891 detection_loss_cls: 0.1314 detection_loss_reg: 0.4837 semantic_segmentation_loss_cls: 0.0344 grounding_loss_reg: 5.5243 instance_segmentation_loss_cls: 0.1240 instance_segmentation_loss_reg: 0.4909 instance_segmentation_loss_poly: 1.5153 2024/01/02 05:08:33 - mmengine - INFO - Iter(train) [ 15500/640000] base_lr: 1.9971e-04 lr: 1.9971e-05 eta: 11 days, 6:11:04 time: 1.5535 data_time: 0.0236 memory: 25685 grad_norm: 2.7792 loss: 1.9651 caption_loss_cls: 3.8518 detection_loss_cls: 0.1292 detection_loss_reg: 0.4813 semantic_segmentation_loss_cls: 0.0338 grounding_loss_reg: 5.4842 instance_segmentation_loss_cls: 0.1217 instance_segmentation_loss_reg: 0.4863 instance_segmentation_loss_poly: 1.5015 2024/01/02 05:21:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 05:21:35 - mmengine - INFO - Iter(train) [ 16000/640000] base_lr: 1.9969e-04 lr: 1.9969e-05 eta: 11 days, 6:00:43 time: 1.5530 data_time: 0.0236 memory: 25685 grad_norm: 2.7389 loss: 1.9467 caption_loss_cls: 3.8260 detection_loss_cls: 0.1272 detection_loss_reg: 0.4817 semantic_segmentation_loss_cls: 0.0331 grounding_loss_reg: 5.4539 instance_segmentation_loss_cls: 0.1198 instance_segmentation_loss_reg: 0.4863 instance_segmentation_loss_poly: 1.4951 2024/01/02 05:21:35 - mmengine - INFO - Saving checkpoint at 16000 iterations 2024/01/02 05:34:56 - mmengine - INFO - Iter(train) [ 16500/640000] base_lr: 1.9967e-04 lr: 1.9967e-05 eta: 11 days, 6:01:25 time: 1.5600 data_time: 0.0237 memory: 25685 grad_norm: 2.7474 loss: 1.9286 caption_loss_cls: 3.8002 detection_loss_cls: 0.1248 detection_loss_reg: 0.4808 semantic_segmentation_loss_cls: 0.0327 grounding_loss_reg: 5.4149 instance_segmentation_loss_cls: 0.1183 instance_segmentation_loss_reg: 0.4877 instance_segmentation_loss_poly: 1.4954 2024/01/02 05:47:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 05:47:27 - mmengine - INFO - Iter(train) [ 17000/640000] base_lr: 1.9965e-04 lr: 1.9965e-05 eta: 11 days, 5:30:57 time: 1.5514 data_time: 0.0236 memory: 25685 grad_norm: 2.7813 loss: 1.9122 caption_loss_cls: 3.7758 detection_loss_cls: 0.1228 detection_loss_reg: 0.4792 semantic_segmentation_loss_cls: 0.0321 grounding_loss_reg: 5.3838 instance_segmentation_loss_cls: 0.1166 instance_segmentation_loss_reg: 0.4847 instance_segmentation_loss_poly: 1.4846 2024/01/02 06:00:03 - mmengine - INFO - Iter(train) [ 17500/640000] base_lr: 1.9963e-04 lr: 1.9963e-05 eta: 11 days, 5:04:27 time: 1.5510 data_time: 0.0236 memory: 25685 grad_norm: 2.7766 loss: 1.9020 caption_loss_cls: 3.7514 detection_loss_cls: 0.1214 detection_loss_reg: 0.4792 semantic_segmentation_loss_cls: 0.0316 grounding_loss_reg: 5.3555 instance_segmentation_loss_cls: 0.1148 instance_segmentation_loss_reg: 0.4819 instance_segmentation_loss_poly: 1.4747 2024/01/02 06:12:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 06:12:56 - mmengine - INFO - Iter(train) [ 18000/640000] base_lr: 1.9961e-04 lr: 1.9961e-05 eta: 11 days, 4:48:23 time: 1.5542 data_time: 0.0236 memory: 25685 grad_norm: 2.7451 loss: 1.8886 caption_loss_cls: 3.7229 detection_loss_cls: 0.1198 detection_loss_reg: 0.4778 semantic_segmentation_loss_cls: 0.0312 grounding_loss_reg: 5.3274 instance_segmentation_loss_cls: 0.1133 instance_segmentation_loss_reg: 0.4817 instance_segmentation_loss_poly: 1.4721 2024/01/02 06:12:56 - mmengine - INFO - Saving checkpoint at 18000 iterations 2024/01/02 06:26:07 - mmengine - INFO - Iter(train) [ 18500/640000] base_lr: 1.9959e-04 lr: 1.9959e-05 eta: 11 days, 4:42:40 time: 1.5519 data_time: 0.0236 memory: 25685 grad_norm: 2.7961 loss: 1.8855 caption_loss_cls: 3.7058 detection_loss_cls: 0.1182 detection_loss_reg: 0.4775 semantic_segmentation_loss_cls: 0.0307 grounding_loss_reg: 5.2936 instance_segmentation_loss_cls: 0.1118 instance_segmentation_loss_reg: 0.4805 instance_segmentation_loss_poly: 1.4642 2024/01/02 06:38:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 06:38:39 - mmengine - INFO - Iter(train) [ 19000/640000] base_lr: 1.9957e-04 lr: 1.9957e-05 eta: 11 days, 4:15:49 time: 1.5412 data_time: 0.0235 memory: 25685 grad_norm: 2.8273 loss: 1.8748 caption_loss_cls: 3.6794 detection_loss_cls: 0.1170 detection_loss_reg: 0.4775 semantic_segmentation_loss_cls: 0.0303 grounding_loss_reg: 5.2621 instance_segmentation_loss_cls: 0.1106 instance_segmentation_loss_reg: 0.4806 instance_segmentation_loss_poly: 1.4622 2024/01/02 06:52:01 - mmengine - INFO - Iter(train) [ 19500/640000] base_lr: 1.9954e-04 lr: 1.9954e-05 eta: 11 days, 4:15:43 time: 1.5521 data_time: 0.0237 memory: 25685 grad_norm: 2.8052 loss: 1.8775 caption_loss_cls: 3.6660 detection_loss_cls: 0.1154 detection_loss_reg: 0.4755 semantic_segmentation_loss_cls: 0.0299 grounding_loss_reg: 5.2409 instance_segmentation_loss_cls: 0.1087 instance_segmentation_loss_reg: 0.4776 instance_segmentation_loss_poly: 1.4524 2024/01/02 07:04:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 07:04:37 - mmengine - INFO - Iter(train) [ 20000/640000] base_lr: 1.9952e-04 lr: 1.9952e-05 eta: 11 days, 3:51:15 time: 1.5454 data_time: 0.0236 memory: 25685 grad_norm: 2.8638 loss: 1.8835 caption_loss_cls: 3.4349 detection_loss_cls: 0.1139 detection_loss_reg: 0.4741 semantic_segmentation_loss_cls: 0.0243 grounding_loss_reg: 5.2132 instance_segmentation_loss_cls: 0.1076 instance_segmentation_loss_reg: 0.4772 instance_segmentation_loss_poly: 1.4492 2024/01/02 07:04:37 - mmengine - INFO - Saving checkpoint at 20000 iterations 2024/01/02 07:16:07 - mmengine - INFO - Evaluating bbox... 2024/01/02 07:17:02 - mmengine - INFO - bbox_mAP_copypaste: 0.327 0.512 0.360 0.216 0.390 0.407 2024/01/02 07:17:02 - mmengine - INFO - Evaluating segm... 2024/01/02 07:18:13 - mmengine - INFO - segm_mAP_copypaste: 0.174 0.379 0.141 0.075 0.224 0.301 2024/01/02 07:20:22 - mmengine - INFO - Evaluating bbox... 2024/01/02 07:21:19 - mmengine - INFO - bbox_mAP_copypaste: 0.328 0.513 0.361 0.216 0.390 0.411 2024/01/02 07:26:37 - mmengine - INFO - per class results: 2024/01/02 07:26:37 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 73.48 | 84.41 | | building | 78.91 | 93.96 | | sky | 92.3 | 96.21 | | floor | 79.8 | 86.49 | | tree | 70.09 | 85.94 | | ceiling | 82.44 | 93.52 | | road | 79.51 | 88.16 | | bed | 85.51 | 93.36 | | windowpane | 59.54 | 77.05 | | grass | 64.88 | 76.73 | | cabinet | 51.14 | 79.39 | | sidewalk | 60.18 | 82.68 | | person | 77.03 | 84.45 | | earth | 37.62 | 55.93 | | door | 40.66 | 54.35 | | table | 52.06 | 69.25 | | mountain | 52.89 | 79.0 | | plant | 46.15 | 53.86 | | curtain | 69.94 | 89.35 | | chair | 52.61 | 67.49 | | car | 79.89 | 90.65 | | water | 45.8 | 56.7 | | painting | 59.65 | 90.05 | | sofa | 54.21 | 86.78 | | shelf | 23.34 | 28.32 | | house | 37.11 | 52.3 | | sea | 48.16 | 90.75 | | mirror | 45.61 | 80.05 | | rug | 65.35 | 81.49 | | field | 21.49 | 29.07 | | armchair | 10.79 | 11.84 | | seat | 62.93 | 76.49 | | fence | 37.24 | 48.3 | | desk | 30.27 | 43.49 | | rock | 28.57 | 33.31 | | wardrobe | 30.79 | 33.41 | | lamp | 52.89 | 67.63 | | bathtub | 61.5 | 86.75 | | railing | 33.85 | 42.17 | | cushion | 44.86 | 54.72 | | base | 14.46 | 15.79 | | box | 17.74 | 21.08 | | column | 43.01 | 48.34 | | signboard | 24.96 | 29.3 | | chest of drawers | 19.61 | 29.06 | | counter | 22.89 | 31.71 | | sand | 33.86 | 38.88 | | sink | 56.68 | 63.72 | | skyscraper | 56.54 | 76.6 | | fireplace | 62.95 | 76.4 | | refrigerator | 63.36 | 72.03 | | grandstand | 46.29 | 54.0 | | path | 13.84 | 15.63 | | stairs | 25.83 | 28.75 | | runway | 67.18 | 76.45 | | case | 31.68 | 53.47 | | pool table | 85.64 | 95.78 | | pillow | 50.02 | 61.96 | | screen door | 24.67 | 25.33 | | stairway | 28.97 | 55.19 | | river | 21.08 | 26.35 | | bridge | 56.75 | 87.01 | | bookcase | 24.37 | 63.25 | | blind | 36.67 | 41.58 | | coffee table | 46.8 | 59.17 | | toilet | 78.77 | 85.26 | | flower | 28.96 | 40.0 | | book | 34.83 | 56.92 | | hill | 0.0 | 0.0 | | bench | 51.29 | 61.22 | | countertop | 24.98 | 25.93 | | stove | 57.73 | 66.92 | | palm | 47.01 | 68.47 | | kitchen island | 27.21 | 78.16 | | computer | 65.08 | 85.85 | | swivel chair | 34.57 | 58.95 | | boat | 39.17 | 41.09 | | bar | 21.16 | 31.54 | | arcade machine | 36.75 | 96.52 | | hovel | 0.0 | 0.0 | | bus | 84.63 | 87.99 | | towel | 51.26 | 60.74 | | light | 46.4 | 59.76 | | truck | 29.49 | 45.71 | | tower | 0.61 | 0.64 | | chandelier | 52.17 | 59.09 | | awning | 17.73 | 19.09 | | streetlight | 16.74 | 19.41 | | booth | 28.21 | 29.33 | | television receiver | 55.28 | 64.97 | | airplane | 53.74 | 62.91 | | dirt track | 0.0 | 0.0 | | apparel | 20.65 | 38.29 | | pole | 18.3 | 26.96 | | land | 0.0 | 0.0 | | bannister | 5.44 | 6.24 | | escalator | 48.67 | 71.45 | | ottoman | 37.36 | 59.67 | | bottle | 19.51 | 24.36 | | buffet | 22.7 | 24.38 | | poster | 15.32 | 21.95 | | stage | 3.06 | 3.8 | | van | 26.62 | 31.27 | | ship | 4.09 | 5.69 | | fountain | 31.65 | 35.85 | | conveyer belt | 55.86 | 62.71 | | canopy | 9.31 | 9.55 | | washer | 55.31 | 63.85 | | plaything | 17.06 | 18.73 | | swimming pool | 43.58 | 73.43 | | stool | 27.54 | 39.12 | | barrel | 49.05 | 50.87 | | basket | 20.76 | 22.4 | | waterfall | 61.41 | 84.95 | | tent | 60.32 | 98.6 | | bag | 17.37 | 21.04 | | minibike | 67.46 | 77.2 | | cradle | 55.82 | 97.83 | | oven | 23.12 | 40.28 | | ball | 38.58 | 66.66 | | food | 44.51 | 48.95 | | step | 1.85 | 2.11 | | tank | 30.33 | 35.73 | | trade name | 1.99 | 2.03 | | microwave | 73.37 | 81.61 | | pot | 38.91 | 51.06 | | animal | 55.01 | 60.14 | | bicycle | 50.82 | 64.02 | | lake | 0.0 | 0.0 | | dishwasher | 37.33 | 47.12 | | screen | 48.54 | 88.01 | | blanket | 6.72 | 7.55 | | sculpture | 38.32 | 47.05 | | hood | 39.59 | 43.28 | | sconce | 5.17 | 5.43 | | vase | 28.77 | 38.82 | | traffic light | 12.33 | 13.71 | | tray | 6.1 | 20.48 | | ashcan | 8.82 | 9.18 | | fan | 42.29 | 48.84 | | pier | 27.28 | 30.44 | | crt screen | 4.66 | 5.83 | | plate | 35.54 | 44.84 | | monitor | 58.42 | 76.69 | | bulletin board | 3.86 | 5.1 | | shower | 0.31 | 7.78 | | radiator | 26.44 | 27.27 | | glass | 10.0 | 10.93 | | clock | 28.86 | 48.04 | | flag | 41.33 | 61.65 | +---------------------+-------+-------+ 2024/01/02 07:26:50 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.3280 coco/bbox_mAP_50: 0.5130 coco/bbox_mAP_75: 0.3610 coco/bbox_mAP_s: 0.2160 coco/bbox_mAP_m: 0.3900 coco/bbox_mAP_l: 0.4110 coco/segm_mAP: 0.1740 coco/segm_mAP_50: 0.3790 coco/segm_mAP_75: 0.1410 coco/segm_mAP_s: 0.0750 coco/segm_mAP_m: 0.2240 coco/segm_mAP_l: 0.3010 Bleu_1: 0.6737 Bleu_2: 0.5005 Bleu_3: 0.3588 Bleu_4: 0.2527 METEOR: 0.2242 ROUGE_L: 0.5023 CIDEr: 0.7641 SPICE: 0.1627 aAcc: 79.5400 mIoU: 38.6900 mAcc: 50.2700 visual-grounding/miou: 0.5134 visual-grounding/acc: 0.5713 data_time: 0.0249 time: 1.9202 2024/01/02 07:39:38 - mmengine - INFO - Iter(train) [ 20500/640000] base_lr: 1.9949e-04 lr: 1.9949e-05 eta: 11 days, 3:34:30 time: 1.5377 data_time: 0.0182 memory: 34776 grad_norm: 2.8228 loss: 1.8687 caption_loss_cls: 3.3218 detection_loss_cls: 0.0878 detection_loss_reg: 0.4647 semantic_segmentation_loss_cls: 0.0227 grounding_loss_reg: 5.0837 instance_segmentation_loss_cls: 0.0882 instance_segmentation_loss_reg: 0.4661 instance_segmentation_loss_poly: 1.3933 2024/01/02 07:51:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 07:51:54 - mmengine - INFO - Iter(train) [ 21000/640000] base_lr: 1.9947e-04 lr: 1.9947e-05 eta: 11 days, 3:01:30 time: 1.5342 data_time: 0.0184 memory: 25684 grad_norm: 2.8486 loss: 1.8879 caption_loss_cls: 3.2689 detection_loss_cls: 0.0745 detection_loss_reg: 0.4562 semantic_segmentation_loss_cls: 0.0210 grounding_loss_reg: 4.9640 instance_segmentation_loss_cls: 0.0768 instance_segmentation_loss_reg: 0.4611 instance_segmentation_loss_poly: 1.3665 2024/01/02 08:04:37 - mmengine - INFO - Iter(train) [ 21500/640000] base_lr: 1.9944e-04 lr: 1.9944e-05 eta: 11 days, 2:42:01 time: 1.5359 data_time: 0.0186 memory: 25684 grad_norm: 2.8566 loss: 1.8681 caption_loss_cls: 3.2300 detection_loss_cls: 0.0701 detection_loss_reg: 0.4524 semantic_segmentation_loss_cls: 0.0202 grounding_loss_reg: 4.8828 instance_segmentation_loss_cls: 0.0719 instance_segmentation_loss_reg: 0.4548 instance_segmentation_loss_poly: 1.3411 2024/01/02 08:17:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 08:17:36 - mmengine - INFO - Iter(train) [ 22000/640000] base_lr: 1.9942e-04 lr: 1.9942e-05 eta: 11 days, 2:30:14 time: 1.5374 data_time: 0.0188 memory: 25684 grad_norm: 2.8446 loss: 1.8552 caption_loss_cls: 3.1883 detection_loss_cls: 0.0678 detection_loss_reg: 0.4500 semantic_segmentation_loss_cls: 0.0194 grounding_loss_reg: 4.8256 instance_segmentation_loss_cls: 0.0692 instance_segmentation_loss_reg: 0.4520 instance_segmentation_loss_poly: 1.3267 2024/01/02 08:17:36 - mmengine - INFO - Saving checkpoint at 22000 iterations 2024/01/02 08:30:48 - mmengine - INFO - Iter(train) [ 22500/640000] base_lr: 1.9939e-04 lr: 1.9939e-05 eta: 11 days, 2:24:39 time: 1.5378 data_time: 0.0201 memory: 25684 grad_norm: 2.7917 loss: 1.8286 caption_loss_cls: 3.1534 detection_loss_cls: 0.0662 detection_loss_reg: 0.4473 semantic_segmentation_loss_cls: 0.0190 grounding_loss_reg: 4.7690 instance_segmentation_loss_cls: 0.0675 instance_segmentation_loss_reg: 0.4497 instance_segmentation_loss_poly: 1.3180 2024/01/02 08:43:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 08:43:01 - mmengine - INFO - Iter(train) [ 23000/640000] base_lr: 1.9936e-04 lr: 1.9936e-05 eta: 11 days, 1:52:02 time: 1.5327 data_time: 0.0202 memory: 25684 grad_norm: 2.7917 loss: 1.8222 caption_loss_cls: 3.1186 detection_loss_cls: 0.0649 detection_loss_reg: 0.4456 semantic_segmentation_loss_cls: 0.0185 grounding_loss_reg: 4.7131 instance_segmentation_loss_cls: 0.0656 instance_segmentation_loss_reg: 0.4469 instance_segmentation_loss_poly: 1.3076 2024/01/02 08:55:21 - mmengine - INFO - Iter(train) [ 23500/640000] base_lr: 1.9934e-04 lr: 1.9934e-05 eta: 11 days, 1:23:40 time: 1.5173 data_time: 0.0202 memory: 25684 grad_norm: 2.9251 loss: 1.8218 caption_loss_cls: 3.0913 detection_loss_cls: 0.0642 detection_loss_reg: 0.4450 semantic_segmentation_loss_cls: 0.0183 grounding_loss_reg: 4.6579 instance_segmentation_loss_cls: 0.0641 instance_segmentation_loss_reg: 0.4443 instance_segmentation_loss_poly: 1.2982 2024/01/02 09:07:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 09:07:53 - mmengine - INFO - Iter(train) [ 24000/640000] base_lr: 1.9931e-04 lr: 1.9931e-05 eta: 11 days, 1:00:49 time: 1.5163 data_time: 0.0204 memory: 25684 grad_norm: 2.9329 loss: 1.8108 caption_loss_cls: 3.0670 detection_loss_cls: 0.0635 detection_loss_reg: 0.4437 semantic_segmentation_loss_cls: 0.0180 grounding_loss_reg: 4.6070 instance_segmentation_loss_cls: 0.0630 instance_segmentation_loss_reg: 0.4416 instance_segmentation_loss_poly: 1.2872 2024/01/02 09:07:53 - mmengine - INFO - Saving checkpoint at 24000 iterations 2024/01/02 09:21:03 - mmengine - INFO - Iter(train) [ 24500/640000] base_lr: 1.9928e-04 lr: 1.9928e-05 eta: 11 days, 0:54:40 time: 1.5214 data_time: 0.0267 memory: 25684 grad_norm: 2.9643 loss: 1.7847 caption_loss_cls: 3.0378 detection_loss_cls: 0.0625 detection_loss_reg: 0.4410 semantic_segmentation_loss_cls: 0.0178 grounding_loss_reg: 4.5605 instance_segmentation_loss_cls: 0.0621 instance_segmentation_loss_reg: 0.4382 instance_segmentation_loss_poly: 1.2769 2024/01/02 09:34:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 09:34:08 - mmengine - INFO - Iter(train) [ 25000/640000] base_lr: 1.9925e-04 lr: 1.9925e-05 eta: 11 days, 0:45:50 time: 1.5334 data_time: 0.0268 memory: 25684 grad_norm: 2.9549 loss: 1.7558 caption_loss_cls: 3.0108 detection_loss_cls: 0.0616 detection_loss_reg: 0.4396 semantic_segmentation_loss_cls: 0.0176 grounding_loss_reg: 4.5215 instance_segmentation_loss_cls: 0.0610 instance_segmentation_loss_reg: 0.4354 instance_segmentation_loss_poly: 1.2650 2024/01/02 09:46:21 - mmengine - INFO - Iter(train) [ 25500/640000] base_lr: 1.9922e-04 lr: 1.9922e-05 eta: 11 days, 0:16:09 time: 1.5260 data_time: 0.0269 memory: 25684 grad_norm: 3.0117 loss: 1.7755 caption_loss_cls: 2.9943 detection_loss_cls: 0.0607 detection_loss_reg: 0.4374 semantic_segmentation_loss_cls: 0.0176 grounding_loss_reg: 4.4772 instance_segmentation_loss_cls: 0.0606 instance_segmentation_loss_reg: 0.4369 instance_segmentation_loss_poly: 1.2658 2024/01/02 09:58:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 09:58:24 - mmengine - INFO - Iter(train) [ 26000/640000] base_lr: 1.9919e-04 lr: 1.9919e-05 eta: 10 days, 23:42:49 time: 1.5119 data_time: 0.0267 memory: 25684 grad_norm: 3.0935 loss: 1.7891 caption_loss_cls: 2.9728 detection_loss_cls: 0.0600 detection_loss_reg: 0.4337 semantic_segmentation_loss_cls: 0.0174 grounding_loss_reg: 4.4337 instance_segmentation_loss_cls: 0.0599 instance_segmentation_loss_reg: 0.4361 instance_segmentation_loss_poly: 1.2602 2024/01/02 09:58:24 - mmengine - INFO - Saving checkpoint at 26000 iterations 2024/01/02 10:11:05 - mmengine - INFO - Iter(train) [ 26500/640000] base_lr: 1.9916e-04 lr: 1.9916e-05 eta: 10 days, 23:25:37 time: 1.5043 data_time: 0.0261 memory: 25684 grad_norm: 3.1549 loss: 1.8022 caption_loss_cls: 2.9578 detection_loss_cls: 0.0597 detection_loss_reg: 0.4350 semantic_segmentation_loss_cls: 0.0172 grounding_loss_reg: 4.3914 instance_segmentation_loss_cls: 0.0595 instance_segmentation_loss_reg: 0.4347 instance_segmentation_loss_poly: 1.2562 2024/01/02 10:23:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 10:23:53 - mmengine - INFO - Iter(train) [ 27000/640000] base_lr: 1.9912e-04 lr: 1.9912e-05 eta: 10 days, 23:10:39 time: 1.5130 data_time: 0.0262 memory: 25684 grad_norm: 3.1466 loss: 1.7834 caption_loss_cls: 2.9470 detection_loss_cls: 0.0591 detection_loss_reg: 0.4328 semantic_segmentation_loss_cls: 0.0171 grounding_loss_reg: 4.3571 instance_segmentation_loss_cls: 0.0587 instance_segmentation_loss_reg: 0.4316 instance_segmentation_loss_poly: 1.2474 2024/01/02 10:37:13 - mmengine - INFO - Iter(train) [ 27500/640000] base_lr: 1.9909e-04 lr: 1.9909e-05 eta: 10 days, 23:08:02 time: 1.5280 data_time: 0.0261 memory: 25684 grad_norm: 3.0784 loss: 1.7816 caption_loss_cls: 2.9258 detection_loss_cls: 0.0587 detection_loss_reg: 0.4327 semantic_segmentation_loss_cls: 0.0170 grounding_loss_reg: 4.3261 instance_segmentation_loss_cls: 0.0580 instance_segmentation_loss_reg: 0.4290 instance_segmentation_loss_poly: 1.2404 2024/01/02 10:49:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 10:49:58 - mmengine - INFO - Iter(train) [ 28000/640000] base_lr: 1.9906e-04 lr: 1.9906e-05 eta: 10 days, 22:52:09 time: 1.5313 data_time: 0.0262 memory: 25684 grad_norm: 3.0647 loss: 1.7744 caption_loss_cls: 2.9113 detection_loss_cls: 0.0580 detection_loss_reg: 0.4318 semantic_segmentation_loss_cls: 0.0169 grounding_loss_reg: 4.2988 instance_segmentation_loss_cls: 0.0573 instance_segmentation_loss_reg: 0.4273 instance_segmentation_loss_poly: 1.2326 2024/01/02 10:49:58 - mmengine - INFO - Saving checkpoint at 28000 iterations 2024/01/02 11:03:24 - mmengine - INFO - Iter(train) [ 28500/640000] base_lr: 1.9902e-04 lr: 1.9902e-05 eta: 10 days, 22:50:57 time: 1.5352 data_time: 0.0264 memory: 25684 grad_norm: 3.0814 loss: 1.7787 caption_loss_cls: 2.9009 detection_loss_cls: 0.0574 detection_loss_reg: 0.4303 semantic_segmentation_loss_cls: 0.0167 grounding_loss_reg: 4.2764 instance_segmentation_loss_cls: 0.0572 instance_segmentation_loss_reg: 0.4298 instance_segmentation_loss_poly: 1.2346 2024/01/02 11:16:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 11:16:01 - mmengine - INFO - Iter(train) [ 29000/640000] base_lr: 1.9899e-04 lr: 1.9899e-05 eta: 10 days, 22:32:10 time: 1.5282 data_time: 0.0263 memory: 25684 grad_norm: 3.1369 loss: 1.7761 caption_loss_cls: 2.8862 detection_loss_cls: 0.0566 detection_loss_reg: 0.4268 semantic_segmentation_loss_cls: 0.0164 grounding_loss_reg: 4.2507 instance_segmentation_loss_cls: 0.0566 instance_segmentation_loss_reg: 0.4275 instance_segmentation_loss_poly: 1.2268 2024/01/02 11:28:42 - mmengine - INFO - Iter(train) [ 29500/640000] base_lr: 1.9895e-04 lr: 1.9895e-05 eta: 10 days, 22:15:01 time: 1.5352 data_time: 0.0263 memory: 25684 grad_norm: 3.1037 loss: 1.7538 caption_loss_cls: 2.8778 detection_loss_cls: 0.0562 detection_loss_reg: 0.4259 semantic_segmentation_loss_cls: 0.0163 grounding_loss_reg: 4.2234 instance_segmentation_loss_cls: 0.0561 instance_segmentation_loss_reg: 0.4253 instance_segmentation_loss_poly: 1.2203 2024/01/02 11:42:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 11:42:10 - mmengine - INFO - Iter(train) [ 30000/640000] base_lr: 1.9892e-04 lr: 1.9892e-05 eta: 10 days, 22:13:40 time: 1.5565 data_time: 0.0266 memory: 25684 grad_norm: 3.2057 loss: 1.7310 caption_loss_cls: 2.8630 detection_loss_cls: 0.0558 detection_loss_reg: 0.4244 semantic_segmentation_loss_cls: 0.0162 grounding_loss_reg: 4.2038 instance_segmentation_loss_cls: 0.0558 instance_segmentation_loss_reg: 0.4235 instance_segmentation_loss_poly: 1.2131 2024/01/02 11:42:10 - mmengine - INFO - Saving checkpoint at 30000 iterations 2024/01/02 11:54:58 - mmengine - INFO - Iter(train) [ 30500/640000] base_lr: 1.9888e-04 lr: 1.9888e-05 eta: 10 days, 21:59:01 time: 1.5581 data_time: 0.0266 memory: 25684 grad_norm: 3.3371 loss: 1.7183 caption_loss_cls: 2.8455 detection_loss_cls: 0.0555 detection_loss_reg: 0.4231 semantic_segmentation_loss_cls: 0.0162 grounding_loss_reg: 4.1795 instance_segmentation_loss_cls: 0.0555 instance_segmentation_loss_reg: 0.4234 instance_segmentation_loss_poly: 1.2113 2024/01/02 12:07:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 12:07:53 - mmengine - INFO - Iter(train) [ 31000/640000] base_lr: 1.9884e-04 lr: 1.9884e-05 eta: 10 days, 21:46:27 time: 1.5600 data_time: 0.0267 memory: 25684 grad_norm: 3.4716 loss: 1.7161 caption_loss_cls: 2.8374 detection_loss_cls: 0.0554 detection_loss_reg: 0.4236 semantic_segmentation_loss_cls: 0.0161 grounding_loss_reg: 4.1559 instance_segmentation_loss_cls: 0.0551 instance_segmentation_loss_reg: 0.4221 instance_segmentation_loss_poly: 1.2069 2024/01/02 12:20:01 - mmengine - INFO - Iter(train) [ 31500/640000] base_lr: 1.9881e-04 lr: 1.9881e-05 eta: 10 days, 21:18:46 time: 1.5418 data_time: 0.0267 memory: 25684 grad_norm: 3.5161 loss: 1.7277 caption_loss_cls: 2.8286 detection_loss_cls: 0.0550 detection_loss_reg: 0.4223 semantic_segmentation_loss_cls: 0.0159 grounding_loss_reg: 4.1352 instance_segmentation_loss_cls: 0.0548 instance_segmentation_loss_reg: 0.4217 instance_segmentation_loss_poly: 1.2039 2024/01/02 12:33:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 12:33:18 - mmengine - INFO - Iter(train) [ 32000/640000] base_lr: 1.9877e-04 lr: 1.9877e-05 eta: 10 days, 21:13:36 time: 1.5499 data_time: 0.0267 memory: 25684 grad_norm: 3.4883 loss: 1.7009 caption_loss_cls: 2.8153 detection_loss_cls: 0.0548 detection_loss_reg: 0.4200 semantic_segmentation_loss_cls: 0.0159 grounding_loss_reg: 4.1163 instance_segmentation_loss_cls: 0.0540 instance_segmentation_loss_reg: 0.4179 instance_segmentation_loss_poly: 1.1928 2024/01/02 12:33:18 - mmengine - INFO - Saving checkpoint at 32000 iterations 2024/01/02 12:46:12 - mmengine - INFO - Iter(train) [ 32500/640000] base_lr: 1.9873e-04 lr: 1.9873e-05 eta: 10 days, 21:00:42 time: 1.5418 data_time: 0.0265 memory: 25684 grad_norm: 3.4832 loss: 1.7062 caption_loss_cls: 2.7962 detection_loss_cls: 0.0545 detection_loss_reg: 0.4193 semantic_segmentation_loss_cls: 0.0158 grounding_loss_reg: 4.0969 instance_segmentation_loss_cls: 0.0540 instance_segmentation_loss_reg: 0.4178 instance_segmentation_loss_poly: 1.1916 2024/01/02 12:58:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 12:58:45 - mmengine - INFO - Iter(train) [ 33000/640000] base_lr: 1.9869e-04 lr: 1.9869e-05 eta: 10 days, 20:41:33 time: 1.5407 data_time: 0.0266 memory: 25684 grad_norm: 3.4334 loss: 1.7222 caption_loss_cls: 2.7875 detection_loss_cls: 0.0541 detection_loss_reg: 0.4177 semantic_segmentation_loss_cls: 0.0157 grounding_loss_reg: 4.0816 instance_segmentation_loss_cls: 0.0535 instance_segmentation_loss_reg: 0.4160 instance_segmentation_loss_poly: 1.1864 2024/01/02 13:11:24 - mmengine - INFO - Iter(train) [ 33500/640000] base_lr: 1.9865e-04 lr: 1.9865e-05 eta: 10 days, 20:24:26 time: 1.5402 data_time: 0.0267 memory: 25684 grad_norm: 3.4067 loss: 1.7251 caption_loss_cls: 2.7899 detection_loss_cls: 0.0539 detection_loss_reg: 0.4158 semantic_segmentation_loss_cls: 0.0155 grounding_loss_reg: 4.0670 instance_segmentation_loss_cls: 0.0529 instance_segmentation_loss_reg: 0.4139 instance_segmentation_loss_poly: 1.1789 2024/01/02 13:23:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 13:23:53 - mmengine - INFO - Iter(train) [ 34000/640000] base_lr: 1.9861e-04 lr: 1.9861e-05 eta: 10 days, 20:04:26 time: 1.5256 data_time: 0.0266 memory: 25684 grad_norm: 3.2784 loss: 1.7320 caption_loss_cls: 2.7811 detection_loss_cls: 0.0538 detection_loss_reg: 0.4151 semantic_segmentation_loss_cls: 0.0154 grounding_loss_reg: 4.0496 instance_segmentation_loss_cls: 0.0527 instance_segmentation_loss_reg: 0.4143 instance_segmentation_loss_poly: 1.1777 2024/01/02 13:23:53 - mmengine - INFO - Saving checkpoint at 34000 iterations 2024/01/02 13:36:46 - mmengine - INFO - Iter(train) [ 34500/640000] base_lr: 1.9857e-04 lr: 1.9857e-05 eta: 10 days, 19:51:50 time: 1.5269 data_time: 0.0265 memory: 25684 grad_norm: 3.1385 loss: 1.7302 caption_loss_cls: 2.7731 detection_loss_cls: 0.0532 detection_loss_reg: 0.4124 semantic_segmentation_loss_cls: 0.0154 grounding_loss_reg: 4.0257 instance_segmentation_loss_cls: 0.0525 instance_segmentation_loss_reg: 0.4134 instance_segmentation_loss_poly: 1.1749 2024/01/02 13:49:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 13:49:16 - mmengine - INFO - Iter(train) [ 35000/640000] base_lr: 1.9853e-04 lr: 1.9853e-05 eta: 10 days, 19:32:31 time: 1.5207 data_time: 0.0264 memory: 25684 grad_norm: 3.0161 loss: 1.7235 caption_loss_cls: 2.7736 detection_loss_cls: 0.0530 detection_loss_reg: 0.4121 semantic_segmentation_loss_cls: 0.0153 grounding_loss_reg: 4.0079 instance_segmentation_loss_cls: 0.0521 instance_segmentation_loss_reg: 0.4112 instance_segmentation_loss_poly: 1.1707 2024/01/02 14:01:54 - mmengine - INFO - Iter(train) [ 35500/640000] base_lr: 1.9849e-04 lr: 1.9849e-05 eta: 10 days, 19:15:36 time: 1.5283 data_time: 0.0265 memory: 25684 grad_norm: 2.9516 loss: 1.6997 caption_loss_cls: 2.7652 detection_loss_cls: 0.0526 detection_loss_reg: 0.4121 semantic_segmentation_loss_cls: 0.0152 grounding_loss_reg: 3.9912 instance_segmentation_loss_cls: 0.0521 instance_segmentation_loss_reg: 0.4117 instance_segmentation_loss_poly: 1.1714 2024/01/02 14:14:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 14:14:27 - mmengine - INFO - Iter(train) [ 36000/640000] base_lr: 1.9844e-04 lr: 1.9844e-05 eta: 10 days, 18:57:17 time: 1.5170 data_time: 0.0265 memory: 25684 grad_norm: 2.9732 loss: 1.7217 caption_loss_cls: 2.7567 detection_loss_cls: 0.0523 detection_loss_reg: 0.4095 semantic_segmentation_loss_cls: 0.0152 grounding_loss_reg: 3.9722 instance_segmentation_loss_cls: 0.0523 instance_segmentation_loss_reg: 0.4145 instance_segmentation_loss_poly: 1.1776 2024/01/02 14:14:27 - mmengine - INFO - Saving checkpoint at 36000 iterations 2024/01/02 14:27:36 - mmengine - INFO - Iter(train) [ 36500/640000] base_lr: 1.9840e-04 lr: 1.9840e-05 eta: 10 days, 18:49:12 time: 1.5210 data_time: 0.0266 memory: 25684 grad_norm: 2.9517 loss: 1.7134 caption_loss_cls: 2.7545 detection_loss_cls: 0.0520 detection_loss_reg: 0.4053 semantic_segmentation_loss_cls: 0.0152 grounding_loss_reg: 3.9562 instance_segmentation_loss_cls: 0.0517 instance_segmentation_loss_reg: 0.4091 instance_segmentation_loss_poly: 1.1656 2024/01/02 14:40:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 14:40:34 - mmengine - INFO - Iter(train) [ 37000/640000] base_lr: 1.9836e-04 lr: 1.9836e-05 eta: 10 days, 18:38:05 time: 1.5273 data_time: 0.0266 memory: 25684 grad_norm: 2.9346 loss: 1.6926 caption_loss_cls: 2.7512 detection_loss_cls: 0.0516 detection_loss_reg: 0.4031 semantic_segmentation_loss_cls: 0.0151 grounding_loss_reg: 3.9427 instance_segmentation_loss_cls: 0.0514 instance_segmentation_loss_reg: 0.4080 instance_segmentation_loss_poly: 1.1599 2024/01/02 14:53:03 - mmengine - INFO - Iter(train) [ 37500/640000] base_lr: 1.9831e-04 lr: 1.9831e-05 eta: 10 days, 18:19:01 time: 1.5248 data_time: 0.0266 memory: 25684 grad_norm: 2.9413 loss: 1.6903 caption_loss_cls: 2.7439 detection_loss_cls: 0.0513 detection_loss_reg: 0.4023 semantic_segmentation_loss_cls: 0.0151 grounding_loss_reg: 3.9312 instance_segmentation_loss_cls: 0.0514 instance_segmentation_loss_reg: 0.4091 instance_segmentation_loss_poly: 1.1617 2024/01/02 15:05:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 15:05:26 - mmengine - INFO - Iter(train) [ 38000/640000] base_lr: 1.9827e-04 lr: 1.9827e-05 eta: 10 days, 17:58:22 time: 1.5232 data_time: 0.0264 memory: 25684 grad_norm: 2.9234 loss: 1.6810 caption_loss_cls: 2.7404 detection_loss_cls: 0.0509 detection_loss_reg: 0.4007 semantic_segmentation_loss_cls: 0.0150 grounding_loss_reg: 3.9205 instance_segmentation_loss_cls: 0.0514 instance_segmentation_loss_reg: 0.4093 instance_segmentation_loss_poly: 1.1604 2024/01/02 15:05:26 - mmengine - INFO - Saving checkpoint at 38000 iterations 2024/01/02 15:18:49 - mmengine - INFO - Iter(train) [ 38500/640000] base_lr: 1.9822e-04 lr: 1.9822e-05 eta: 10 days, 17:53:55 time: 1.5307 data_time: 0.0266 memory: 25684 grad_norm: 2.8902 loss: 1.6686 caption_loss_cls: 2.7313 detection_loss_cls: 0.0508 detection_loss_reg: 0.4005 semantic_segmentation_loss_cls: 0.0150 grounding_loss_reg: 3.9047 instance_segmentation_loss_cls: 0.0510 instance_segmentation_loss_reg: 0.4077 instance_segmentation_loss_poly: 1.1553 2024/01/02 15:31:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 15:31:05 - mmengine - INFO - Iter(train) [ 39000/640000] base_lr: 1.9817e-04 lr: 1.9817e-05 eta: 10 days, 17:31:49 time: 1.5271 data_time: 0.0266 memory: 25684 grad_norm: 2.8814 loss: 1.6811 caption_loss_cls: 2.7221 detection_loss_cls: 0.0504 detection_loss_reg: 0.3987 semantic_segmentation_loss_cls: 0.0149 grounding_loss_reg: 3.8896 instance_segmentation_loss_cls: 0.0508 instance_segmentation_loss_reg: 0.4062 instance_segmentation_loss_poly: 1.1520 2024/01/02 15:43:56 - mmengine - INFO - Iter(train) [ 39500/640000] base_lr: 1.9813e-04 lr: 1.9813e-05 eta: 10 days, 17:18:55 time: 1.5304 data_time: 0.0266 memory: 25684 grad_norm: 2.8458 loss: 1.6726 caption_loss_cls: 2.7168 detection_loss_cls: 0.0502 detection_loss_reg: 0.3998 semantic_segmentation_loss_cls: 0.0149 grounding_loss_reg: 3.8742 instance_segmentation_loss_cls: 0.0508 instance_segmentation_loss_reg: 0.4063 instance_segmentation_loss_poly: 1.1492 2024/01/02 15:56:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 15:56:43 - mmengine - INFO - Iter(train) [ 40000/640000] base_lr: 1.9808e-04 lr: 1.9808e-05 eta: 10 days, 17:04:54 time: 1.5339 data_time: 0.0266 memory: 25684 grad_norm: 2.8333 loss: 1.6668 caption_loss_cls: 2.7112 detection_loss_cls: 0.0500 detection_loss_reg: 0.3997 semantic_segmentation_loss_cls: 0.0149 grounding_loss_reg: 3.8628 instance_segmentation_loss_cls: 0.0510 instance_segmentation_loss_reg: 0.4061 instance_segmentation_loss_poly: 1.1478 2024/01/02 15:56:43 - mmengine - INFO - Saving checkpoint at 40000 iterations 2024/01/02 16:08:23 - mmengine - INFO - Evaluating bbox... 2024/01/02 16:09:19 - mmengine - INFO - bbox_mAP_copypaste: 0.394 0.577 0.437 0.267 0.452 0.493 2024/01/02 16:09:19 - mmengine - INFO - Evaluating segm... 2024/01/02 16:10:30 - mmengine - INFO - segm_mAP_copypaste: 0.231 0.447 0.217 0.108 0.281 0.368 2024/01/02 16:12:39 - mmengine - INFO - Evaluating bbox... 2024/01/02 16:13:36 - mmengine - INFO - bbox_mAP_copypaste: 0.394 0.577 0.437 0.267 0.451 0.492 2024/01/02 16:19:13 - mmengine - INFO - per class results: 2024/01/02 16:19:13 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 76.57 | 88.3 | | building | 80.47 | 90.85 | | sky | 92.45 | 95.3 | | floor | 79.99 | 88.23 | | tree | 70.28 | 90.53 | | ceiling | 83.81 | 91.73 | | road | 79.22 | 88.69 | | bed | 86.4 | 94.11 | | windowpane | 62.48 | 79.71 | | grass | 61.17 | 68.1 | | cabinet | 58.01 | 73.32 | | sidewalk | 59.29 | 74.08 | | person | 78.64 | 88.37 | | earth | 39.68 | 54.5 | | door | 46.05 | 56.25 | | table | 56.1 | 79.76 | | mountain | 50.74 | 57.37 | | plant | 43.31 | 49.0 | | curtain | 71.45 | 86.17 | | chair | 51.13 | 63.34 | | car | 79.9 | 92.4 | | water | 58.73 | 77.11 | | painting | 70.12 | 83.15 | | sofa | 64.57 | 85.41 | | shelf | 38.02 | 51.86 | | house | 45.96 | 63.27 | | sea | 68.12 | 90.25 | | mirror | 63.56 | 71.01 | | rug | 63.75 | 68.95 | | field | 29.96 | 64.76 | | armchair | 36.1 | 47.49 | | seat | 63.76 | 80.35 | | fence | 35.59 | 75.45 | | desk | 36.65 | 58.9 | | rock | 37.67 | 84.83 | | wardrobe | 38.79 | 43.76 | | lamp | 56.87 | 72.16 | | bathtub | 61.75 | 87.78 | | railing | 27.5 | 38.45 | | cushion | 51.9 | 59.72 | | base | 25.43 | 35.62 | | box | 26.0 | 41.17 | | column | 49.87 | 66.21 | | signboard | 30.42 | 38.57 | | chest of drawers | 33.94 | 67.81 | | counter | 27.73 | 41.83 | | sand | 37.65 | 50.3 | | sink | 63.43 | 73.82 | | skyscraper | 49.5 | 60.5 | | fireplace | 67.5 | 84.04 | | refrigerator | 66.7 | 77.98 | | grandstand | 49.25 | 53.0 | | path | 18.68 | 23.26 | | stairs | 33.67 | 50.3 | | runway | 73.11 | 80.3 | | case | 40.15 | 66.25 | | pool table | 87.6 | 95.29 | | pillow | 50.19 | 62.31 | | screen door | 67.21 | 84.12 | | stairway | 38.49 | 55.07 | | river | 18.53 | 24.23 | | bridge | 54.72 | 89.77 | | bookcase | 27.62 | 44.42 | | blind | 34.58 | 36.52 | | coffee table | 41.82 | 46.7 | | toilet | 74.77 | 86.12 | | flower | 29.79 | 35.29 | | book | 41.22 | 59.07 | | hill | 1.09 | 1.11 | | bench | 44.1 | 61.76 | | countertop | 52.99 | 70.18 | | stove | 69.86 | 76.28 | | palm | 42.02 | 57.85 | | kitchen island | 29.07 | 47.08 | | computer | 55.64 | 61.3 | | swivel chair | 33.66 | 70.67 | | boat | 55.11 | 87.73 | | bar | 33.26 | 45.56 | | arcade machine | 61.74 | 66.77 | | hovel | 51.22 | 67.29 | | bus | 88.41 | 92.42 | | towel | 52.41 | 58.17 | | light | 38.97 | 45.5 | | truck | 38.33 | 57.69 | | tower | 2.45 | 2.82 | | chandelier | 59.11 | 71.12 | | awning | 26.89 | 34.74 | | streetlight | 21.85 | 46.76 | | booth | 33.63 | 36.55 | | television receiver | 59.02 | 73.76 | | airplane | 59.83 | 71.61 | | dirt track | 9.8 | 18.18 | | apparel | 24.14 | 42.94 | | pole | 18.94 | 29.02 | | land | 0.69 | 1.11 | | bannister | 8.33 | 26.91 | | escalator | 22.78 | 29.81 | | ottoman | 42.42 | 53.11 | | bottle | 22.86 | 29.73 | | buffet | 34.96 | 42.85 | | poster | 23.68 | 24.96 | | stage | 6.85 | 16.02 | | van | 26.87 | 29.11 | | ship | 3.1 | 3.16 | | fountain | 24.64 | 29.38 | | conveyer belt | 59.36 | 89.96 | | canopy | 44.55 | 54.74 | | washer | 61.62 | 73.71 | | plaything | 23.24 | 34.52 | | swimming pool | 41.65 | 43.81 | | stool | 25.51 | 28.34 | | barrel | 38.43 | 52.57 | | basket | 26.87 | 34.45 | | waterfall | 56.33 | 79.18 | | tent | 87.07 | 98.07 | | bag | 15.97 | 17.11 | | minibike | 71.28 | 81.2 | | cradle | 77.39 | 94.69 | | oven | 22.19 | 56.65 | | ball | 48.95 | 72.44 | | food | 52.29 | 64.05 | | step | 6.01 | 6.43 | | tank | 39.73 | 49.09 | | trade name | 3.34 | 3.38 | | microwave | 44.18 | 48.54 | | pot | 43.81 | 57.57 | | animal | 61.76 | 67.76 | | bicycle | 50.36 | 72.04 | | lake | 1.15 | 1.46 | | dishwasher | 49.4 | 63.4 | | screen | 62.58 | 69.73 | | blanket | 20.11 | 27.57 | | sculpture | 40.12 | 76.76 | | hood | 57.65 | 63.68 | | sconce | 29.2 | 46.29 | | vase | 31.24 | 44.65 | | traffic light | 26.34 | 39.36 | | tray | 5.11 | 9.46 | | ashcan | 32.43 | 49.42 | | fan | 47.08 | 54.61 | | pier | 28.57 | 40.58 | | crt screen | 2.05 | 6.86 | | plate | 42.45 | 54.24 | | monitor | 48.4 | 56.98 | | bulletin board | 32.25 | 37.21 | | shower | 0.0 | 0.0 | | radiator | 37.11 | 74.27 | | glass | 11.14 | 12.27 | | clock | 18.78 | 22.13 | | flag | 29.61 | 32.18 | +---------------------+-------+-------+ 2024/01/02 16:19:26 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.3940 coco/bbox_mAP_50: 0.5770 coco/bbox_mAP_75: 0.4370 coco/bbox_mAP_s: 0.2670 coco/bbox_mAP_m: 0.4510 coco/bbox_mAP_l: 0.4920 coco/segm_mAP: 0.2310 coco/segm_mAP_50: 0.4470 coco/segm_mAP_75: 0.2170 coco/segm_mAP_s: 0.1080 coco/segm_mAP_m: 0.2810 coco/segm_mAP_l: 0.3680 Bleu_1: 0.6881 Bleu_2: 0.5112 Bleu_3: 0.3654 Bleu_4: 0.2571 METEOR: 0.2357 ROUGE_L: 0.5068 CIDEr: 0.8330 SPICE: 0.1697 aAcc: 81.0700 mIoU: 43.6800 mAcc: 56.1900 visual-grounding/miou: 0.6080 visual-grounding/acc: 0.6657 data_time: 0.0121 time: 1.9114 2024/01/02 16:31:49 - mmengine - INFO - Iter(train) [ 40500/640000] base_lr: 1.9803e-04 lr: 1.9803e-05 eta: 10 days, 16:45:51 time: 1.5232 data_time: 0.0204 memory: 34774 grad_norm: 2.8424 loss: 1.6694 caption_loss_cls: 2.7083 detection_loss_cls: 0.0500 detection_loss_reg: 0.3994 semantic_segmentation_loss_cls: 0.0146 grounding_loss_reg: 3.8458 instance_segmentation_loss_cls: 0.0507 instance_segmentation_loss_reg: 0.4038 instance_segmentation_loss_poly: 1.1404 2024/01/02 16:44:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 16:44:14 - mmengine - INFO - Iter(train) [ 41000/640000] base_lr: 1.9798e-04 lr: 1.9798e-05 eta: 10 days, 16:26:40 time: 1.5149 data_time: 0.0203 memory: 25682 grad_norm: 2.8608 loss: 1.6718 caption_loss_cls: 2.7040 detection_loss_cls: 0.0498 detection_loss_reg: 0.3988 semantic_segmentation_loss_cls: 0.0146 grounding_loss_reg: 3.8299 instance_segmentation_loss_cls: 0.0507 instance_segmentation_loss_reg: 0.4027 instance_segmentation_loss_poly: 1.1392 2024/01/02 16:56:42 - mmengine - INFO - Iter(train) [ 41500/640000] base_lr: 1.9793e-04 lr: 1.9793e-05 eta: 10 days, 16:08:22 time: 1.5146 data_time: 0.0202 memory: 25682 grad_norm: 2.8760 loss: 1.6566 caption_loss_cls: 2.6998 detection_loss_cls: 0.0498 detection_loss_reg: 0.3987 semantic_segmentation_loss_cls: 0.0146 grounding_loss_reg: 3.8172 instance_segmentation_loss_cls: 0.0505 instance_segmentation_loss_reg: 0.4018 instance_segmentation_loss_poly: 1.1362 2024/01/02 17:09:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 17:09:03 - mmengine - INFO - Iter(train) [ 42000/640000] base_lr: 1.9788e-04 lr: 1.9788e-05 eta: 10 days, 15:48:43 time: 1.5144 data_time: 0.0203 memory: 25682 grad_norm: 2.8684 loss: 1.6686 caption_loss_cls: 2.6992 detection_loss_cls: 0.0495 detection_loss_reg: 0.3950 semantic_segmentation_loss_cls: 0.0145 grounding_loss_reg: 3.8086 instance_segmentation_loss_cls: 0.0504 instance_segmentation_loss_reg: 0.4017 instance_segmentation_loss_poly: 1.1339 2024/01/02 17:09:03 - mmengine - INFO - Saving checkpoint at 42000 iterations 2024/01/02 17:22:14 - mmengine - INFO - Iter(train) [ 42500/640000] base_lr: 1.9783e-04 lr: 1.9783e-05 eta: 10 days, 15:40:40 time: 1.5112 data_time: 0.0204 memory: 25682 grad_norm: 2.8788 loss: 1.6769 caption_loss_cls: 2.6962 detection_loss_cls: 0.0492 detection_loss_reg: 0.3934 semantic_segmentation_loss_cls: 0.0144 grounding_loss_reg: 3.7946 instance_segmentation_loss_cls: 0.0503 instance_segmentation_loss_reg: 0.4015 instance_segmentation_loss_poly: 1.1334 2024/01/02 17:35:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 17:35:01 - mmengine - INFO - Iter(train) [ 43000/640000] base_lr: 1.9778e-04 lr: 1.9778e-05 eta: 10 days, 15:27:11 time: 1.5191 data_time: 0.0205 memory: 25682 grad_norm: 2.9036 loss: 1.6743 caption_loss_cls: 2.6928 detection_loss_cls: 0.0487 detection_loss_reg: 0.3906 semantic_segmentation_loss_cls: 0.0144 grounding_loss_reg: 3.7830 instance_segmentation_loss_cls: 0.0499 instance_segmentation_loss_reg: 0.3996 instance_segmentation_loss_poly: 1.1298 2024/01/02 17:47:47 - mmengine - INFO - Iter(train) [ 43500/640000] base_lr: 1.9773e-04 lr: 1.9773e-05 eta: 10 days, 15:13:20 time: 1.5177 data_time: 0.0204 memory: 25682 grad_norm: 2.9076 loss: 1.6737 caption_loss_cls: 2.6956 detection_loss_cls: 0.0487 detection_loss_reg: 0.3931 semantic_segmentation_loss_cls: 0.0143 grounding_loss_reg: 3.7769 instance_segmentation_loss_cls: 0.0500 instance_segmentation_loss_reg: 0.3993 instance_segmentation_loss_poly: 1.1271 2024/01/02 18:00:41 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 18:00:41 - mmengine - INFO - Iter(train) [ 44000/640000] base_lr: 1.9768e-04 lr: 1.9768e-05 eta: 10 days, 15:01:20 time: 1.5195 data_time: 0.0204 memory: 25682 grad_norm: 2.9195 loss: 1.6702 caption_loss_cls: 2.7015 detection_loss_cls: 0.0484 detection_loss_reg: 0.3910 semantic_segmentation_loss_cls: 0.0142 grounding_loss_reg: 3.7703 instance_segmentation_loss_cls: 0.0500 instance_segmentation_loss_reg: 0.4003 instance_segmentation_loss_poly: 1.1289 2024/01/02 18:00:41 - mmengine - INFO - Saving checkpoint at 44000 iterations 2024/01/02 18:13:59 - mmengine - INFO - Iter(train) [ 44500/640000] base_lr: 1.9762e-04 lr: 1.9762e-05 eta: 10 days, 14:54:39 time: 1.5324 data_time: 0.0266 memory: 25682 grad_norm: 2.9446 loss: 1.6543 caption_loss_cls: 2.7008 detection_loss_cls: 0.0483 detection_loss_reg: 0.3914 semantic_segmentation_loss_cls: 0.0142 grounding_loss_reg: 3.7663 instance_segmentation_loss_cls: 0.0501 instance_segmentation_loss_reg: 0.4002 instance_segmentation_loss_poly: 1.1279 2024/01/02 18:26:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 18:26:36 - mmengine - INFO - Iter(train) [ 45000/640000] base_lr: 1.9757e-04 lr: 1.9757e-05 eta: 10 days, 14:38:46 time: 1.5354 data_time: 0.0266 memory: 25682 grad_norm: 2.9758 loss: 1.6637 caption_loss_cls: 2.6976 detection_loss_cls: 0.0484 detection_loss_reg: 0.3923 semantic_segmentation_loss_cls: 0.0142 grounding_loss_reg: 3.7561 instance_segmentation_loss_cls: 0.0503 instance_segmentation_loss_reg: 0.4005 instance_segmentation_loss_poly: 1.1273 2024/01/02 18:39:36 - mmengine - INFO - Iter(train) [ 45500/640000] base_lr: 1.9752e-04 lr: 1.9752e-05 eta: 10 days, 14:28:04 time: 1.5435 data_time: 0.0268 memory: 25682 grad_norm: 2.9615 loss: 1.6654 caption_loss_cls: 2.6944 detection_loss_cls: 0.0482 detection_loss_reg: 0.3908 semantic_segmentation_loss_cls: 0.0140 grounding_loss_reg: 3.7445 instance_segmentation_loss_cls: 0.0502 instance_segmentation_loss_reg: 0.3995 instance_segmentation_loss_poly: 1.1242 2024/01/02 18:52:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 18:52:09 - mmengine - INFO - Iter(train) [ 46000/640000] base_lr: 1.9746e-04 lr: 1.9746e-05 eta: 10 days, 14:11:22 time: 1.5462 data_time: 0.0268 memory: 25682 grad_norm: 2.9676 loss: 1.6462 caption_loss_cls: 2.6927 detection_loss_cls: 0.0482 detection_loss_reg: 0.3900 semantic_segmentation_loss_cls: 0.0140 grounding_loss_reg: 3.7341 instance_segmentation_loss_cls: 0.0501 instance_segmentation_loss_reg: 0.3984 instance_segmentation_loss_poly: 1.1220 2024/01/02 18:52:09 - mmengine - INFO - Saving checkpoint at 46000 iterations 2024/01/02 19:05:29 - mmengine - INFO - Iter(train) [ 46500/640000] base_lr: 1.9741e-04 lr: 1.9741e-05 eta: 10 days, 14:05:02 time: 1.5488 data_time: 0.0266 memory: 25682 grad_norm: 2.9264 loss: 1.6333 caption_loss_cls: 2.6948 detection_loss_cls: 0.0481 detection_loss_reg: 0.3917 semantic_segmentation_loss_cls: 0.0139 grounding_loss_reg: 3.7222 instance_segmentation_loss_cls: 0.0500 instance_segmentation_loss_reg: 0.3968 instance_segmentation_loss_poly: 1.1172 2024/01/02 19:18:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 19:18:18 - mmengine - INFO - Iter(train) [ 47000/640000] base_lr: 1.9735e-04 lr: 1.9735e-05 eta: 10 days, 13:51:40 time: 1.5490 data_time: 0.0266 memory: 25682 grad_norm: 2.8671 loss: 1.6203 caption_loss_cls: 2.6913 detection_loss_cls: 0.0483 detection_loss_reg: 0.3939 semantic_segmentation_loss_cls: 0.0138 grounding_loss_reg: 3.7100 instance_segmentation_loss_cls: 0.0501 instance_segmentation_loss_reg: 0.3991 instance_segmentation_loss_poly: 1.1201 2024/01/02 19:30:56 - mmengine - INFO - Iter(train) [ 47500/640000] base_lr: 1.9729e-04 lr: 1.9729e-05 eta: 10 days, 13:36:14 time: 1.5470 data_time: 0.0267 memory: 25682 grad_norm: 2.8887 loss: 1.6353 caption_loss_cls: 2.6950 detection_loss_cls: 0.0485 detection_loss_reg: 0.3949 semantic_segmentation_loss_cls: 0.0138 grounding_loss_reg: 3.6962 instance_segmentation_loss_cls: 0.0502 instance_segmentation_loss_reg: 0.3999 instance_segmentation_loss_poly: 1.1229 2024/01/02 19:44:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 19:44:11 - mmengine - INFO - Iter(train) [ 48000/640000] base_lr: 1.9724e-04 lr: 1.9724e-05 eta: 10 days, 13:28:27 time: 1.5523 data_time: 0.0268 memory: 25682 grad_norm: 2.8689 loss: 1.6395 caption_loss_cls: 2.6906 detection_loss_cls: 0.0482 detection_loss_reg: 0.3916 semantic_segmentation_loss_cls: 0.0137 grounding_loss_reg: 3.6876 instance_segmentation_loss_cls: 0.0503 instance_segmentation_loss_reg: 0.4011 instance_segmentation_loss_poly: 1.1239 2024/01/02 19:44:11 - mmengine - INFO - Saving checkpoint at 48000 iterations 2024/01/02 19:57:25 - mmengine - INFO - Iter(train) [ 48500/640000] base_lr: 1.9718e-04 lr: 1.9718e-05 eta: 10 days, 13:20:21 time: 1.5514 data_time: 0.0268 memory: 25682 grad_norm: 2.8386 loss: 1.6605 caption_loss_cls: 2.6986 detection_loss_cls: 0.0481 detection_loss_reg: 0.3902 semantic_segmentation_loss_cls: 0.0137 grounding_loss_reg: 3.6752 instance_segmentation_loss_cls: 0.0502 instance_segmentation_loss_reg: 0.4003 instance_segmentation_loss_poly: 1.1210 2024/01/02 20:10:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 20:10:31 - mmengine - INFO - Iter(train) [ 49000/640000] base_lr: 1.9712e-04 lr: 1.9712e-05 eta: 10 days, 13:10:37 time: 1.5588 data_time: 0.0268 memory: 25682 grad_norm: 2.7675 loss: 1.6346 caption_loss_cls: 2.7005 detection_loss_cls: 0.0481 detection_loss_reg: 0.3902 semantic_segmentation_loss_cls: 0.0137 grounding_loss_reg: 3.6695 instance_segmentation_loss_cls: 0.0501 instance_segmentation_loss_reg: 0.3998 instance_segmentation_loss_poly: 1.1186 2024/01/02 20:23:27 - mmengine - INFO - Iter(train) [ 49500/640000] base_lr: 1.9706e-04 lr: 1.9706e-05 eta: 10 days, 12:58:34 time: 1.5576 data_time: 0.0266 memory: 25682 grad_norm: 2.7352 loss: 1.6120 caption_loss_cls: 2.6968 detection_loss_cls: 0.0482 detection_loss_reg: 0.3912 semantic_segmentation_loss_cls: 0.0137 grounding_loss_reg: 3.6606 instance_segmentation_loss_cls: 0.0499 instance_segmentation_loss_reg: 0.3989 instance_segmentation_loss_poly: 1.1187 2024/01/02 20:36:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 20:36:24 - mmengine - INFO - Iter(train) [ 50000/640000] base_lr: 1.9700e-04 lr: 1.9700e-05 eta: 10 days, 12:46:55 time: 1.5637 data_time: 0.0268 memory: 25682 grad_norm: 2.7663 loss: 1.6113 caption_loss_cls: 2.6923 detection_loss_cls: 0.0480 detection_loss_reg: 0.3895 semantic_segmentation_loss_cls: 0.0137 grounding_loss_reg: 3.6490 instance_segmentation_loss_cls: 0.0500 instance_segmentation_loss_reg: 0.4008 instance_segmentation_loss_poly: 1.1227 2024/01/02 20:36:24 - mmengine - INFO - Saving checkpoint at 50000 iterations 2024/01/02 20:49:44 - mmengine - INFO - Iter(train) [ 50500/640000] base_lr: 1.9694e-04 lr: 1.9694e-05 eta: 10 days, 12:39:42 time: 1.5636 data_time: 0.0268 memory: 25682 grad_norm: 2.7800 loss: 1.6181 caption_loss_cls: 2.6987 detection_loss_cls: 0.0480 detection_loss_reg: 0.3895 semantic_segmentation_loss_cls: 0.0136 grounding_loss_reg: 3.6428 instance_segmentation_loss_cls: 0.0499 instance_segmentation_loss_reg: 0.4001 instance_segmentation_loss_poly: 1.1204 2024/01/02 21:01:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 21:01:57 - mmengine - INFO - Iter(train) [ 51000/640000] base_lr: 1.9688e-04 lr: 1.9688e-05 eta: 10 days, 12:19:19 time: 1.5546 data_time: 0.0268 memory: 25682 grad_norm: 2.8429 loss: 1.6179 caption_loss_cls: 2.6860 detection_loss_cls: 0.0476 detection_loss_reg: 0.3869 semantic_segmentation_loss_cls: 0.0135 grounding_loss_reg: 3.6321 instance_segmentation_loss_cls: 0.0499 instance_segmentation_loss_reg: 0.4009 instance_segmentation_loss_poly: 1.1202 2024/01/02 21:14:40 - mmengine - INFO - Iter(train) [ 51500/640000] base_lr: 1.9682e-04 lr: 1.9682e-05 eta: 10 days, 12:04:54 time: 1.5558 data_time: 0.0268 memory: 25682 grad_norm: 2.8464 loss: 1.6030 caption_loss_cls: 2.6805 detection_loss_cls: 0.0473 detection_loss_reg: 0.3850 semantic_segmentation_loss_cls: 0.0135 grounding_loss_reg: 3.6325 instance_segmentation_loss_cls: 0.0497 instance_segmentation_loss_reg: 0.4005 instance_segmentation_loss_poly: 1.1165 2024/01/02 21:27:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 21:27:24 - mmengine - INFO - Iter(train) [ 52000/640000] base_lr: 1.9676e-04 lr: 1.9676e-05 eta: 10 days, 11:50:53 time: 1.5483 data_time: 0.0266 memory: 25682 grad_norm: 2.8567 loss: 1.5927 caption_loss_cls: 2.6778 detection_loss_cls: 0.0471 detection_loss_reg: 0.3848 semantic_segmentation_loss_cls: 0.0135 grounding_loss_reg: 3.6225 instance_segmentation_loss_cls: 0.0496 instance_segmentation_loss_reg: 0.4009 instance_segmentation_loss_poly: 1.1175 2024/01/02 21:27:25 - mmengine - INFO - Saving checkpoint at 52000 iterations 2024/01/02 21:40:16 - mmengine - INFO - Iter(train) [ 52500/640000] base_lr: 1.9670e-04 lr: 1.9670e-05 eta: 10 days, 11:38:05 time: 1.5426 data_time: 0.0266 memory: 25682 grad_norm: 2.8431 loss: 1.5722 caption_loss_cls: 2.6750 detection_loss_cls: 0.0471 detection_loss_reg: 0.3858 semantic_segmentation_loss_cls: 0.0135 grounding_loss_reg: 3.6143 instance_segmentation_loss_cls: 0.0493 instance_segmentation_loss_reg: 0.3995 instance_segmentation_loss_poly: 1.1137 2024/01/02 21:53:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 21:53:15 - mmengine - INFO - Iter(train) [ 53000/640000] base_lr: 1.9663e-04 lr: 1.9663e-05 eta: 10 days, 11:26:44 time: 1.5407 data_time: 0.0266 memory: 25682 grad_norm: 2.8564 loss: 1.5810 caption_loss_cls: 2.6718 detection_loss_cls: 0.0471 detection_loss_reg: 0.3861 semantic_segmentation_loss_cls: 0.0134 grounding_loss_reg: 3.6037 instance_segmentation_loss_cls: 0.0492 instance_segmentation_loss_reg: 0.3986 instance_segmentation_loss_poly: 1.1113 2024/01/02 22:05:53 - mmengine - INFO - Iter(train) [ 53500/640000] base_lr: 1.9657e-04 lr: 1.9657e-05 eta: 10 days, 11:11:29 time: 1.5364 data_time: 0.0267 memory: 25682 grad_norm: 2.8751 loss: 1.5963 caption_loss_cls: 2.6657 detection_loss_cls: 0.0467 detection_loss_reg: 0.3839 semantic_segmentation_loss_cls: 0.0134 grounding_loss_reg: 3.5957 instance_segmentation_loss_cls: 0.0488 instance_segmentation_loss_reg: 0.3965 instance_segmentation_loss_poly: 1.1062 2024/01/02 22:18:41 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240101_222309 2024/01/02 22:18:41 - mmengine - INFO - Iter(train) [ 54000/640000] base_lr: 1.9651e-04 lr: 1.9651e-05 eta: 10 days, 10:58:05 time: 1.5341 data_time: 0.0265 memory: 25682 grad_norm: 2.8082 loss: 1.5804 caption_loss_cls: 2.6655 detection_loss_cls: 0.0467 detection_loss_reg: 0.3845 semantic_segmentation_loss_cls: 0.0133 grounding_loss_reg: 3.5835 instance_segmentation_loss_cls: 0.0485 instance_segmentation_loss_reg: 0.3934 instance_segmentation_loss_poly: 1.0992 2024/01/02 22:18:41 - mmengine - INFO - Saving checkpoint at 54000 iterations 2024/01/03 01:23:05 - mmengine - INFO - Iter(train) [ 54500/640000] base_lr: 1.9644e-04 lr: 1.9644e-05 eta: 9 days, 8:02:48 time: 1.5062 data_time: 0.0191 memory: 25630 grad_norm: 2.7826 loss: 1.5564 caption_loss_cls: 2.6667 detection_loss_cls: 0.0467 detection_loss_reg: 0.3854 semantic_segmentation_loss_cls: 0.0133 grounding_loss_reg: 3.5696 instance_segmentation_loss_cls: 0.0483 instance_segmentation_loss_reg: 0.3921 instance_segmentation_loss_poly: 1.0953 2024/01/03 01:34:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 01:34:20 - mmengine - INFO - Iter(train) [ 55000/640000] base_lr: 1.9638e-04 lr: 1.9638e-05 eta: 9 days, 5:38:37 time: 1.4920 data_time: 0.0187 memory: 25630 grad_norm: 2.7377 loss: 1.5690 caption_loss_cls: 2.6603 detection_loss_cls: 0.0468 detection_loss_reg: 0.3878 semantic_segmentation_loss_cls: 0.0132 grounding_loss_reg: 3.5647 instance_segmentation_loss_cls: 0.0483 instance_segmentation_loss_reg: 0.3930 instance_segmentation_loss_poly: 1.0961 2024/01/03 01:45:41 - mmengine - INFO - Iter(train) [ 55500/640000] base_lr: 1.9631e-04 lr: 1.9631e-05 eta: 9 days, 5:20:49 time: 1.4715 data_time: 0.0182 memory: 25630 grad_norm: 2.7205 loss: 1.5606 caption_loss_cls: 2.6527 detection_loss_cls: 0.0466 detection_loss_reg: 0.3871 semantic_segmentation_loss_cls: 0.0132 grounding_loss_reg: 3.5565 instance_segmentation_loss_cls: 0.0477 instance_segmentation_loss_reg: 0.3889 instance_segmentation_loss_poly: 1.0863 2024/01/03 01:57:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 01:57:29 - mmengine - INFO - Iter(train) [ 56000/640000] base_lr: 1.9625e-04 lr: 1.9625e-05 eta: 9 days, 7:16:50 time: 1.4572 data_time: 0.0178 memory: 25630 grad_norm: 2.7216 loss: 1.5637 caption_loss_cls: 2.6482 detection_loss_cls: 0.0465 detection_loss_reg: 0.3852 semantic_segmentation_loss_cls: 0.0131 grounding_loss_reg: 3.5488 instance_segmentation_loss_cls: 0.0476 instance_segmentation_loss_reg: 0.3884 instance_segmentation_loss_poly: 1.0849 2024/01/03 01:57:29 - mmengine - INFO - Saving checkpoint at 56000 iterations 2024/01/03 02:09:30 - mmengine - INFO - Iter(train) [ 56500/640000] base_lr: 1.9618e-04 lr: 1.9618e-05 eta: 9 days, 9:11:37 time: 1.4446 data_time: 0.0169 memory: 25630 grad_norm: 2.7187 loss: 1.5675 caption_loss_cls: 2.6423 detection_loss_cls: 0.0465 detection_loss_reg: 0.3852 semantic_segmentation_loss_cls: 0.0130 grounding_loss_reg: 3.5391 instance_segmentation_loss_cls: 0.0472 instance_segmentation_loss_reg: 0.3863 instance_segmentation_loss_poly: 1.0813 2024/01/03 02:21:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 02:21:21 - mmengine - INFO - Iter(train) [ 57000/640000] base_lr: 1.9611e-04 lr: 1.9611e-05 eta: 9 days, 9:52:01 time: 1.4275 data_time: 0.0164 memory: 25630 grad_norm: 2.6836 loss: 1.5598 caption_loss_cls: 2.6349 detection_loss_cls: 0.0465 detection_loss_reg: 0.3859 semantic_segmentation_loss_cls: 0.0130 grounding_loss_reg: 3.5339 instance_segmentation_loss_cls: 0.0470 instance_segmentation_loss_reg: 0.3857 instance_segmentation_loss_poly: 1.0809 2024/01/03 02:32:57 - mmengine - INFO - Iter(train) [ 57500/640000] base_lr: 1.9604e-04 lr: 1.9604e-05 eta: 9 days, 9:36:51 time: 1.4120 data_time: 0.0160 memory: 25630 grad_norm: 2.6667 loss: 1.5594 caption_loss_cls: 2.6313 detection_loss_cls: 0.0468 detection_loss_reg: 0.3867 semantic_segmentation_loss_cls: 0.0129 grounding_loss_reg: 3.5227 instance_segmentation_loss_cls: 0.0469 instance_segmentation_loss_reg: 0.3852 instance_segmentation_loss_poly: 1.0796 2024/01/03 02:44:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 02:44:55 - mmengine - INFO - Iter(train) [ 58000/640000] base_lr: 1.9597e-04 lr: 1.9597e-05 eta: 9 days, 10:16:14 time: 1.3996 data_time: 0.0156 memory: 25630 grad_norm: 2.6569 loss: 1.5496 caption_loss_cls: 2.6251 detection_loss_cls: 0.0467 detection_loss_reg: 0.3871 semantic_segmentation_loss_cls: 0.0129 grounding_loss_reg: 3.5150 instance_segmentation_loss_cls: 0.0466 instance_segmentation_loss_reg: 0.3839 instance_segmentation_loss_poly: 1.0766 2024/01/03 02:44:55 - mmengine - INFO - Saving checkpoint at 58000 iterations 2024/01/03 02:56:47 - mmengine - INFO - Iter(train) [ 58500/640000] base_lr: 1.9591e-04 lr: 1.9591e-05 eta: 9 days, 10:31:10 time: 1.4055 data_time: 0.0218 memory: 25630 grad_norm: 2.6876 loss: 1.5748 caption_loss_cls: 2.6238 detection_loss_cls: 0.0467 detection_loss_reg: 0.3878 semantic_segmentation_loss_cls: 0.0128 grounding_loss_reg: 3.5115 instance_segmentation_loss_cls: 0.0465 instance_segmentation_loss_reg: 0.3835 instance_segmentation_loss_poly: 1.0750 2024/01/03 03:08:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 03:08:50 - mmengine - INFO - Iter(train) [ 59000/640000] base_lr: 1.9584e-04 lr: 1.9584e-05 eta: 9 days, 11:00:08 time: 1.4172 data_time: 0.0218 memory: 25630 grad_norm: 2.6400 loss: 1.5446 caption_loss_cls: 2.6209 detection_loss_cls: 0.0465 detection_loss_reg: 0.3865 semantic_segmentation_loss_cls: 0.0128 grounding_loss_reg: 3.5050 instance_segmentation_loss_cls: 0.0463 instance_segmentation_loss_reg: 0.3821 instance_segmentation_loss_poly: 1.0714 2024/01/03 03:21:34 - mmengine - INFO - Iter(train) [ 59500/640000] base_lr: 1.9577e-04 lr: 1.9577e-05 eta: 9 days, 12:36:26 time: 1.4381 data_time: 0.0220 memory: 25630 grad_norm: 2.6129 loss: 1.5433 caption_loss_cls: 2.6172 detection_loss_cls: 0.0465 detection_loss_reg: 0.3863 semantic_segmentation_loss_cls: 0.0127 grounding_loss_reg: 3.4957 instance_segmentation_loss_cls: 0.0457 instance_segmentation_loss_reg: 0.3791 instance_segmentation_loss_poly: 1.0666 2024/01/03 03:33:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 03:33:14 - mmengine - INFO - Iter(train) [ 60000/640000] base_lr: 1.9569e-04 lr: 1.9569e-05 eta: 9 days, 12:09:38 time: 1.4361 data_time: 0.0219 memory: 25630 grad_norm: 2.6162 loss: 1.5354 caption_loss_cls: 2.6198 detection_loss_cls: 0.0462 detection_loss_reg: 0.3830 semantic_segmentation_loss_cls: 0.0126 grounding_loss_reg: 3.4866 instance_segmentation_loss_cls: 0.0453 instance_segmentation_loss_reg: 0.3768 instance_segmentation_loss_poly: 1.0617 2024/01/03 03:33:14 - mmengine - INFO - Saving checkpoint at 60000 iterations 2024/01/03 03:44:40 - mmengine - INFO - Evaluating bbox... 2024/01/03 03:45:37 - mmengine - INFO - bbox_mAP_copypaste: 0.424 0.609 0.471 0.276 0.480 0.537 2024/01/03 03:45:37 - mmengine - INFO - Evaluating segm... 2024/01/03 03:46:50 - mmengine - INFO - segm_mAP_copypaste: 0.260 0.504 0.244 0.128 0.306 0.409 2024/01/03 03:49:00 - mmengine - INFO - Evaluating bbox... 2024/01/03 03:49:57 - mmengine - INFO - bbox_mAP_copypaste: 0.424 0.609 0.472 0.276 0.480 0.536 2024/01/03 03:55:36 - mmengine - INFO - per class results: 2024/01/03 03:55:36 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.3 | 87.99 | | building | 79.36 | 91.26 | | sky | 92.69 | 97.7 | | floor | 81.45 | 88.13 | | tree | 69.32 | 81.16 | | ceiling | 83.98 | 89.43 | | road | 83.01 | 87.22 | | bed | 87.22 | 95.16 | | windowpane | 61.38 | 82.89 | | grass | 67.36 | 77.87 | | cabinet | 59.68 | 74.18 | | sidewalk | 64.63 | 78.6 | | person | 80.61 | 93.25 | | earth | 38.38 | 61.32 | | door | 49.85 | 67.98 | | table | 55.1 | 78.52 | | mountain | 55.2 | 81.97 | | plant | 47.91 | 54.94 | | curtain | 74.17 | 85.94 | | chair | 55.46 | 75.37 | | car | 80.22 | 93.8 | | water | 49.86 | 57.31 | | painting | 66.99 | 88.62 | | sofa | 62.14 | 90.76 | | shelf | 41.24 | 61.44 | | house | 40.46 | 72.07 | | sea | 61.37 | 93.94 | | mirror | 64.13 | 70.58 | | rug | 68.98 | 82.94 | | field | 22.14 | 25.3 | | armchair | 34.64 | 44.96 | | seat | 56.63 | 62.02 | | fence | 42.14 | 59.51 | | desk | 36.64 | 55.01 | | rock | 28.61 | 34.81 | | wardrobe | 42.92 | 55.81 | | lamp | 58.6 | 82.38 | | bathtub | 77.73 | 89.97 | | railing | 32.88 | 50.37 | | cushion | 51.78 | 59.43 | | base | 21.42 | 34.47 | | box | 18.07 | 20.36 | | column | 49.93 | 52.55 | | signboard | 30.72 | 36.94 | | chest of drawers | 44.67 | 52.25 | | counter | 23.72 | 40.4 | | sand | 37.6 | 50.4 | | sink | 67.25 | 81.12 | | skyscraper | 47.78 | 58.33 | | fireplace | 68.82 | 90.85 | | refrigerator | 70.44 | 80.26 | | grandstand | 43.83 | 73.89 | | path | 22.62 | 35.34 | | stairs | 24.74 | 27.28 | | runway | 73.56 | 86.76 | | case | 38.25 | 61.69 | | pool table | 91.73 | 94.24 | | pillow | 53.44 | 64.16 | | screen door | 57.02 | 83.47 | | stairway | 28.99 | 52.92 | | river | 27.45 | 48.84 | | bridge | 41.89 | 45.69 | | bookcase | 33.38 | 50.71 | | blind | 35.89 | 38.45 | | coffee table | 60.15 | 74.15 | | toilet | 79.65 | 89.03 | | flower | 31.61 | 46.24 | | book | 42.5 | 74.66 | | hill | 2.54 | 2.82 | | bench | 56.38 | 66.66 | | countertop | 51.69 | 77.39 | | stove | 64.38 | 86.62 | | palm | 50.24 | 71.73 | | kitchen island | 28.13 | 42.48 | | computer | 71.62 | 85.1 | | swivel chair | 13.83 | 14.31 | | boat | 74.61 | 88.14 | | bar | 29.42 | 34.25 | | arcade machine | 36.41 | 36.96 | | hovel | 8.04 | 8.29 | | bus | 91.44 | 93.99 | | towel | 55.92 | 66.91 | | light | 51.13 | 60.13 | | truck | 36.75 | 57.39 | | tower | 13.64 | 19.05 | | chandelier | 56.91 | 66.01 | | awning | 31.95 | 52.78 | | streetlight | 27.79 | 50.79 | | booth | 31.62 | 32.37 | | television receiver | 65.55 | 78.58 | | airplane | 59.49 | 65.44 | | dirt track | 0.78 | 0.86 | | apparel | 27.44 | 41.8 | | pole | 17.39 | 22.59 | | land | 7.52 | 16.59 | | bannister | 9.48 | 22.59 | | escalator | 4.66 | 4.68 | | ottoman | 42.29 | 79.15 | | bottle | 25.37 | 34.32 | | buffet | 42.3 | 64.86 | | poster | 11.66 | 14.61 | | stage | 7.03 | 7.93 | | van | 5.45 | 5.73 | | ship | 7.57 | 8.2 | | fountain | 30.33 | 30.95 | | conveyer belt | 55.94 | 57.54 | | canopy | 37.17 | 44.91 | | washer | 61.93 | 65.27 | | plaything | 36.35 | 54.17 | | swimming pool | 44.51 | 75.38 | | stool | 42.18 | 52.53 | | barrel | 12.93 | 22.64 | | basket | 33.23 | 51.69 | | waterfall | 62.7 | 68.76 | | tent | 89.0 | 97.51 | | bag | 22.68 | 27.87 | | minibike | 72.64 | 84.3 | | cradle | 73.73 | 97.06 | | oven | 5.61 | 5.93 | | ball | 41.97 | 68.55 | | food | 49.39 | 54.72 | | step | 0.94 | 1.03 | | tank | 45.2 | 48.33 | | trade name | 30.18 | 62.72 | | microwave | 80.81 | 96.04 | | pot | 40.09 | 47.66 | | animal | 65.45 | 72.97 | | bicycle | 57.64 | 83.48 | | lake | 3.1 | 4.15 | | dishwasher | 57.36 | 65.28 | | screen | 64.92 | 75.43 | | blanket | 10.08 | 11.69 | | sculpture | 51.98 | 58.53 | | hood | 56.45 | 68.06 | | sconce | 34.12 | 42.88 | | vase | 40.56 | 50.19 | | traffic light | 28.38 | 40.54 | | tray | 2.36 | 3.15 | | ashcan | 29.95 | 38.5 | | fan | 49.34 | 55.04 | | pier | 34.68 | 43.42 | | crt screen | 3.84 | 9.83 | | plate | 45.61 | 66.1 | | monitor | 9.29 | 9.61 | | bulletin board | 30.37 | 47.84 | | shower | 1.05 | 1.1 | | radiator | 47.74 | 49.77 | | glass | 14.54 | 15.37 | | clock | 21.43 | 28.27 | | flag | 40.72 | 44.86 | +---------------------+-------+-------+ 2024/01/03 03:55:58 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4240 coco/bbox_mAP_50: 0.6090 coco/bbox_mAP_75: 0.4720 coco/bbox_mAP_s: 0.2760 coco/bbox_mAP_m: 0.4800 coco/bbox_mAP_l: 0.5360 coco/segm_mAP: 0.2600 coco/segm_mAP_50: 0.5040 coco/segm_mAP_75: 0.2440 coco/segm_mAP_s: 0.1280 coco/segm_mAP_m: 0.3060 coco/segm_mAP_l: 0.4090 Bleu_1: 0.7104 Bleu_2: 0.5347 Bleu_3: 0.3888 Bleu_4: 0.2786 METEOR: 0.2413 ROUGE_L: 0.5187 CIDEr: 0.8896 SPICE: 0.1767 aAcc: 81.6600 mIoU: 44.2700 mAcc: 55.9400 visual-grounding/miou: 0.6695 visual-grounding/acc: 0.7402 data_time: 0.0289 time: 1.9266 2024/01/03 04:07:19 - mmengine - INFO - Iter(train) [ 60500/640000] base_lr: 1.9562e-04 lr: 1.9562e-05 eta: 9 days, 11:30:44 time: 1.4284 data_time: 0.0178 memory: 34723 grad_norm: 2.6205 loss: 1.5363 caption_loss_cls: 2.6175 detection_loss_cls: 0.0459 detection_loss_reg: 0.3819 semantic_segmentation_loss_cls: 0.0126 grounding_loss_reg: 3.4783 instance_segmentation_loss_cls: 0.0453 instance_segmentation_loss_reg: 0.3768 instance_segmentation_loss_poly: 1.0614 2024/01/03 04:19:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 04:19:02 - mmengine - INFO - Iter(train) [ 61000/640000] base_lr: 1.9555e-04 lr: 1.9555e-05 eta: 9 days, 11:12:55 time: 1.4263 data_time: 0.0180 memory: 25631 grad_norm: 2.6434 loss: 1.5329 caption_loss_cls: 2.6114 detection_loss_cls: 0.0460 detection_loss_reg: 0.3829 semantic_segmentation_loss_cls: 0.0126 grounding_loss_reg: 3.4725 instance_segmentation_loss_cls: 0.0449 instance_segmentation_loss_reg: 0.3744 instance_segmentation_loss_poly: 1.0543 2024/01/03 04:30:14 - mmengine - INFO - Iter(train) [ 61500/640000] base_lr: 1.9548e-04 lr: 1.9548e-05 eta: 9 days, 10:16:57 time: 1.4202 data_time: 0.0182 memory: 25631 grad_norm: 2.6604 loss: 1.5421 caption_loss_cls: 2.6087 detection_loss_cls: 0.0459 detection_loss_reg: 0.3824 semantic_segmentation_loss_cls: 0.0126 grounding_loss_reg: 3.4702 instance_segmentation_loss_cls: 0.0450 instance_segmentation_loss_reg: 0.3765 instance_segmentation_loss_poly: 1.0593 2024/01/03 04:41:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 04:41:54 - mmengine - INFO - Iter(train) [ 62000/640000] base_lr: 1.9540e-04 lr: 1.9540e-05 eta: 9 days, 10:00:20 time: 1.4157 data_time: 0.0185 memory: 25631 grad_norm: 2.6650 loss: 1.5704 caption_loss_cls: 2.6095 detection_loss_cls: 0.0457 detection_loss_reg: 0.3830 semantic_segmentation_loss_cls: 0.0126 grounding_loss_reg: 3.4650 instance_segmentation_loss_cls: 0.0450 instance_segmentation_loss_reg: 0.3756 instance_segmentation_loss_poly: 1.0575 2024/01/03 04:41:54 - mmengine - INFO - Saving checkpoint at 62000 iterations 2024/01/03 04:53:51 - mmengine - INFO - Iter(train) [ 62500/640000] base_lr: 1.9533e-04 lr: 1.9533e-05 eta: 9 days, 10:03:37 time: 1.4169 data_time: 0.0202 memory: 25631 grad_norm: 2.6732 loss: 1.5694 caption_loss_cls: 2.6054 detection_loss_cls: 0.0457 detection_loss_reg: 0.3838 semantic_segmentation_loss_cls: 0.0125 grounding_loss_reg: 3.4570 instance_segmentation_loss_cls: 0.0449 instance_segmentation_loss_reg: 0.3757 instance_segmentation_loss_poly: 1.0556 2024/01/03 05:05:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 05:05:24 - mmengine - INFO - Iter(train) [ 63000/640000] base_lr: 1.9526e-04 lr: 1.9526e-05 eta: 9 days, 9:38:47 time: 1.4095 data_time: 0.0204 memory: 25631 grad_norm: 2.6966 loss: 1.5782 caption_loss_cls: 2.6055 detection_loss_cls: 0.0456 detection_loss_reg: 0.3832 semantic_segmentation_loss_cls: 0.0125 grounding_loss_reg: 3.4468 instance_segmentation_loss_cls: 0.0449 instance_segmentation_loss_reg: 0.3769 instance_segmentation_loss_poly: 1.0558 2024/01/03 05:16:59 - mmengine - INFO - Iter(train) [ 63500/640000] base_lr: 1.9518e-04 lr: 1.9518e-05 eta: 9 days, 9:18:42 time: 1.3922 data_time: 0.0204 memory: 25631 grad_norm: 2.7139 loss: 1.5734 caption_loss_cls: 2.5893 detection_loss_cls: 0.0456 detection_loss_reg: 0.3824 semantic_segmentation_loss_cls: 0.0125 grounding_loss_reg: 3.4397 instance_segmentation_loss_cls: 0.0450 instance_segmentation_loss_reg: 0.3780 instance_segmentation_loss_poly: 1.0573 2024/01/03 05:28:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 05:28:34 - mmengine - INFO - Iter(train) [ 64000/640000] base_lr: 1.9511e-04 lr: 1.9511e-05 eta: 9 days, 8:58:01 time: 1.3909 data_time: 0.0208 memory: 25631 grad_norm: 2.7415 loss: 1.5803 caption_loss_cls: 2.5842 detection_loss_cls: 0.0455 detection_loss_reg: 0.3823 semantic_segmentation_loss_cls: 0.0125 grounding_loss_reg: 3.4269 instance_segmentation_loss_cls: 0.0448 instance_segmentation_loss_reg: 0.3757 instance_segmentation_loss_poly: 1.0527 2024/01/03 05:28:34 - mmengine - INFO - Saving checkpoint at 64000 iterations 2024/01/03 05:40:48 - mmengine - INFO - Iter(train) [ 64500/640000] base_lr: 1.9503e-04 lr: 1.9503e-05 eta: 9 days, 9:15:04 time: 1.4020 data_time: 0.0259 memory: 25631 grad_norm: 2.7263 loss: 1.5684 caption_loss_cls: 2.5844 detection_loss_cls: 0.0457 detection_loss_reg: 0.3828 semantic_segmentation_loss_cls: 0.0124 grounding_loss_reg: 3.4199 instance_segmentation_loss_cls: 0.0448 instance_segmentation_loss_reg: 0.3776 instance_segmentation_loss_poly: 1.0556 2024/01/03 05:52:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 05:52:17 - mmengine - INFO - Iter(train) [ 65000/640000] base_lr: 1.9495e-04 lr: 1.9495e-05 eta: 9 days, 8:49:59 time: 1.3987 data_time: 0.0259 memory: 25631 grad_norm: 2.7428 loss: 1.5676 caption_loss_cls: 2.5755 detection_loss_cls: 0.0456 detection_loss_reg: 0.3821 semantic_segmentation_loss_cls: 0.0124 grounding_loss_reg: 3.4133 instance_segmentation_loss_cls: 0.0445 instance_segmentation_loss_reg: 0.3767 instance_segmentation_loss_poly: 1.0533 2024/01/03 06:03:24 - mmengine - INFO - Iter(train) [ 65500/640000] base_lr: 1.9488e-04 lr: 1.9488e-05 eta: 9 days, 8:07:06 time: 1.3974 data_time: 0.0258 memory: 25631 grad_norm: 2.7670 loss: 1.5563 caption_loss_cls: 2.5729 detection_loss_cls: 0.0456 detection_loss_reg: 0.3820 semantic_segmentation_loss_cls: 0.0124 grounding_loss_reg: 3.4018 instance_segmentation_loss_cls: 0.0441 instance_segmentation_loss_reg: 0.3752 instance_segmentation_loss_poly: 1.0502 2024/01/03 06:15:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 06:15:14 - mmengine - INFO - Iter(train) [ 66000/640000] base_lr: 1.9480e-04 lr: 1.9480e-05 eta: 9 days, 8:01:16 time: 1.3997 data_time: 0.0258 memory: 25631 grad_norm: 2.7818 loss: 1.5510 caption_loss_cls: 2.5766 detection_loss_cls: 0.0457 detection_loss_reg: 0.3839 semantic_segmentation_loss_cls: 0.0124 grounding_loss_reg: 3.3980 instance_segmentation_loss_cls: 0.0440 instance_segmentation_loss_reg: 0.3746 instance_segmentation_loss_poly: 1.0485 2024/01/03 06:15:14 - mmengine - INFO - Saving checkpoint at 66000 iterations 2024/01/03 06:26:42 - mmengine - INFO - Iter(train) [ 66500/640000] base_lr: 1.9472e-04 lr: 1.9472e-05 eta: 9 days, 7:39:10 time: 1.3927 data_time: 0.0253 memory: 25631 grad_norm: 2.8045 loss: 1.5481 caption_loss_cls: 2.5629 detection_loss_cls: 0.0456 detection_loss_reg: 0.3819 semantic_segmentation_loss_cls: 0.0123 grounding_loss_reg: 3.3932 instance_segmentation_loss_cls: 0.0439 instance_segmentation_loss_reg: 0.3754 instance_segmentation_loss_poly: 1.0506 2024/01/03 06:38:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 06:38:27 - mmengine - INFO - Iter(train) [ 67000/640000] base_lr: 1.9464e-04 lr: 1.9464e-05 eta: 9 days, 7:29:24 time: 1.3958 data_time: 0.0253 memory: 25631 grad_norm: 2.7837 loss: 1.5327 caption_loss_cls: 2.5609 detection_loss_cls: 0.0454 detection_loss_reg: 0.3788 semantic_segmentation_loss_cls: 0.0123 grounding_loss_reg: 3.3885 instance_segmentation_loss_cls: 0.0438 instance_segmentation_loss_reg: 0.3752 instance_segmentation_loss_poly: 1.0508 2024/01/03 06:49:47 - mmengine - INFO - Iter(train) [ 67500/640000] base_lr: 1.9456e-04 lr: 1.9456e-05 eta: 9 days, 7:01:41 time: 1.3917 data_time: 0.0253 memory: 25631 grad_norm: 2.7746 loss: 1.5375 caption_loss_cls: 2.5530 detection_loss_cls: 0.0451 detection_loss_reg: 0.3767 semantic_segmentation_loss_cls: 0.0123 grounding_loss_reg: 3.3815 instance_segmentation_loss_cls: 0.0438 instance_segmentation_loss_reg: 0.3765 instance_segmentation_loss_poly: 1.0514 2024/01/03 07:01:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 07:01:04 - mmengine - INFO - Iter(train) [ 68000/640000] base_lr: 1.9448e-04 lr: 1.9448e-05 eta: 9 days, 6:33:52 time: 1.3875 data_time: 0.0253 memory: 25631 grad_norm: 2.7536 loss: 1.5442 caption_loss_cls: 2.5451 detection_loss_cls: 0.0451 detection_loss_reg: 0.3783 semantic_segmentation_loss_cls: 0.0122 grounding_loss_reg: 3.3771 instance_segmentation_loss_cls: 0.0435 instance_segmentation_loss_reg: 0.3751 instance_segmentation_loss_poly: 1.0480 2024/01/03 07:01:04 - mmengine - INFO - Saving checkpoint at 68000 iterations 2024/01/03 07:13:13 - mmengine - INFO - Iter(train) [ 68500/640000] base_lr: 1.9440e-04 lr: 1.9440e-05 eta: 9 days, 6:40:31 time: 1.3860 data_time: 0.0251 memory: 25631 grad_norm: 2.7695 loss: 1.5505 caption_loss_cls: 2.5374 detection_loss_cls: 0.0451 detection_loss_reg: 0.3780 semantic_segmentation_loss_cls: 0.0122 grounding_loss_reg: 3.3731 instance_segmentation_loss_cls: 0.0437 instance_segmentation_loss_reg: 0.3759 instance_segmentation_loss_poly: 1.0504 2024/01/03 07:25:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 07:25:01 - mmengine - INFO - Iter(train) [ 69000/640000] base_lr: 1.9432e-04 lr: 1.9432e-05 eta: 9 days, 6:32:57 time: 1.3906 data_time: 0.0252 memory: 25631 grad_norm: 2.7699 loss: 1.5625 caption_loss_cls: 2.5372 detection_loss_cls: 0.0454 detection_loss_reg: 0.3810 semantic_segmentation_loss_cls: 0.0122 grounding_loss_reg: 3.3709 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3748 instance_segmentation_loss_poly: 1.0482 2024/01/03 07:36:22 - mmengine - INFO - Iter(train) [ 69500/640000] base_lr: 1.9424e-04 lr: 1.9424e-05 eta: 9 days, 6:08:39 time: 1.3943 data_time: 0.0253 memory: 25631 grad_norm: 2.7479 loss: 1.5676 caption_loss_cls: 2.5415 detection_loss_cls: 0.0449 detection_loss_reg: 0.3784 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3666 instance_segmentation_loss_cls: 0.0436 instance_segmentation_loss_reg: 0.3769 instance_segmentation_loss_poly: 1.0514 2024/01/03 07:48:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 07:48:08 - mmengine - INFO - Iter(train) [ 70000/640000] base_lr: 1.9415e-04 lr: 1.9415e-05 eta: 9 days, 6:00:17 time: 1.3935 data_time: 0.0253 memory: 25631 grad_norm: 2.7492 loss: 1.5568 caption_loss_cls: 2.5442 detection_loss_cls: 0.0451 detection_loss_reg: 0.3784 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3631 instance_segmentation_loss_cls: 0.0436 instance_segmentation_loss_reg: 0.3761 instance_segmentation_loss_poly: 1.0489 2024/01/03 07:48:08 - mmengine - INFO - Saving checkpoint at 70000 iterations 2024/01/03 08:00:21 - mmengine - INFO - Iter(train) [ 70500/640000] base_lr: 1.9407e-04 lr: 1.9407e-05 eta: 9 days, 6:06:52 time: 1.4045 data_time: 0.0254 memory: 25631 grad_norm: 2.7430 loss: 1.5557 caption_loss_cls: 2.5422 detection_loss_cls: 0.0450 detection_loss_reg: 0.3777 semantic_segmentation_loss_cls: 0.0122 grounding_loss_reg: 3.3575 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3758 instance_segmentation_loss_poly: 1.0491 2024/01/03 08:11:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 08:11:42 - mmengine - INFO - Iter(train) [ 71000/640000] base_lr: 1.9399e-04 lr: 1.9399e-05 eta: 9 days, 5:43:17 time: 1.3985 data_time: 0.0253 memory: 25631 grad_norm: 2.7950 loss: 1.5592 caption_loss_cls: 2.5505 detection_loss_cls: 0.0447 detection_loss_reg: 0.3754 semantic_segmentation_loss_cls: 0.0122 grounding_loss_reg: 3.3514 instance_segmentation_loss_cls: 0.0433 instance_segmentation_loss_reg: 0.3761 instance_segmentation_loss_poly: 1.0518 2024/01/03 08:23:02 - mmengine - INFO - Iter(train) [ 71500/640000] base_lr: 1.9390e-04 lr: 1.9390e-05 eta: 9 days, 5:19:52 time: 1.3986 data_time: 0.0251 memory: 25631 grad_norm: 2.8107 loss: 1.5593 caption_loss_cls: 2.5508 detection_loss_cls: 0.0448 detection_loss_reg: 0.3760 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3460 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3735 instance_segmentation_loss_poly: 1.0462 2024/01/03 08:34:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 08:34:39 - mmengine - INFO - Iter(train) [ 72000/640000] base_lr: 1.9382e-04 lr: 1.9382e-05 eta: 9 days, 5:06:06 time: 1.4034 data_time: 0.0251 memory: 25631 grad_norm: 2.7802 loss: 1.5368 caption_loss_cls: 2.5433 detection_loss_cls: 0.0449 detection_loss_reg: 0.3765 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3409 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3729 instance_segmentation_loss_poly: 1.0442 2024/01/03 08:34:39 - mmengine - INFO - Saving checkpoint at 72000 iterations 2024/01/03 08:46:41 - mmengine - INFO - Iter(train) [ 72500/640000] base_lr: 1.9373e-04 lr: 1.9373e-05 eta: 9 days, 5:05:26 time: 1.4019 data_time: 0.0251 memory: 25631 grad_norm: 2.7795 loss: 1.5393 caption_loss_cls: 2.5404 detection_loss_cls: 0.0448 detection_loss_reg: 0.3768 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3369 instance_segmentation_loss_cls: 0.0432 instance_segmentation_loss_reg: 0.3730 instance_segmentation_loss_poly: 1.0449 2024/01/03 08:57:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 08:57:50 - mmengine - INFO - Iter(train) [ 73000/640000] base_lr: 1.9365e-04 lr: 1.9365e-05 eta: 9 days, 4:37:41 time: 1.3922 data_time: 0.0250 memory: 25631 grad_norm: 2.7872 loss: 1.5272 caption_loss_cls: 2.5310 detection_loss_cls: 0.0445 detection_loss_reg: 0.3745 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3328 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3720 instance_segmentation_loss_poly: 1.0428 2024/01/03 09:09:48 - mmengine - INFO - Iter(train) [ 73500/640000] base_lr: 1.9356e-04 lr: 1.9356e-05 eta: 9 days, 4:34:35 time: 1.4015 data_time: 0.0251 memory: 25631 grad_norm: 2.7327 loss: 1.5085 caption_loss_cls: 2.5259 detection_loss_cls: 0.0447 detection_loss_reg: 0.3761 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.3297 instance_segmentation_loss_cls: 0.0429 instance_segmentation_loss_reg: 0.3710 instance_segmentation_loss_poly: 1.0402 2024/01/03 09:21:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 09:21:04 - mmengine - INFO - Iter(train) [ 74000/640000] base_lr: 1.9347e-04 lr: 1.9347e-05 eta: 9 days, 4:11:17 time: 1.3939 data_time: 0.0250 memory: 25631 grad_norm: 2.7440 loss: 1.5238 caption_loss_cls: 2.5280 detection_loss_cls: 0.0447 detection_loss_reg: 0.3776 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3296 instance_segmentation_loss_cls: 0.0428 instance_segmentation_loss_reg: 0.3700 instance_segmentation_loss_poly: 1.0385 2024/01/03 09:21:04 - mmengine - INFO - Saving checkpoint at 74000 iterations 2024/01/03 09:33:12 - mmengine - INFO - Iter(train) [ 74500/640000] base_lr: 1.9339e-04 lr: 1.9339e-05 eta: 9 days, 4:12:01 time: 1.3925 data_time: 0.0250 memory: 25631 grad_norm: 2.6898 loss: 1.5141 caption_loss_cls: 2.5223 detection_loss_cls: 0.0444 detection_loss_reg: 0.3749 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3297 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3723 instance_segmentation_loss_poly: 1.0441 2024/01/03 09:44:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 09:44:24 - mmengine - INFO - Iter(train) [ 75000/640000] base_lr: 1.9330e-04 lr: 1.9330e-05 eta: 9 days, 3:47:16 time: 1.3903 data_time: 0.0250 memory: 25631 grad_norm: 2.6892 loss: 1.5182 caption_loss_cls: 2.5261 detection_loss_cls: 0.0440 detection_loss_reg: 0.3721 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3217 instance_segmentation_loss_cls: 0.0432 instance_segmentation_loss_reg: 0.3726 instance_segmentation_loss_poly: 1.0430 2024/01/03 09:56:02 - mmengine - INFO - Iter(train) [ 75500/640000] base_lr: 1.9321e-04 lr: 1.9321e-05 eta: 9 days, 3:34:58 time: 1.3950 data_time: 0.0251 memory: 25631 grad_norm: 2.7185 loss: 1.5215 caption_loss_cls: 2.5283 detection_loss_cls: 0.0439 detection_loss_reg: 0.3701 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3210 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3711 instance_segmentation_loss_poly: 1.0403 2024/01/03 10:07:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 10:07:55 - mmengine - INFO - Iter(train) [ 76000/640000] base_lr: 1.9312e-04 lr: 1.9312e-05 eta: 9 days, 3:28:26 time: 1.3989 data_time: 0.0251 memory: 25631 grad_norm: 2.7068 loss: 1.5179 caption_loss_cls: 2.5302 detection_loss_cls: 0.0439 detection_loss_reg: 0.3722 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3171 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3715 instance_segmentation_loss_poly: 1.0406 2024/01/03 10:07:55 - mmengine - INFO - Saving checkpoint at 76000 iterations 2024/01/03 10:20:01 - mmengine - INFO - Iter(train) [ 76500/640000] base_lr: 1.9303e-04 lr: 1.9303e-05 eta: 9 days, 3:27:31 time: 1.3999 data_time: 0.0253 memory: 25631 grad_norm: 2.6846 loss: 1.5027 caption_loss_cls: 2.5259 detection_loss_cls: 0.0440 detection_loss_reg: 0.3729 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3129 instance_segmentation_loss_cls: 0.0433 instance_segmentation_loss_reg: 0.3733 instance_segmentation_loss_poly: 1.0429 2024/01/03 10:31:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 10:31:06 - mmengine - INFO - Iter(train) [ 77000/640000] base_lr: 1.9294e-04 lr: 1.9294e-05 eta: 9 days, 3:01:18 time: 1.3990 data_time: 0.0253 memory: 25631 grad_norm: 2.6689 loss: 1.5081 caption_loss_cls: 2.5232 detection_loss_cls: 0.0440 detection_loss_reg: 0.3738 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3078 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3740 instance_segmentation_loss_poly: 1.0438 2024/01/03 10:42:28 - mmengine - INFO - Iter(train) [ 77500/640000] base_lr: 1.9285e-04 lr: 1.9285e-05 eta: 9 days, 2:42:15 time: 1.3899 data_time: 0.0251 memory: 25631 grad_norm: 2.7273 loss: 1.5151 caption_loss_cls: 2.5239 detection_loss_cls: 0.0436 detection_loss_reg: 0.3718 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.3027 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3724 instance_segmentation_loss_poly: 1.0406 2024/01/03 10:53:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 10:53:53 - mmengine - INFO - Iter(train) [ 78000/640000] base_lr: 1.9276e-04 lr: 1.9276e-05 eta: 9 days, 2:24:35 time: 1.3920 data_time: 0.0252 memory: 25631 grad_norm: 2.7152 loss: 1.5196 caption_loss_cls: 2.5297 detection_loss_cls: 0.0433 detection_loss_reg: 0.3696 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.2972 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3732 instance_segmentation_loss_poly: 1.0407 2024/01/03 10:53:53 - mmengine - INFO - Saving checkpoint at 78000 iterations 2024/01/03 11:05:56 - mmengine - INFO - Iter(train) [ 78500/640000] base_lr: 1.9267e-04 lr: 1.9267e-05 eta: 9 days, 2:21:53 time: 1.3909 data_time: 0.0251 memory: 25631 grad_norm: 2.7200 loss: 1.5066 caption_loss_cls: 2.5254 detection_loss_cls: 0.0431 detection_loss_reg: 0.3687 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.2906 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3731 instance_segmentation_loss_poly: 1.0404 2024/01/03 11:18:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 11:18:03 - mmengine - INFO - Iter(train) [ 79000/640000] base_lr: 1.9257e-04 lr: 1.9257e-05 eta: 9 days, 2:20:22 time: 1.4047 data_time: 0.0253 memory: 25631 grad_norm: 2.6636 loss: 1.4961 caption_loss_cls: 2.5208 detection_loss_cls: 0.0433 detection_loss_reg: 0.3697 semantic_segmentation_loss_cls: 0.0121 grounding_loss_reg: 3.2843 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3732 instance_segmentation_loss_poly: 1.0419 2024/01/03 11:29:10 - mmengine - INFO - Iter(train) [ 79500/640000] base_lr: 1.9248e-04 lr: 1.9248e-05 eta: 9 days, 1:56:27 time: 1.3968 data_time: 0.0252 memory: 25631 grad_norm: 2.6623 loss: 1.5045 caption_loss_cls: 2.5200 detection_loss_cls: 0.0430 detection_loss_reg: 0.3676 semantic_segmentation_loss_cls: 0.0120 grounding_loss_reg: 3.2852 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3726 instance_segmentation_loss_poly: 1.0399 2024/01/03 11:40:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 11:40:03 - mmengine - INFO - Iter(train) [ 80000/640000] base_lr: 1.9239e-04 lr: 1.9239e-05 eta: 9 days, 1:27:53 time: 1.3820 data_time: 0.0251 memory: 25631 grad_norm: 2.7204 loss: 1.5297 caption_loss_cls: 2.5127 detection_loss_cls: 0.0431 detection_loss_reg: 0.3682 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2828 instance_segmentation_loss_cls: 0.0435 instance_segmentation_loss_reg: 0.3740 instance_segmentation_loss_poly: 1.0419 2024/01/03 11:40:03 - mmengine - INFO - Saving checkpoint at 80000 iterations 2024/01/03 11:51:54 - mmengine - INFO - Evaluating bbox... 2024/01/03 11:52:53 - mmengine - INFO - bbox_mAP_copypaste: 0.442 0.622 0.487 0.289 0.494 0.566 2024/01/03 11:52:53 - mmengine - INFO - Evaluating segm... 2024/01/03 11:54:03 - mmengine - INFO - segm_mAP_copypaste: 0.276 0.515 0.264 0.137 0.323 0.427 2024/01/03 11:56:12 - mmengine - INFO - Evaluating bbox... 2024/01/03 11:57:10 - mmengine - INFO - bbox_mAP_copypaste: 0.440 0.620 0.485 0.289 0.493 0.564 2024/01/03 12:02:35 - mmengine - INFO - per class results: 2024/01/03 12:02:35 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.76 | 87.25 | | building | 81.47 | 94.19 | | sky | 92.05 | 98.39 | | floor | 82.04 | 88.65 | | tree | 69.98 | 79.72 | | ceiling | 84.77 | 94.26 | | road | 83.15 | 91.77 | | bed | 88.6 | 95.79 | | windowpane | 60.77 | 70.25 | | grass | 60.3 | 79.74 | | cabinet | 56.52 | 78.22 | | sidewalk | 62.49 | 70.84 | | person | 79.98 | 90.36 | | earth | 35.71 | 46.08 | | door | 53.09 | 73.13 | | table | 57.18 | 69.04 | | mountain | 54.69 | 79.31 | | plant | 43.37 | 47.02 | | curtain | 72.45 | 78.42 | | chair | 57.38 | 74.74 | | car | 83.11 | 90.99 | | water | 51.82 | 66.16 | | painting | 68.58 | 88.83 | | sofa | 64.07 | 79.65 | | shelf | 37.85 | 50.31 | | house | 43.1 | 52.48 | | sea | 52.3 | 78.68 | | mirror | 67.06 | 74.67 | | rug | 67.09 | 76.03 | | field | 28.66 | 57.26 | | armchair | 43.78 | 66.75 | | seat | 65.83 | 84.07 | | fence | 45.38 | 66.08 | | desk | 41.0 | 54.72 | | rock | 33.14 | 41.63 | | wardrobe | 46.51 | 66.78 | | lamp | 59.57 | 71.97 | | bathtub | 75.55 | 86.3 | | railing | 37.31 | 50.06 | | cushion | 55.53 | 79.11 | | base | 29.98 | 34.36 | | box | 26.35 | 31.95 | | column | 52.68 | 70.16 | | signboard | 35.2 | 45.6 | | chest of drawers | 46.72 | 65.44 | | counter | 18.11 | 22.45 | | sand | 33.96 | 71.1 | | sink | 69.09 | 80.15 | | skyscraper | 44.32 | 53.73 | | fireplace | 69.85 | 90.67 | | refrigerator | 71.93 | 88.96 | | grandstand | 37.74 | 81.13 | | path | 23.16 | 35.05 | | stairs | 36.81 | 62.71 | | runway | 70.07 | 88.07 | | case | 44.86 | 64.75 | | pool table | 84.94 | 97.29 | | pillow | 55.3 | 64.77 | | screen door | 2.53 | 2.53 | | stairway | 42.77 | 50.77 | | river | 22.79 | 32.61 | | bridge | 62.71 | 69.86 | | bookcase | 31.18 | 56.76 | | blind | 45.22 | 56.5 | | coffee table | 57.35 | 74.89 | | toilet | 84.69 | 91.86 | | flower | 30.06 | 50.3 | | book | 41.59 | 57.54 | | hill | 10.31 | 18.81 | | bench | 55.59 | 68.8 | | countertop | 48.18 | 60.64 | | stove | 69.95 | 84.5 | | palm | 42.08 | 52.97 | | kitchen island | 21.78 | 94.08 | | computer | 69.44 | 86.73 | | swivel chair | 41.56 | 52.78 | | boat | 54.6 | 57.07 | | bar | 23.59 | 33.26 | | arcade machine | 69.2 | 89.51 | | hovel | 23.1 | 24.67 | | bus | 92.15 | 94.47 | | towel | 56.26 | 75.2 | | light | 46.23 | 53.51 | | truck | 33.63 | 67.52 | | tower | 3.29 | 4.98 | | chandelier | 60.26 | 73.3 | | awning | 30.2 | 40.11 | | streetlight | 19.46 | 25.18 | | booth | 28.39 | 50.14 | | television receiver | 67.9 | 86.46 | | airplane | 68.61 | 77.99 | | dirt track | 3.97 | 15.62 | | apparel | 24.96 | 34.39 | | pole | 15.59 | 17.48 | | land | 0.38 | 0.45 | | bannister | 4.7 | 5.46 | | escalator | 30.73 | 35.17 | | ottoman | 31.8 | 38.63 | | bottle | 25.37 | 35.18 | | buffet | 44.48 | 50.86 | | poster | 16.72 | 17.04 | | stage | 14.06 | 21.45 | | van | 30.56 | 34.66 | | ship | 7.21 | 9.43 | | fountain | 32.93 | 33.33 | | conveyer belt | 79.96 | 91.83 | | canopy | 32.16 | 42.46 | | washer | 36.27 | 83.84 | | plaything | 32.36 | 36.31 | | swimming pool | 45.83 | 59.79 | | stool | 31.42 | 69.45 | | barrel | 22.27 | 83.56 | | basket | 26.88 | 32.94 | | waterfall | 72.72 | 91.5 | | tent | 89.51 | 97.85 | | bag | 17.86 | 19.7 | | minibike | 70.36 | 86.19 | | cradle | 82.95 | 96.17 | | oven | 27.71 | 31.59 | | ball | 41.55 | 57.36 | | food | 49.63 | 57.25 | | step | 10.04 | 11.09 | | tank | 31.15 | 32.25 | | trade name | 23.72 | 26.75 | | microwave | 68.34 | 79.52 | | pot | 43.79 | 55.8 | | animal | 66.84 | 73.99 | | bicycle | 54.45 | 62.25 | | lake | 0.0 | 0.0 | | dishwasher | 61.62 | 68.79 | | screen | 51.65 | 85.15 | | blanket | 19.84 | 22.52 | | sculpture | 50.78 | 65.3 | | hood | 53.25 | 55.71 | | sconce | 35.73 | 41.67 | | vase | 38.4 | 53.03 | | traffic light | 31.42 | 47.9 | | tray | 3.45 | 6.54 | | ashcan | 29.74 | 48.28 | | fan | 55.08 | 64.91 | | pier | 36.38 | 48.92 | | crt screen | 7.02 | 18.09 | | plate | 46.05 | 78.3 | | monitor | 13.28 | 20.21 | | bulletin board | 29.27 | 60.66 | | shower | 0.73 | 17.75 | | radiator | 50.49 | 58.39 | | glass | 18.12 | 21.54 | | clock | 21.07 | 33.2 | | flag | 45.41 | 50.35 | +---------------------+-------+-------+ 2024/01/03 12:02:48 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4400 coco/bbox_mAP_50: 0.6200 coco/bbox_mAP_75: 0.4850 coco/bbox_mAP_s: 0.2890 coco/bbox_mAP_m: 0.4930 coco/bbox_mAP_l: 0.5640 coco/segm_mAP: 0.2760 coco/segm_mAP_50: 0.5150 coco/segm_mAP_75: 0.2640 coco/segm_mAP_s: 0.1370 coco/segm_mAP_m: 0.3230 coco/segm_mAP_l: 0.4270 Bleu_1: 0.7117 Bleu_2: 0.5433 Bleu_3: 0.4037 Bleu_4: 0.2967 METEOR: 0.2459 ROUGE_L: 0.5262 CIDEr: 0.9131 SPICE: 0.1818 aAcc: 81.8500 mIoU: 45.1200 mAcc: 58.2000 visual-grounding/miou: 0.7050 visual-grounding/acc: 0.7745 data_time: 0.0126 time: 1.9395 2024/01/03 12:14:01 - mmengine - INFO - Iter(train) [ 80500/640000] base_lr: 1.9229e-04 lr: 1.9229e-05 eta: 9 days, 1:07:56 time: 1.3693 data_time: 0.0184 memory: 34722 grad_norm: 2.7791 loss: 1.5426 caption_loss_cls: 2.5099 detection_loss_cls: 0.0431 detection_loss_reg: 0.3680 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2810 instance_segmentation_loss_cls: 0.0436 instance_segmentation_loss_reg: 0.3763 instance_segmentation_loss_poly: 1.0466 2024/01/03 12:25:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 12:25:44 - mmengine - INFO - Iter(train) [ 81000/640000] base_lr: 1.9220e-04 lr: 1.9220e-05 eta: 9 days, 0:57:45 time: 1.3786 data_time: 0.0185 memory: 25629 grad_norm: 2.7822 loss: 1.5416 caption_loss_cls: 2.5079 detection_loss_cls: 0.0431 detection_loss_reg: 0.3710 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2775 instance_segmentation_loss_cls: 0.0436 instance_segmentation_loss_reg: 0.3769 instance_segmentation_loss_poly: 1.0469 2024/01/03 12:36:56 - mmengine - INFO - Iter(train) [ 81500/640000] base_lr: 1.9210e-04 lr: 1.9210e-05 eta: 9 days, 0:37:01 time: 1.3761 data_time: 0.0186 memory: 25629 grad_norm: 2.8136 loss: 1.5541 caption_loss_cls: 2.4966 detection_loss_cls: 0.0430 detection_loss_reg: 0.3694 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2676 instance_segmentation_loss_cls: 0.0436 instance_segmentation_loss_reg: 0.3778 instance_segmentation_loss_poly: 1.0502 2024/01/03 12:48:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 12:48:39 - mmengine - INFO - Iter(train) [ 82000/640000] base_lr: 1.9201e-04 lr: 1.9201e-05 eta: 9 days, 0:26:56 time: 1.3807 data_time: 0.0186 memory: 25629 grad_norm: 2.8067 loss: 1.5287 caption_loss_cls: 2.4910 detection_loss_cls: 0.0430 detection_loss_reg: 0.3697 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2600 instance_segmentation_loss_cls: 0.0435 instance_segmentation_loss_reg: 0.3763 instance_segmentation_loss_poly: 1.0472 2024/01/03 12:48:39 - mmengine - INFO - Saving checkpoint at 82000 iterations 2024/01/03 13:03:56 - mmengine - INFO - Iter(train) [ 82500/640000] base_lr: 1.9191e-04 lr: 1.9191e-05 eta: 9 days, 1:26:50 time: 1.4294 data_time: 0.0187 memory: 25629 grad_norm: 2.7995 loss: 1.5295 caption_loss_cls: 2.4871 detection_loss_cls: 0.0431 detection_loss_reg: 0.3700 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2581 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3756 instance_segmentation_loss_poly: 1.0453 2024/01/03 13:15:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 13:15:23 - mmengine - INFO - Iter(train) [ 83000/640000] base_lr: 1.9181e-04 lr: 1.9181e-05 eta: 9 days, 1:10:19 time: 1.4193 data_time: 0.0187 memory: 25629 grad_norm: 2.8260 loss: 1.5359 caption_loss_cls: 2.4895 detection_loss_cls: 0.0430 detection_loss_reg: 0.3707 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2535 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3754 instance_segmentation_loss_poly: 1.0464 2024/01/03 13:27:04 - mmengine - INFO - Iter(train) [ 83500/640000] base_lr: 1.9172e-04 lr: 1.9172e-05 eta: 9 days, 0:58:21 time: 1.4278 data_time: 0.0188 memory: 25629 grad_norm: 2.8106 loss: 1.5141 caption_loss_cls: 2.4828 detection_loss_cls: 0.0429 detection_loss_reg: 0.3688 semantic_segmentation_loss_cls: 0.0119 grounding_loss_reg: 3.2506 instance_segmentation_loss_cls: 0.0434 instance_segmentation_loss_reg: 0.3746 instance_segmentation_loss_poly: 1.0444 2024/01/03 13:39:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 13:39:31 - mmengine - INFO - Iter(train) [ 84000/640000] base_lr: 1.9162e-04 lr: 1.9162e-05 eta: 9 days, 1:00:21 time: 1.4511 data_time: 0.0191 memory: 25629 grad_norm: 2.7445 loss: 1.4969 caption_loss_cls: 2.4793 detection_loss_cls: 0.0430 detection_loss_reg: 0.3706 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2555 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3729 instance_segmentation_loss_poly: 1.0411 2024/01/03 13:39:31 - mmengine - INFO - Saving checkpoint at 84000 iterations 2024/01/03 13:51:15 - mmengine - INFO - Iter(train) [ 84500/640000] base_lr: 1.9152e-04 lr: 1.9152e-05 eta: 9 days, 0:49:05 time: 1.4583 data_time: 0.0256 memory: 25629 grad_norm: 2.7098 loss: 1.4938 caption_loss_cls: 2.4810 detection_loss_cls: 0.0428 detection_loss_reg: 0.3685 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2516 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3729 instance_segmentation_loss_poly: 1.0407 2024/01/03 14:03:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 14:03:04 - mmengine - INFO - Iter(train) [ 85000/640000] base_lr: 1.9142e-04 lr: 1.9142e-05 eta: 9 days, 0:39:18 time: 1.4598 data_time: 0.0257 memory: 25629 grad_norm: 2.6897 loss: 1.4997 caption_loss_cls: 2.4805 detection_loss_cls: 0.0426 detection_loss_reg: 0.3672 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2520 instance_segmentation_loss_cls: 0.0429 instance_segmentation_loss_reg: 0.3719 instance_segmentation_loss_poly: 1.0401 2024/01/03 14:14:07 - mmengine - INFO - Iter(train) [ 85500/640000] base_lr: 1.9132e-04 lr: 1.9132e-05 eta: 9 days, 0:15:59 time: 1.4577 data_time: 0.0257 memory: 25629 grad_norm: 2.6598 loss: 1.4974 caption_loss_cls: 2.4774 detection_loss_cls: 0.0424 detection_loss_reg: 0.3664 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2550 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3717 instance_segmentation_loss_poly: 1.0396 2024/01/03 14:26:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 14:26:16 - mmengine - INFO - Iter(train) [ 86000/640000] base_lr: 1.9122e-04 lr: 1.9122e-05 eta: 9 days, 0:11:57 time: 1.4641 data_time: 0.0259 memory: 25629 grad_norm: 2.6388 loss: 1.5039 caption_loss_cls: 2.4859 detection_loss_cls: 0.0423 detection_loss_reg: 0.3669 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2522 instance_segmentation_loss_cls: 0.0431 instance_segmentation_loss_reg: 0.3728 instance_segmentation_loss_poly: 1.0408 2024/01/03 14:26:16 - mmengine - INFO - Saving checkpoint at 86000 iterations 2024/01/03 14:38:10 - mmengine - INFO - Iter(train) [ 86500/640000] base_lr: 1.9112e-04 lr: 1.9112e-05 eta: 9 days, 0:03:31 time: 1.4132 data_time: 0.0259 memory: 25629 grad_norm: 2.6642 loss: 1.5151 caption_loss_cls: 2.4858 detection_loss_cls: 0.0419 detection_loss_reg: 0.3655 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2449 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3725 instance_segmentation_loss_poly: 1.0404 2024/01/03 14:49:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 14:49:38 - mmengine - INFO - Iter(train) [ 87000/640000] base_lr: 1.9102e-04 lr: 1.9102e-05 eta: 8 days, 23:47:41 time: 1.4134 data_time: 0.0260 memory: 25629 grad_norm: 2.6655 loss: 1.5313 caption_loss_cls: 2.4848 detection_loss_cls: 0.0419 detection_loss_reg: 0.3661 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2455 instance_segmentation_loss_cls: 0.0430 instance_segmentation_loss_reg: 0.3732 instance_segmentation_loss_poly: 1.0407 2024/01/03 15:01:19 - mmengine - INFO - Iter(train) [ 87500/640000] base_lr: 1.9092e-04 lr: 1.9092e-05 eta: 8 days, 23:35:33 time: 1.4134 data_time: 0.0260 memory: 25629 grad_norm: 2.6447 loss: 1.5386 caption_loss_cls: 2.4861 detection_loss_cls: 0.0417 detection_loss_reg: 0.3646 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2407 instance_segmentation_loss_cls: 0.0429 instance_segmentation_loss_reg: 0.3724 instance_segmentation_loss_poly: 1.0391 2024/01/03 15:12:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 15:12:40 - mmengine - INFO - Iter(train) [ 88000/640000] base_lr: 1.9081e-04 lr: 1.9081e-05 eta: 8 days, 23:17:58 time: 1.3970 data_time: 0.0258 memory: 25629 grad_norm: 2.6763 loss: 1.5401 caption_loss_cls: 2.4877 detection_loss_cls: 0.0416 detection_loss_reg: 0.3646 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2392 instance_segmentation_loss_cls: 0.0429 instance_segmentation_loss_reg: 0.3720 instance_segmentation_loss_poly: 1.0386 2024/01/03 15:12:40 - mmengine - INFO - Saving checkpoint at 88000 iterations 2024/01/03 15:24:30 - mmengine - INFO - Iter(train) [ 88500/640000] base_lr: 1.9071e-04 lr: 1.9071e-05 eta: 8 days, 23:08:21 time: 1.3985 data_time: 0.0259 memory: 25629 grad_norm: 2.6559 loss: 1.5440 caption_loss_cls: 2.4893 detection_loss_cls: 0.0417 detection_loss_reg: 0.3657 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2348 instance_segmentation_loss_cls: 0.0426 instance_segmentation_loss_reg: 0.3711 instance_segmentation_loss_poly: 1.0366 2024/01/03 15:35:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 15:35:59 - mmengine - INFO - Iter(train) [ 89000/640000] base_lr: 1.9061e-04 lr: 1.9061e-05 eta: 8 days, 22:53:11 time: 1.3935 data_time: 0.0259 memory: 25629 grad_norm: 2.6500 loss: 1.5303 caption_loss_cls: 2.4810 detection_loss_cls: 0.0417 detection_loss_reg: 0.3656 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2296 instance_segmentation_loss_cls: 0.0427 instance_segmentation_loss_reg: 0.3713 instance_segmentation_loss_poly: 1.0370 2024/01/03 15:47:33 - mmengine - INFO - Iter(train) [ 89500/640000] base_lr: 1.9050e-04 lr: 1.9050e-05 eta: 8 days, 22:39:35 time: 1.4014 data_time: 0.0259 memory: 25629 grad_norm: 2.6216 loss: 1.5198 caption_loss_cls: 2.4782 detection_loss_cls: 0.0418 detection_loss_reg: 0.3673 semantic_segmentation_loss_cls: 0.0118 grounding_loss_reg: 3.2235 instance_segmentation_loss_cls: 0.0425 instance_segmentation_loss_reg: 0.3698 instance_segmentation_loss_poly: 1.0344 2024/01/03 15:59:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 15:59:09 - mmengine - INFO - Iter(train) [ 90000/640000] base_lr: 1.9040e-04 lr: 1.9040e-05 eta: 8 days, 22:26:19 time: 1.3931 data_time: 0.0258 memory: 25629 grad_norm: 2.6596 loss: 1.5197 caption_loss_cls: 2.4714 detection_loss_cls: 0.0414 detection_loss_reg: 0.3652 semantic_segmentation_loss_cls: 0.0117 grounding_loss_reg: 3.2226 instance_segmentation_loss_cls: 0.0425 instance_segmentation_loss_reg: 0.3692 instance_segmentation_loss_poly: 1.0345 2024/01/03 15:59:09 - mmengine - INFO - Saving checkpoint at 90000 iterations 2024/01/03 16:11:18 - mmengine - INFO - Iter(train) [ 90500/640000] base_lr: 1.9029e-04 lr: 1.9029e-05 eta: 8 days, 22:21:27 time: 1.3969 data_time: 0.0259 memory: 25629 grad_norm: 2.6627 loss: 1.5291 caption_loss_cls: 2.4726 detection_loss_cls: 0.0415 detection_loss_reg: 0.3665 semantic_segmentation_loss_cls: 0.0116 grounding_loss_reg: 3.2176 instance_segmentation_loss_cls: 0.0426 instance_segmentation_loss_reg: 0.3700 instance_segmentation_loss_poly: 1.0349 2024/01/03 16:23:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 16:23:14 - mmengine - INFO - Iter(train) [ 91000/640000] base_lr: 1.9019e-04 lr: 1.9019e-05 eta: 8 days, 22:12:59 time: 1.4038 data_time: 0.0259 memory: 25629 grad_norm: 2.6360 loss: 1.5171 caption_loss_cls: 2.4758 detection_loss_cls: 0.0417 detection_loss_reg: 0.3693 semantic_segmentation_loss_cls: 0.0116 grounding_loss_reg: 3.2160 instance_segmentation_loss_cls: 0.0427 instance_segmentation_loss_reg: 0.3710 instance_segmentation_loss_poly: 1.0372 2024/01/03 16:34:44 - mmengine - INFO - Iter(train) [ 91500/640000] base_lr: 1.9008e-04 lr: 1.9008e-05 eta: 8 days, 21:58:19 time: 1.4011 data_time: 0.0258 memory: 25629 grad_norm: 2.6487 loss: 1.5183 caption_loss_cls: 2.4835 detection_loss_cls: 0.0418 detection_loss_reg: 0.3702 semantic_segmentation_loss_cls: 0.0116 grounding_loss_reg: 3.2096 instance_segmentation_loss_cls: 0.0428 instance_segmentation_loss_reg: 0.3696 instance_segmentation_loss_poly: 1.0339 2024/01/03 16:46:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 16:46:04 - mmengine - INFO - Iter(train) [ 92000/640000] base_lr: 1.8998e-04 lr: 1.8998e-05 eta: 8 days, 21:41:20 time: 1.4010 data_time: 0.0257 memory: 25629 grad_norm: 2.6522 loss: 1.5082 caption_loss_cls: 2.4786 detection_loss_cls: 0.0414 detection_loss_reg: 0.3676 semantic_segmentation_loss_cls: 0.0116 grounding_loss_reg: 3.2121 instance_segmentation_loss_cls: 0.0425 instance_segmentation_loss_reg: 0.3692 instance_segmentation_loss_poly: 1.0324 2024/01/03 16:46:04 - mmengine - INFO - Saving checkpoint at 92000 iterations 2024/01/03 16:58:23 - mmengine - INFO - Iter(train) [ 92500/640000] base_lr: 1.8987e-04 lr: 1.8987e-05 eta: 8 days, 21:38:14 time: 1.4081 data_time: 0.0258 memory: 25629 grad_norm: 2.6519 loss: 1.4985 caption_loss_cls: 2.4842 detection_loss_cls: 0.0413 detection_loss_reg: 0.3662 semantic_segmentation_loss_cls: 0.0115 grounding_loss_reg: 3.2055 instance_segmentation_loss_cls: 0.0424 instance_segmentation_loss_reg: 0.3689 instance_segmentation_loss_poly: 1.0324 2024/01/03 17:09:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 17:09:32 - mmengine - INFO - Iter(train) [ 93000/640000] base_lr: 1.8976e-04 lr: 1.8976e-05 eta: 8 days, 21:18:47 time: 1.4031 data_time: 0.0256 memory: 25629 grad_norm: 2.7079 loss: 1.5107 caption_loss_cls: 2.4821 detection_loss_cls: 0.0412 detection_loss_reg: 0.3669 semantic_segmentation_loss_cls: 0.0115 grounding_loss_reg: 3.2055 instance_segmentation_loss_cls: 0.0424 instance_segmentation_loss_reg: 0.3688 instance_segmentation_loss_poly: 1.0322 2024/01/03 17:21:21 - mmengine - INFO - Iter(train) [ 93500/640000] base_lr: 1.8965e-04 lr: 1.8965e-05 eta: 8 days, 21:08:36 time: 1.4066 data_time: 0.0257 memory: 25629 grad_norm: 2.7029 loss: 1.4993 caption_loss_cls: 2.4826 detection_loss_cls: 0.0412 detection_loss_reg: 0.3663 semantic_segmentation_loss_cls: 0.0115 grounding_loss_reg: 3.2000 instance_segmentation_loss_cls: 0.0426 instance_segmentation_loss_reg: 0.3700 instance_segmentation_loss_poly: 1.0342 2024/01/03 17:33:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 17:33:05 - mmengine - INFO - Iter(train) [ 94000/640000] base_lr: 1.8954e-04 lr: 1.8954e-05 eta: 8 days, 20:57:28 time: 1.4088 data_time: 0.0257 memory: 25629 grad_norm: 2.6964 loss: 1.4971 caption_loss_cls: 2.4794 detection_loss_cls: 0.0412 detection_loss_reg: 0.3659 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1961 instance_segmentation_loss_cls: 0.0427 instance_segmentation_loss_reg: 0.3713 instance_segmentation_loss_poly: 1.0358 2024/01/03 17:33:05 - mmengine - INFO - Saving checkpoint at 94000 iterations 2024/01/03 17:44:55 - mmengine - INFO - Iter(train) [ 94500/640000] base_lr: 1.8943e-04 lr: 1.8943e-05 eta: 8 days, 20:47:28 time: 1.4039 data_time: 0.0255 memory: 25629 grad_norm: 2.6933 loss: 1.4836 caption_loss_cls: 2.4784 detection_loss_cls: 0.0413 detection_loss_reg: 0.3670 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.2003 instance_segmentation_loss_cls: 0.0424 instance_segmentation_loss_reg: 0.3685 instance_segmentation_loss_poly: 1.0306 2024/01/03 17:56:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 17:56:15 - mmengine - INFO - Iter(train) [ 95000/640000] base_lr: 1.8932e-04 lr: 1.8932e-05 eta: 8 days, 20:30:45 time: 1.3950 data_time: 0.0253 memory: 25629 grad_norm: 2.7215 loss: 1.4875 caption_loss_cls: 2.4799 detection_loss_cls: 0.0414 detection_loss_reg: 0.3683 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1983 instance_segmentation_loss_cls: 0.0422 instance_segmentation_loss_reg: 0.3688 instance_segmentation_loss_poly: 1.0298 2024/01/03 18:07:46 - mmengine - INFO - Iter(train) [ 95500/640000] base_lr: 1.8921e-04 lr: 1.8921e-05 eta: 8 days, 20:16:43 time: 1.3953 data_time: 0.0254 memory: 25629 grad_norm: 2.7038 loss: 1.4862 caption_loss_cls: 2.4787 detection_loss_cls: 0.0413 detection_loss_reg: 0.3667 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1926 instance_segmentation_loss_cls: 0.0424 instance_segmentation_loss_reg: 0.3698 instance_segmentation_loss_poly: 1.0321 2024/01/03 18:19:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 18:19:11 - mmengine - INFO - Iter(train) [ 96000/640000] base_lr: 1.8910e-04 lr: 1.8910e-05 eta: 8 days, 20:01:24 time: 1.3964 data_time: 0.0255 memory: 25629 grad_norm: 2.6831 loss: 1.4945 caption_loss_cls: 2.4802 detection_loss_cls: 0.0411 detection_loss_reg: 0.3644 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1910 instance_segmentation_loss_cls: 0.0422 instance_segmentation_loss_reg: 0.3693 instance_segmentation_loss_poly: 1.0303 2024/01/03 18:19:11 - mmengine - INFO - Saving checkpoint at 96000 iterations 2024/01/03 18:31:16 - mmengine - INFO - Iter(train) [ 96500/640000] base_lr: 1.8899e-04 lr: 1.8899e-05 eta: 8 days, 19:54:45 time: 1.3932 data_time: 0.0255 memory: 25629 grad_norm: 2.6667 loss: 1.4965 caption_loss_cls: 2.4777 detection_loss_cls: 0.0412 detection_loss_reg: 0.3657 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1870 instance_segmentation_loss_cls: 0.0421 instance_segmentation_loss_reg: 0.3704 instance_segmentation_loss_poly: 1.0323 2024/01/03 18:43:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 18:43:05 - mmengine - INFO - Iter(train) [ 97000/640000] base_lr: 1.8888e-04 lr: 1.8888e-05 eta: 8 days, 19:44:25 time: 1.4030 data_time: 0.0256 memory: 25629 grad_norm: 2.6440 loss: 1.4987 caption_loss_cls: 2.4828 detection_loss_cls: 0.0410 detection_loss_reg: 0.3637 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1887 instance_segmentation_loss_cls: 0.0420 instance_segmentation_loss_reg: 0.3686 instance_segmentation_loss_poly: 1.0289 2024/01/03 18:54:43 - mmengine - INFO - Iter(train) [ 97500/640000] base_lr: 1.8876e-04 lr: 1.8876e-05 eta: 8 days, 19:32:02 time: 1.4005 data_time: 0.0255 memory: 25629 grad_norm: 2.6360 loss: 1.5065 caption_loss_cls: 2.4796 detection_loss_cls: 0.0410 detection_loss_reg: 0.3642 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1923 instance_segmentation_loss_cls: 0.0421 instance_segmentation_loss_reg: 0.3705 instance_segmentation_loss_poly: 1.0328 2024/01/03 19:06:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 19:06:31 - mmengine - INFO - Iter(train) [ 98000/640000] base_lr: 1.8865e-04 lr: 1.8865e-05 eta: 8 days, 19:21:29 time: 1.4012 data_time: 0.0255 memory: 25629 grad_norm: 2.6090 loss: 1.5016 caption_loss_cls: 2.4771 detection_loss_cls: 0.0412 detection_loss_reg: 0.3661 semantic_segmentation_loss_cls: 0.0115 grounding_loss_reg: 3.1881 instance_segmentation_loss_cls: 0.0421 instance_segmentation_loss_reg: 0.3700 instance_segmentation_loss_poly: 1.0316 2024/01/03 19:06:31 - mmengine - INFO - Saving checkpoint at 98000 iterations 2024/01/03 19:18:49 - mmengine - INFO - Iter(train) [ 98500/640000] base_lr: 1.8854e-04 lr: 1.8854e-05 eta: 8 days, 19:17:08 time: 1.4083 data_time: 0.0258 memory: 25629 grad_norm: 2.5738 loss: 1.4938 caption_loss_cls: 2.4776 detection_loss_cls: 0.0412 detection_loss_reg: 0.3674 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1866 instance_segmentation_loss_cls: 0.0422 instance_segmentation_loss_reg: 0.3718 instance_segmentation_loss_poly: 1.0356 2024/01/03 19:30:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 19:30:14 - mmengine - INFO - Iter(train) [ 99000/640000] base_lr: 1.8842e-04 lr: 1.8842e-05 eta: 8 days, 19:01:57 time: 1.4097 data_time: 0.0257 memory: 25629 grad_norm: 2.5795 loss: 1.4907 caption_loss_cls: 2.4796 detection_loss_cls: 0.0410 detection_loss_reg: 0.3659 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1836 instance_segmentation_loss_cls: 0.0420 instance_segmentation_loss_reg: 0.3709 instance_segmentation_loss_poly: 1.0328 2024/01/03 19:42:14 - mmengine - INFO - Iter(train) [ 99500/640000] base_lr: 1.8831e-04 lr: 1.8831e-05 eta: 8 days, 18:53:47 time: 1.4169 data_time: 0.0259 memory: 25629 grad_norm: 2.5717 loss: 1.4892 caption_loss_cls: 2.4799 detection_loss_cls: 0.0408 detection_loss_reg: 0.3648 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1778 instance_segmentation_loss_cls: 0.0419 instance_segmentation_loss_reg: 0.3717 instance_segmentation_loss_poly: 1.0336 2024/01/03 19:53:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 19:53:44 - mmengine - INFO - Iter(train) [100000/640000] base_lr: 1.8819e-04 lr: 1.8819e-05 eta: 8 days, 18:39:38 time: 1.4181 data_time: 0.0259 memory: 25629 grad_norm: 2.5958 loss: 1.4885 caption_loss_cls: 2.4746 detection_loss_cls: 0.0410 detection_loss_reg: 0.3675 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1736 instance_segmentation_loss_cls: 0.0419 instance_segmentation_loss_reg: 0.3712 instance_segmentation_loss_poly: 1.0333 2024/01/03 19:53:44 - mmengine - INFO - Saving checkpoint at 100000 iterations 2024/01/03 20:05:36 - mmengine - INFO - Evaluating bbox... 2024/01/03 20:06:33 - mmengine - INFO - bbox_mAP_copypaste: 0.447 0.632 0.496 0.303 0.507 0.572 2024/01/03 20:06:33 - mmengine - INFO - Evaluating segm... 2024/01/03 20:07:45 - mmengine - INFO - segm_mAP_copypaste: 0.283 0.531 0.270 0.152 0.333 0.442 2024/01/03 20:09:53 - mmengine - INFO - Evaluating bbox... 2024/01/03 20:10:51 - mmengine - INFO - bbox_mAP_copypaste: 0.447 0.633 0.495 0.303 0.507 0.573 2024/01/03 20:16:42 - mmengine - INFO - per class results: 2024/01/03 20:16:42 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.33 | 89.4 | | building | 82.24 | 92.28 | | sky | 92.92 | 97.55 | | floor | 82.07 | 91.3 | | tree | 72.27 | 88.74 | | ceiling | 85.13 | 95.09 | | road | 84.5 | 88.86 | | bed | 88.82 | 94.92 | | windowpane | 64.03 | 80.23 | | grass | 69.67 | 82.93 | | cabinet | 60.35 | 74.68 | | sidewalk | 64.63 | 85.44 | | person | 78.66 | 86.97 | | earth | 37.08 | 50.75 | | door | 53.66 | 70.38 | | table | 61.46 | 76.3 | | mountain | 56.51 | 73.86 | | plant | 51.14 | 61.96 | | curtain | 73.01 | 84.41 | | chair | 57.48 | 68.74 | | car | 83.3 | 90.98 | | water | 52.22 | 64.48 | | painting | 73.41 | 82.46 | | sofa | 66.88 | 80.88 | | shelf | 36.27 | 50.03 | | house | 42.43 | 54.27 | | sea | 54.11 | 67.61 | | mirror | 65.55 | 71.42 | | rug | 66.36 | 75.68 | | field | 28.48 | 33.17 | | armchair | 46.82 | 67.16 | | seat | 62.91 | 79.49 | | fence | 42.94 | 51.07 | | desk | 48.64 | 67.59 | | rock | 50.73 | 64.52 | | wardrobe | 46.62 | 65.12 | | lamp | 58.45 | 65.56 | | bathtub | 79.7 | 85.69 | | railing | 36.98 | 50.58 | | cushion | 56.32 | 67.57 | | base | 19.82 | 22.39 | | box | 26.42 | 34.11 | | column | 54.72 | 64.55 | | signboard | 33.65 | 61.62 | | chest of drawers | 39.35 | 49.51 | | counter | 21.51 | 27.63 | | sand | 40.9 | 75.04 | | sink | 72.46 | 80.67 | | skyscraper | 46.21 | 57.53 | | fireplace | 71.82 | 87.03 | | refrigerator | 73.09 | 88.29 | | grandstand | 45.84 | 82.17 | | path | 24.56 | 38.05 | | stairs | 39.81 | 48.46 | | runway | 70.89 | 92.0 | | case | 52.99 | 73.92 | | pool table | 92.72 | 95.41 | | pillow | 58.97 | 72.38 | | screen door | 4.39 | 4.39 | | stairway | 38.29 | 42.07 | | river | 16.62 | 30.34 | | bridge | 53.25 | 61.95 | | bookcase | 31.83 | 56.52 | | blind | 40.6 | 44.8 | | coffee table | 60.19 | 82.19 | | toilet | 83.66 | 87.68 | | flower | 31.98 | 44.97 | | book | 44.26 | 61.05 | | hill | 6.0 | 6.75 | | bench | 56.46 | 63.73 | | countertop | 60.5 | 76.95 | | stove | 72.76 | 78.83 | | palm | 37.43 | 42.29 | | kitchen island | 35.43 | 92.05 | | computer | 73.32 | 87.39 | | swivel chair | 44.8 | 58.95 | | boat | 62.98 | 66.16 | | bar | 42.22 | 54.05 | | arcade machine | 67.59 | 79.21 | | hovel | 35.33 | 40.41 | | bus | 92.01 | 94.83 | | towel | 62.03 | 77.14 | | light | 39.32 | 42.74 | | truck | 36.91 | 52.01 | | tower | 20.42 | 33.26 | | chandelier | 65.37 | 78.3 | | awning | 21.62 | 29.64 | | streetlight | 26.13 | 39.57 | | booth | 38.34 | 67.56 | | television receiver | 66.46 | 80.96 | | airplane | 62.17 | 69.11 | | dirt track | 9.49 | 23.01 | | apparel | 26.32 | 36.98 | | pole | 8.36 | 9.44 | | land | 3.2 | 6.17 | | bannister | 18.07 | 26.92 | | escalator | 30.65 | 36.91 | | ottoman | 46.31 | 63.47 | | bottle | 20.03 | 23.05 | | buffet | 53.49 | 66.46 | | poster | 32.93 | 63.85 | | stage | 11.11 | 16.67 | | van | 34.98 | 43.84 | | ship | 15.17 | 15.44 | | fountain | 26.22 | 26.57 | | conveyer belt | 48.98 | 93.32 | | canopy | 42.86 | 47.84 | | washer | 49.24 | 73.27 | | plaything | 33.82 | 36.03 | | swimming pool | 36.14 | 40.73 | | stool | 39.25 | 50.41 | | barrel | 21.35 | 63.36 | | basket | 28.02 | 32.48 | | waterfall | 73.22 | 78.35 | | tent | 94.34 | 96.51 | | bag | 24.15 | 38.0 | | minibike | 68.24 | 73.81 | | cradle | 81.46 | 94.85 | | oven | 24.74 | 27.38 | | ball | 39.58 | 46.36 | | food | 50.37 | 61.39 | | step | 19.04 | 22.43 | | tank | 38.27 | 44.38 | | trade name | 33.08 | 50.56 | | microwave | 54.99 | 60.11 | | pot | 44.92 | 51.62 | | animal | 64.67 | 68.99 | | bicycle | 57.25 | 73.69 | | lake | 31.7 | 83.41 | | dishwasher | 62.78 | 71.16 | | screen | 60.23 | 88.97 | | blanket | 22.12 | 25.98 | | sculpture | 57.06 | 73.41 | | hood | 57.33 | 60.21 | | sconce | 41.2 | 51.29 | | vase | 37.97 | 43.59 | | traffic light | 29.18 | 33.32 | | tray | 7.62 | 10.18 | | ashcan | 37.47 | 62.96 | | fan | 53.13 | 68.49 | | pier | 31.71 | 41.95 | | crt screen | 3.04 | 4.29 | | plate | 53.88 | 75.15 | | monitor | 56.98 | 67.0 | | bulletin board | 21.46 | 26.14 | | shower | 0.49 | 13.28 | | radiator | 60.07 | 66.63 | | glass | 15.37 | 16.73 | | clock | 21.27 | 28.65 | | flag | 29.42 | 31.72 | +---------------------+-------+-------+ 2024/01/03 20:16:54 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4470 coco/bbox_mAP_50: 0.6330 coco/bbox_mAP_75: 0.4950 coco/bbox_mAP_s: 0.3030 coco/bbox_mAP_m: 0.5070 coco/bbox_mAP_l: 0.5730 coco/segm_mAP: 0.2830 coco/segm_mAP_50: 0.5310 coco/segm_mAP_75: 0.2700 coco/segm_mAP_s: 0.1520 coco/segm_mAP_m: 0.3330 coco/segm_mAP_l: 0.4420 Bleu_1: 0.7270 Bleu_2: 0.5554 Bleu_3: 0.4098 Bleu_4: 0.2973 METEOR: 0.2515 ROUGE_L: 0.5355 CIDEr: 0.9523 SPICE: 0.1866 aAcc: 83.0000 mIoU: 47.3700 mAcc: 59.1900 visual-grounding/miou: 0.7259 visual-grounding/acc: 0.7978 data_time: 0.0124 time: 1.9219 2024/01/03 20:28:42 - mmengine - INFO - Iter(train) [100500/640000] base_lr: 1.8808e-04 lr: 1.8808e-05 eta: 8 days, 18:29:32 time: 1.4145 data_time: 0.0193 memory: 34722 grad_norm: 2.5855 loss: 1.4712 caption_loss_cls: 2.4755 detection_loss_cls: 0.0409 detection_loss_reg: 0.3659 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1715 instance_segmentation_loss_cls: 0.0417 instance_segmentation_loss_reg: 0.3699 instance_segmentation_loss_poly: 1.0304 2024/01/03 20:40:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 20:40:23 - mmengine - INFO - Iter(train) [101000/640000] base_lr: 1.8796e-04 lr: 1.8796e-05 eta: 8 days, 18:17:31 time: 1.4126 data_time: 0.0192 memory: 25629 grad_norm: 2.5653 loss: 1.4590 caption_loss_cls: 2.4846 detection_loss_cls: 0.0406 detection_loss_reg: 0.3643 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1700 instance_segmentation_loss_cls: 0.0416 instance_segmentation_loss_reg: 0.3687 instance_segmentation_loss_poly: 1.0277 2024/01/03 20:51:53 - mmengine - INFO - Iter(train) [101500/640000] base_lr: 1.8784e-04 lr: 1.8784e-05 eta: 8 days, 18:03:28 time: 1.4104 data_time: 0.0192 memory: 25629 grad_norm: 2.5995 loss: 1.4592 caption_loss_cls: 2.4841 detection_loss_cls: 0.0406 detection_loss_reg: 0.3631 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1648 instance_segmentation_loss_cls: 0.0415 instance_segmentation_loss_reg: 0.3673 instance_segmentation_loss_poly: 1.0230 2024/01/03 21:03:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 21:03:52 - mmengine - INFO - Iter(train) [102000/640000] base_lr: 1.8773e-04 lr: 1.8773e-05 eta: 8 days, 17:54:53 time: 1.4132 data_time: 0.0193 memory: 25629 grad_norm: 2.6106 loss: 1.4714 caption_loss_cls: 2.4870 detection_loss_cls: 0.0405 detection_loss_reg: 0.3644 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1658 instance_segmentation_loss_cls: 0.0416 instance_segmentation_loss_reg: 0.3681 instance_segmentation_loss_poly: 1.0229 2024/01/03 21:03:52 - mmengine - INFO - Saving checkpoint at 102000 iterations 2024/01/03 21:15:58 - mmengine - INFO - Iter(train) [102500/640000] base_lr: 1.8761e-04 lr: 1.8761e-05 eta: 8 days, 17:47:31 time: 1.4101 data_time: 0.0192 memory: 25629 grad_norm: 2.6581 loss: 1.4855 caption_loss_cls: 2.4901 detection_loss_cls: 0.0404 detection_loss_reg: 0.3638 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1642 instance_segmentation_loss_cls: 0.0416 instance_segmentation_loss_reg: 0.3686 instance_segmentation_loss_poly: 1.0237 2024/01/03 21:27:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 21:27:52 - mmengine - INFO - Iter(train) [103000/640000] base_lr: 1.8749e-04 lr: 1.8749e-05 eta: 8 days, 17:37:55 time: 1.4174 data_time: 0.0193 memory: 25629 grad_norm: 2.6419 loss: 1.4748 caption_loss_cls: 2.4942 detection_loss_cls: 0.0402 detection_loss_reg: 0.3622 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1652 instance_segmentation_loss_cls: 0.0415 instance_segmentation_loss_reg: 0.3681 instance_segmentation_loss_poly: 1.0207 2024/01/03 21:39:44 - mmengine - INFO - Iter(train) [103500/640000] base_lr: 1.8737e-04 lr: 1.8737e-05 eta: 8 days, 17:27:50 time: 1.4153 data_time: 0.0191 memory: 25629 grad_norm: 2.6488 loss: 1.4549 caption_loss_cls: 2.5003 detection_loss_cls: 0.0404 detection_loss_reg: 0.3638 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1613 instance_segmentation_loss_cls: 0.0414 instance_segmentation_loss_reg: 0.3674 instance_segmentation_loss_poly: 1.0177 2024/01/03 21:51:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 21:51:40 - mmengine - INFO - Iter(train) [104000/640000] base_lr: 1.8725e-04 lr: 1.8725e-05 eta: 8 days, 17:18:31 time: 1.4219 data_time: 0.0191 memory: 25629 grad_norm: 2.6153 loss: 1.4548 caption_loss_cls: 2.5096 detection_loss_cls: 0.0405 detection_loss_reg: 0.3651 semantic_segmentation_loss_cls: 0.0114 grounding_loss_reg: 3.1558 instance_segmentation_loss_cls: 0.0414 instance_segmentation_loss_reg: 0.3678 instance_segmentation_loss_poly: 1.0194 2024/01/03 21:51:40 - mmengine - INFO - Saving checkpoint at 104000 iterations 2024/01/03 22:03:57 - mmengine - INFO - Iter(train) [104500/640000] base_lr: 1.8713e-04 lr: 1.8713e-05 eta: 8 days, 17:12:48 time: 1.4285 data_time: 0.0258 memory: 25629 grad_norm: 2.6476 loss: 1.4704 caption_loss_cls: 2.5111 detection_loss_cls: 0.0404 detection_loss_reg: 0.3643 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1553 instance_segmentation_loss_cls: 0.0413 instance_segmentation_loss_reg: 0.3676 instance_segmentation_loss_poly: 1.0192 2024/01/03 22:15:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 22:15:11 - mmengine - INFO - Iter(train) [105000/640000] base_lr: 1.8701e-04 lr: 1.8701e-05 eta: 8 days, 16:55:54 time: 1.4217 data_time: 0.0258 memory: 25629 grad_norm: 2.6673 loss: 1.4830 caption_loss_cls: 2.5099 detection_loss_cls: 0.0405 detection_loss_reg: 0.3657 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1483 instance_segmentation_loss_cls: 0.0413 instance_segmentation_loss_reg: 0.3676 instance_segmentation_loss_poly: 1.0199 2024/01/03 22:26:43 - mmengine - INFO - Iter(train) [105500/640000] base_lr: 1.8689e-04 lr: 1.8689e-05 eta: 8 days, 16:42:20 time: 1.4223 data_time: 0.0258 memory: 25629 grad_norm: 2.6525 loss: 1.4885 caption_loss_cls: 2.5096 detection_loss_cls: 0.0407 detection_loss_reg: 0.3673 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1442 instance_segmentation_loss_cls: 0.0411 instance_segmentation_loss_reg: 0.3670 instance_segmentation_loss_poly: 1.0174 2024/01/03 22:38:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 22:38:25 - mmengine - INFO - Iter(train) [106000/640000] base_lr: 1.8677e-04 lr: 1.8677e-05 eta: 8 days, 16:30:23 time: 1.4180 data_time: 0.0257 memory: 25629 grad_norm: 2.6577 loss: 1.4716 caption_loss_cls: 2.5006 detection_loss_cls: 0.0405 detection_loss_reg: 0.3659 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1400 instance_segmentation_loss_cls: 0.0410 instance_segmentation_loss_reg: 0.3661 instance_segmentation_loss_poly: 1.0157 2024/01/03 22:38:25 - mmengine - INFO - Saving checkpoint at 106000 iterations 2024/01/03 22:50:15 - mmengine - INFO - Iter(train) [106500/640000] base_lr: 1.8664e-04 lr: 1.8664e-05 eta: 8 days, 16:19:50 time: 1.4139 data_time: 0.0256 memory: 25629 grad_norm: 2.6654 loss: 1.4668 caption_loss_cls: 2.5039 detection_loss_cls: 0.0407 detection_loss_reg: 0.3674 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1310 instance_segmentation_loss_cls: 0.0411 instance_segmentation_loss_reg: 0.3672 instance_segmentation_loss_poly: 1.0162 2024/01/03 23:02:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 23:02:08 - mmengine - INFO - Iter(train) [107000/640000] base_lr: 1.8652e-04 lr: 1.8652e-05 eta: 8 days, 16:09:48 time: 1.4137 data_time: 0.0256 memory: 25629 grad_norm: 2.6622 loss: 1.4689 caption_loss_cls: 2.4928 detection_loss_cls: 0.0411 detection_loss_reg: 0.3698 semantic_segmentation_loss_cls: 0.0113 grounding_loss_reg: 3.1318 instance_segmentation_loss_cls: 0.0411 instance_segmentation_loss_reg: 0.3666 instance_segmentation_loss_poly: 1.0157 2024/01/03 23:13:41 - mmengine - INFO - Iter(train) [107500/640000] base_lr: 1.8640e-04 lr: 1.8640e-05 eta: 8 days, 15:56:33 time: 1.4092 data_time: 0.0257 memory: 25629 grad_norm: 2.6720 loss: 1.4752 caption_loss_cls: 2.4901 detection_loss_cls: 0.0410 detection_loss_reg: 0.3694 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1263 instance_segmentation_loss_cls: 0.0409 instance_segmentation_loss_reg: 0.3648 instance_segmentation_loss_poly: 1.0113 2024/01/03 23:25:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 23:25:25 - mmengine - INFO - Iter(train) [108000/640000] base_lr: 1.8627e-04 lr: 1.8627e-05 eta: 8 days, 15:44:53 time: 1.4059 data_time: 0.0257 memory: 25629 grad_norm: 2.6695 loss: 1.4588 caption_loss_cls: 2.4809 detection_loss_cls: 0.0410 detection_loss_reg: 0.3697 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1230 instance_segmentation_loss_cls: 0.0410 instance_segmentation_loss_reg: 0.3661 instance_segmentation_loss_poly: 1.0140 2024/01/03 23:25:25 - mmengine - INFO - Saving checkpoint at 108000 iterations 2024/01/03 23:37:44 - mmengine - INFO - Iter(train) [108500/640000] base_lr: 1.8615e-04 lr: 1.8615e-05 eta: 8 days, 15:39:03 time: 1.4065 data_time: 0.0256 memory: 25629 grad_norm: 2.6349 loss: 1.4388 caption_loss_cls: 2.4724 detection_loss_cls: 0.0408 detection_loss_reg: 0.3673 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1205 instance_segmentation_loss_cls: 0.0411 instance_segmentation_loss_reg: 0.3663 instance_segmentation_loss_poly: 1.0150 2024/01/03 23:49:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/03 23:49:12 - mmengine - INFO - Iter(train) [109000/640000] base_lr: 1.8602e-04 lr: 1.8602e-05 eta: 8 days, 15:24:47 time: 1.4100 data_time: 0.0256 memory: 25629 grad_norm: 2.6383 loss: 1.4297 caption_loss_cls: 2.4739 detection_loss_cls: 0.0407 detection_loss_reg: 0.3672 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1163 instance_segmentation_loss_cls: 0.0410 instance_segmentation_loss_reg: 0.3649 instance_segmentation_loss_poly: 1.0109 2024/01/04 00:00:45 - mmengine - INFO - Iter(train) [109500/640000] base_lr: 1.8590e-04 lr: 1.8590e-05 eta: 8 days, 15:11:25 time: 1.4101 data_time: 0.0256 memory: 25629 grad_norm: 2.6544 loss: 1.4272 caption_loss_cls: 2.4775 detection_loss_cls: 0.0407 detection_loss_reg: 0.3665 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1174 instance_segmentation_loss_cls: 0.0408 instance_segmentation_loss_reg: 0.3634 instance_segmentation_loss_poly: 1.0075 2024/01/04 00:12:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/04 00:12:32 - mmengine - INFO - Iter(train) [110000/640000] base_lr: 1.8577e-04 lr: 1.8577e-05 eta: 8 days, 15:00:22 time: 1.4115 data_time: 0.0257 memory: 25629 grad_norm: 2.7077 loss: 1.4437 caption_loss_cls: 2.4758 detection_loss_cls: 0.0407 detection_loss_reg: 0.3665 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1183 instance_segmentation_loss_cls: 0.0407 instance_segmentation_loss_reg: 0.3636 instance_segmentation_loss_poly: 1.0069 2024/01/04 00:12:32 - mmengine - INFO - Saving checkpoint at 110000 iterations 2024/01/04 00:24:22 - mmengine - INFO - Iter(train) [110500/640000] base_lr: 1.8565e-04 lr: 1.8565e-05 eta: 8 days, 14:49:44 time: 1.4116 data_time: 0.0259 memory: 25629 grad_norm: 2.7242 loss: 1.4613 caption_loss_cls: 2.4755 detection_loss_cls: 0.0406 detection_loss_reg: 0.3657 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1151 instance_segmentation_loss_cls: 0.0404 instance_segmentation_loss_reg: 0.3628 instance_segmentation_loss_poly: 1.0041 2024/01/04 00:35:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/04 00:35:59 - mmengine - INFO - Iter(train) [111000/640000] base_lr: 1.8552e-04 lr: 1.8552e-05 eta: 8 days, 14:37:05 time: 1.4076 data_time: 0.0257 memory: 25629 grad_norm: 2.7109 loss: 1.4560 caption_loss_cls: 2.4772 detection_loss_cls: 0.0408 detection_loss_reg: 0.3657 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1146 instance_segmentation_loss_cls: 0.0403 instance_segmentation_loss_reg: 0.3613 instance_segmentation_loss_poly: 1.0003 2024/01/04 00:47:31 - mmengine - INFO - Iter(train) [111500/640000] base_lr: 1.8539e-04 lr: 1.8539e-05 eta: 8 days, 14:23:37 time: 1.4071 data_time: 0.0257 memory: 25629 grad_norm: 2.7279 loss: 1.4705 caption_loss_cls: 2.4799 detection_loss_cls: 0.0407 detection_loss_reg: 0.3651 semantic_segmentation_loss_cls: 0.0112 grounding_loss_reg: 3.1140 instance_segmentation_loss_cls: 0.0402 instance_segmentation_loss_reg: 0.3621 instance_segmentation_loss_poly: 1.0021 2024/01/04 00:59:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240103_010759 2024/01/04 00:59:12 - mmengine - INFO - Iter(train) [112000/640000] base_lr: 1.8526e-04 lr: 1.8526e-05 eta: 8 days, 14:11:37 time: 1.4066 data_time: 0.0258 memory: 25629 grad_norm: 2.7890 loss: 1.4840 caption_loss_cls: 2.4807 detection_loss_cls: 0.0407 detection_loss_reg: 0.3650 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.1046 instance_segmentation_loss_cls: 0.0403 instance_segmentation_loss_reg: 0.3629 instance_segmentation_loss_poly: 1.0055 2024/01/04 00:59:12 - mmengine - INFO - Saving checkpoint at 112000 iterations 2024/01/04 01:26:11 - mmengine - INFO - Iter(train) [112500/640000] base_lr: 1.8514e-04 lr: 1.8514e-05 eta: 8 days, 10:17:55 time: 1.3944 data_time: 0.0185 memory: 25565 grad_norm: 2.7903 loss: 1.4896 caption_loss_cls: 2.4775 detection_loss_cls: 0.0407 detection_loss_reg: 0.3665 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.1033 instance_segmentation_loss_cls: 0.0401 instance_segmentation_loss_reg: 0.3624 instance_segmentation_loss_poly: 1.0043 2024/01/04 01:37:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 01:37:25 - mmengine - INFO - Iter(train) [113000/640000] base_lr: 1.8501e-04 lr: 1.8501e-05 eta: 8 days, 7:49:51 time: 1.3912 data_time: 0.0182 memory: 25565 grad_norm: 2.7908 loss: 1.4860 caption_loss_cls: 2.4685 detection_loss_cls: 0.0407 detection_loss_reg: 0.3671 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.1016 instance_segmentation_loss_cls: 0.0401 instance_segmentation_loss_reg: 0.3629 instance_segmentation_loss_poly: 1.0039 2024/01/04 01:48:47 - mmengine - INFO - Iter(train) [113500/640000] base_lr: 1.8488e-04 lr: 1.8488e-05 eta: 8 days, 7:33:32 time: 1.3884 data_time: 0.0179 memory: 25565 grad_norm: 2.7743 loss: 1.4824 caption_loss_cls: 2.4663 detection_loss_cls: 0.0406 detection_loss_reg: 0.3665 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.1024 instance_segmentation_loss_cls: 0.0397 instance_segmentation_loss_reg: 0.3612 instance_segmentation_loss_poly: 0.9991 2024/01/04 02:00:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 02:00:40 - mmengine - INFO - Iter(train) [114000/640000] base_lr: 1.8475e-04 lr: 1.8475e-05 eta: 8 days, 9:34:40 time: 1.3898 data_time: 0.0177 memory: 25565 grad_norm: 2.7589 loss: 1.4800 caption_loss_cls: 2.4689 detection_loss_cls: 0.0404 detection_loss_reg: 0.3652 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.1002 instance_segmentation_loss_cls: 0.0396 instance_segmentation_loss_reg: 0.3616 instance_segmentation_loss_poly: 0.9998 2024/01/04 02:00:40 - mmengine - INFO - Saving checkpoint at 114000 iterations 2024/01/04 02:12:41 - mmengine - INFO - Iter(train) [114500/640000] base_lr: 1.8462e-04 lr: 1.8462e-05 eta: 8 days, 11:13:45 time: 1.3926 data_time: 0.0167 memory: 25565 grad_norm: 2.7526 loss: 1.4644 caption_loss_cls: 2.4625 detection_loss_cls: 0.0402 detection_loss_reg: 0.3641 semantic_segmentation_loss_cls: 0.0111 grounding_loss_reg: 3.0929 instance_segmentation_loss_cls: 0.0399 instance_segmentation_loss_reg: 0.3635 instance_segmentation_loss_poly: 1.0033 2024/01/04 02:24:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 02:24:37 - mmengine - INFO - Iter(train) [115000/640000] base_lr: 1.8449e-04 lr: 1.8449e-05 eta: 8 days, 11:57:22 time: 1.3971 data_time: 0.0165 memory: 25565 grad_norm: 2.7794 loss: 1.4679 caption_loss_cls: 2.4605 detection_loss_cls: 0.0404 detection_loss_reg: 0.3647 semantic_segmentation_loss_cls: 0.0110 grounding_loss_reg: 3.0888 instance_segmentation_loss_cls: 0.0398 instance_segmentation_loss_reg: 0.3622 instance_segmentation_loss_poly: 1.0010 2024/01/04 02:36:22 - mmengine - INFO - Iter(train) [115500/640000] base_lr: 1.8435e-04 lr: 1.8435e-05 eta: 8 days, 12:01:09 time: 1.4006 data_time: 0.0164 memory: 25565 grad_norm: 2.7586 loss: 1.4594 caption_loss_cls: 2.4591 detection_loss_cls: 0.0402 detection_loss_reg: 0.3635 semantic_segmentation_loss_cls: 0.0110 grounding_loss_reg: 3.0876 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3605 instance_segmentation_loss_poly: 0.9962 2024/01/04 02:48:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 02:48:21 - mmengine - INFO - Iter(train) [116000/640000] base_lr: 1.8422e-04 lr: 1.8422e-05 eta: 8 days, 12:29:37 time: 1.4049 data_time: 0.0161 memory: 25565 grad_norm: 2.6878 loss: 1.4413 caption_loss_cls: 2.4590 detection_loss_cls: 0.0402 detection_loss_reg: 0.3637 semantic_segmentation_loss_cls: 0.0110 grounding_loss_reg: 3.0867 instance_segmentation_loss_cls: 0.0396 instance_segmentation_loss_reg: 0.3615 instance_segmentation_loss_poly: 0.9972 2024/01/04 02:48:21 - mmengine - INFO - Saving checkpoint at 116000 iterations 2024/01/04 03:00:13 - mmengine - INFO - Iter(train) [116500/640000] base_lr: 1.8409e-04 lr: 1.8409e-05 eta: 8 days, 12:35:47 time: 1.4103 data_time: 0.0224 memory: 25565 grad_norm: 2.7494 loss: 1.4553 caption_loss_cls: 2.4553 detection_loss_cls: 0.0398 detection_loss_reg: 0.3622 semantic_segmentation_loss_cls: 0.0109 grounding_loss_reg: 3.0872 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3614 instance_segmentation_loss_poly: 0.9977 2024/01/04 03:12:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 03:12:19 - mmengine - INFO - Iter(train) [117000/640000] base_lr: 1.8396e-04 lr: 1.8396e-05 eta: 8 days, 13:03:39 time: 1.4231 data_time: 0.0224 memory: 25565 grad_norm: 2.7230 loss: 1.4373 caption_loss_cls: 2.4533 detection_loss_cls: 0.0397 detection_loss_reg: 0.3626 semantic_segmentation_loss_cls: 0.0110 grounding_loss_reg: 3.0850 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3597 instance_segmentation_loss_poly: 0.9934 2024/01/04 03:25:08 - mmengine - INFO - Iter(train) [117500/640000] base_lr: 1.8382e-04 lr: 1.8382e-05 eta: 8 days, 14:32:09 time: 1.4450 data_time: 0.0226 memory: 25565 grad_norm: 2.7075 loss: 1.4420 caption_loss_cls: 2.4504 detection_loss_cls: 0.0399 detection_loss_reg: 0.3632 semantic_segmentation_loss_cls: 0.0108 grounding_loss_reg: 3.0784 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3607 instance_segmentation_loss_poly: 0.9949 2024/01/04 03:36:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 03:36:47 - mmengine - INFO - Iter(train) [118000/640000] base_lr: 1.8369e-04 lr: 1.8369e-05 eta: 8 days, 14:01:48 time: 1.4415 data_time: 0.0226 memory: 25565 grad_norm: 2.6986 loss: 1.4348 caption_loss_cls: 2.4451 detection_loss_cls: 0.0396 detection_loss_reg: 0.3612 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0762 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3611 instance_segmentation_loss_poly: 0.9952 2024/01/04 03:36:47 - mmengine - INFO - Saving checkpoint at 118000 iterations 2024/01/04 03:48:54 - mmengine - INFO - Iter(train) [118500/640000] base_lr: 1.8355e-04 lr: 1.8355e-05 eta: 8 days, 14:12:24 time: 1.4430 data_time: 0.0223 memory: 25565 grad_norm: 2.6627 loss: 1.4226 caption_loss_cls: 2.4432 detection_loss_cls: 0.0394 detection_loss_reg: 0.3600 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0747 instance_segmentation_loss_cls: 0.0389 instance_segmentation_loss_reg: 0.3587 instance_segmentation_loss_poly: 0.9898 2024/01/04 04:00:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 04:00:54 - mmengine - INFO - Iter(train) [119000/640000] base_lr: 1.8342e-04 lr: 1.8342e-05 eta: 8 days, 14:10:08 time: 1.4441 data_time: 0.0222 memory: 25565 grad_norm: 2.6526 loss: 1.4247 caption_loss_cls: 2.4418 detection_loss_cls: 0.0395 detection_loss_reg: 0.3610 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0760 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3588 instance_segmentation_loss_poly: 0.9897 2024/01/04 04:12:19 - mmengine - INFO - Iter(train) [119500/640000] base_lr: 1.8328e-04 lr: 1.8328e-05 eta: 8 days, 13:26:31 time: 1.4390 data_time: 0.0222 memory: 25565 grad_norm: 2.6821 loss: 1.4288 caption_loss_cls: 2.4381 detection_loss_cls: 0.0394 detection_loss_reg: 0.3598 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0728 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3586 instance_segmentation_loss_poly: 0.9881 2024/01/04 04:24:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 04:24:14 - mmengine - INFO - Iter(train) [120000/640000] base_lr: 1.8315e-04 lr: 1.8315e-05 eta: 8 days, 13:19:12 time: 1.4380 data_time: 0.0222 memory: 25565 grad_norm: 2.6743 loss: 1.4397 caption_loss_cls: 2.4413 detection_loss_cls: 0.0390 detection_loss_reg: 0.3570 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0673 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3598 instance_segmentation_loss_poly: 0.9899 2024/01/04 04:24:14 - mmengine - INFO - Saving checkpoint at 120000 iterations 2024/01/04 04:35:40 - mmengine - INFO - Evaluating bbox... 2024/01/04 04:36:38 - mmengine - INFO - bbox_mAP_copypaste: 0.460 0.645 0.510 0.315 0.515 0.581 2024/01/04 04:36:38 - mmengine - INFO - Evaluating segm... 2024/01/04 04:37:51 - mmengine - INFO - segm_mAP_copypaste: 0.292 0.548 0.280 0.163 0.339 0.442 2024/01/04 04:40:00 - mmengine - INFO - Evaluating bbox... 2024/01/04 04:40:59 - mmengine - INFO - bbox_mAP_copypaste: 0.460 0.644 0.509 0.314 0.514 0.582 2024/01/04 04:46:21 - mmengine - INFO - per class results: 2024/01/04 04:46:21 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.33 | 90.37 | | building | 80.35 | 86.2 | | sky | 92.14 | 97.82 | | floor | 80.95 | 90.19 | | tree | 72.62 | 90.54 | | ceiling | 83.9 | 90.42 | | road | 84.7 | 90.7 | | bed | 87.94 | 95.48 | | windowpane | 63.35 | 77.28 | | grass | 66.89 | 92.17 | | cabinet | 59.24 | 72.37 | | sidewalk | 69.0 | 80.07 | | person | 80.73 | 90.38 | | earth | 43.4 | 62.75 | | door | 53.77 | 70.11 | | table | 59.38 | 80.86 | | mountain | 61.14 | 70.1 | | plant | 49.45 | 55.1 | | curtain | 73.46 | 86.09 | | chair | 57.83 | 69.57 | | car | 84.38 | 91.54 | | water | 61.6 | 85.96 | | painting | 72.54 | 81.28 | | sofa | 66.24 | 77.27 | | shelf | 39.07 | 52.87 | | house | 41.22 | 83.26 | | sea | 54.53 | 60.62 | | mirror | 68.77 | 77.5 | | rug | 65.28 | 76.49 | | field | 20.58 | 23.97 | | armchair | 45.27 | 71.63 | | seat | 63.33 | 70.16 | | fence | 40.99 | 54.11 | | desk | 47.75 | 68.51 | | rock | 54.44 | 70.13 | | wardrobe | 50.31 | 63.85 | | lamp | 59.3 | 72.71 | | bathtub | 72.23 | 81.07 | | railing | 34.57 | 45.33 | | cushion | 49.8 | 55.05 | | base | 27.52 | 50.08 | | box | 28.94 | 40.0 | | column | 50.39 | 69.08 | | signboard | 35.58 | 46.5 | | chest of drawers | 44.87 | 50.4 | | counter | 32.88 | 60.23 | | sand | 52.47 | 55.65 | | sink | 73.71 | 81.66 | | skyscraper | 44.63 | 55.23 | | fireplace | 72.17 | 85.26 | | refrigerator | 71.47 | 79.18 | | grandstand | 40.94 | 83.59 | | path | 17.61 | 28.8 | | stairs | 37.13 | 42.42 | | runway | 67.8 | 82.69 | | case | 42.94 | 56.87 | | pool table | 91.8 | 95.16 | | pillow | 50.89 | 59.76 | | screen door | 57.03 | 57.71 | | stairway | 37.93 | 54.93 | | river | 8.17 | 10.68 | | bridge | 58.59 | 74.04 | | bookcase | 34.1 | 53.04 | | blind | 38.51 | 41.82 | | coffee table | 61.06 | 75.08 | | toilet | 83.7 | 89.49 | | flower | 36.05 | 54.98 | | book | 47.14 | 65.53 | | hill | 9.22 | 22.42 | | bench | 59.79 | 74.0 | | countertop | 47.54 | 82.25 | | stove | 75.57 | 81.52 | | palm | 47.18 | 58.8 | | kitchen island | 34.58 | 69.97 | | computer | 73.45 | 85.6 | | swivel chair | 45.0 | 65.35 | | boat | 66.69 | 80.76 | | bar | 39.15 | 45.54 | | arcade machine | 60.99 | 63.21 | | hovel | 7.03 | 7.34 | | bus | 91.74 | 93.88 | | towel | 62.73 | 74.32 | | light | 50.99 | 68.64 | | truck | 41.77 | 53.02 | | tower | 27.88 | 43.74 | | chandelier | 63.59 | 81.06 | | awning | 29.01 | 37.95 | | streetlight | 21.36 | 26.58 | | booth | 31.93 | 35.16 | | television receiver | 68.09 | 79.79 | | airplane | 77.93 | 87.73 | | dirt track | 0.0 | 0.0 | | apparel | 29.03 | 42.41 | | pole | 14.01 | 18.14 | | land | 2.06 | 3.16 | | bannister | 16.25 | 25.9 | | escalator | 20.77 | 22.8 | | ottoman | 39.4 | 72.92 | | bottle | 20.31 | 24.12 | | buffet | 26.98 | 27.22 | | poster | 29.58 | 67.83 | | stage | 11.84 | 17.34 | | van | 39.31 | 48.99 | | ship | 3.15 | 3.22 | | fountain | 29.74 | 32.77 | | conveyer belt | 76.99 | 84.59 | | canopy | 30.64 | 44.84 | | washer | 67.43 | 72.24 | | plaything | 32.95 | 39.78 | | swimming pool | 40.59 | 40.77 | | stool | 44.32 | 56.7 | | barrel | 29.19 | 64.6 | | basket | 34.05 | 49.88 | | waterfall | 74.45 | 89.91 | | tent | 83.34 | 96.99 | | bag | 24.4 | 31.94 | | minibike | 72.08 | 81.61 | | cradle | 77.75 | 89.07 | | oven | 34.92 | 43.84 | | ball | 47.47 | 57.73 | | food | 37.77 | 38.79 | | step | 15.43 | 22.56 | | tank | 27.57 | 27.75 | | trade name | 25.16 | 29.66 | | microwave | 83.33 | 94.13 | | pot | 47.58 | 58.51 | | animal | 59.39 | 62.55 | | bicycle | 55.69 | 71.38 | | lake | 48.03 | 61.15 | | dishwasher | 59.19 | 64.45 | | screen | 75.93 | 89.26 | | blanket | 22.82 | 26.73 | | sculpture | 62.69 | 78.42 | | hood | 63.72 | 67.65 | | sconce | 38.48 | 52.08 | | vase | 41.23 | 57.1 | | traffic light | 34.1 | 53.84 | | tray | 8.14 | 10.01 | | ashcan | 37.87 | 51.76 | | fan | 52.6 | 77.58 | | pier | 30.56 | 42.37 | | crt screen | 6.54 | 9.61 | | plate | 55.35 | 69.83 | | monitor | 34.58 | 39.19 | | bulletin board | 38.75 | 55.09 | | shower | 0.72 | 0.8 | | radiator | 62.91 | 74.73 | | glass | 16.83 | 18.15 | | clock | 26.28 | 36.37 | | flag | 30.6 | 35.66 | +---------------------+-------+-------+ 2024/01/04 04:46:34 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4600 coco/bbox_mAP_50: 0.6440 coco/bbox_mAP_75: 0.5090 coco/bbox_mAP_s: 0.3140 coco/bbox_mAP_m: 0.5140 coco/bbox_mAP_l: 0.5820 coco/segm_mAP: 0.2920 coco/segm_mAP_50: 0.5480 coco/segm_mAP_75: 0.2800 coco/segm_mAP_s: 0.1630 coco/segm_mAP_m: 0.3390 coco/segm_mAP_l: 0.4420 Bleu_1: 0.7286 Bleu_2: 0.5593 Bleu_3: 0.4165 Bleu_4: 0.3070 METEOR: 0.2569 ROUGE_L: 0.5400 CIDEr: 0.9828 SPICE: 0.1881 aAcc: 82.8800 mIoU: 48.1500 mAcc: 59.6600 visual-grounding/miou: 0.7437 visual-grounding/acc: 0.8163 data_time: 0.0269 time: 1.9186 2024/01/04 04:58:09 - mmengine - INFO - Iter(train) [120500/640000] base_lr: 1.8301e-04 lr: 1.8301e-05 eta: 8 days, 12:53:00 time: 1.4342 data_time: 0.0169 memory: 34658 grad_norm: 2.6538 loss: 1.4413 caption_loss_cls: 2.4394 detection_loss_cls: 0.0391 detection_loss_reg: 0.3584 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0620 instance_segmentation_loss_cls: 0.0392 instance_segmentation_loss_reg: 0.3604 instance_segmentation_loss_poly: 0.9909 2024/01/04 05:09:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 05:09:43 - mmengine - INFO - Iter(train) [121000/640000] base_lr: 1.8287e-04 lr: 1.8287e-05 eta: 8 days, 12:26:05 time: 1.4262 data_time: 0.0172 memory: 25565 grad_norm: 2.6533 loss: 1.4566 caption_loss_cls: 2.4329 detection_loss_cls: 0.0390 detection_loss_reg: 0.3571 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0610 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3617 instance_segmentation_loss_poly: 0.9929 2024/01/04 05:21:18 - mmengine - INFO - Iter(train) [121500/640000] base_lr: 1.8274e-04 lr: 1.8274e-05 eta: 8 days, 12:01:57 time: 1.4078 data_time: 0.0171 memory: 25565 grad_norm: 2.6808 loss: 1.4368 caption_loss_cls: 2.4285 detection_loss_cls: 0.0391 detection_loss_reg: 0.3582 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0576 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3622 instance_segmentation_loss_poly: 0.9939 2024/01/04 05:32:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 05:32:53 - mmengine - INFO - Iter(train) [122000/640000] base_lr: 1.8260e-04 lr: 1.8260e-05 eta: 8 days, 11:38:25 time: 1.4068 data_time: 0.0174 memory: 25565 grad_norm: 2.6783 loss: 1.4553 caption_loss_cls: 2.4328 detection_loss_cls: 0.0394 detection_loss_reg: 0.3594 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0550 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3624 instance_segmentation_loss_poly: 0.9947 2024/01/04 05:32:53 - mmengine - INFO - Saving checkpoint at 122000 iterations 2024/01/04 05:45:11 - mmengine - INFO - Iter(train) [122500/640000] base_lr: 1.8246e-04 lr: 1.8246e-05 eta: 8 days, 11:51:15 time: 1.4094 data_time: 0.0192 memory: 25565 grad_norm: 2.6525 loss: 1.4482 caption_loss_cls: 2.4328 detection_loss_cls: 0.0393 detection_loss_reg: 0.3600 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0504 instance_segmentation_loss_cls: 0.0390 instance_segmentation_loss_reg: 0.3602 instance_segmentation_loss_poly: 0.9907 2024/01/04 05:56:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 05:56:44 - mmengine - INFO - Iter(train) [123000/640000] base_lr: 1.8232e-04 lr: 1.8232e-05 eta: 8 days, 11:26:23 time: 1.4026 data_time: 0.0194 memory: 25565 grad_norm: 2.6481 loss: 1.4494 caption_loss_cls: 2.4302 detection_loss_cls: 0.0395 detection_loss_reg: 0.3611 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0498 instance_segmentation_loss_cls: 0.0390 instance_segmentation_loss_reg: 0.3606 instance_segmentation_loss_poly: 0.9922 2024/01/04 06:07:54 - mmengine - INFO - Iter(train) [123500/640000] base_lr: 1.8218e-04 lr: 1.8218e-05 eta: 8 days, 10:46:21 time: 1.3991 data_time: 0.0196 memory: 25565 grad_norm: 2.6592 loss: 1.4562 caption_loss_cls: 2.4281 detection_loss_cls: 0.0394 detection_loss_reg: 0.3609 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0465 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3608 instance_segmentation_loss_poly: 0.9944 2024/01/04 06:19:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 06:19:43 - mmengine - INFO - Iter(train) [124000/640000] base_lr: 1.8204e-04 lr: 1.8204e-05 eta: 8 days, 10:35:52 time: 1.3975 data_time: 0.0198 memory: 25565 grad_norm: 2.7018 loss: 1.4622 caption_loss_cls: 2.4183 detection_loss_cls: 0.0395 detection_loss_reg: 0.3605 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0444 instance_segmentation_loss_cls: 0.0392 instance_segmentation_loss_reg: 0.3621 instance_segmentation_loss_poly: 0.9959 2024/01/04 06:19:43 - mmengine - INFO - Saving checkpoint at 124000 iterations 2024/01/04 06:31:15 - mmengine - INFO - Iter(train) [124500/640000] base_lr: 1.8190e-04 lr: 1.8190e-05 eta: 8 days, 10:14:12 time: 1.3965 data_time: 0.0261 memory: 25565 grad_norm: 2.7421 loss: 1.4687 caption_loss_cls: 2.4144 detection_loss_cls: 0.0396 detection_loss_reg: 0.3620 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0417 instance_segmentation_loss_cls: 0.0394 instance_segmentation_loss_reg: 0.3642 instance_segmentation_loss_poly: 1.0012 2024/01/04 06:43:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 06:43:01 - mmengine - INFO - Iter(train) [125000/640000] base_lr: 1.8176e-04 lr: 1.8176e-05 eta: 8 days, 10:02:06 time: 1.3993 data_time: 0.0261 memory: 25565 grad_norm: 2.7348 loss: 1.4522 caption_loss_cls: 2.4124 detection_loss_cls: 0.0394 detection_loss_reg: 0.3600 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0381 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3643 instance_segmentation_loss_poly: 1.0001 2024/01/04 06:54:24 - mmengine - INFO - Iter(train) [125500/640000] base_lr: 1.8162e-04 lr: 1.8162e-05 eta: 8 days, 9:35:39 time: 1.3962 data_time: 0.0262 memory: 25565 grad_norm: 2.7503 loss: 1.4620 caption_loss_cls: 2.4093 detection_loss_cls: 0.0391 detection_loss_reg: 0.3572 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0354 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3656 instance_segmentation_loss_poly: 1.0010 2024/01/04 07:05:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 07:05:49 - mmengine - INFO - Iter(train) [126000/640000] base_lr: 1.8148e-04 lr: 1.8148e-05 eta: 8 days, 9:10:53 time: 1.3935 data_time: 0.0262 memory: 25565 grad_norm: 2.8454 loss: 1.4662 caption_loss_cls: 2.4110 detection_loss_cls: 0.0393 detection_loss_reg: 0.3584 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0393 instance_segmentation_loss_cls: 0.0398 instance_segmentation_loss_reg: 0.3667 instance_segmentation_loss_poly: 1.0046 2024/01/04 07:05:49 - mmengine - INFO - Saving checkpoint at 126000 iterations 2024/01/04 07:17:59 - mmengine - INFO - Iter(train) [126500/640000] base_lr: 1.8133e-04 lr: 1.8133e-05 eta: 8 days, 9:14:36 time: 1.3918 data_time: 0.0257 memory: 25565 grad_norm: 2.8569 loss: 1.4751 caption_loss_cls: 2.4095 detection_loss_cls: 0.0393 detection_loss_reg: 0.3584 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0410 instance_segmentation_loss_cls: 0.0400 instance_segmentation_loss_reg: 0.3671 instance_segmentation_loss_poly: 1.0062 2024/01/04 07:29:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 07:29:52 - mmengine - INFO - Iter(train) [127000/640000] base_lr: 1.8119e-04 lr: 1.8119e-05 eta: 8 days, 9:07:03 time: 1.3969 data_time: 0.0258 memory: 25565 grad_norm: 2.8666 loss: 1.4695 caption_loss_cls: 2.4105 detection_loss_cls: 0.0389 detection_loss_reg: 0.3557 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0390 instance_segmentation_loss_cls: 0.0398 instance_segmentation_loss_reg: 0.3661 instance_segmentation_loss_poly: 1.0043 2024/01/04 07:41:15 - mmengine - INFO - Iter(train) [127500/640000] base_lr: 1.8105e-04 lr: 1.8105e-05 eta: 8 days, 8:42:23 time: 1.3998 data_time: 0.0258 memory: 25565 grad_norm: 2.8342 loss: 1.4627 caption_loss_cls: 2.4136 detection_loss_cls: 0.0390 detection_loss_reg: 0.3557 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0401 instance_segmentation_loss_cls: 0.0397 instance_segmentation_loss_reg: 0.3652 instance_segmentation_loss_poly: 1.0026 2024/01/04 07:53:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 07:53:02 - mmengine - INFO - Iter(train) [128000/640000] base_lr: 1.8090e-04 lr: 1.8090e-05 eta: 8 days, 8:31:31 time: 1.3994 data_time: 0.0257 memory: 25565 grad_norm: 2.7977 loss: 1.4406 caption_loss_cls: 2.4146 detection_loss_cls: 0.0388 detection_loss_reg: 0.3554 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0381 instance_segmentation_loss_cls: 0.0398 instance_segmentation_loss_reg: 0.3650 instance_segmentation_loss_poly: 1.0018 2024/01/04 07:53:02 - mmengine - INFO - Saving checkpoint at 128000 iterations 2024/01/04 08:05:18 - mmengine - INFO - Iter(train) [128500/640000] base_lr: 1.8076e-04 lr: 1.8076e-05 eta: 8 days, 8:35:58 time: 1.4104 data_time: 0.0259 memory: 25565 grad_norm: 2.7473 loss: 1.4271 caption_loss_cls: 2.4133 detection_loss_cls: 0.0390 detection_loss_reg: 0.3555 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0382 instance_segmentation_loss_cls: 0.0396 instance_segmentation_loss_reg: 0.3634 instance_segmentation_loss_poly: 0.9983 2024/01/04 08:16:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 08:16:39 - mmengine - INFO - Iter(train) [129000/640000] base_lr: 1.8061e-04 lr: 1.8061e-05 eta: 8 days, 8:11:33 time: 1.4041 data_time: 0.0258 memory: 25565 grad_norm: 2.7393 loss: 1.4307 caption_loss_cls: 2.4113 detection_loss_cls: 0.0391 detection_loss_reg: 0.3553 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0386 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3624 instance_segmentation_loss_poly: 0.9963 2024/01/04 08:28:01 - mmengine - INFO - Iter(train) [129500/640000] base_lr: 1.8047e-04 lr: 1.8047e-05 eta: 8 days, 7:48:20 time: 1.4037 data_time: 0.0258 memory: 25565 grad_norm: 2.7150 loss: 1.4344 caption_loss_cls: 2.4076 detection_loss_cls: 0.0389 detection_loss_reg: 0.3540 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0321 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3637 instance_segmentation_loss_poly: 0.9988 2024/01/04 08:39:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 08:39:39 - mmengine - INFO - Iter(train) [130000/640000] base_lr: 1.8032e-04 lr: 1.8032e-05 eta: 8 days, 7:33:26 time: 1.4072 data_time: 0.0257 memory: 25565 grad_norm: 2.6208 loss: 1.4028 caption_loss_cls: 2.4028 detection_loss_cls: 0.0390 detection_loss_reg: 0.3542 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0242 instance_segmentation_loss_cls: 0.0394 instance_segmentation_loss_reg: 0.3640 instance_segmentation_loss_poly: 0.9983 2024/01/04 08:39:39 - mmengine - INFO - Saving checkpoint at 130000 iterations 2024/01/04 08:51:41 - mmengine - INFO - Iter(train) [130500/640000] base_lr: 1.8017e-04 lr: 1.8017e-05 eta: 8 days, 7:30:04 time: 1.4052 data_time: 0.0254 memory: 25565 grad_norm: 2.6675 loss: 1.3983 caption_loss_cls: 2.3970 detection_loss_cls: 0.0390 detection_loss_reg: 0.3553 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 3.0223 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3620 instance_segmentation_loss_poly: 0.9938 2024/01/04 09:02:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 09:02:52 - mmengine - INFO - Iter(train) [131000/640000] base_lr: 1.8003e-04 lr: 1.8003e-05 eta: 8 days, 7:02:50 time: 1.3945 data_time: 0.0252 memory: 25565 grad_norm: 2.6977 loss: 1.3982 caption_loss_cls: 2.3876 detection_loss_cls: 0.0386 detection_loss_reg: 0.3537 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 3.0189 instance_segmentation_loss_cls: 0.0390 instance_segmentation_loss_reg: 0.3588 instance_segmentation_loss_poly: 0.9891 2024/01/04 09:14:56 - mmengine - INFO - Iter(train) [131500/640000] base_lr: 1.7988e-04 lr: 1.7988e-05 eta: 8 days, 7:00:06 time: 1.4051 data_time: 0.0254 memory: 25565 grad_norm: 2.6738 loss: 1.3897 caption_loss_cls: 2.3846 detection_loss_cls: 0.0387 detection_loss_reg: 0.3545 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 3.0130 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3602 instance_segmentation_loss_poly: 0.9919 2024/01/04 09:26:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 09:26:17 - mmengine - INFO - Iter(train) [132000/640000] base_lr: 1.7973e-04 lr: 1.7973e-05 eta: 8 days, 6:38:19 time: 1.3986 data_time: 0.0254 memory: 25565 grad_norm: 2.7385 loss: 1.4115 caption_loss_cls: 2.3835 detection_loss_cls: 0.0388 detection_loss_reg: 0.3554 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0169 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3586 instance_segmentation_loss_poly: 0.9877 2024/01/04 09:26:17 - mmengine - INFO - Saving checkpoint at 132000 iterations 2024/01/04 09:38:28 - mmengine - INFO - Iter(train) [132500/640000] base_lr: 1.7958e-04 lr: 1.7958e-05 eta: 8 days, 6:37:53 time: 1.3973 data_time: 0.0254 memory: 25565 grad_norm: 2.7082 loss: 1.4197 caption_loss_cls: 2.3884 detection_loss_cls: 0.0389 detection_loss_reg: 0.3558 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0130 instance_segmentation_loss_cls: 0.0394 instance_segmentation_loss_reg: 0.3609 instance_segmentation_loss_poly: 0.9931 2024/01/04 09:49:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 09:49:39 - mmengine - INFO - Iter(train) [133000/640000] base_lr: 1.7944e-04 lr: 1.7944e-05 eta: 8 days, 6:12:32 time: 1.3948 data_time: 0.0254 memory: 25565 grad_norm: 2.7607 loss: 1.4211 caption_loss_cls: 2.3831 detection_loss_cls: 0.0387 detection_loss_reg: 0.3544 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 3.0114 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3604 instance_segmentation_loss_poly: 0.9917 2024/01/04 10:01:20 - mmengine - INFO - Iter(train) [133500/640000] base_lr: 1.7929e-04 lr: 1.7929e-05 eta: 8 days, 5:59:40 time: 1.3996 data_time: 0.0254 memory: 25565 grad_norm: 2.7766 loss: 1.4110 caption_loss_cls: 2.3803 detection_loss_cls: 0.0388 detection_loss_reg: 0.3533 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 3.0078 instance_segmentation_loss_cls: 0.0393 instance_segmentation_loss_reg: 0.3599 instance_segmentation_loss_poly: 0.9918 2024/01/04 10:13:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 10:13:19 - mmengine - INFO - Iter(train) [134000/640000] base_lr: 1.7914e-04 lr: 1.7914e-05 eta: 8 days, 5:53:38 time: 1.4048 data_time: 0.0254 memory: 25565 grad_norm: 2.8210 loss: 1.4163 caption_loss_cls: 2.3792 detection_loss_cls: 0.0388 detection_loss_reg: 0.3541 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0096 instance_segmentation_loss_cls: 0.0394 instance_segmentation_loss_reg: 0.3598 instance_segmentation_loss_poly: 0.9927 2024/01/04 10:13:19 - mmengine - INFO - Saving checkpoint at 134000 iterations 2024/01/04 10:25:27 - mmengine - INFO - Iter(train) [134500/640000] base_lr: 1.7899e-04 lr: 1.7899e-05 eta: 8 days, 5:50:56 time: 1.4062 data_time: 0.0255 memory: 25565 grad_norm: 2.8047 loss: 1.4134 caption_loss_cls: 2.3818 detection_loss_cls: 0.0389 detection_loss_reg: 0.3550 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0102 instance_segmentation_loss_cls: 0.0395 instance_segmentation_loss_reg: 0.3606 instance_segmentation_loss_poly: 0.9936 2024/01/04 10:36:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 10:36:38 - mmengine - INFO - Iter(train) [135000/640000] base_lr: 1.7883e-04 lr: 1.7883e-05 eta: 8 days, 5:26:59 time: 1.4064 data_time: 0.0256 memory: 25565 grad_norm: 2.8071 loss: 1.4216 caption_loss_cls: 2.3806 detection_loss_cls: 0.0392 detection_loss_reg: 0.3578 semantic_segmentation_loss_cls: 0.0107 grounding_loss_reg: 3.0108 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3584 instance_segmentation_loss_poly: 0.9878 2024/01/04 10:47:56 - mmengine - INFO - Iter(train) [135500/640000] base_lr: 1.7868e-04 lr: 1.7868e-05 eta: 8 days, 5:05:47 time: 1.3946 data_time: 0.0254 memory: 25565 grad_norm: 2.8589 loss: 1.4257 caption_loss_cls: 2.3713 detection_loss_cls: 0.0394 detection_loss_reg: 0.3586 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0077 instance_segmentation_loss_cls: 0.0392 instance_segmentation_loss_reg: 0.3589 instance_segmentation_loss_poly: 0.9897 2024/01/04 10:59:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 10:59:22 - mmengine - INFO - Iter(train) [136000/640000] base_lr: 1.7853e-04 lr: 1.7853e-05 eta: 8 days, 4:48:01 time: 1.3959 data_time: 0.0254 memory: 25565 grad_norm: 2.8450 loss: 1.4284 caption_loss_cls: 2.3788 detection_loss_cls: 0.0393 detection_loss_reg: 0.3576 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0000 instance_segmentation_loss_cls: 0.0392 instance_segmentation_loss_reg: 0.3585 instance_segmentation_loss_poly: 0.9895 2024/01/04 10:59:22 - mmengine - INFO - Saving checkpoint at 136000 iterations 2024/01/04 11:11:25 - mmengine - INFO - Iter(train) [136500/640000] base_lr: 1.7838e-04 lr: 1.7838e-05 eta: 8 days, 4:43:04 time: 1.3938 data_time: 0.0254 memory: 25565 grad_norm: 2.9215 loss: 1.4151 caption_loss_cls: 2.3814 detection_loss_cls: 0.0394 detection_loss_reg: 0.3582 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 3.0003 instance_segmentation_loss_cls: 0.0392 instance_segmentation_loss_reg: 0.3573 instance_segmentation_loss_poly: 0.9890 2024/01/04 11:23:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 11:23:27 - mmengine - INFO - Iter(train) [137000/640000] base_lr: 1.7823e-04 lr: 1.7823e-05 eta: 8 days, 4:37:45 time: 1.4067 data_time: 0.0256 memory: 25565 grad_norm: 2.9774 loss: 1.3989 caption_loss_cls: 2.3740 detection_loss_cls: 0.0393 detection_loss_reg: 0.3582 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9951 instance_segmentation_loss_cls: 0.0389 instance_segmentation_loss_reg: 0.3542 instance_segmentation_loss_poly: 0.9809 2024/01/04 11:34:37 - mmengine - INFO - Iter(train) [137500/640000] base_lr: 1.7807e-04 lr: 1.7807e-05 eta: 8 days, 4:15:00 time: 1.3990 data_time: 0.0255 memory: 25565 grad_norm: 3.0452 loss: 1.4073 caption_loss_cls: 2.3766 detection_loss_cls: 0.0390 detection_loss_reg: 0.3555 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9901 instance_segmentation_loss_cls: 0.0391 instance_segmentation_loss_reg: 0.3558 instance_segmentation_loss_poly: 0.9844 2024/01/04 11:45:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 11:45:31 - mmengine - INFO - Iter(train) [138000/640000] base_lr: 1.7792e-04 lr: 1.7792e-05 eta: 8 days, 3:47:18 time: 1.3827 data_time: 0.0253 memory: 25565 grad_norm: 3.0357 loss: 1.4278 caption_loss_cls: 2.3866 detection_loss_cls: 0.0391 detection_loss_reg: 0.3560 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9893 instance_segmentation_loss_cls: 0.0388 instance_segmentation_loss_reg: 0.3549 instance_segmentation_loss_poly: 0.9817 2024/01/04 11:45:31 - mmengine - INFO - Saving checkpoint at 138000 iterations 2024/01/04 11:57:15 - mmengine - INFO - Iter(train) [138500/640000] base_lr: 1.7777e-04 lr: 1.7777e-05 eta: 8 days, 3:36:17 time: 1.3767 data_time: 0.0253 memory: 25565 grad_norm: 3.0749 loss: 1.4382 caption_loss_cls: 2.3870 detection_loss_cls: 0.0392 detection_loss_reg: 0.3549 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9863 instance_segmentation_loss_cls: 0.0387 instance_segmentation_loss_reg: 0.3531 instance_segmentation_loss_poly: 0.9761 2024/01/04 12:09:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 12:09:02 - mmengine - INFO - Iter(train) [139000/640000] base_lr: 1.7761e-04 lr: 1.7761e-05 eta: 8 days, 3:26:00 time: 1.3856 data_time: 0.0256 memory: 25565 grad_norm: 3.0646 loss: 1.4398 caption_loss_cls: 2.3877 detection_loss_cls: 0.0392 detection_loss_reg: 0.3555 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9792 instance_segmentation_loss_cls: 0.0390 instance_segmentation_loss_reg: 0.3568 instance_segmentation_loss_poly: 0.9849 2024/01/04 12:20:10 - mmengine - INFO - Iter(train) [139500/640000] base_lr: 1.7746e-04 lr: 1.7746e-05 eta: 8 days, 3:03:52 time: 1.3832 data_time: 0.0256 memory: 25565 grad_norm: 3.1196 loss: 1.4625 caption_loss_cls: 2.3984 detection_loss_cls: 0.0393 detection_loss_reg: 0.3572 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9761 instance_segmentation_loss_cls: 0.0389 instance_segmentation_loss_reg: 0.3564 instance_segmentation_loss_poly: 0.9856 2024/01/04 12:31:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 12:31:54 - mmengine - INFO - Iter(train) [140000/640000] base_lr: 1.7730e-04 lr: 1.7730e-05 eta: 8 days, 2:52:51 time: 1.3877 data_time: 0.0257 memory: 25565 grad_norm: 3.1141 loss: 1.4445 caption_loss_cls: 2.3963 detection_loss_cls: 0.0394 detection_loss_reg: 0.3569 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9743 instance_segmentation_loss_cls: 0.0388 instance_segmentation_loss_reg: 0.3569 instance_segmentation_loss_poly: 0.9860 2024/01/04 12:31:54 - mmengine - INFO - Saving checkpoint at 140000 iterations 2024/01/04 12:43:51 - mmengine - INFO - Evaluating bbox... 2024/01/04 12:44:50 - mmengine - INFO - bbox_mAP_copypaste: 0.462 0.648 0.509 0.307 0.521 0.601 2024/01/04 12:44:50 - mmengine - INFO - Evaluating segm... 2024/01/04 12:46:00 - mmengine - INFO - segm_mAP_copypaste: 0.295 0.549 0.283 0.155 0.353 0.470 2024/01/04 12:48:10 - mmengine - INFO - Evaluating bbox... 2024/01/04 12:49:08 - mmengine - INFO - bbox_mAP_copypaste: 0.462 0.649 0.510 0.307 0.521 0.600 2024/01/04 12:54:51 - mmengine - INFO - per class results: 2024/01/04 12:54:51 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.12 | 86.21 | | building | 81.82 | 93.09 | | sky | 93.46 | 97.06 | | floor | 81.92 | 89.3 | | tree | 74.09 | 85.79 | | ceiling | 83.78 | 94.04 | | road | 83.92 | 89.5 | | bed | 88.44 | 96.69 | | windowpane | 62.84 | 80.01 | | grass | 65.5 | 84.91 | | cabinet | 60.11 | 68.99 | | sidewalk | 65.17 | 84.71 | | person | 80.84 | 91.73 | | earth | 35.6 | 45.66 | | door | 53.76 | 75.45 | | table | 60.23 | 73.1 | | mountain | 58.73 | 77.34 | | plant | 50.79 | 64.48 | | curtain | 75.89 | 85.57 | | chair | 57.99 | 71.47 | | car | 83.48 | 91.87 | | water | 50.47 | 57.04 | | painting | 70.2 | 86.58 | | sofa | 67.61 | 83.65 | | shelf | 43.75 | 64.67 | | house | 31.05 | 34.48 | | sea | 62.94 | 84.76 | | mirror | 66.69 | 74.8 | | rug | 63.07 | 68.15 | | field | 30.73 | 45.96 | | armchair | 48.03 | 70.76 | | seat | 65.54 | 78.45 | | fence | 44.91 | 62.29 | | desk | 39.53 | 79.08 | | rock | 34.19 | 54.79 | | wardrobe | 49.18 | 71.85 | | lamp | 59.78 | 77.36 | | bathtub | 74.14 | 92.37 | | railing | 34.58 | 67.54 | | cushion | 55.41 | 68.5 | | base | 26.06 | 34.07 | | box | 28.98 | 43.35 | | column | 49.87 | 63.65 | | signboard | 35.39 | 49.94 | | chest of drawers | 40.84 | 44.22 | | counter | 22.02 | 33.7 | | sand | 40.94 | 53.81 | | sink | 72.49 | 81.41 | | skyscraper | 32.85 | 39.51 | | fireplace | 70.39 | 92.72 | | refrigerator | 74.01 | 80.92 | | grandstand | 34.33 | 77.15 | | path | 19.92 | 28.68 | | stairs | 12.1 | 12.63 | | runway | 71.0 | 92.44 | | case | 44.5 | 50.57 | | pool table | 87.86 | 96.22 | | pillow | 54.89 | 65.69 | | screen door | 78.14 | 84.49 | | stairway | 30.8 | 69.1 | | river | 21.32 | 72.83 | | bridge | 62.01 | 86.2 | | bookcase | 32.02 | 55.97 | | blind | 39.26 | 41.89 | | coffee table | 60.48 | 84.31 | | toilet | 85.92 | 93.49 | | flower | 32.82 | 56.26 | | book | 43.67 | 61.54 | | hill | 10.98 | 15.48 | | bench | 40.45 | 73.32 | | countertop | 55.58 | 82.24 | | stove | 72.35 | 77.26 | | palm | 47.91 | 66.36 | | kitchen island | 34.11 | 91.51 | | computer | 74.25 | 84.4 | | swivel chair | 38.5 | 47.38 | | boat | 70.99 | 74.93 | | bar | 33.48 | 43.59 | | arcade machine | 70.62 | 79.37 | | hovel | 19.13 | 20.41 | | bus | 91.87 | 95.46 | | towel | 62.22 | 72.87 | | light | 49.32 | 70.28 | | truck | 38.06 | 53.73 | | tower | 27.0 | 43.44 | | chandelier | 57.71 | 63.37 | | awning | 26.28 | 38.25 | | streetlight | 30.23 | 44.18 | | booth | 48.57 | 53.89 | | television receiver | 71.03 | 86.8 | | airplane | 59.58 | 63.16 | | dirt track | 8.0 | 15.97 | | apparel | 30.36 | 47.16 | | pole | 18.41 | 24.17 | | land | 1.16 | 2.43 | | bannister | 16.56 | 39.23 | | escalator | 9.91 | 9.93 | | ottoman | 44.25 | 74.86 | | bottle | 23.68 | 30.08 | | buffet | 55.69 | 69.29 | | poster | 29.82 | 67.62 | | stage | 10.97 | 33.97 | | van | 33.8 | 42.87 | | ship | 32.32 | 33.14 | | fountain | 7.93 | 7.99 | | conveyer belt | 70.15 | 89.51 | | canopy | 43.7 | 50.69 | | washer | 68.59 | 72.57 | | plaything | 32.55 | 52.66 | | swimming pool | 63.13 | 77.19 | | stool | 41.43 | 53.44 | | barrel | 12.25 | 87.69 | | basket | 27.17 | 41.01 | | waterfall | 56.1 | 59.0 | | tent | 73.51 | 98.41 | | bag | 25.43 | 34.38 | | minibike | 72.64 | 86.48 | | cradle | 78.14 | 96.73 | | oven | 45.73 | 61.95 | | ball | 36.9 | 42.04 | | food | 58.25 | 67.79 | | step | 4.41 | 4.88 | | tank | 53.93 | 61.84 | | trade name | 10.72 | 11.52 | | microwave | 82.22 | 90.98 | | pot | 39.48 | 45.19 | | animal | 63.99 | 69.22 | | bicycle | 54.98 | 66.02 | | lake | 47.76 | 48.62 | | dishwasher | 48.17 | 86.44 | | screen | 61.91 | 78.68 | | blanket | 15.5 | 17.54 | | sculpture | 62.02 | 78.56 | | hood | 58.07 | 75.01 | | sconce | 41.31 | 56.3 | | vase | 40.9 | 59.98 | | traffic light | 33.62 | 44.91 | | tray | 8.3 | 11.01 | | ashcan | 29.15 | 62.15 | | fan | 57.93 | 73.82 | | pier | 30.3 | 45.75 | | crt screen | 5.91 | 10.08 | | plate | 53.24 | 73.76 | | monitor | 29.02 | 34.49 | | bulletin board | 43.64 | 50.81 | | shower | 4.43 | 5.19 | | radiator | 50.94 | 74.12 | | glass | 18.6 | 21.43 | | clock | 26.11 | 30.63 | | flag | 33.58 | 39.26 | +---------------------+-------+-------+ 2024/01/04 12:55:03 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4620 coco/bbox_mAP_50: 0.6490 coco/bbox_mAP_75: 0.5100 coco/bbox_mAP_s: 0.3070 coco/bbox_mAP_m: 0.5210 coco/bbox_mAP_l: 0.6000 coco/segm_mAP: 0.2950 coco/segm_mAP_50: 0.5490 coco/segm_mAP_75: 0.2830 coco/segm_mAP_s: 0.1550 coco/segm_mAP_m: 0.3530 coco/segm_mAP_l: 0.4700 Bleu_1: 0.7408 Bleu_2: 0.5754 Bleu_3: 0.4328 Bleu_4: 0.3216 METEOR: 0.2571 ROUGE_L: 0.5428 CIDEr: 1.0087 SPICE: 0.1911 aAcc: 82.6500 mIoU: 47.6100 mAcc: 61.5800 visual-grounding/miou: 0.7547 visual-grounding/acc: 0.8284 data_time: 0.0120 time: 1.9039 2024/01/04 13:07:03 - mmengine - INFO - Iter(train) [140500/640000] base_lr: 1.7715e-04 lr: 1.7715e-05 eta: 8 days, 2:47:20 time: 1.3877 data_time: 0.0190 memory: 34656 grad_norm: 3.0858 loss: 1.4363 caption_loss_cls: 2.3957 detection_loss_cls: 0.0393 detection_loss_reg: 0.3558 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9769 instance_segmentation_loss_cls: 0.0384 instance_segmentation_loss_reg: 0.3540 instance_segmentation_loss_poly: 0.9791 2024/01/04 13:18:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 13:18:35 - mmengine - INFO - Iter(train) [141000/640000] base_lr: 1.7699e-04 lr: 1.7699e-05 eta: 8 days, 2:32:30 time: 1.3799 data_time: 0.0189 memory: 25564 grad_norm: 3.0246 loss: 1.4530 caption_loss_cls: 2.3920 detection_loss_cls: 0.0395 detection_loss_reg: 0.3575 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9782 instance_segmentation_loss_cls: 0.0385 instance_segmentation_loss_reg: 0.3535 instance_segmentation_loss_poly: 0.9769 2024/01/04 13:30:14 - mmengine - INFO - Iter(train) [141500/640000] base_lr: 1.7683e-04 lr: 1.7683e-05 eta: 8 days, 2:20:00 time: 1.3871 data_time: 0.0190 memory: 25564 grad_norm: 3.0418 loss: 1.4481 caption_loss_cls: 2.3867 detection_loss_cls: 0.0395 detection_loss_reg: 0.3575 semantic_segmentation_loss_cls: 0.0106 grounding_loss_reg: 2.9779 instance_segmentation_loss_cls: 0.0384 instance_segmentation_loss_reg: 0.3527 instance_segmentation_loss_poly: 0.9748 2024/01/04 13:42:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 13:42:37 - mmengine - INFO - Iter(train) [142000/640000] base_lr: 1.7667e-04 lr: 1.7667e-05 eta: 8 days, 2:19:52 time: 1.4096 data_time: 0.0193 memory: 25564 grad_norm: 2.9912 loss: 1.4305 caption_loss_cls: 2.3829 detection_loss_cls: 0.0394 detection_loss_reg: 0.3562 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 2.9804 instance_segmentation_loss_cls: 0.0384 instance_segmentation_loss_reg: 0.3533 instance_segmentation_loss_poly: 0.9763 2024/01/04 13:42:37 - mmengine - INFO - Saving checkpoint at 142000 iterations 2024/01/04 13:54:21 - mmengine - INFO - Iter(train) [142500/640000] base_lr: 1.7652e-04 lr: 1.7652e-05 eta: 8 days, 2:08:30 time: 1.4095 data_time: 0.0194 memory: 25564 grad_norm: 3.0188 loss: 1.4350 caption_loss_cls: 2.3855 detection_loss_cls: 0.0393 detection_loss_reg: 0.3552 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 2.9775 instance_segmentation_loss_cls: 0.0383 instance_segmentation_loss_reg: 0.3527 instance_segmentation_loss_poly: 0.9748 2024/01/04 14:06:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 14:06:09 - mmengine - INFO - Iter(train) [143000/640000] base_lr: 1.7636e-04 lr: 1.7636e-05 eta: 8 days, 1:58:17 time: 1.4098 data_time: 0.0192 memory: 25564 grad_norm: 3.0890 loss: 1.4316 caption_loss_cls: 2.3852 detection_loss_cls: 0.0390 detection_loss_reg: 0.3537 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 2.9757 instance_segmentation_loss_cls: 0.0384 instance_segmentation_loss_reg: 0.3543 instance_segmentation_loss_poly: 0.9770 2024/01/04 14:17:10 - mmengine - INFO - Iter(train) [143500/640000] base_lr: 1.7620e-04 lr: 1.7620e-05 eta: 8 days, 1:35:39 time: 1.4081 data_time: 0.0192 memory: 25564 grad_norm: 3.0600 loss: 1.4241 caption_loss_cls: 2.3884 detection_loss_cls: 0.0389 detection_loss_reg: 0.3532 semantic_segmentation_loss_cls: 0.0105 grounding_loss_reg: 2.9767 instance_segmentation_loss_cls: 0.0384 instance_segmentation_loss_reg: 0.3537 instance_segmentation_loss_poly: 0.9747 2024/01/04 14:29:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 14:29:23 - mmengine - INFO - Iter(train) [144000/640000] base_lr: 1.7604e-04 lr: 1.7604e-05 eta: 8 days, 1:32:00 time: 1.4154 data_time: 0.0193 memory: 25564 grad_norm: 3.0626 loss: 1.4173 caption_loss_cls: 2.3903 detection_loss_cls: 0.0386 detection_loss_reg: 0.3511 semantic_segmentation_loss_cls: 0.0104 grounding_loss_reg: 2.9699 instance_segmentation_loss_cls: 0.0383 instance_segmentation_loss_reg: 0.3538 instance_segmentation_loss_poly: 0.9751 2024/01/04 14:29:23 - mmengine - INFO - Saving checkpoint at 144000 iterations 2024/01/04 14:41:17 - mmengine - INFO - Iter(train) [144500/640000] base_lr: 1.7588e-04 lr: 1.7588e-05 eta: 8 days, 1:23:10 time: 1.4132 data_time: 0.0259 memory: 25564 grad_norm: 3.0596 loss: 1.4249 caption_loss_cls: 2.3903 detection_loss_cls: 0.0383 detection_loss_reg: 0.3488 semantic_segmentation_loss_cls: 0.0104 grounding_loss_reg: 2.9651 instance_segmentation_loss_cls: 0.0381 instance_segmentation_loss_reg: 0.3532 instance_segmentation_loss_poly: 0.9737 2024/01/04 14:52:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 14:52:45 - mmengine - INFO - Iter(train) [145000/640000] base_lr: 1.7572e-04 lr: 1.7572e-05 eta: 8 days, 1:07:49 time: 1.4124 data_time: 0.0260 memory: 25564 grad_norm: 3.0506 loss: 1.4429 caption_loss_cls: 2.3925 detection_loss_cls: 0.0382 detection_loss_reg: 0.3486 semantic_segmentation_loss_cls: 0.0104 grounding_loss_reg: 2.9652 instance_segmentation_loss_cls: 0.0379 instance_segmentation_loss_reg: 0.3510 instance_segmentation_loss_poly: 0.9687 2024/01/04 15:04:32 - mmengine - INFO - Iter(train) [145500/640000] base_lr: 1.7556e-04 lr: 1.7556e-05 eta: 8 days, 0:57:14 time: 1.4144 data_time: 0.0259 memory: 25564 grad_norm: 2.9461 loss: 1.4393 caption_loss_cls: 2.3953 detection_loss_cls: 0.0381 detection_loss_reg: 0.3494 semantic_segmentation_loss_cls: 0.0103 grounding_loss_reg: 2.9614 instance_segmentation_loss_cls: 0.0375 instance_segmentation_loss_reg: 0.3489 instance_segmentation_loss_poly: 0.9638 2024/01/04 15:15:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 15:15:56 - mmengine - INFO - Iter(train) [146000/640000] base_lr: 1.7540e-04 lr: 1.7540e-05 eta: 8 days, 0:41:00 time: 1.3994 data_time: 0.0256 memory: 25564 grad_norm: 2.9824 loss: 1.4273 caption_loss_cls: 2.3871 detection_loss_cls: 0.0380 detection_loss_reg: 0.3499 semantic_segmentation_loss_cls: 0.0103 grounding_loss_reg: 2.9568 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3467 instance_segmentation_loss_poly: 0.9602 2024/01/04 15:15:56 - mmengine - INFO - Saving checkpoint at 146000 iterations 2024/01/04 15:27:44 - mmengine - INFO - Iter(train) [146500/640000] base_lr: 1.7524e-04 lr: 1.7524e-05 eta: 8 days, 0:30:38 time: 1.4004 data_time: 0.0256 memory: 25564 grad_norm: 2.9362 loss: 1.4375 caption_loss_cls: 2.3984 detection_loss_cls: 0.0381 detection_loss_reg: 0.3507 semantic_segmentation_loss_cls: 0.0104 grounding_loss_reg: 2.9558 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3466 instance_segmentation_loss_poly: 0.9592 2024/01/04 15:39:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 15:39:07 - mmengine - INFO - Iter(train) [147000/640000] base_lr: 1.7508e-04 lr: 1.7508e-05 eta: 8 days, 0:14:25 time: 1.3942 data_time: 0.0255 memory: 25564 grad_norm: 2.8715 loss: 1.4291 caption_loss_cls: 2.4008 detection_loss_cls: 0.0381 detection_loss_reg: 0.3507 semantic_segmentation_loss_cls: 0.0103 grounding_loss_reg: 2.9508 instance_segmentation_loss_cls: 0.0370 instance_segmentation_loss_reg: 0.3463 instance_segmentation_loss_poly: 0.9585 2024/01/04 15:50:42 - mmengine - INFO - Iter(train) [147500/640000] base_lr: 1.7491e-04 lr: 1.7491e-05 eta: 8 days, 0:01:00 time: 1.4025 data_time: 0.0255 memory: 25564 grad_norm: 2.8314 loss: 1.4133 caption_loss_cls: 2.4061 detection_loss_cls: 0.0381 detection_loss_reg: 0.3515 semantic_segmentation_loss_cls: 0.0103 grounding_loss_reg: 2.9525 instance_segmentation_loss_cls: 0.0369 instance_segmentation_loss_reg: 0.3459 instance_segmentation_loss_poly: 0.9569 2024/01/04 16:02:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 16:02:21 - mmengine - INFO - Iter(train) [148000/640000] base_lr: 1.7475e-04 lr: 1.7475e-05 eta: 7 days, 23:48:46 time: 1.3942 data_time: 0.0255 memory: 25564 grad_norm: 2.8112 loss: 1.4325 caption_loss_cls: 2.4042 detection_loss_cls: 0.0380 detection_loss_reg: 0.3510 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9523 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9595 2024/01/04 16:02:21 - mmengine - INFO - Saving checkpoint at 148000 iterations 2024/01/04 16:14:35 - mmengine - INFO - Iter(train) [148500/640000] base_lr: 1.7459e-04 lr: 1.7459e-05 eta: 7 days, 23:44:09 time: 1.3991 data_time: 0.0256 memory: 25564 grad_norm: 2.8393 loss: 1.4450 caption_loss_cls: 2.4058 detection_loss_cls: 0.0380 detection_loss_reg: 0.3516 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9535 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3484 instance_segmentation_loss_poly: 0.9617 2024/01/04 16:26:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 16:26:34 - mmengine - INFO - Iter(train) [149000/640000] base_lr: 1.7442e-04 lr: 1.7442e-05 eta: 7 days, 23:36:06 time: 1.4068 data_time: 0.0257 memory: 25564 grad_norm: 2.7960 loss: 1.4249 caption_loss_cls: 2.4072 detection_loss_cls: 0.0380 detection_loss_reg: 0.3522 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9491 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3493 instance_segmentation_loss_poly: 0.9625 2024/01/04 16:38:06 - mmengine - INFO - Iter(train) [149500/640000] base_lr: 1.7426e-04 lr: 1.7426e-05 eta: 7 days, 23:22:09 time: 1.4031 data_time: 0.0257 memory: 25564 grad_norm: 2.8078 loss: 1.4230 caption_loss_cls: 2.4017 detection_loss_cls: 0.0381 detection_loss_reg: 0.3528 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9492 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3484 instance_segmentation_loss_poly: 0.9600 2024/01/04 16:49:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 16:49:22 - mmengine - INFO - Iter(train) [150000/640000] base_lr: 1.7410e-04 lr: 1.7410e-05 eta: 7 days, 23:04:50 time: 1.4012 data_time: 0.0256 memory: 25564 grad_norm: 2.7718 loss: 1.4309 caption_loss_cls: 2.4060 detection_loss_cls: 0.0379 detection_loss_reg: 0.3527 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9564 instance_segmentation_loss_cls: 0.0369 instance_segmentation_loss_reg: 0.3459 instance_segmentation_loss_poly: 0.9544 2024/01/04 16:49:22 - mmengine - INFO - Saving checkpoint at 150000 iterations 2024/01/04 17:01:43 - mmengine - INFO - Iter(train) [150500/640000] base_lr: 1.7393e-04 lr: 1.7393e-05 eta: 7 days, 23:01:18 time: 1.4094 data_time: 0.0257 memory: 25564 grad_norm: 2.7471 loss: 1.4137 caption_loss_cls: 2.4076 detection_loss_cls: 0.0379 detection_loss_reg: 0.3521 semantic_segmentation_loss_cls: 0.0102 grounding_loss_reg: 2.9602 instance_segmentation_loss_cls: 0.0368 instance_segmentation_loss_reg: 0.3459 instance_segmentation_loss_poly: 0.9542 2024/01/04 17:12:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 17:12:52 - mmengine - INFO - Iter(train) [151000/640000] base_lr: 1.7376e-04 lr: 1.7376e-05 eta: 7 days, 22:42:38 time: 1.4060 data_time: 0.0257 memory: 25564 grad_norm: 2.7636 loss: 1.4245 caption_loss_cls: 2.4057 detection_loss_cls: 0.0382 detection_loss_reg: 0.3541 semantic_segmentation_loss_cls: 0.0101 grounding_loss_reg: 2.9575 instance_segmentation_loss_cls: 0.0370 instance_segmentation_loss_reg: 0.3471 instance_segmentation_loss_poly: 0.9544 2024/01/04 17:24:42 - mmengine - INFO - Iter(train) [151500/640000] base_lr: 1.7360e-04 lr: 1.7360e-05 eta: 7 days, 22:32:29 time: 1.4097 data_time: 0.0258 memory: 25564 grad_norm: 2.7406 loss: 1.4087 caption_loss_cls: 2.4031 detection_loss_cls: 0.0382 detection_loss_reg: 0.3551 semantic_segmentation_loss_cls: 0.0101 grounding_loss_reg: 2.9550 instance_segmentation_loss_cls: 0.0374 instance_segmentation_loss_reg: 0.3490 instance_segmentation_loss_poly: 0.9590 2024/01/04 17:36:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 17:36:20 - mmengine - INFO - Iter(train) [152000/640000] base_lr: 1.7343e-04 lr: 1.7343e-05 eta: 7 days, 22:20:01 time: 1.4094 data_time: 0.0256 memory: 25564 grad_norm: 2.7799 loss: 1.4020 caption_loss_cls: 2.4047 detection_loss_cls: 0.0379 detection_loss_reg: 0.3524 semantic_segmentation_loss_cls: 0.0101 grounding_loss_reg: 2.9553 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9541 2024/01/04 17:36:20 - mmengine - INFO - Saving checkpoint at 152000 iterations 2024/01/04 17:48:14 - mmengine - INFO - Iter(train) [152500/640000] base_lr: 1.7327e-04 lr: 1.7327e-05 eta: 7 days, 22:10:40 time: 1.4046 data_time: 0.0254 memory: 25564 grad_norm: 2.7614 loss: 1.3893 caption_loss_cls: 2.3998 detection_loss_cls: 0.0381 detection_loss_reg: 0.3533 semantic_segmentation_loss_cls: 0.0101 grounding_loss_reg: 2.9537 instance_segmentation_loss_cls: 0.0369 instance_segmentation_loss_reg: 0.3459 instance_segmentation_loss_poly: 0.9512 2024/01/04 17:59:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 17:59:33 - mmengine - INFO - Iter(train) [153000/640000] base_lr: 1.7310e-04 lr: 1.7310e-05 eta: 7 days, 21:54:22 time: 1.3946 data_time: 0.0253 memory: 25564 grad_norm: 2.9605 loss: 1.3977 caption_loss_cls: 2.4089 detection_loss_cls: 0.0380 detection_loss_reg: 0.3533 semantic_segmentation_loss_cls: 0.0101 grounding_loss_reg: 2.9524 instance_segmentation_loss_cls: 0.0368 instance_segmentation_loss_reg: 0.3447 instance_segmentation_loss_poly: 0.9472 2024/01/04 18:11:02 - mmengine - INFO - Iter(train) [153500/640000] base_lr: 1.7293e-04 lr: 1.7293e-05 eta: 7 days, 21:40:07 time: 1.3938 data_time: 0.0254 memory: 25564 grad_norm: 2.9998 loss: 1.4096 caption_loss_cls: 2.4086 detection_loss_cls: 0.0379 detection_loss_reg: 0.3533 semantic_segmentation_loss_cls: 0.0100 grounding_loss_reg: 2.9522 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3479 instance_segmentation_loss_poly: 0.9532 2024/01/04 18:22:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 18:22:21 - mmengine - INFO - Iter(train) [154000/640000] base_lr: 1.7276e-04 lr: 1.7276e-05 eta: 7 days, 21:24:00 time: 1.3945 data_time: 0.0256 memory: 25564 grad_norm: 3.1098 loss: 1.4246 caption_loss_cls: 2.4161 detection_loss_cls: 0.0379 detection_loss_reg: 0.3534 semantic_segmentation_loss_cls: 0.0100 grounding_loss_reg: 2.9461 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3485 instance_segmentation_loss_poly: 0.9546 2024/01/04 18:22:21 - mmengine - INFO - Saving checkpoint at 154000 iterations 2024/01/04 18:34:29 - mmengine - INFO - Iter(train) [154500/640000] base_lr: 1.7259e-04 lr: 1.7259e-05 eta: 7 days, 21:17:21 time: 1.3913 data_time: 0.0256 memory: 25564 grad_norm: 3.1378 loss: 1.4239 caption_loss_cls: 2.4126 detection_loss_cls: 0.0380 detection_loss_reg: 0.3536 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9474 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3480 instance_segmentation_loss_poly: 0.9529 2024/01/04 18:46:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 18:46:20 - mmengine - INFO - Iter(train) [155000/640000] base_lr: 1.7243e-04 lr: 1.7243e-05 eta: 7 days, 21:07:17 time: 1.4017 data_time: 0.0257 memory: 25564 grad_norm: 3.1281 loss: 1.4207 caption_loss_cls: 2.4153 detection_loss_cls: 0.0379 detection_loss_reg: 0.3534 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9443 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3481 instance_segmentation_loss_poly: 0.9537 2024/01/04 18:58:05 - mmengine - INFO - Iter(train) [155500/640000] base_lr: 1.7226e-04 lr: 1.7226e-05 eta: 7 days, 20:56:12 time: 1.4006 data_time: 0.0256 memory: 25564 grad_norm: 3.1367 loss: 1.4275 caption_loss_cls: 2.4088 detection_loss_cls: 0.0375 detection_loss_reg: 0.3515 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9486 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3480 instance_segmentation_loss_poly: 0.9529 2024/01/04 19:09:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 19:09:52 - mmengine - INFO - Iter(train) [156000/640000] base_lr: 1.7209e-04 lr: 1.7209e-05 eta: 7 days, 20:45:21 time: 1.4027 data_time: 0.0257 memory: 25564 grad_norm: 3.0863 loss: 1.4185 caption_loss_cls: 2.4003 detection_loss_cls: 0.0379 detection_loss_reg: 0.3544 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9496 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3473 instance_segmentation_loss_poly: 0.9516 2024/01/04 19:09:52 - mmengine - INFO - Saving checkpoint at 156000 iterations 2024/01/04 19:22:10 - mmengine - INFO - Iter(train) [156500/640000] base_lr: 1.7192e-04 lr: 1.7192e-05 eta: 7 days, 20:40:10 time: 1.4087 data_time: 0.0257 memory: 25564 grad_norm: 3.0294 loss: 1.4052 caption_loss_cls: 2.3991 detection_loss_cls: 0.0380 detection_loss_reg: 0.3563 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9451 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3474 instance_segmentation_loss_poly: 0.9514 2024/01/04 19:33:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 19:33:34 - mmengine - INFO - Iter(train) [157000/640000] base_lr: 1.7174e-04 lr: 1.7174e-05 eta: 7 days, 20:25:04 time: 1.4098 data_time: 0.0259 memory: 25564 grad_norm: 2.8707 loss: 1.4207 caption_loss_cls: 2.4044 detection_loss_cls: 0.0381 detection_loss_reg: 0.3560 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9483 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3503 instance_segmentation_loss_poly: 0.9576 2024/01/04 19:45:27 - mmengine - INFO - Iter(train) [157500/640000] base_lr: 1.7157e-04 lr: 1.7157e-05 eta: 7 days, 20:15:18 time: 1.4159 data_time: 0.0258 memory: 25564 grad_norm: 2.8060 loss: 1.4082 caption_loss_cls: 2.4018 detection_loss_cls: 0.0381 detection_loss_reg: 0.3564 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9433 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3508 instance_segmentation_loss_poly: 0.9589 2024/01/04 19:56:51 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 19:56:51 - mmengine - INFO - Iter(train) [158000/640000] base_lr: 1.7140e-04 lr: 1.7140e-05 eta: 7 days, 20:00:17 time: 1.4170 data_time: 0.0258 memory: 25564 grad_norm: 2.7148 loss: 1.4016 caption_loss_cls: 2.4020 detection_loss_cls: 0.0385 detection_loss_reg: 0.3596 semantic_segmentation_loss_cls: 0.0099 grounding_loss_reg: 2.9474 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3510 instance_segmentation_loss_poly: 0.9600 2024/01/04 19:56:51 - mmengine - INFO - Saving checkpoint at 158000 iterations 2024/01/04 20:09:05 - mmengine - INFO - Iter(train) [158500/640000] base_lr: 1.7123e-04 lr: 1.7123e-05 eta: 7 days, 19:54:10 time: 1.4186 data_time: 0.0258 memory: 25564 grad_norm: 2.6652 loss: 1.3927 caption_loss_cls: 2.4040 detection_loss_cls: 0.0382 detection_loss_reg: 0.3570 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9445 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3505 instance_segmentation_loss_poly: 0.9600 2024/01/04 20:20:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 20:20:47 - mmengine - INFO - Iter(train) [159000/640000] base_lr: 1.7106e-04 lr: 1.7106e-05 eta: 7 days, 19:42:16 time: 1.4163 data_time: 0.0257 memory: 25564 grad_norm: 2.6380 loss: 1.3912 caption_loss_cls: 2.4096 detection_loss_cls: 0.0381 detection_loss_reg: 0.3570 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9410 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3489 instance_segmentation_loss_poly: 0.9575 2024/01/04 20:32:23 - mmengine - INFO - Iter(train) [159500/640000] base_lr: 1.7088e-04 lr: 1.7088e-05 eta: 7 days, 19:29:32 time: 1.4141 data_time: 0.0258 memory: 25564 grad_norm: 2.6751 loss: 1.4044 caption_loss_cls: 2.4043 detection_loss_cls: 0.0382 detection_loss_reg: 0.3580 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9384 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3493 instance_segmentation_loss_poly: 0.9563 2024/01/04 20:44:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 20:44:29 - mmengine - INFO - Iter(train) [160000/640000] base_lr: 1.7071e-04 lr: 1.7071e-05 eta: 7 days, 19:21:44 time: 1.4189 data_time: 0.0260 memory: 25564 grad_norm: 2.6843 loss: 1.4204 caption_loss_cls: 2.4038 detection_loss_cls: 0.0384 detection_loss_reg: 0.3588 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9417 instance_segmentation_loss_cls: 0.0374 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9572 2024/01/04 20:44:29 - mmengine - INFO - Saving checkpoint at 160000 iterations 2024/01/04 20:56:03 - mmengine - INFO - Evaluating bbox... 2024/01/04 20:56:59 - mmengine - INFO - bbox_mAP_copypaste: 0.478 0.656 0.527 0.317 0.533 0.615 2024/01/04 20:56:59 - mmengine - INFO - Evaluating segm... 2024/01/04 20:58:12 - mmengine - INFO - segm_mAP_copypaste: 0.308 0.558 0.302 0.164 0.354 0.465 2024/01/04 21:00:21 - mmengine - INFO - Evaluating bbox... 2024/01/04 21:01:19 - mmengine - INFO - bbox_mAP_copypaste: 0.478 0.656 0.527 0.316 0.533 0.615 2024/01/04 21:06:59 - mmengine - INFO - per class results: 2024/01/04 21:06:59 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 76.93 | 86.45 | | building | 81.54 | 90.74 | | sky | 92.69 | 98.06 | | floor | 81.1 | 86.11 | | tree | 71.98 | 82.86 | | ceiling | 82.6 | 94.43 | | road | 84.23 | 91.3 | | bed | 88.59 | 93.67 | | windowpane | 62.5 | 75.81 | | grass | 65.87 | 80.2 | | cabinet | 58.28 | 73.37 | | sidewalk | 66.14 | 76.36 | | person | 80.91 | 89.85 | | earth | 30.58 | 39.78 | | door | 52.62 | 75.21 | | table | 60.82 | 78.29 | | mountain | 54.18 | 89.34 | | plant | 50.57 | 59.91 | | curtain | 75.42 | 89.16 | | chair | 59.94 | 76.6 | | car | 83.07 | 90.89 | | water | 57.24 | 69.59 | | painting | 69.99 | 90.11 | | sofa | 68.07 | 88.26 | | shelf | 41.09 | 51.72 | | house | 42.99 | 52.24 | | sea | 60.58 | 83.51 | | mirror | 63.25 | 67.49 | | rug | 67.56 | 86.82 | | field | 29.05 | 53.82 | | armchair | 42.07 | 48.34 | | seat | 64.8 | 81.86 | | fence | 40.48 | 63.37 | | desk | 43.37 | 65.84 | | rock | 46.97 | 70.06 | | wardrobe | 47.23 | 69.18 | | lamp | 60.47 | 69.19 | | bathtub | 80.74 | 85.31 | | railing | 34.58 | 46.64 | | cushion | 55.04 | 66.93 | | base | 30.94 | 45.42 | | box | 27.9 | 44.31 | | column | 51.42 | 67.4 | | signboard | 34.85 | 44.22 | | chest of drawers | 44.49 | 67.28 | | counter | 14.33 | 18.19 | | sand | 40.99 | 67.96 | | sink | 72.43 | 85.02 | | skyscraper | 47.02 | 58.63 | | fireplace | 70.13 | 92.67 | | refrigerator | 75.49 | 83.55 | | grandstand | 51.98 | 77.56 | | path | 20.38 | 32.87 | | stairs | 34.28 | 40.57 | | runway | 67.17 | 90.99 | | case | 49.25 | 68.39 | | pool table | 89.48 | 95.81 | | pillow | 56.57 | 85.86 | | screen door | 54.3 | 59.24 | | stairway | 41.29 | 69.24 | | river | 17.05 | 22.45 | | bridge | 31.96 | 53.95 | | bookcase | 34.15 | 44.02 | | blind | 50.19 | 61.23 | | coffee table | 59.64 | 78.8 | | toilet | 85.48 | 91.9 | | flower | 36.46 | 55.73 | | book | 44.49 | 77.75 | | hill | 8.54 | 9.53 | | bench | 45.91 | 59.07 | | countertop | 60.27 | 74.14 | | stove | 76.62 | 81.05 | | palm | 42.88 | 64.94 | | kitchen island | 38.51 | 70.47 | | computer | 69.22 | 90.98 | | swivel chair | 37.2 | 46.52 | | boat | 76.42 | 85.28 | | bar | 39.2 | 53.67 | | arcade machine | 65.8 | 80.3 | | hovel | 36.34 | 46.1 | | bus | 90.75 | 92.83 | | towel | 62.4 | 73.47 | | light | 44.94 | 51.92 | | truck | 42.51 | 64.69 | | tower | 36.45 | 63.88 | | chandelier | 63.3 | 79.53 | | awning | 31.88 | 44.15 | | streetlight | 27.93 | 34.57 | | booth | 39.45 | 57.82 | | television receiver | 58.98 | 61.59 | | airplane | 75.09 | 91.28 | | dirt track | 0.73 | 1.57 | | apparel | 31.3 | 40.09 | | pole | 26.4 | 38.54 | | land | 6.83 | 7.71 | | bannister | 17.27 | 25.75 | | escalator | 23.76 | 26.65 | | ottoman | 49.82 | 66.52 | | bottle | 22.2 | 29.69 | | buffet | 45.97 | 65.35 | | poster | 26.96 | 65.63 | | stage | 14.03 | 31.4 | | van | 31.96 | 36.88 | | ship | 10.55 | 11.08 | | fountain | 21.83 | 22.13 | | conveyer belt | 59.71 | 90.18 | | canopy | 22.89 | 88.16 | | washer | 48.91 | 73.45 | | plaything | 26.01 | 35.35 | | swimming pool | 75.84 | 82.75 | | stool | 36.89 | 44.06 | | barrel | 10.94 | 64.63 | | basket | 31.99 | 39.68 | | waterfall | 73.71 | 86.77 | | tent | 78.15 | 96.21 | | bag | 23.18 | 58.49 | | minibike | 70.6 | 80.77 | | cradle | 77.85 | 96.21 | | oven | 44.82 | 57.67 | | ball | 35.5 | 47.67 | | food | 44.95 | 47.38 | | step | 10.64 | 11.66 | | tank | 42.4 | 44.42 | | trade name | 30.03 | 38.3 | | microwave | 61.63 | 65.13 | | pot | 44.12 | 51.52 | | animal | 66.96 | 82.19 | | bicycle | 53.26 | 64.29 | | lake | 37.97 | 53.29 | | dishwasher | 68.02 | 78.87 | | screen | 59.12 | 74.56 | | blanket | 27.48 | 34.81 | | sculpture | 42.07 | 85.36 | | hood | 54.87 | 64.69 | | sconce | 37.48 | 51.65 | | vase | 36.32 | 59.74 | | traffic light | 28.45 | 34.29 | | tray | 8.63 | 14.95 | | ashcan | 28.44 | 49.08 | | fan | 55.05 | 75.4 | | pier | 34.3 | 43.71 | | crt screen | 2.13 | 7.15 | | plate | 53.28 | 71.19 | | monitor | 8.99 | 10.55 | | bulletin board | 36.55 | 62.07 | | shower | 0.29 | 0.35 | | radiator | 64.17 | 75.57 | | glass | 17.97 | 19.69 | | clock | 33.84 | 42.96 | | flag | 32.54 | 38.71 | +---------------------+-------+-------+ 2024/01/04 21:07:12 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4780 coco/bbox_mAP_50: 0.6560 coco/bbox_mAP_75: 0.5270 coco/bbox_mAP_s: 0.3160 coco/bbox_mAP_m: 0.5330 coco/bbox_mAP_l: 0.6150 coco/segm_mAP: 0.3080 coco/segm_mAP_50: 0.5580 coco/segm_mAP_75: 0.3020 coco/segm_mAP_s: 0.1640 coco/segm_mAP_m: 0.3540 coco/segm_mAP_l: 0.4650 Bleu_1: 0.7219 Bleu_2: 0.5509 Bleu_3: 0.4080 Bleu_4: 0.2993 METEOR: 0.2573 ROUGE_L: 0.5333 CIDEr: 0.9671 SPICE: 0.1877 aAcc: 82.3400 mIoU: 47.6600 mAcc: 61.6800 visual-grounding/miou: 0.7666 visual-grounding/acc: 0.8385 data_time: 0.0135 time: 1.9012 2024/01/04 21:18:50 - mmengine - INFO - Iter(train) [160500/640000] base_lr: 1.7054e-04 lr: 1.7054e-05 eta: 7 days, 19:09:43 time: 1.4096 data_time: 0.0196 memory: 34656 grad_norm: 2.7168 loss: 1.4327 caption_loss_cls: 2.4037 detection_loss_cls: 0.0385 detection_loss_reg: 0.3603 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9371 instance_segmentation_loss_cls: 0.0375 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9573 2024/01/04 21:30:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 21:30:49 - mmengine - INFO - Iter(train) [161000/640000] base_lr: 1.7036e-04 lr: 1.7036e-05 eta: 7 days, 19:00:42 time: 1.4186 data_time: 0.0196 memory: 25564 grad_norm: 2.6722 loss: 1.4039 caption_loss_cls: 2.4062 detection_loss_cls: 0.0383 detection_loss_reg: 0.3589 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9334 instance_segmentation_loss_cls: 0.0374 instance_segmentation_loss_reg: 0.3483 instance_segmentation_loss_poly: 0.9567 2024/01/04 21:42:41 - mmengine - INFO - Iter(train) [161500/640000] base_lr: 1.7019e-04 lr: 1.7019e-05 eta: 7 days, 18:50:18 time: 1.4180 data_time: 0.0196 memory: 25564 grad_norm: 2.6656 loss: 1.3967 caption_loss_cls: 2.4087 detection_loss_cls: 0.0382 detection_loss_reg: 0.3576 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9331 instance_segmentation_loss_cls: 0.0375 instance_segmentation_loss_reg: 0.3496 instance_segmentation_loss_poly: 0.9590 2024/01/04 21:54:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 21:54:39 - mmengine - INFO - Iter(train) [162000/640000] base_lr: 1.7001e-04 lr: 1.7001e-05 eta: 7 days, 18:41:00 time: 1.4267 data_time: 0.0197 memory: 25564 grad_norm: 2.6373 loss: 1.3808 caption_loss_cls: 2.4071 detection_loss_cls: 0.0381 detection_loss_reg: 0.3569 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9311 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9573 2024/01/04 21:54:39 - mmengine - INFO - Saving checkpoint at 162000 iterations 2024/01/04 22:06:50 - mmengine - INFO - Iter(train) [162500/640000] base_lr: 1.6984e-04 lr: 1.6984e-05 eta: 7 days, 18:33:45 time: 1.4259 data_time: 0.0199 memory: 25564 grad_norm: 2.6506 loss: 1.3825 caption_loss_cls: 2.4008 detection_loss_cls: 0.0382 detection_loss_reg: 0.3564 semantic_segmentation_loss_cls: 0.0098 grounding_loss_reg: 2.9247 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3497 instance_segmentation_loss_poly: 0.9591 2024/01/04 22:17:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 22:17:59 - mmengine - INFO - Iter(train) [163000/640000] base_lr: 1.6966e-04 lr: 1.6966e-05 eta: 7 days, 18:16:42 time: 1.4178 data_time: 0.0199 memory: 25564 grad_norm: 2.6897 loss: 1.3834 caption_loss_cls: 2.3958 detection_loss_cls: 0.0379 detection_loss_reg: 0.3542 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9201 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3504 instance_segmentation_loss_poly: 0.9604 2024/01/04 22:29:35 - mmengine - INFO - Iter(train) [163500/640000] base_lr: 1.6949e-04 lr: 1.6949e-05 eta: 7 days, 18:03:48 time: 1.4176 data_time: 0.0198 memory: 25564 grad_norm: 2.6609 loss: 1.3733 caption_loss_cls: 2.3930 detection_loss_cls: 0.0378 detection_loss_reg: 0.3536 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9190 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9577 2024/01/04 22:41:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 22:41:15 - mmengine - INFO - Iter(train) [164000/640000] base_lr: 1.6931e-04 lr: 1.6931e-05 eta: 7 days, 17:51:40 time: 1.4112 data_time: 0.0197 memory: 25564 grad_norm: 2.6547 loss: 1.3664 caption_loss_cls: 2.3946 detection_loss_cls: 0.0380 detection_loss_reg: 0.3549 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9127 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3487 instance_segmentation_loss_poly: 0.9570 2024/01/04 22:41:15 - mmengine - INFO - Saving checkpoint at 164000 iterations 2024/01/04 22:53:02 - mmengine - INFO - Iter(train) [164500/640000] base_lr: 1.6913e-04 lr: 1.6913e-05 eta: 7 days, 17:40:33 time: 1.4126 data_time: 0.0259 memory: 25564 grad_norm: 2.6320 loss: 1.3590 caption_loss_cls: 2.3922 detection_loss_cls: 0.0381 detection_loss_reg: 0.3563 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9162 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3478 instance_segmentation_loss_poly: 0.9548 2024/01/04 23:04:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 23:04:56 - mmengine - INFO - Iter(train) [165000/640000] base_lr: 1.6895e-04 lr: 1.6895e-05 eta: 7 days, 17:30:28 time: 1.4113 data_time: 0.0259 memory: 25564 grad_norm: 2.6594 loss: 1.3684 caption_loss_cls: 2.3898 detection_loss_cls: 0.0380 detection_loss_reg: 0.3562 semantic_segmentation_loss_cls: 0.0097 grounding_loss_reg: 2.9134 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3473 instance_segmentation_loss_poly: 0.9535 2024/01/04 23:16:34 - mmengine - INFO - Iter(train) [165500/640000] base_lr: 1.6878e-04 lr: 1.6878e-05 eta: 7 days, 17:17:56 time: 1.4079 data_time: 0.0259 memory: 25564 grad_norm: 2.6832 loss: 1.3803 caption_loss_cls: 2.3903 detection_loss_cls: 0.0382 detection_loss_reg: 0.3576 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9100 instance_segmentation_loss_cls: 0.0371 instance_segmentation_loss_reg: 0.3480 instance_segmentation_loss_poly: 0.9560 2024/01/04 23:28:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 23:28:19 - mmengine - INFO - Iter(train) [166000/640000] base_lr: 1.6860e-04 lr: 1.6860e-05 eta: 7 days, 17:06:31 time: 1.4047 data_time: 0.0258 memory: 25564 grad_norm: 2.6709 loss: 1.3743 caption_loss_cls: 2.3890 detection_loss_cls: 0.0383 detection_loss_reg: 0.3579 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9081 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3491 instance_segmentation_loss_poly: 0.9573 2024/01/04 23:28:19 - mmengine - INFO - Saving checkpoint at 166000 iterations 2024/01/04 23:40:34 - mmengine - INFO - Iter(train) [166500/640000] base_lr: 1.6842e-04 lr: 1.6842e-05 eta: 7 days, 16:59:22 time: 1.4055 data_time: 0.0258 memory: 25564 grad_norm: 2.6444 loss: 1.3789 caption_loss_cls: 2.3836 detection_loss_cls: 0.0384 detection_loss_reg: 0.3584 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9101 instance_segmentation_loss_cls: 0.0373 instance_segmentation_loss_reg: 0.3500 instance_segmentation_loss_poly: 0.9586 2024/01/04 23:52:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/04 23:52:02 - mmengine - INFO - Iter(train) [167000/640000] base_lr: 1.6824e-04 lr: 1.6824e-05 eta: 7 days, 16:45:31 time: 1.4103 data_time: 0.0258 memory: 25564 grad_norm: 2.6194 loss: 1.3810 caption_loss_cls: 2.3842 detection_loss_cls: 0.0384 detection_loss_reg: 0.3584 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9146 instance_segmentation_loss_cls: 0.0372 instance_segmentation_loss_reg: 0.3488 instance_segmentation_loss_poly: 0.9565 2024/01/05 00:03:39 - mmengine - INFO - Iter(train) [167500/640000] base_lr: 1.6806e-04 lr: 1.6806e-05 eta: 7 days, 16:32:57 time: 1.4108 data_time: 0.0257 memory: 25564 grad_norm: 2.6004 loss: 1.3808 caption_loss_cls: 2.3787 detection_loss_cls: 0.0382 detection_loss_reg: 0.3566 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9078 instance_segmentation_loss_cls: 0.0370 instance_segmentation_loss_reg: 0.3481 instance_segmentation_loss_poly: 0.9560 2024/01/05 00:15:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/05 00:15:37 - mmengine - INFO - Iter(train) [168000/640000] base_lr: 1.6788e-04 lr: 1.6788e-05 eta: 7 days, 16:23:13 time: 1.4150 data_time: 0.0257 memory: 25564 grad_norm: 2.6324 loss: 1.3781 caption_loss_cls: 2.3793 detection_loss_cls: 0.0383 detection_loss_reg: 0.3567 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.9011 instance_segmentation_loss_cls: 0.0367 instance_segmentation_loss_reg: 0.3468 instance_segmentation_loss_poly: 0.9533 2024/01/05 00:15:37 - mmengine - INFO - Saving checkpoint at 168000 iterations 2024/01/05 00:27:29 - mmengine - INFO - Iter(train) [168500/640000] base_lr: 1.6770e-04 lr: 1.6770e-05 eta: 7 days, 16:12:44 time: 1.4163 data_time: 0.0258 memory: 25564 grad_norm: 2.6835 loss: 1.4016 caption_loss_cls: 2.3814 detection_loss_cls: 0.0382 detection_loss_reg: 0.3561 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8945 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3453 instance_segmentation_loss_poly: 0.9522 2024/01/05 00:39:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/05 00:39:08 - mmengine - INFO - Iter(train) [169000/640000] base_lr: 1.6752e-04 lr: 1.6752e-05 eta: 7 days, 16:00:21 time: 1.4125 data_time: 0.0256 memory: 25564 grad_norm: 2.6625 loss: 1.3824 caption_loss_cls: 2.3793 detection_loss_cls: 0.0380 detection_loss_reg: 0.3539 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8926 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3441 instance_segmentation_loss_poly: 0.9503 2024/01/05 00:50:43 - mmengine - INFO - Iter(train) [169500/640000] base_lr: 1.6734e-04 lr: 1.6734e-05 eta: 7 days, 15:47:33 time: 1.4120 data_time: 0.0256 memory: 25564 grad_norm: 2.6325 loss: 1.3743 caption_loss_cls: 2.3741 detection_loss_cls: 0.0380 detection_loss_reg: 0.3532 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8913 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3448 instance_segmentation_loss_poly: 0.9517 2024/01/05 01:02:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240104_011054 2024/01/05 01:02:23 - mmengine - INFO - Iter(train) [170000/640000] base_lr: 1.6716e-04 lr: 1.6716e-05 eta: 7 days, 15:35:23 time: 1.4107 data_time: 0.0256 memory: 25564 grad_norm: 2.6309 loss: 1.3820 caption_loss_cls: 2.3712 detection_loss_cls: 0.0381 detection_loss_reg: 0.3531 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8860 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3463 instance_segmentation_loss_poly: 0.9560 2024/01/05 01:02:23 - mmengine - INFO - Saving checkpoint at 170000 iterations 2024/01/05 02:15:09 - mmengine - INFO - Iter(train) [170500/640000] base_lr: 1.6697e-04 lr: 1.6697e-05 eta: 8 days, 3:20:31 time: 1.4143 data_time: 0.0181 memory: 25568 grad_norm: 2.6313 loss: 1.3708 caption_loss_cls: 2.3656 detection_loss_cls: 0.0381 detection_loss_reg: 0.3540 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8831 instance_segmentation_loss_cls: 0.0365 instance_segmentation_loss_reg: 0.3458 instance_segmentation_loss_poly: 0.9564 2024/01/05 02:27:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 02:27:28 - mmengine - INFO - Iter(train) [171000/640000] base_lr: 1.6679e-04 lr: 1.6679e-05 eta: 8 days, 1:46:00 time: 1.4268 data_time: 0.0179 memory: 25568 grad_norm: 2.6441 loss: 1.3680 caption_loss_cls: 2.3637 detection_loss_cls: 0.0380 detection_loss_reg: 0.3524 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8786 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9599 2024/01/05 02:39:57 - mmengine - INFO - Iter(train) [171500/640000] base_lr: 1.6661e-04 lr: 1.6661e-05 eta: 8 days, 2:00:26 time: 1.4397 data_time: 0.0178 memory: 25568 grad_norm: 2.7037 loss: 1.3745 caption_loss_cls: 2.3657 detection_loss_cls: 0.0378 detection_loss_reg: 0.3508 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8777 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3484 instance_segmentation_loss_poly: 0.9612 2024/01/05 02:52:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 02:52:46 - mmengine - INFO - Iter(train) [172000/640000] base_lr: 1.6643e-04 lr: 1.6643e-05 eta: 8 days, 3:20:17 time: 1.4527 data_time: 0.0176 memory: 25568 grad_norm: 2.7001 loss: 1.3826 caption_loss_cls: 2.3626 detection_loss_cls: 0.0377 detection_loss_reg: 0.3496 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8713 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9619 2024/01/05 02:52:46 - mmengine - INFO - Saving checkpoint at 172000 iterations 2024/01/05 03:05:53 - mmengine - INFO - Iter(train) [172500/640000] base_lr: 1.6624e-04 lr: 1.6624e-05 eta: 8 days, 4:58:33 time: 1.4714 data_time: 0.0167 memory: 25568 grad_norm: 2.6947 loss: 1.3728 caption_loss_cls: 2.3597 detection_loss_cls: 0.0376 detection_loss_reg: 0.3502 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8653 instance_segmentation_loss_cls: 0.0369 instance_segmentation_loss_reg: 0.3512 instance_segmentation_loss_poly: 0.9655 2024/01/05 03:18:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 03:18:44 - mmengine - INFO - Iter(train) [173000/640000] base_lr: 1.6606e-04 lr: 1.6606e-05 eta: 8 days, 5:18:32 time: 1.4895 data_time: 0.0167 memory: 25568 grad_norm: 2.7057 loss: 1.3883 caption_loss_cls: 2.3629 detection_loss_cls: 0.0377 detection_loss_reg: 0.3508 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8632 instance_segmentation_loss_cls: 0.0368 instance_segmentation_loss_reg: 0.3523 instance_segmentation_loss_poly: 0.9675 2024/01/05 03:31:32 - mmengine - INFO - Iter(train) [173500/640000] base_lr: 1.6587e-04 lr: 1.6587e-05 eta: 8 days, 5:20:47 time: 1.5075 data_time: 0.0165 memory: 25568 grad_norm: 2.7542 loss: 1.3912 caption_loss_cls: 2.3534 detection_loss_cls: 0.0378 detection_loss_reg: 0.3519 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8566 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3514 instance_segmentation_loss_poly: 0.9658 2024/01/05 03:44:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 03:44:31 - mmengine - INFO - Iter(train) [174000/640000] base_lr: 1.6569e-04 lr: 1.6569e-05 eta: 8 days, 5:43:03 time: 1.5274 data_time: 0.0163 memory: 25568 grad_norm: 2.7386 loss: 1.3846 caption_loss_cls: 2.3494 detection_loss_cls: 0.0380 detection_loss_reg: 0.3539 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8536 instance_segmentation_loss_cls: 0.0366 instance_segmentation_loss_reg: 0.3523 instance_segmentation_loss_poly: 0.9680 2024/01/05 03:44:31 - mmengine - INFO - Saving checkpoint at 174000 iterations 2024/01/05 03:57:28 - mmengine - INFO - Iter(train) [174500/640000] base_lr: 1.6550e-04 lr: 1.6550e-05 eta: 8 days, 5:51:54 time: 1.5343 data_time: 0.0224 memory: 25568 grad_norm: 2.7699 loss: 1.3924 caption_loss_cls: 2.3412 detection_loss_cls: 0.0379 detection_loss_reg: 0.3536 semantic_segmentation_loss_cls: 0.0096 grounding_loss_reg: 2.8491 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3513 instance_segmentation_loss_poly: 0.9657 2024/01/05 04:10:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 04:10:32 - mmengine - INFO - Iter(train) [175000/640000] base_lr: 1.6532e-04 lr: 1.6532e-05 eta: 8 days, 6:09:22 time: 1.5458 data_time: 0.0225 memory: 25568 grad_norm: 2.7272 loss: 1.3778 caption_loss_cls: 2.3429 detection_loss_cls: 0.0379 detection_loss_reg: 0.3540 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8471 instance_segmentation_loss_cls: 0.0363 instance_segmentation_loss_reg: 0.3518 instance_segmentation_loss_poly: 0.9667 2024/01/05 04:24:13 - mmengine - INFO - Iter(train) [175500/640000] base_lr: 1.6513e-04 lr: 1.6513e-05 eta: 8 days, 7:11:07 time: 1.5636 data_time: 0.0228 memory: 25568 grad_norm: 2.6724 loss: 1.3739 caption_loss_cls: 2.3481 detection_loss_cls: 0.0378 detection_loss_reg: 0.3533 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8431 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3505 instance_segmentation_loss_poly: 0.9636 2024/01/05 04:36:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 04:36:52 - mmengine - INFO - Iter(train) [176000/640000] base_lr: 1.6495e-04 lr: 1.6495e-05 eta: 8 days, 6:41:31 time: 1.5611 data_time: 0.0227 memory: 25568 grad_norm: 2.6828 loss: 1.3719 caption_loss_cls: 2.3573 detection_loss_cls: 0.0378 detection_loss_reg: 0.3522 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8399 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3504 instance_segmentation_loss_poly: 0.9628 2024/01/05 04:36:52 - mmengine - INFO - Saving checkpoint at 176000 iterations 2024/01/05 04:50:04 - mmengine - INFO - Iter(train) [176500/640000] base_lr: 1.6476e-04 lr: 1.6476e-05 eta: 8 days, 6:53:49 time: 1.5623 data_time: 0.0226 memory: 25568 grad_norm: 2.6872 loss: 1.3668 caption_loss_cls: 2.3571 detection_loss_cls: 0.0378 detection_loss_reg: 0.3521 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8403 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3505 instance_segmentation_loss_poly: 0.9631 2024/01/05 05:03:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 05:03:11 - mmengine - INFO - Iter(train) [177000/640000] base_lr: 1.6457e-04 lr: 1.6457e-05 eta: 8 days, 6:56:41 time: 1.5663 data_time: 0.0225 memory: 25568 grad_norm: 2.7082 loss: 1.3645 caption_loss_cls: 2.3579 detection_loss_cls: 0.0380 detection_loss_reg: 0.3538 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8382 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3474 instance_segmentation_loss_poly: 0.9564 2024/01/05 05:15:42 - mmengine - INFO - Iter(train) [177500/640000] base_lr: 1.6438e-04 lr: 1.6438e-05 eta: 8 days, 6:20:47 time: 1.5622 data_time: 0.0225 memory: 25568 grad_norm: 2.6940 loss: 1.3718 caption_loss_cls: 2.3583 detection_loss_cls: 0.0377 detection_loss_reg: 0.3518 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8365 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3478 instance_segmentation_loss_poly: 0.9564 2024/01/05 05:28:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 05:28:48 - mmengine - INFO - Iter(train) [178000/640000] base_lr: 1.6420e-04 lr: 1.6420e-05 eta: 8 days, 6:21:06 time: 1.5638 data_time: 0.0224 memory: 25568 grad_norm: 2.7006 loss: 1.3724 caption_loss_cls: 2.3504 detection_loss_cls: 0.0376 detection_loss_reg: 0.3511 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8328 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9567 2024/01/05 05:28:48 - mmengine - INFO - Saving checkpoint at 178000 iterations 2024/01/05 05:42:05 - mmengine - INFO - Iter(train) [178500/640000] base_lr: 1.6401e-04 lr: 1.6401e-05 eta: 8 days, 6:30:35 time: 1.5691 data_time: 0.0226 memory: 25568 grad_norm: 2.7109 loss: 1.3771 caption_loss_cls: 2.3411 detection_loss_cls: 0.0377 detection_loss_reg: 0.3521 semantic_segmentation_loss_cls: 0.0095 grounding_loss_reg: 2.8344 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3465 instance_segmentation_loss_poly: 0.9529 2024/01/05 05:55:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 05:55:00 - mmengine - INFO - Iter(train) [179000/640000] base_lr: 1.6382e-04 lr: 1.6382e-05 eta: 8 days, 6:18:00 time: 1.5666 data_time: 0.0227 memory: 25568 grad_norm: 2.7148 loss: 1.3864 caption_loss_cls: 2.3242 detection_loss_cls: 0.0375 detection_loss_reg: 0.3516 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.8351 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9558 2024/01/05 06:07:54 - mmengine - INFO - Iter(train) [179500/640000] base_lr: 1.6363e-04 lr: 1.6363e-05 eta: 8 days, 6:04:22 time: 1.5549 data_time: 0.0225 memory: 25568 grad_norm: 2.7070 loss: 1.3778 caption_loss_cls: 2.3195 detection_loss_cls: 0.0376 detection_loss_reg: 0.3523 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.8374 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3467 instance_segmentation_loss_poly: 0.9539 2024/01/05 06:20:51 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 06:20:51 - mmengine - INFO - Iter(train) [180000/640000] base_lr: 1.6344e-04 lr: 1.6344e-05 eta: 8 days, 5:54:01 time: 1.5596 data_time: 0.0226 memory: 25568 grad_norm: 2.7280 loss: 1.3735 caption_loss_cls: 2.3150 detection_loss_cls: 0.0373 detection_loss_reg: 0.3506 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.8314 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9546 2024/01/05 06:20:51 - mmengine - INFO - Saving checkpoint at 180000 iterations 2024/01/05 06:32:22 - mmengine - INFO - Evaluating bbox... 2024/01/05 06:33:18 - mmengine - INFO - bbox_mAP_copypaste: 0.475 0.657 0.524 0.319 0.528 0.616 2024/01/05 06:33:18 - mmengine - INFO - Evaluating segm... 2024/01/05 06:34:32 - mmengine - INFO - segm_mAP_copypaste: 0.304 0.562 0.293 0.165 0.359 0.479 2024/01/05 06:36:42 - mmengine - INFO - Evaluating bbox... 2024/01/05 06:37:41 - mmengine - INFO - bbox_mAP_copypaste: 0.474 0.656 0.522 0.319 0.526 0.614 2024/01/05 06:44:16 - mmengine - INFO - per class results: 2024/01/05 06:44:16 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.78 | 89.92 | | building | 81.65 | 91.95 | | sky | 93.36 | 97.23 | | floor | 81.9 | 89.13 | | tree | 74.5 | 86.03 | | ceiling | 84.77 | 92.13 | | road | 84.34 | 91.08 | | bed | 89.96 | 94.73 | | windowpane | 63.41 | 81.07 | | grass | 52.94 | 59.51 | | cabinet | 60.06 | 70.75 | | sidewalk | 65.57 | 82.09 | | person | 81.26 | 89.56 | | earth | 37.08 | 58.7 | | door | 49.05 | 54.08 | | table | 62.39 | 78.95 | | mountain | 53.52 | 58.17 | | plant | 50.93 | 58.08 | | curtain | 74.3 | 86.39 | | chair | 60.27 | 77.91 | | car | 83.58 | 89.07 | | water | 22.91 | 25.29 | | painting | 72.13 | 83.6 | | sofa | 72.12 | 84.98 | | shelf | 40.95 | 75.06 | | house | 39.32 | 53.04 | | sea | 43.12 | 81.71 | | mirror | 67.19 | 75.25 | | rug | 61.39 | 68.1 | | field | 26.25 | 77.89 | | armchair | 48.9 | 59.13 | | seat | 65.71 | 81.72 | | fence | 38.85 | 66.61 | | desk | 39.75 | 72.0 | | rock | 51.2 | 74.78 | | wardrobe | 20.7 | 21.08 | | lamp | 60.26 | 77.22 | | bathtub | 80.25 | 88.96 | | railing | 30.5 | 37.27 | | cushion | 56.4 | 77.78 | | base | 33.21 | 51.34 | | box | 26.02 | 34.05 | | column | 50.7 | 61.36 | | signboard | 38.46 | 56.36 | | chest of drawers | 29.66 | 51.2 | | counter | 34.42 | 50.25 | | sand | 28.13 | 75.68 | | sink | 72.48 | 78.66 | | skyscraper | 46.1 | 53.8 | | fireplace | 71.06 | 83.74 | | refrigerator | 70.7 | 72.86 | | grandstand | 48.18 | 71.67 | | path | 16.16 | 18.78 | | stairs | 38.59 | 46.89 | | runway | 77.43 | 93.66 | | case | 50.64 | 88.29 | | pool table | 91.14 | 94.49 | | pillow | 51.58 | 56.77 | | screen door | 76.67 | 82.18 | | stairway | 41.98 | 55.85 | | river | 11.17 | 38.71 | | bridge | 41.81 | 72.27 | | bookcase | 35.02 | 40.48 | | blind | 45.76 | 50.67 | | coffee table | 63.92 | 77.08 | | toilet | 85.17 | 91.14 | | flower | 39.53 | 57.84 | | book | 48.05 | 65.44 | | hill | 3.73 | 4.9 | | bench | 50.83 | 57.38 | | countertop | 59.19 | 76.33 | | stove | 73.12 | 76.49 | | palm | 47.48 | 75.7 | | kitchen island | 42.23 | 81.28 | | computer | 73.9 | 89.93 | | swivel chair | 28.74 | 31.95 | | boat | 56.95 | 84.78 | | bar | 41.94 | 54.6 | | arcade machine | 77.96 | 80.29 | | hovel | 18.24 | 22.96 | | bus | 91.17 | 95.68 | | towel | 61.26 | 72.53 | | light | 47.0 | 59.4 | | truck | 37.27 | 55.34 | | tower | 19.09 | 32.18 | | chandelier | 62.12 | 87.98 | | awning | 36.21 | 51.83 | | streetlight | 29.86 | 64.55 | | booth | 48.48 | 55.65 | | television receiver | 74.64 | 86.34 | | airplane | 71.87 | 82.25 | | dirt track | 4.41 | 16.49 | | apparel | 28.39 | 37.76 | | pole | 20.74 | 34.94 | | land | 2.68 | 4.7 | | bannister | 16.97 | 24.41 | | escalator | 35.37 | 44.98 | | ottoman | 51.76 | 73.89 | | bottle | 22.6 | 29.54 | | buffet | 47.2 | 52.08 | | poster | 37.05 | 61.92 | | stage | 17.94 | 27.64 | | van | 47.47 | 63.53 | | ship | 5.43 | 5.78 | | fountain | 26.17 | 27.52 | | conveyer belt | 78.26 | 88.54 | | canopy | 38.24 | 62.98 | | washer | 59.96 | 66.7 | | plaything | 34.16 | 39.08 | | swimming pool | 57.25 | 63.3 | | stool | 45.28 | 58.14 | | barrel | 13.02 | 51.79 | | basket | 32.45 | 42.09 | | waterfall | 73.5 | 90.65 | | tent | 87.51 | 97.77 | | bag | 26.76 | 36.93 | | minibike | 73.44 | 79.36 | | cradle | 77.42 | 96.35 | | oven | 41.65 | 62.55 | | ball | 35.1 | 54.33 | | food | 49.61 | 53.4 | | step | 8.08 | 9.19 | | tank | 49.12 | 51.64 | | trade name | 9.39 | 9.8 | | microwave | 75.67 | 82.13 | | pot | 48.33 | 55.6 | | animal | 59.09 | 61.28 | | bicycle | 47.71 | 53.14 | | lake | 4.29 | 5.92 | | dishwasher | 57.43 | 64.03 | | screen | 66.91 | 77.81 | | blanket | 23.51 | 27.46 | | sculpture | 52.75 | 87.11 | | hood | 32.95 | 95.07 | | sconce | 43.64 | 60.79 | | vase | 40.73 | 52.21 | | traffic light | 38.85 | 61.6 | | tray | 10.12 | 13.14 | | ashcan | 35.98 | 46.03 | | fan | 62.2 | 76.32 | | pier | 33.95 | 45.66 | | crt screen | 2.1 | 4.25 | | plate | 55.94 | 70.65 | | monitor | 15.87 | 16.93 | | bulletin board | 35.87 | 45.54 | | shower | 0.25 | 0.41 | | radiator | 63.79 | 72.71 | | glass | 17.33 | 18.96 | | clock | 32.51 | 46.55 | | flag | 32.58 | 35.4 | +---------------------+-------+-------+ 2024/01/05 06:44:30 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4740 coco/bbox_mAP_50: 0.6560 coco/bbox_mAP_75: 0.5220 coco/bbox_mAP_s: 0.3190 coco/bbox_mAP_m: 0.5260 coco/bbox_mAP_l: 0.6140 coco/segm_mAP: 0.3040 coco/segm_mAP_50: 0.5620 coco/segm_mAP_75: 0.2930 coco/segm_mAP_s: 0.1650 coco/segm_mAP_m: 0.3590 coco/segm_mAP_l: 0.4790 Bleu_1: 0.7419 Bleu_2: 0.5691 Bleu_3: 0.4231 Bleu_4: 0.3094 METEOR: 0.2575 ROUGE_L: 0.5404 CIDEr: 1.0095 SPICE: 0.1917 aAcc: 82.1300 mIoU: 47.7500 mAcc: 60.7100 visual-grounding/miou: 0.7689 visual-grounding/acc: 0.8381 data_time: 0.0296 time: 1.9200 2024/01/05 06:57:14 - mmengine - INFO - Iter(train) [180500/640000] base_lr: 1.6325e-04 lr: 1.6325e-05 eta: 8 days, 5:35:18 time: 1.5532 data_time: 0.0174 memory: 34658 grad_norm: 2.6794 loss: 1.3579 caption_loss_cls: 2.3125 detection_loss_cls: 0.0371 detection_loss_reg: 0.3496 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.8321 instance_segmentation_loss_cls: 0.0355 instance_segmentation_loss_reg: 0.3477 instance_segmentation_loss_poly: 0.9552 2024/01/05 07:09:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 07:09:53 - mmengine - INFO - Iter(train) [181000/640000] base_lr: 1.6306e-04 lr: 1.6306e-05 eta: 8 days, 5:11:48 time: 1.5462 data_time: 0.0176 memory: 25568 grad_norm: 2.6600 loss: 1.3571 caption_loss_cls: 2.3093 detection_loss_cls: 0.0371 detection_loss_reg: 0.3495 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.8320 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3479 instance_segmentation_loss_poly: 0.9552 2024/01/05 07:22:07 - mmengine - INFO - Iter(train) [181500/640000] base_lr: 1.6287e-04 lr: 1.6287e-05 eta: 8 days, 4:32:31 time: 1.5419 data_time: 0.0179 memory: 25568 grad_norm: 2.6599 loss: 1.3580 caption_loss_cls: 2.3055 detection_loss_cls: 0.0369 detection_loss_reg: 0.3479 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.8279 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3482 instance_segmentation_loss_poly: 0.9565 2024/01/05 07:34:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 07:34:58 - mmengine - INFO - Iter(train) [182000/640000] base_lr: 1.6268e-04 lr: 1.6268e-05 eta: 8 days, 4:19:38 time: 1.5384 data_time: 0.0182 memory: 25568 grad_norm: 2.6626 loss: 1.3676 caption_loss_cls: 2.3056 detection_loss_cls: 0.0367 detection_loss_reg: 0.3467 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.8282 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3493 instance_segmentation_loss_poly: 0.9589 2024/01/05 07:34:58 - mmengine - INFO - Saving checkpoint at 182000 iterations 2024/01/05 07:47:35 - mmengine - INFO - Iter(train) [182500/640000] base_lr: 1.6249e-04 lr: 1.6249e-05 eta: 8 days, 3:57:25 time: 1.5280 data_time: 0.0197 memory: 25568 grad_norm: 2.6490 loss: 1.3745 caption_loss_cls: 2.3123 detection_loss_cls: 0.0366 detection_loss_reg: 0.3464 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.8281 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3500 instance_segmentation_loss_poly: 0.9612 2024/01/05 08:00:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 08:00:23 - mmengine - INFO - Iter(train) [183000/640000] base_lr: 1.6229e-04 lr: 1.6229e-05 eta: 8 days, 3:42:50 time: 1.5264 data_time: 0.0198 memory: 25568 grad_norm: 2.6232 loss: 1.3506 caption_loss_cls: 2.3063 detection_loss_cls: 0.0369 detection_loss_reg: 0.3500 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.8272 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3466 instance_segmentation_loss_poly: 0.9530 2024/01/05 08:12:49 - mmengine - INFO - Iter(train) [183500/640000] base_lr: 1.6210e-04 lr: 1.6210e-05 eta: 8 days, 3:15:48 time: 1.5195 data_time: 0.0200 memory: 25568 grad_norm: 2.6212 loss: 1.3548 caption_loss_cls: 2.3092 detection_loss_cls: 0.0370 detection_loss_reg: 0.3505 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8221 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3485 instance_segmentation_loss_poly: 0.9575 2024/01/05 08:25:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 08:25:13 - mmengine - INFO - Iter(train) [184000/640000] base_lr: 1.6191e-04 lr: 1.6191e-05 eta: 8 days, 2:48:37 time: 1.5110 data_time: 0.0202 memory: 25568 grad_norm: 2.5924 loss: 1.3698 caption_loss_cls: 2.3090 detection_loss_cls: 0.0372 detection_loss_reg: 0.3512 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8254 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3483 instance_segmentation_loss_poly: 0.9572 2024/01/05 08:25:13 - mmengine - INFO - Saving checkpoint at 184000 iterations 2024/01/05 08:38:22 - mmengine - INFO - Iter(train) [184500/640000] base_lr: 1.6172e-04 lr: 1.6172e-05 eta: 8 days, 2:46:28 time: 1.5167 data_time: 0.0265 memory: 25568 grad_norm: 2.5894 loss: 1.3753 caption_loss_cls: 2.3040 detection_loss_cls: 0.0372 detection_loss_reg: 0.3499 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8229 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3492 instance_segmentation_loss_poly: 0.9597 2024/01/05 08:51:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 08:51:12 - mmengine - INFO - Iter(train) [185000/640000] base_lr: 1.6152e-04 lr: 1.6152e-05 eta: 8 days, 2:33:53 time: 1.5195 data_time: 0.0265 memory: 25568 grad_norm: 2.6100 loss: 1.3781 caption_loss_cls: 2.3012 detection_loss_cls: 0.0371 detection_loss_reg: 0.3500 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8234 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3477 instance_segmentation_loss_poly: 0.9573 2024/01/05 09:03:27 - mmengine - INFO - Iter(train) [185500/640000] base_lr: 1.6133e-04 lr: 1.6133e-05 eta: 8 days, 2:03:50 time: 1.5198 data_time: 0.0265 memory: 25568 grad_norm: 2.6252 loss: 1.3699 caption_loss_cls: 2.2948 detection_loss_cls: 0.0372 detection_loss_reg: 0.3499 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8223 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3489 instance_segmentation_loss_poly: 0.9587 2024/01/05 09:16:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 09:16:15 - mmengine - INFO - Iter(train) [186000/640000] base_lr: 1.6114e-04 lr: 1.6114e-05 eta: 8 days, 1:50:29 time: 1.5187 data_time: 0.0265 memory: 25568 grad_norm: 2.6161 loss: 1.3514 caption_loss_cls: 2.2952 detection_loss_cls: 0.0371 detection_loss_reg: 0.3483 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8211 instance_segmentation_loss_cls: 0.0355 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9552 2024/01/05 09:16:15 - mmengine - INFO - Saving checkpoint at 186000 iterations 2024/01/05 09:29:30 - mmengine - INFO - Iter(train) [186500/640000] base_lr: 1.6094e-04 lr: 1.6094e-05 eta: 8 days, 1:49:48 time: 1.5284 data_time: 0.0259 memory: 25568 grad_norm: 2.6296 loss: 1.3339 caption_loss_cls: 2.2845 detection_loss_cls: 0.0369 detection_loss_reg: 0.3470 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8158 instance_segmentation_loss_cls: 0.0354 instance_segmentation_loss_reg: 0.3454 instance_segmentation_loss_poly: 0.9507 2024/01/05 09:41:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 09:41:55 - mmengine - INFO - Iter(train) [187000/640000] base_lr: 1.6075e-04 lr: 1.6075e-05 eta: 8 days, 1:26:13 time: 1.5227 data_time: 0.0259 memory: 25568 grad_norm: 2.6615 loss: 1.3459 caption_loss_cls: 2.2846 detection_loss_cls: 0.0369 detection_loss_reg: 0.3476 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8086 instance_segmentation_loss_cls: 0.0354 instance_segmentation_loss_reg: 0.3454 instance_segmentation_loss_poly: 0.9502 2024/01/05 09:54:21 - mmengine - INFO - Iter(train) [187500/640000] base_lr: 1.6055e-04 lr: 1.6055e-05 eta: 8 days, 1:03:19 time: 1.5226 data_time: 0.0258 memory: 25568 grad_norm: 2.6758 loss: 1.3449 caption_loss_cls: 2.2800 detection_loss_cls: 0.0368 detection_loss_reg: 0.3470 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8092 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9534 2024/01/05 10:06:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 10:06:59 - mmengine - INFO - Iter(train) [188000/640000] base_lr: 1.6036e-04 lr: 1.6036e-05 eta: 8 days, 0:46:30 time: 1.5263 data_time: 0.0258 memory: 25568 grad_norm: 2.6432 loss: 1.3223 caption_loss_cls: 2.2744 detection_loss_cls: 0.0365 detection_loss_reg: 0.3451 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8086 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3475 instance_segmentation_loss_poly: 0.9526 2024/01/05 10:06:59 - mmengine - INFO - Saving checkpoint at 188000 iterations 2024/01/05 10:20:04 - mmengine - INFO - Iter(train) [188500/640000] base_lr: 1.6016e-04 lr: 1.6016e-05 eta: 8 days, 0:40:27 time: 1.5250 data_time: 0.0259 memory: 25568 grad_norm: 2.6732 loss: 1.3263 caption_loss_cls: 2.2752 detection_loss_cls: 0.0366 detection_loss_reg: 0.3458 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8088 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3483 instance_segmentation_loss_poly: 0.9534 2024/01/05 10:32:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 10:32:16 - mmengine - INFO - Iter(train) [189000/640000] base_lr: 1.5996e-04 lr: 1.5996e-05 eta: 8 days, 0:13:29 time: 1.5155 data_time: 0.0259 memory: 25568 grad_norm: 2.6848 loss: 1.3374 caption_loss_cls: 2.2749 detection_loss_cls: 0.0366 detection_loss_reg: 0.3472 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8097 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3500 instance_segmentation_loss_poly: 0.9556 2024/01/05 10:45:21 - mmengine - INFO - Iter(train) [189500/640000] base_lr: 1.5977e-04 lr: 1.5977e-05 eta: 8 days, 0:07:25 time: 1.5281 data_time: 0.0261 memory: 25568 grad_norm: 2.6346 loss: 1.3275 caption_loss_cls: 2.2749 detection_loss_cls: 0.0365 detection_loss_reg: 0.3468 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8074 instance_segmentation_loss_cls: 0.0363 instance_segmentation_loss_reg: 0.3512 instance_segmentation_loss_poly: 0.9578 2024/01/05 10:57:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 10:57:46 - mmengine - INFO - Iter(train) [190000/640000] base_lr: 1.5957e-04 lr: 1.5957e-05 eta: 7 days, 23:46:04 time: 1.5224 data_time: 0.0260 memory: 25568 grad_norm: 2.6867 loss: 1.3505 caption_loss_cls: 2.2803 detection_loss_cls: 0.0365 detection_loss_reg: 0.3483 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8036 instance_segmentation_loss_cls: 0.0363 instance_segmentation_loss_reg: 0.3516 instance_segmentation_loss_poly: 0.9581 2024/01/05 10:57:46 - mmengine - INFO - Saving checkpoint at 190000 iterations 2024/01/05 11:11:02 - mmengine - INFO - Iter(train) [190500/640000] base_lr: 1.5937e-04 lr: 1.5937e-05 eta: 7 days, 23:43:37 time: 1.5225 data_time: 0.0261 memory: 25568 grad_norm: 2.6848 loss: 1.3517 caption_loss_cls: 2.2727 detection_loss_cls: 0.0365 detection_loss_reg: 0.3467 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.8025 instance_segmentation_loss_cls: 0.0365 instance_segmentation_loss_reg: 0.3513 instance_segmentation_loss_poly: 0.9575 2024/01/05 11:23:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 11:23:18 - mmengine - INFO - Iter(train) [191000/640000] base_lr: 1.5918e-04 lr: 1.5918e-05 eta: 7 days, 23:19:23 time: 1.5202 data_time: 0.0261 memory: 25568 grad_norm: 2.7005 loss: 1.3613 caption_loss_cls: 2.2732 detection_loss_cls: 0.0364 detection_loss_reg: 0.3462 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8003 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3521 instance_segmentation_loss_poly: 0.9580 2024/01/05 11:35:57 - mmengine - INFO - Iter(train) [191500/640000] base_lr: 1.5898e-04 lr: 1.5898e-05 eta: 7 days, 23:03:46 time: 1.5236 data_time: 0.0261 memory: 25568 grad_norm: 2.7118 loss: 1.3577 caption_loss_cls: 2.2703 detection_loss_cls: 0.0365 detection_loss_reg: 0.3461 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.7964 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3525 instance_segmentation_loss_poly: 0.9593 2024/01/05 11:48:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 11:48:52 - mmengine - INFO - Iter(train) [192000/640000] base_lr: 1.5878e-04 lr: 1.5878e-05 eta: 7 days, 22:54:01 time: 1.5279 data_time: 0.0261 memory: 25568 grad_norm: 2.6968 loss: 1.3550 caption_loss_cls: 2.2746 detection_loss_cls: 0.0363 detection_loss_reg: 0.3456 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.7990 instance_segmentation_loss_cls: 0.0362 instance_segmentation_loss_reg: 0.3505 instance_segmentation_loss_poly: 0.9560 2024/01/05 11:48:52 - mmengine - INFO - Saving checkpoint at 192000 iterations 2024/01/05 12:02:00 - mmengine - INFO - Iter(train) [192500/640000] base_lr: 1.5858e-04 lr: 1.5858e-05 eta: 7 days, 22:47:52 time: 1.5286 data_time: 0.0259 memory: 25568 grad_norm: 2.6545 loss: 1.3581 caption_loss_cls: 2.2744 detection_loss_cls: 0.0362 detection_loss_reg: 0.3432 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.8018 instance_segmentation_loss_cls: 0.0364 instance_segmentation_loss_reg: 0.3515 instance_segmentation_loss_poly: 0.9591 2024/01/05 12:14:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 12:14:10 - mmengine - INFO - Iter(train) [193000/640000] base_lr: 1.5838e-04 lr: 1.5838e-05 eta: 7 days, 22:22:57 time: 1.5280 data_time: 0.0258 memory: 25568 grad_norm: 2.6628 loss: 1.3490 caption_loss_cls: 2.2750 detection_loss_cls: 0.0361 detection_loss_reg: 0.3421 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.8028 instance_segmentation_loss_cls: 0.0362 instance_segmentation_loss_reg: 0.3495 instance_segmentation_loss_poly: 0.9558 2024/01/05 12:26:30 - mmengine - INFO - Iter(train) [193500/640000] base_lr: 1.5818e-04 lr: 1.5818e-05 eta: 7 days, 22:01:47 time: 1.5169 data_time: 0.0256 memory: 25568 grad_norm: 2.7123 loss: 1.3614 caption_loss_cls: 2.2760 detection_loss_cls: 0.0361 detection_loss_reg: 0.3417 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.8028 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9522 2024/01/05 12:38:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 12:38:57 - mmengine - INFO - Iter(train) [194000/640000] base_lr: 1.5798e-04 lr: 1.5798e-05 eta: 7 days, 21:42:57 time: 1.5173 data_time: 0.0257 memory: 25568 grad_norm: 2.7185 loss: 1.3577 caption_loss_cls: 2.2788 detection_loss_cls: 0.0359 detection_loss_reg: 0.3389 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.8021 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3462 instance_segmentation_loss_poly: 0.9488 2024/01/05 12:38:57 - mmengine - INFO - Saving checkpoint at 194000 iterations 2024/01/05 12:52:02 - mmengine - INFO - Iter(train) [194500/640000] base_lr: 1.5778e-04 lr: 1.5778e-05 eta: 7 days, 21:36:05 time: 1.5148 data_time: 0.0256 memory: 25568 grad_norm: 2.7226 loss: 1.3536 caption_loss_cls: 2.2740 detection_loss_cls: 0.0360 detection_loss_reg: 0.3395 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.8003 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3450 instance_segmentation_loss_poly: 0.9459 2024/01/05 13:05:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 13:05:12 - mmengine - INFO - Iter(train) [195000/640000] base_lr: 1.5758e-04 lr: 1.5758e-05 eta: 7 days, 21:30:24 time: 1.5283 data_time: 0.0257 memory: 25568 grad_norm: 2.7088 loss: 1.3435 caption_loss_cls: 2.2737 detection_loss_cls: 0.0359 detection_loss_reg: 0.3391 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7995 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3448 instance_segmentation_loss_poly: 0.9454 2024/01/05 13:17:24 - mmengine - INFO - Iter(train) [195500/640000] base_lr: 1.5738e-04 lr: 1.5738e-05 eta: 7 days, 21:07:21 time: 1.5214 data_time: 0.0256 memory: 25568 grad_norm: 2.7557 loss: 1.3565 caption_loss_cls: 2.2710 detection_loss_cls: 0.0359 detection_loss_reg: 0.3391 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7951 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3444 instance_segmentation_loss_poly: 0.9438 2024/01/05 13:29:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 13:29:22 - mmengine - INFO - Iter(train) [196000/640000] base_lr: 1.5718e-04 lr: 1.5718e-05 eta: 7 days, 20:40:56 time: 1.5070 data_time: 0.0255 memory: 25568 grad_norm: 2.8255 loss: 1.3755 caption_loss_cls: 2.2700 detection_loss_cls: 0.0358 detection_loss_reg: 0.3391 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7940 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3444 instance_segmentation_loss_poly: 0.9440 2024/01/05 13:29:22 - mmengine - INFO - Saving checkpoint at 196000 iterations 2024/01/05 13:42:12 - mmengine - INFO - Iter(train) [196500/640000] base_lr: 1.5698e-04 lr: 1.5698e-05 eta: 7 days, 20:29:28 time: 1.5026 data_time: 0.0256 memory: 25568 grad_norm: 2.8786 loss: 1.3724 caption_loss_cls: 2.2664 detection_loss_cls: 0.0356 detection_loss_reg: 0.3370 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7940 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3450 instance_segmentation_loss_poly: 0.9440 2024/01/05 13:54:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 13:54:58 - mmengine - INFO - Iter(train) [197000/640000] base_lr: 1.5678e-04 lr: 1.5678e-05 eta: 7 days, 20:17:09 time: 1.5117 data_time: 0.0256 memory: 25568 grad_norm: 2.8962 loss: 1.3740 caption_loss_cls: 2.2672 detection_loss_cls: 0.0356 detection_loss_reg: 0.3372 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7951 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3436 instance_segmentation_loss_poly: 0.9403 2024/01/05 14:07:13 - mmengine - INFO - Iter(train) [197500/640000] base_lr: 1.5657e-04 lr: 1.5657e-05 eta: 7 days, 19:56:17 time: 1.5104 data_time: 0.0257 memory: 25568 grad_norm: 2.9121 loss: 1.3819 caption_loss_cls: 2.2670 detection_loss_cls: 0.0355 detection_loss_reg: 0.3375 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7955 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3452 instance_segmentation_loss_poly: 0.9437 2024/01/05 14:19:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 14:19:56 - mmengine - INFO - Iter(train) [198000/640000] base_lr: 1.5637e-04 lr: 1.5637e-05 eta: 7 days, 19:43:03 time: 1.5144 data_time: 0.0257 memory: 25568 grad_norm: 2.8886 loss: 1.3688 caption_loss_cls: 2.2723 detection_loss_cls: 0.0356 detection_loss_reg: 0.3384 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7910 instance_segmentation_loss_cls: 0.0359 instance_segmentation_loss_reg: 0.3440 instance_segmentation_loss_poly: 0.9410 2024/01/05 14:19:56 - mmengine - INFO - Saving checkpoint at 198000 iterations 2024/01/05 14:33:23 - mmengine - INFO - Iter(train) [198500/640000] base_lr: 1.5617e-04 lr: 1.5617e-05 eta: 7 days, 19:41:04 time: 1.5197 data_time: 0.0260 memory: 25568 grad_norm: 2.8560 loss: 1.3823 caption_loss_cls: 2.2782 detection_loss_cls: 0.0358 detection_loss_reg: 0.3396 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7919 instance_segmentation_loss_cls: 0.0361 instance_segmentation_loss_reg: 0.3463 instance_segmentation_loss_poly: 0.9445 2024/01/05 14:45:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 14:45:50 - mmengine - INFO - Iter(train) [199000/640000] base_lr: 1.5596e-04 lr: 1.5596e-05 eta: 7 days, 19:23:47 time: 1.5090 data_time: 0.0260 memory: 25568 grad_norm: 2.8880 loss: 1.3921 caption_loss_cls: 2.2879 detection_loss_cls: 0.0357 detection_loss_reg: 0.3392 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7874 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3469 instance_segmentation_loss_poly: 0.9453 2024/01/05 14:58:35 - mmengine - INFO - Iter(train) [199500/640000] base_lr: 1.5576e-04 lr: 1.5576e-05 eta: 7 days, 19:10:57 time: 1.5174 data_time: 0.0261 memory: 25568 grad_norm: 2.8501 loss: 1.3854 caption_loss_cls: 2.2956 detection_loss_cls: 0.0360 detection_loss_reg: 0.3416 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7850 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3450 instance_segmentation_loss_poly: 0.9396 2024/01/05 15:11:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 15:11:59 - mmengine - INFO - Iter(train) [200000/640000] base_lr: 1.5556e-04 lr: 1.5556e-05 eta: 7 days, 19:07:51 time: 1.5390 data_time: 0.0264 memory: 25568 grad_norm: 2.7846 loss: 1.3734 caption_loss_cls: 2.2980 detection_loss_cls: 0.0360 detection_loss_reg: 0.3402 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7866 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3451 instance_segmentation_loss_poly: 0.9394 2024/01/05 15:11:59 - mmengine - INFO - Saving checkpoint at 200000 iterations 2024/01/05 15:23:56 - mmengine - INFO - Evaluating bbox... 2024/01/05 15:24:53 - mmengine - INFO - bbox_mAP_copypaste: 0.481 0.665 0.528 0.333 0.527 0.615 2024/01/05 15:24:53 - mmengine - INFO - Evaluating segm... 2024/01/05 15:26:05 - mmengine - INFO - segm_mAP_copypaste: 0.314 0.572 0.306 0.178 0.361 0.488 2024/01/05 15:28:15 - mmengine - INFO - Evaluating bbox... 2024/01/05 15:29:13 - mmengine - INFO - bbox_mAP_copypaste: 0.481 0.665 0.528 0.333 0.526 0.616 2024/01/05 15:35:52 - mmengine - INFO - per class results: 2024/01/05 15:35:52 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.53 | 88.69 | | building | 81.27 | 92.88 | | sky | 93.5 | 97.08 | | floor | 81.35 | 90.43 | | tree | 74.17 | 87.8 | | ceiling | 84.9 | 92.9 | | road | 84.56 | 89.62 | | bed | 88.91 | 94.46 | | windowpane | 62.98 | 79.36 | | grass | 65.95 | 83.81 | | cabinet | 61.88 | 73.43 | | sidewalk | 66.41 | 77.38 | | person | 80.25 | 91.32 | | earth | 30.43 | 37.13 | | door | 52.75 | 63.8 | | table | 61.97 | 77.84 | | mountain | 54.8 | 72.04 | | plant | 52.61 | 67.65 | | curtain | 75.06 | 88.98 | | chair | 58.03 | 72.4 | | car | 83.42 | 90.18 | | water | 54.94 | 65.44 | | painting | 71.39 | 89.2 | | sofa | 71.16 | 82.34 | | shelf | 41.68 | 61.3 | | house | 35.98 | 48.01 | | sea | 64.09 | 89.61 | | mirror | 68.11 | 77.25 | | rug | 56.51 | 62.91 | | field | 28.93 | 53.57 | | armchair | 49.11 | 73.18 | | seat | 62.58 | 84.26 | | fence | 44.2 | 61.2 | | desk | 47.98 | 68.15 | | rock | 41.37 | 60.47 | | wardrobe | 45.27 | 70.74 | | lamp | 61.11 | 75.49 | | bathtub | 77.29 | 84.16 | | railing | 38.62 | 52.1 | | cushion | 58.88 | 78.17 | | base | 23.64 | 35.58 | | box | 28.14 | 38.8 | | column | 51.1 | 58.33 | | signboard | 38.28 | 51.19 | | chest of drawers | 37.11 | 59.2 | | counter | 26.38 | 41.19 | | sand | 54.53 | 67.67 | | sink | 74.75 | 82.97 | | skyscraper | 43.38 | 50.83 | | fireplace | 71.09 | 87.26 | | refrigerator | 76.9 | 84.44 | | grandstand | 49.69 | 76.04 | | path | 25.58 | 42.12 | | stairs | 26.0 | 27.74 | | runway | 69.68 | 88.08 | | case | 43.78 | 57.56 | | pool table | 91.39 | 95.55 | | pillow | 56.09 | 65.08 | | screen door | 82.6 | 91.51 | | stairway | 32.66 | 42.85 | | river | 14.41 | 19.35 | | bridge | 35.66 | 45.4 | | bookcase | 33.65 | 50.89 | | blind | 38.51 | 42.16 | | coffee table | 62.9 | 80.56 | | toilet | 84.14 | 91.21 | | flower | 37.23 | 61.31 | | book | 47.22 | 65.7 | | hill | 11.84 | 19.55 | | bench | 52.27 | 60.14 | | countertop | 55.77 | 65.13 | | stove | 75.77 | 79.47 | | palm | 45.78 | 69.86 | | kitchen island | 44.92 | 88.03 | | computer | 76.94 | 86.07 | | swivel chair | 22.05 | 25.01 | | boat | 72.44 | 83.34 | | bar | 51.07 | 65.25 | | arcade machine | 72.63 | 75.22 | | hovel | 18.04 | 19.21 | | bus | 90.45 | 96.27 | | towel | 58.58 | 74.23 | | light | 49.22 | 56.55 | | truck | 38.21 | 56.99 | | tower | 21.59 | 34.96 | | chandelier | 65.95 | 86.73 | | awning | 23.91 | 35.09 | | streetlight | 29.78 | 37.09 | | booth | 48.2 | 61.21 | | television receiver | 61.42 | 85.89 | | airplane | 72.52 | 82.99 | | dirt track | 2.07 | 4.56 | | apparel | 30.76 | 42.38 | | pole | 26.42 | 34.62 | | land | 4.18 | 6.12 | | bannister | 11.34 | 15.59 | | escalator | 45.93 | 57.44 | | ottoman | 50.76 | 71.99 | | bottle | 22.65 | 28.24 | | buffet | 55.42 | 64.77 | | poster | 34.85 | 47.4 | | stage | 14.77 | 25.7 | | van | 40.71 | 64.6 | | ship | 7.9 | 8.12 | | fountain | 36.89 | 38.82 | | conveyer belt | 75.5 | 91.56 | | canopy | 37.96 | 59.57 | | washer | 65.5 | 70.39 | | plaything | 25.74 | 31.26 | | swimming pool | 57.35 | 61.14 | | stool | 44.45 | 63.87 | | barrel | 39.42 | 64.24 | | basket | 30.68 | 37.09 | | waterfall | 68.27 | 87.81 | | tent | 72.51 | 97.09 | | bag | 20.72 | 31.0 | | minibike | 70.37 | 81.96 | | cradle | 73.85 | 97.82 | | oven | 45.21 | 58.66 | | ball | 32.21 | 43.33 | | food | 41.11 | 42.77 | | step | 5.43 | 6.93 | | tank | 45.29 | 51.06 | | trade name | 30.43 | 35.53 | | microwave | 85.06 | 93.66 | | pot | 47.16 | 54.2 | | animal | 61.51 | 64.97 | | bicycle | 53.38 | 69.92 | | lake | 41.18 | 78.86 | | dishwasher | 63.13 | 77.12 | | screen | 74.13 | 89.97 | | blanket | 23.82 | 28.95 | | sculpture | 59.77 | 78.1 | | hood | 62.17 | 72.96 | | sconce | 43.48 | 54.46 | | vase | 37.79 | 58.14 | | traffic light | 37.7 | 57.21 | | tray | 14.53 | 24.78 | | ashcan | 40.58 | 54.48 | | fan | 59.55 | 72.35 | | pier | 38.57 | 50.96 | | crt screen | 6.33 | 9.71 | | plate | 54.94 | 75.89 | | monitor | 29.74 | 32.25 | | bulletin board | 41.88 | 51.96 | | shower | 3.78 | 4.02 | | radiator | 58.85 | 69.89 | | glass | 19.35 | 22.56 | | clock | 29.45 | 34.22 | | flag | 30.47 | 38.48 | +---------------------+-------+-------+ 2024/01/05 15:36:05 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4810 coco/bbox_mAP_50: 0.6650 coco/bbox_mAP_75: 0.5280 coco/bbox_mAP_s: 0.3330 coco/bbox_mAP_m: 0.5260 coco/bbox_mAP_l: 0.6160 coco/segm_mAP: 0.3140 coco/segm_mAP_50: 0.5720 coco/segm_mAP_75: 0.3060 coco/segm_mAP_s: 0.1780 coco/segm_mAP_m: 0.3610 coco/segm_mAP_l: 0.4880 Bleu_1: 0.7415 Bleu_2: 0.5747 Bleu_3: 0.4299 Bleu_4: 0.3183 METEOR: 0.2594 ROUGE_L: 0.5450 CIDEr: 1.0138 SPICE: 0.1915 aAcc: 82.9900 mIoU: 49.2800 mAcc: 61.5700 visual-grounding/miou: 0.7751 visual-grounding/acc: 0.8450 data_time: 0.0116 time: 1.9264 2024/01/05 15:48:25 - mmengine - INFO - Iter(train) [200500/640000] base_lr: 1.5535e-04 lr: 1.5535e-05 eta: 7 days, 18:49:36 time: 1.5322 data_time: 0.0200 memory: 34658 grad_norm: 2.7733 loss: 1.3808 caption_loss_cls: 2.3004 detection_loss_cls: 0.0361 detection_loss_reg: 0.3408 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7812 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3464 instance_segmentation_loss_poly: 0.9426 2024/01/05 16:01:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 16:01:12 - mmengine - INFO - Iter(train) [201000/640000] base_lr: 1.5515e-04 lr: 1.5515e-05 eta: 7 days, 18:37:20 time: 1.5324 data_time: 0.0202 memory: 25567 grad_norm: 2.7293 loss: 1.3863 caption_loss_cls: 2.2971 detection_loss_cls: 0.0364 detection_loss_reg: 0.3431 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7832 instance_segmentation_loss_cls: 0.0360 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9459 2024/01/05 16:13:19 - mmengine - INFO - Iter(train) [201500/640000] base_lr: 1.5494e-04 lr: 1.5494e-05 eta: 7 days, 18:15:37 time: 1.5304 data_time: 0.0202 memory: 25567 grad_norm: 2.7434 loss: 1.3861 caption_loss_cls: 2.2936 detection_loss_cls: 0.0363 detection_loss_reg: 0.3439 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7859 instance_segmentation_loss_cls: 0.0358 instance_segmentation_loss_reg: 0.3471 instance_segmentation_loss_poly: 0.9455 2024/01/05 16:26:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 16:26:34 - mmengine - INFO - Iter(train) [202000/640000] base_lr: 1.5474e-04 lr: 1.5474e-05 eta: 7 days, 18:09:42 time: 1.5383 data_time: 0.0203 memory: 25567 grad_norm: 2.7478 loss: 1.3782 caption_loss_cls: 2.2891 detection_loss_cls: 0.0364 detection_loss_reg: 0.3443 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7841 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3463 instance_segmentation_loss_poly: 0.9428 2024/01/05 16:26:34 - mmengine - INFO - Saving checkpoint at 202000 iterations 2024/01/05 16:39:29 - mmengine - INFO - Iter(train) [202500/640000] base_lr: 1.5453e-04 lr: 1.5453e-05 eta: 7 days, 17:59:13 time: 1.5306 data_time: 0.0202 memory: 25567 grad_norm: 2.8103 loss: 1.3772 caption_loss_cls: 2.2891 detection_loss_cls: 0.0364 detection_loss_reg: 0.3444 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7844 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3470 instance_segmentation_loss_poly: 0.9434 2024/01/05 16:52:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 16:52:00 - mmengine - INFO - Iter(train) [203000/640000] base_lr: 1.5433e-04 lr: 1.5433e-05 eta: 7 days, 17:43:15 time: 1.5314 data_time: 0.0203 memory: 25567 grad_norm: 2.7943 loss: 1.3776 caption_loss_cls: 2.2830 detection_loss_cls: 0.0363 detection_loss_reg: 0.3431 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7857 instance_segmentation_loss_cls: 0.0355 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9434 2024/01/05 17:04:44 - mmengine - INFO - Iter(train) [203500/640000] base_lr: 1.5412e-04 lr: 1.5412e-05 eta: 7 days, 17:30:10 time: 1.5311 data_time: 0.0202 memory: 25567 grad_norm: 2.7863 loss: 1.3613 caption_loss_cls: 2.2773 detection_loss_cls: 0.0361 detection_loss_reg: 0.3416 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7880 instance_segmentation_loss_cls: 0.0352 instance_segmentation_loss_reg: 0.3452 instance_segmentation_loss_poly: 0.9391 2024/01/05 17:17:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 17:17:06 - mmengine - INFO - Iter(train) [204000/640000] base_lr: 1.5391e-04 lr: 1.5391e-05 eta: 7 days, 17:12:37 time: 1.5157 data_time: 0.0200 memory: 25567 grad_norm: 2.8382 loss: 1.3579 caption_loss_cls: 2.2743 detection_loss_cls: 0.0358 detection_loss_reg: 0.3408 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7863 instance_segmentation_loss_cls: 0.0352 instance_segmentation_loss_reg: 0.3445 instance_segmentation_loss_poly: 0.9381 2024/01/05 17:17:06 - mmengine - INFO - Saving checkpoint at 204000 iterations 2024/01/05 17:29:59 - mmengine - INFO - Iter(train) [204500/640000] base_lr: 1.5371e-04 lr: 1.5371e-05 eta: 7 days, 17:01:35 time: 1.5232 data_time: 0.0264 memory: 25567 grad_norm: 2.8504 loss: 1.3560 caption_loss_cls: 2.2746 detection_loss_cls: 0.0358 detection_loss_reg: 0.3416 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7837 instance_segmentation_loss_cls: 0.0352 instance_segmentation_loss_reg: 0.3440 instance_segmentation_loss_poly: 0.9370 2024/01/05 17:42:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 17:42:30 - mmengine - INFO - Iter(train) [205000/640000] base_lr: 1.5350e-04 lr: 1.5350e-05 eta: 7 days, 16:45:53 time: 1.5190 data_time: 0.0263 memory: 25567 grad_norm: 2.8709 loss: 1.3462 caption_loss_cls: 2.2770 detection_loss_cls: 0.0357 detection_loss_reg: 0.3411 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7851 instance_segmentation_loss_cls: 0.0354 instance_segmentation_loss_reg: 0.3466 instance_segmentation_loss_poly: 0.9408 2024/01/05 17:55:11 - mmengine - INFO - Iter(train) [205500/640000] base_lr: 1.5329e-04 lr: 1.5329e-05 eta: 7 days, 16:32:23 time: 1.5276 data_time: 0.0264 memory: 25567 grad_norm: 2.8311 loss: 1.3311 caption_loss_cls: 2.2778 detection_loss_cls: 0.0355 detection_loss_reg: 0.3397 semantic_segmentation_loss_cls: 0.0094 grounding_loss_reg: 2.7840 instance_segmentation_loss_cls: 0.0355 instance_segmentation_loss_reg: 0.3479 instance_segmentation_loss_poly: 0.9434 2024/01/05 18:07:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 18:07:48 - mmengine - INFO - Iter(train) [206000/640000] base_lr: 1.5308e-04 lr: 1.5308e-05 eta: 7 days, 16:18:02 time: 1.5181 data_time: 0.0263 memory: 25567 grad_norm: 2.8213 loss: 1.3455 caption_loss_cls: 2.2788 detection_loss_cls: 0.0354 detection_loss_reg: 0.3410 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7841 instance_segmentation_loss_cls: 0.0353 instance_segmentation_loss_reg: 0.3465 instance_segmentation_loss_poly: 0.9396 2024/01/05 18:07:48 - mmengine - INFO - Saving checkpoint at 206000 iterations 2024/01/05 18:21:05 - mmengine - INFO - Iter(train) [206500/640000] base_lr: 1.5288e-04 lr: 1.5288e-05 eta: 7 days, 16:11:52 time: 1.5237 data_time: 0.0263 memory: 25567 grad_norm: 2.8089 loss: 1.3547 caption_loss_cls: 2.2822 detection_loss_cls: 0.0355 detection_loss_reg: 0.3410 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7875 instance_segmentation_loss_cls: 0.0355 instance_segmentation_loss_reg: 0.3481 instance_segmentation_loss_poly: 0.9441 2024/01/05 18:34:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 18:34:03 - mmengine - INFO - Iter(train) [207000/640000] base_lr: 1.5267e-04 lr: 1.5267e-05 eta: 7 days, 16:01:32 time: 1.5303 data_time: 0.0265 memory: 25567 grad_norm: 2.7801 loss: 1.3501 caption_loss_cls: 2.2859 detection_loss_cls: 0.0356 detection_loss_reg: 0.3422 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7888 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3501 instance_segmentation_loss_poly: 0.9469 2024/01/05 18:46:41 - mmengine - INFO - Iter(train) [207500/640000] base_lr: 1.5246e-04 lr: 1.5246e-05 eta: 7 days, 15:47:21 time: 1.5288 data_time: 0.0266 memory: 25567 grad_norm: 2.7816 loss: 1.3699 caption_loss_cls: 2.2875 detection_loss_cls: 0.0358 detection_loss_reg: 0.3445 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7900 instance_segmentation_loss_cls: 0.0357 instance_segmentation_loss_reg: 0.3508 instance_segmentation_loss_poly: 0.9491 2024/01/05 18:58:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 18:58:58 - mmengine - INFO - Iter(train) [208000/640000] base_lr: 1.5225e-04 lr: 1.5225e-05 eta: 7 days, 15:29:22 time: 1.5276 data_time: 0.0266 memory: 25567 grad_norm: 2.7735 loss: 1.3751 caption_loss_cls: 2.2954 detection_loss_cls: 0.0358 detection_loss_reg: 0.3450 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7842 instance_segmentation_loss_cls: 0.0356 instance_segmentation_loss_reg: 0.3506 instance_segmentation_loss_poly: 0.9492 2024/01/05 18:58:58 - mmengine - INFO - Saving checkpoint at 208000 iterations 2024/01/05 19:12:21 - mmengine - INFO - Iter(train) [208500/640000] base_lr: 1.5204e-04 lr: 1.5204e-05 eta: 7 days, 15:23:39 time: 1.5349 data_time: 0.0264 memory: 25567 grad_norm: 2.7752 loss: 1.3571 caption_loss_cls: 2.2926 detection_loss_cls: 0.0357 detection_loss_reg: 0.3445 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7836 instance_segmentation_loss_cls: 0.0353 instance_segmentation_loss_reg: 0.3486 instance_segmentation_loss_poly: 0.9453 2024/01/05 19:24:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 19:24:34 - mmengine - INFO - Iter(train) [209000/640000] base_lr: 1.5183e-04 lr: 1.5183e-05 eta: 7 days, 15:05:01 time: 1.5306 data_time: 0.0263 memory: 25567 grad_norm: 2.8020 loss: 1.3576 caption_loss_cls: 2.2897 detection_loss_cls: 0.0358 detection_loss_reg: 0.3439 semantic_segmentation_loss_cls: 0.0093 grounding_loss_reg: 2.7811 instance_segmentation_loss_cls: 0.0353 instance_segmentation_loss_reg: 0.3486 instance_segmentation_loss_poly: 0.9448 2024/01/05 19:37:21 - mmengine - INFO - Iter(train) [209500/640000] base_lr: 1.5162e-04 lr: 1.5162e-05 eta: 7 days, 14:52:42 time: 1.5321 data_time: 0.0263 memory: 25567 grad_norm: 2.7845 loss: 1.3527 caption_loss_cls: 2.2900 detection_loss_cls: 0.0358 detection_loss_reg: 0.3449 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7808 instance_segmentation_loss_cls: 0.0349 instance_segmentation_loss_reg: 0.3467 instance_segmentation_loss_poly: 0.9408 2024/01/05 19:50:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 19:50:00 - mmengine - INFO - Iter(train) [210000/640000] base_lr: 1.5141e-04 lr: 1.5141e-05 eta: 7 days, 14:38:54 time: 1.5327 data_time: 0.0264 memory: 25567 grad_norm: 2.8293 loss: 1.3573 caption_loss_cls: 2.2935 detection_loss_cls: 0.0357 detection_loss_reg: 0.3450 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7823 instance_segmentation_loss_cls: 0.0348 instance_segmentation_loss_reg: 0.3464 instance_segmentation_loss_poly: 0.9399 2024/01/05 19:50:00 - mmengine - INFO - Saving checkpoint at 210000 iterations 2024/01/05 20:02:53 - mmengine - INFO - Iter(train) [210500/640000] base_lr: 1.5120e-04 lr: 1.5120e-05 eta: 7 days, 14:27:34 time: 1.5264 data_time: 0.0263 memory: 25567 grad_norm: 2.8430 loss: 1.3355 caption_loss_cls: 2.2916 detection_loss_cls: 0.0358 detection_loss_reg: 0.3463 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7787 instance_segmentation_loss_cls: 0.0347 instance_segmentation_loss_reg: 0.3461 instance_segmentation_loss_poly: 0.9403 2024/01/05 20:15:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 20:15:12 - mmengine - INFO - Iter(train) [211000/640000] base_lr: 1.5099e-04 lr: 1.5099e-05 eta: 7 days, 14:10:26 time: 1.5170 data_time: 0.0261 memory: 25567 grad_norm: 2.9239 loss: 1.3429 caption_loss_cls: 2.2930 detection_loss_cls: 0.0357 detection_loss_reg: 0.3461 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7792 instance_segmentation_loss_cls: 0.0346 instance_segmentation_loss_reg: 0.3463 instance_segmentation_loss_poly: 0.9395 2024/01/05 20:27:40 - mmengine - INFO - Iter(train) [211500/640000] base_lr: 1.5078e-04 lr: 1.5078e-05 eta: 7 days, 13:54:44 time: 1.5144 data_time: 0.0260 memory: 25567 grad_norm: 2.9930 loss: 1.3351 caption_loss_cls: 2.2938 detection_loss_cls: 0.0359 detection_loss_reg: 0.3479 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7833 instance_segmentation_loss_cls: 0.0344 instance_segmentation_loss_reg: 0.3442 instance_segmentation_loss_poly: 0.9346 2024/01/05 20:40:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 20:40:02 - mmengine - INFO - Iter(train) [212000/640000] base_lr: 1.5057e-04 lr: 1.5057e-05 eta: 7 days, 13:38:18 time: 1.5156 data_time: 0.0261 memory: 25567 grad_norm: 3.0220 loss: 1.3391 caption_loss_cls: 2.2947 detection_loss_cls: 0.0357 detection_loss_reg: 0.3464 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7764 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3440 instance_segmentation_loss_poly: 0.9355 2024/01/05 20:40:02 - mmengine - INFO - Saving checkpoint at 212000 iterations 2024/01/05 20:53:12 - mmengine - INFO - Iter(train) [212500/640000] base_lr: 1.5035e-04 lr: 1.5035e-05 eta: 7 days, 13:29:53 time: 1.5125 data_time: 0.0262 memory: 25567 grad_norm: 3.0052 loss: 1.3447 caption_loss_cls: 2.2922 detection_loss_cls: 0.0358 detection_loss_reg: 0.3483 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7711 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3439 instance_segmentation_loss_poly: 0.9341 2024/01/05 21:06:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 21:06:08 - mmengine - INFO - Iter(train) [213000/640000] base_lr: 1.5014e-04 lr: 1.5014e-05 eta: 7 days, 13:19:00 time: 1.5231 data_time: 0.0265 memory: 25567 grad_norm: 3.0671 loss: 1.3567 caption_loss_cls: 2.2942 detection_loss_cls: 0.0358 detection_loss_reg: 0.3499 semantic_segmentation_loss_cls: 0.0092 grounding_loss_reg: 2.7694 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3448 instance_segmentation_loss_poly: 0.9348 2024/01/05 21:18:56 - mmengine - INFO - Iter(train) [213500/640000] base_lr: 1.4993e-04 lr: 1.4993e-05 eta: 7 days, 13:06:46 time: 1.5232 data_time: 0.0266 memory: 25567 grad_norm: 3.1541 loss: 1.3596 caption_loss_cls: 2.2968 detection_loss_cls: 0.0357 detection_loss_reg: 0.3491 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.7693 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3455 instance_segmentation_loss_poly: 0.9349 2024/01/05 21:31:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 21:31:48 - mmengine - INFO - Iter(train) [214000/640000] base_lr: 1.4972e-04 lr: 1.4972e-05 eta: 7 days, 12:55:15 time: 1.5265 data_time: 0.0266 memory: 25567 grad_norm: 3.1572 loss: 1.3563 caption_loss_cls: 2.2980 detection_loss_cls: 0.0357 detection_loss_reg: 0.3495 semantic_segmentation_loss_cls: 0.0091 grounding_loss_reg: 2.7734 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3467 instance_segmentation_loss_poly: 0.9379 2024/01/05 21:31:48 - mmengine - INFO - Saving checkpoint at 214000 iterations 2024/01/05 21:45:05 - mmengine - INFO - Iter(train) [214500/640000] base_lr: 1.4950e-04 lr: 1.4950e-05 eta: 7 days, 12:47:46 time: 1.5327 data_time: 0.0266 memory: 25567 grad_norm: 3.1075 loss: 1.3505 caption_loss_cls: 2.2942 detection_loss_cls: 0.0356 detection_loss_reg: 0.3495 semantic_segmentation_loss_cls: 0.0090 grounding_loss_reg: 2.7765 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3467 instance_segmentation_loss_poly: 0.9378 2024/01/05 21:57:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 21:57:37 - mmengine - INFO - Iter(train) [215000/640000] base_lr: 1.4929e-04 lr: 1.4929e-05 eta: 7 days, 12:32:55 time: 1.5357 data_time: 0.0265 memory: 25567 grad_norm: 3.0749 loss: 1.3457 caption_loss_cls: 2.2848 detection_loss_cls: 0.0357 detection_loss_reg: 0.3509 semantic_segmentation_loss_cls: 0.0090 grounding_loss_reg: 2.7766 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3454 instance_segmentation_loss_poly: 0.9349 2024/01/05 22:10:38 - mmengine - INFO - Iter(train) [215500/640000] base_lr: 1.4908e-04 lr: 1.4908e-05 eta: 7 days, 12:22:42 time: 1.5442 data_time: 0.0267 memory: 25567 grad_norm: 2.9936 loss: 1.3419 caption_loss_cls: 2.2811 detection_loss_cls: 0.0357 detection_loss_reg: 0.3508 semantic_segmentation_loss_cls: 0.0090 grounding_loss_reg: 2.7777 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3460 instance_segmentation_loss_poly: 0.9358 2024/01/05 22:23:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 22:23:14 - mmengine - INFO - Iter(train) [216000/640000] base_lr: 1.4886e-04 lr: 1.4886e-05 eta: 7 days, 12:08:31 time: 1.5475 data_time: 0.0267 memory: 25567 grad_norm: 2.9568 loss: 1.3345 caption_loss_cls: 2.2772 detection_loss_cls: 0.0355 detection_loss_reg: 0.3493 semantic_segmentation_loss_cls: 0.0090 grounding_loss_reg: 2.7763 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3476 instance_segmentation_loss_poly: 0.9399 2024/01/05 22:23:14 - mmengine - INFO - Saving checkpoint at 216000 iterations 2024/01/05 22:36:35 - mmengine - INFO - Iter(train) [216500/640000] base_lr: 1.4865e-04 lr: 1.4865e-05 eta: 7 days, 12:01:15 time: 1.5503 data_time: 0.0267 memory: 25567 grad_norm: 2.9590 loss: 1.3342 caption_loss_cls: 2.2776 detection_loss_cls: 0.0354 detection_loss_reg: 0.3491 semantic_segmentation_loss_cls: 0.0090 grounding_loss_reg: 2.7811 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3477 instance_segmentation_loss_poly: 0.9394 2024/01/05 22:49:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 22:49:17 - mmengine - INFO - Iter(train) [217000/640000] base_lr: 1.4843e-04 lr: 1.4843e-05 eta: 7 days, 11:48:02 time: 1.5469 data_time: 0.0265 memory: 25567 grad_norm: 2.8645 loss: 1.3156 caption_loss_cls: 2.2781 detection_loss_cls: 0.0356 detection_loss_reg: 0.3494 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7744 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3491 instance_segmentation_loss_poly: 0.9423 2024/01/05 23:01:55 - mmengine - INFO - Iter(train) [217500/640000] base_lr: 1.4822e-04 lr: 1.4822e-05 eta: 7 days, 11:34:07 time: 1.5444 data_time: 0.0265 memory: 25567 grad_norm: 2.8249 loss: 1.3303 caption_loss_cls: 2.2789 detection_loss_cls: 0.0355 detection_loss_reg: 0.3481 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7670 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3497 instance_segmentation_loss_poly: 0.9436 2024/01/05 23:15:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 23:15:06 - mmengine - INFO - Iter(train) [218000/640000] base_lr: 1.4800e-04 lr: 1.4800e-05 eta: 7 days, 11:25:11 time: 1.5492 data_time: 0.0267 memory: 25567 grad_norm: 2.8066 loss: 1.3350 caption_loss_cls: 2.2794 detection_loss_cls: 0.0355 detection_loss_reg: 0.3484 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7701 instance_segmentation_loss_cls: 0.0344 instance_segmentation_loss_reg: 0.3508 instance_segmentation_loss_poly: 0.9461 2024/01/05 23:15:06 - mmengine - INFO - Saving checkpoint at 218000 iterations 2024/01/05 23:28:21 - mmengine - INFO - Iter(train) [218500/640000] base_lr: 1.4779e-04 lr: 1.4779e-05 eta: 7 days, 11:16:37 time: 1.5483 data_time: 0.0267 memory: 25567 grad_norm: 2.8251 loss: 1.3473 caption_loss_cls: 2.2761 detection_loss_cls: 0.0355 detection_loss_reg: 0.3475 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7658 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3488 instance_segmentation_loss_poly: 0.9435 2024/01/05 23:41:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/05 23:41:22 - mmengine - INFO - Iter(train) [219000/640000] base_lr: 1.4757e-04 lr: 1.4757e-05 eta: 7 days, 11:06:05 time: 1.5558 data_time: 0.0269 memory: 25567 grad_norm: 2.7934 loss: 1.3412 caption_loss_cls: 2.2708 detection_loss_cls: 0.0352 detection_loss_reg: 0.3461 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7675 instance_segmentation_loss_cls: 0.0344 instance_segmentation_loss_reg: 0.3496 instance_segmentation_loss_poly: 0.9452 2024/01/05 23:54:15 - mmengine - INFO - Iter(train) [219500/640000] base_lr: 1.4736e-04 lr: 1.4736e-05 eta: 7 days, 10:54:17 time: 1.5537 data_time: 0.0267 memory: 25567 grad_norm: 2.7869 loss: 1.3322 caption_loss_cls: 2.2702 detection_loss_cls: 0.0352 detection_loss_reg: 0.3459 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7698 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3494 instance_segmentation_loss_poly: 0.9451 2024/01/06 00:07:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/06 00:07:13 - mmengine - INFO - Iter(train) [220000/640000] base_lr: 1.4714e-04 lr: 1.4714e-05 eta: 7 days, 10:43:11 time: 1.5592 data_time: 0.0268 memory: 25567 grad_norm: 2.7834 loss: 1.3286 caption_loss_cls: 2.2690 detection_loss_cls: 0.0349 detection_loss_reg: 0.3448 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7678 instance_segmentation_loss_cls: 0.0344 instance_segmentation_loss_reg: 0.3491 instance_segmentation_loss_poly: 0.9448 2024/01/06 00:07:13 - mmengine - INFO - Saving checkpoint at 220000 iterations 2024/01/06 00:19:15 - mmengine - INFO - Evaluating bbox... 2024/01/06 00:20:12 - mmengine - INFO - bbox_mAP_copypaste: 0.485 0.668 0.535 0.326 0.543 0.626 2024/01/06 00:20:12 - mmengine - INFO - Evaluating segm... 2024/01/06 00:21:24 - mmengine - INFO - segm_mAP_copypaste: 0.319 0.579 0.311 0.175 0.369 0.488 2024/01/06 00:23:34 - mmengine - INFO - Evaluating bbox... 2024/01/06 00:24:32 - mmengine - INFO - bbox_mAP_copypaste: 0.485 0.668 0.534 0.325 0.543 0.624 2024/01/06 00:30:13 - mmengine - INFO - per class results: 2024/01/06 00:30:13 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.71 | 88.46 | | building | 81.67 | 89.2 | | sky | 93.46 | 97.14 | | floor | 82.43 | 88.17 | | tree | 74.73 | 89.15 | | ceiling | 84.76 | 92.45 | | road | 76.98 | 94.85 | | bed | 88.52 | 93.42 | | windowpane | 61.94 | 76.3 | | grass | 67.83 | 85.02 | | cabinet | 60.42 | 71.13 | | sidewalk | 54.61 | 62.2 | | person | 80.86 | 92.07 | | earth | 37.45 | 46.23 | | door | 53.27 | 75.55 | | table | 64.06 | 79.01 | | mountain | 61.93 | 77.6 | | plant | 55.61 | 71.09 | | curtain | 76.01 | 87.99 | | chair | 60.89 | 74.98 | | car | 85.27 | 91.79 | | water | 55.02 | 64.43 | | painting | 74.06 | 87.8 | | sofa | 69.32 | 78.55 | | shelf | 42.07 | 59.08 | | house | 45.01 | 70.11 | | sea | 54.68 | 76.41 | | mirror | 67.56 | 72.73 | | rug | 68.93 | 82.32 | | field | 28.07 | 43.68 | | armchair | 48.72 | 74.17 | | seat | 67.09 | 78.22 | | fence | 44.96 | 63.02 | | desk | 46.14 | 74.0 | | rock | 46.4 | 71.54 | | wardrobe | 51.4 | 76.74 | | lamp | 63.65 | 77.49 | | bathtub | 70.79 | 86.63 | | railing | 38.88 | 55.87 | | cushion | 57.76 | 78.46 | | base | 26.86 | 42.43 | | box | 27.76 | 34.75 | | column | 50.02 | 61.14 | | signboard | 36.24 | 61.3 | | chest of drawers | 35.14 | 43.06 | | counter | 25.17 | 39.94 | | sand | 50.01 | 64.34 | | sink | 70.78 | 77.55 | | skyscraper | 41.49 | 49.03 | | fireplace | 67.93 | 91.23 | | refrigerator | 75.78 | 86.9 | | grandstand | 47.35 | 78.09 | | path | 16.92 | 22.15 | | stairs | 41.75 | 55.18 | | runway | 63.36 | 64.69 | | case | 50.97 | 58.42 | | pool table | 91.45 | 95.52 | | pillow | 56.79 | 67.88 | | screen door | 64.8 | 73.86 | | stairway | 35.36 | 40.94 | | river | 17.86 | 37.69 | | bridge | 65.52 | 80.04 | | bookcase | 32.14 | 47.35 | | blind | 39.9 | 45.89 | | coffee table | 62.56 | 85.72 | | toilet | 86.19 | 94.06 | | flower | 38.64 | 51.47 | | book | 45.16 | 78.02 | | hill | 11.47 | 24.38 | | bench | 55.24 | 74.41 | | countertop | 58.69 | 73.55 | | stove | 78.53 | 84.32 | | palm | 49.62 | 76.52 | | kitchen island | 43.66 | 60.53 | | computer | 70.77 | 90.0 | | swivel chair | 40.29 | 48.05 | | boat | 79.15 | 87.06 | | bar | 37.31 | 51.02 | | arcade machine | 78.32 | 87.34 | | hovel | 36.5 | 75.53 | | bus | 91.83 | 95.19 | | towel | 63.56 | 74.51 | | light | 53.75 | 67.94 | | truck | 40.2 | 59.47 | | tower | 25.06 | 40.93 | | chandelier | 65.37 | 79.33 | | awning | 27.0 | 42.51 | | streetlight | 27.74 | 34.92 | | booth | 47.93 | 75.81 | | television receiver | 60.87 | 79.86 | | airplane | 58.39 | 64.85 | | dirt track | 0.0 | 0.0 | | apparel | 32.94 | 56.32 | | pole | 26.84 | 42.98 | | land | 2.37 | 3.79 | | bannister | 13.43 | 16.83 | | escalator | 20.23 | 22.58 | | ottoman | 54.66 | 68.64 | | bottle | 24.72 | 37.09 | | buffet | 50.27 | 56.34 | | poster | 35.53 | 66.38 | | stage | 14.52 | 22.71 | | van | 48.72 | 63.28 | | ship | 9.04 | 9.49 | | fountain | 28.4 | 29.93 | | conveyer belt | 55.1 | 92.0 | | canopy | 34.02 | 43.99 | | washer | 56.25 | 75.9 | | plaything | 34.79 | 46.12 | | swimming pool | 66.51 | 88.74 | | stool | 45.92 | 62.66 | | barrel | 56.69 | 64.5 | | basket | 38.57 | 49.96 | | waterfall | 59.69 | 84.33 | | tent | 74.67 | 96.55 | | bag | 18.09 | 23.91 | | minibike | 70.53 | 85.54 | | cradle | 83.02 | 95.95 | | oven | 50.48 | 63.66 | | ball | 50.38 | 71.2 | | food | 59.31 | 79.1 | | step | 13.21 | 14.87 | | tank | 50.06 | 57.06 | | trade name | 5.89 | 6.15 | | microwave | 83.19 | 93.83 | | pot | 47.71 | 61.55 | | animal | 62.75 | 65.88 | | bicycle | 57.2 | 78.38 | | lake | 38.31 | 44.65 | | dishwasher | 69.88 | 80.43 | | screen | 54.77 | 69.14 | | blanket | 26.27 | 32.91 | | sculpture | 59.16 | 79.36 | | hood | 58.11 | 71.28 | | sconce | 39.63 | 52.35 | | vase | 40.97 | 62.17 | | traffic light | 35.25 | 68.87 | | tray | 11.67 | 19.51 | | ashcan | 40.35 | 65.14 | | fan | 59.86 | 70.89 | | pier | 28.39 | 33.0 | | crt screen | 8.36 | 26.1 | | plate | 55.28 | 75.61 | | monitor | 18.05 | 20.74 | | bulletin board | 50.01 | 61.13 | | shower | 2.08 | 3.54 | | radiator | 57.9 | 70.08 | | glass | 18.29 | 20.07 | | clock | 32.84 | 39.47 | | flag | 37.97 | 49.12 | +---------------------+-------+-------+ 2024/01/06 00:30:25 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4850 coco/bbox_mAP_50: 0.6680 coco/bbox_mAP_75: 0.5340 coco/bbox_mAP_s: 0.3250 coco/bbox_mAP_m: 0.5430 coco/bbox_mAP_l: 0.6240 coco/segm_mAP: 0.3190 coco/segm_mAP_50: 0.5790 coco/segm_mAP_75: 0.3110 coco/segm_mAP_s: 0.1750 coco/segm_mAP_m: 0.3690 coco/segm_mAP_l: 0.4880 Bleu_1: 0.7403 Bleu_2: 0.5705 Bleu_3: 0.4249 Bleu_4: 0.3147 METEOR: 0.2631 ROUGE_L: 0.5435 CIDEr: 1.0195 SPICE: 0.1943 aAcc: 83.0800 mIoU: 49.7700 mAcc: 63.2400 visual-grounding/miou: 0.7824 visual-grounding/acc: 0.8543 data_time: 0.0129 time: 1.9234 2024/01/06 00:43:13 - mmengine - INFO - Iter(train) [220500/640000] base_lr: 1.4692e-04 lr: 1.4692e-05 eta: 7 days, 10:31:01 time: 1.5516 data_time: 0.0203 memory: 34658 grad_norm: 2.8031 loss: 1.3375 caption_loss_cls: 2.2706 detection_loss_cls: 0.0349 detection_loss_reg: 0.3448 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7677 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3465 instance_segmentation_loss_poly: 0.9397 2024/01/06 00:55:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/06 00:55:29 - mmengine - INFO - Iter(train) [221000/640000] base_lr: 1.4671e-04 lr: 1.4671e-05 eta: 7 days, 10:14:06 time: 1.5450 data_time: 0.0202 memory: 25567 grad_norm: 2.9162 loss: 1.3500 caption_loss_cls: 2.2739 detection_loss_cls: 0.0348 detection_loss_reg: 0.3428 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7663 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3462 instance_segmentation_loss_poly: 0.9376 2024/01/06 01:08:07 - mmengine - INFO - Iter(train) [221500/640000] base_lr: 1.4649e-04 lr: 1.4649e-05 eta: 7 days, 10:00:11 time: 1.5449 data_time: 0.0200 memory: 25567 grad_norm: 2.9384 loss: 1.3425 caption_loss_cls: 2.2744 detection_loss_cls: 0.0349 detection_loss_reg: 0.3438 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7686 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3449 instance_segmentation_loss_poly: 0.9345 2024/01/06 01:20:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/06 01:20:55 - mmengine - INFO - Iter(train) [222000/640000] base_lr: 1.4627e-04 lr: 1.4627e-05 eta: 7 days, 9:47:46 time: 1.5392 data_time: 0.0200 memory: 25567 grad_norm: 2.9758 loss: 1.3426 caption_loss_cls: 2.2733 detection_loss_cls: 0.0348 detection_loss_reg: 0.3440 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7684 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3465 instance_segmentation_loss_poly: 0.9399 2024/01/06 01:20:55 - mmengine - INFO - Saving checkpoint at 222000 iterations 2024/01/06 01:33:45 - mmengine - INFO - Iter(train) [222500/640000] base_lr: 1.4605e-04 lr: 1.4605e-05 eta: 7 days, 9:35:32 time: 1.5331 data_time: 0.0200 memory: 25567 grad_norm: 2.9746 loss: 1.3383 caption_loss_cls: 2.2735 detection_loss_cls: 0.0349 detection_loss_reg: 0.3450 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7621 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3466 instance_segmentation_loss_poly: 0.9408 2024/01/06 01:46:41 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240105_015845 2024/01/06 01:46:41 - mmengine - INFO - Iter(train) [223000/640000] base_lr: 1.4584e-04 lr: 1.4584e-05 eta: 7 days, 9:24:03 time: 1.5316 data_time: 0.0200 memory: 25567 grad_norm: 3.0055 loss: 1.3411 caption_loss_cls: 2.2757 detection_loss_cls: 0.0351 detection_loss_reg: 0.3468 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7644 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3455 instance_segmentation_loss_poly: 0.9392 2024/01/06 03:08:41 - mmengine - INFO - Iter(train) [223500/640000] base_lr: 1.4562e-04 lr: 1.4562e-05 eta: 7 days, 3:42:13 time: 1.5088 data_time: 0.0120 memory: 25573 grad_norm: 3.0921 loss: 1.3646 caption_loss_cls: 2.2804 detection_loss_cls: 0.0350 detection_loss_reg: 0.3454 semantic_segmentation_loss_cls: 0.0089 grounding_loss_reg: 2.7648 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9420 2024/01/06 03:21:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 03:21:35 - mmengine - INFO - Iter(train) [224000/640000] base_lr: 1.4540e-04 lr: 1.4540e-05 eta: 7 days, 5:19:47 time: 1.5078 data_time: 0.0118 memory: 25573 grad_norm: 3.1604 loss: 1.3699 caption_loss_cls: 2.2788 detection_loss_cls: 0.0351 detection_loss_reg: 0.3458 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7617 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3475 instance_segmentation_loss_poly: 0.9419 2024/01/06 03:21:35 - mmengine - INFO - Saving checkpoint at 224000 iterations 2024/01/06 03:34:42 - mmengine - INFO - Iter(train) [224500/640000] base_lr: 1.4518e-04 lr: 1.4518e-05 eta: 7 days, 6:49:55 time: 1.5120 data_time: 0.0174 memory: 25573 grad_norm: 3.2338 loss: 1.3663 caption_loss_cls: 2.2742 detection_loss_cls: 0.0353 detection_loss_reg: 0.3464 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7586 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3472 instance_segmentation_loss_poly: 0.9417 2024/01/06 03:47:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 03:47:37 - mmengine - INFO - Iter(train) [225000/640000] base_lr: 1.4496e-04 lr: 1.4496e-05 eta: 7 days, 7:17:19 time: 1.5218 data_time: 0.0172 memory: 25573 grad_norm: 3.1379 loss: 1.3585 caption_loss_cls: 2.2765 detection_loss_cls: 0.0351 detection_loss_reg: 0.3439 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7560 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3485 instance_segmentation_loss_poly: 0.9427 2024/01/06 04:00:26 - mmengine - INFO - Iter(train) [225500/640000] base_lr: 1.4474e-04 lr: 1.4474e-05 eta: 7 days, 7:22:11 time: 1.5248 data_time: 0.0170 memory: 25573 grad_norm: 3.1011 loss: 1.3517 caption_loss_cls: 2.2755 detection_loss_cls: 0.0352 detection_loss_reg: 0.3442 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7531 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3483 instance_segmentation_loss_poly: 0.9413 2024/01/06 04:13:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 04:13:28 - mmengine - INFO - Iter(train) [226000/640000] base_lr: 1.4452e-04 lr: 1.4452e-05 eta: 7 days, 7:43:26 time: 1.5280 data_time: 0.0167 memory: 25573 grad_norm: 3.0763 loss: 1.3251 caption_loss_cls: 2.2700 detection_loss_cls: 0.0351 detection_loss_reg: 0.3433 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7498 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3490 instance_segmentation_loss_poly: 0.9427 2024/01/06 04:13:28 - mmengine - INFO - Saving checkpoint at 226000 iterations 2024/01/06 04:26:22 - mmengine - INFO - Iter(train) [226500/640000] base_lr: 1.4430e-04 lr: 1.4430e-05 eta: 7 days, 7:46:58 time: 1.5350 data_time: 0.0228 memory: 25573 grad_norm: 3.1108 loss: 1.3260 caption_loss_cls: 2.2646 detection_loss_cls: 0.0352 detection_loss_reg: 0.3428 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7466 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3471 instance_segmentation_loss_poly: 0.9381 2024/01/06 04:39:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 04:39:29 - mmengine - INFO - Iter(train) [227000/640000] base_lr: 1.4408e-04 lr: 1.4408e-05 eta: 7 days, 8:02:49 time: 1.5487 data_time: 0.0228 memory: 25573 grad_norm: 3.0840 loss: 1.3151 caption_loss_cls: 2.2651 detection_loss_cls: 0.0353 detection_loss_reg: 0.3440 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7472 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3449 instance_segmentation_loss_poly: 0.9346 2024/01/06 04:53:18 - mmengine - INFO - Iter(train) [227500/640000] base_lr: 1.4386e-04 lr: 1.4386e-05 eta: 7 days, 9:07:45 time: 1.5690 data_time: 0.0230 memory: 25573 grad_norm: 3.0282 loss: 1.3044 caption_loss_cls: 2.2628 detection_loss_cls: 0.0351 detection_loss_reg: 0.3427 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7420 instance_segmentation_loss_cls: 0.0338 instance_segmentation_loss_reg: 0.3427 instance_segmentation_loss_poly: 0.9290 2024/01/06 05:05:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 05:05:57 - mmengine - INFO - Iter(train) [228000/640000] base_lr: 1.4364e-04 lr: 1.4364e-05 eta: 7 days, 8:38:01 time: 1.5651 data_time: 0.0228 memory: 25573 grad_norm: 3.0123 loss: 1.2971 caption_loss_cls: 2.2485 detection_loss_cls: 0.0348 detection_loss_reg: 0.3401 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.7451 instance_segmentation_loss_cls: 0.0339 instance_segmentation_loss_reg: 0.3435 instance_segmentation_loss_poly: 0.9311 2024/01/06 05:05:57 - mmengine - INFO - Saving checkpoint at 228000 iterations 2024/01/06 05:19:11 - mmengine - INFO - Iter(train) [228500/640000] base_lr: 1.4342e-04 lr: 1.4342e-05 eta: 7 days, 8:48:45 time: 1.5669 data_time: 0.0225 memory: 25573 grad_norm: 2.9554 loss: 1.2903 caption_loss_cls: 2.2476 detection_loss_cls: 0.0347 detection_loss_reg: 0.3391 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.7405 instance_segmentation_loss_cls: 0.0339 instance_segmentation_loss_reg: 0.3444 instance_segmentation_loss_poly: 0.9323 2024/01/06 05:32:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 05:32:16 - mmengine - INFO - Iter(train) [229000/640000] base_lr: 1.4320e-04 lr: 1.4320e-05 eta: 7 days, 8:47:15 time: 1.5694 data_time: 0.0224 memory: 25573 grad_norm: 2.9328 loss: 1.2822 caption_loss_cls: 2.2483 detection_loss_cls: 0.0348 detection_loss_reg: 0.3398 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7364 instance_segmentation_loss_cls: 0.0340 instance_segmentation_loss_reg: 0.3449 instance_segmentation_loss_poly: 0.9326 2024/01/06 05:44:41 - mmengine - INFO - Iter(train) [229500/640000] base_lr: 1.4298e-04 lr: 1.4298e-05 eta: 7 days, 8:06:53 time: 1.5632 data_time: 0.0224 memory: 25573 grad_norm: 2.9530 loss: 1.2953 caption_loss_cls: 2.2469 detection_loss_cls: 0.0347 detection_loss_reg: 0.3392 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7338 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3465 instance_segmentation_loss_poly: 0.9374 2024/01/06 05:57:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 05:57:42 - mmengine - INFO - Iter(train) [230000/640000] base_lr: 1.4276e-04 lr: 1.4276e-05 eta: 7 days, 8:01:58 time: 1.5633 data_time: 0.0224 memory: 25573 grad_norm: 2.9298 loss: 1.3060 caption_loss_cls: 2.2385 detection_loss_cls: 0.0346 detection_loss_reg: 0.3378 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.7314 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3470 instance_segmentation_loss_poly: 0.9388 2024/01/06 05:57:42 - mmengine - INFO - Saving checkpoint at 230000 iterations 2024/01/06 06:10:57 - mmengine - INFO - Iter(train) [230500/640000] base_lr: 1.4253e-04 lr: 1.4253e-05 eta: 7 days, 8:06:39 time: 1.5683 data_time: 0.0224 memory: 25573 grad_norm: 2.9208 loss: 1.3125 caption_loss_cls: 2.2387 detection_loss_cls: 0.0347 detection_loss_reg: 0.3374 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7335 instance_segmentation_loss_cls: 0.0342 instance_segmentation_loss_reg: 0.3479 instance_segmentation_loss_poly: 0.9401 2024/01/06 06:23:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 06:23:50 - mmengine - INFO - Iter(train) [231000/640000] base_lr: 1.4231e-04 lr: 1.4231e-05 eta: 7 days, 7:52:51 time: 1.5650 data_time: 0.0224 memory: 25573 grad_norm: 2.9280 loss: 1.3099 caption_loss_cls: 2.2356 detection_loss_cls: 0.0346 detection_loss_reg: 0.3368 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7303 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3466 instance_segmentation_loss_poly: 0.9369 2024/01/06 06:36:46 - mmengine - INFO - Iter(train) [231500/640000] base_lr: 1.4209e-04 lr: 1.4209e-05 eta: 7 days, 7:40:39 time: 1.5514 data_time: 0.0223 memory: 25573 grad_norm: 2.9401 loss: 1.3139 caption_loss_cls: 2.2300 detection_loss_cls: 0.0346 detection_loss_reg: 0.3376 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7255 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3481 instance_segmentation_loss_poly: 0.9404 2024/01/06 06:49:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 06:49:43 - mmengine - INFO - Iter(train) [232000/640000] base_lr: 1.4187e-04 lr: 1.4187e-05 eta: 7 days, 7:29:51 time: 1.5562 data_time: 0.0223 memory: 25573 grad_norm: 2.9351 loss: 1.3183 caption_loss_cls: 2.2291 detection_loss_cls: 0.0346 detection_loss_reg: 0.3375 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7225 instance_segmentation_loss_cls: 0.0343 instance_segmentation_loss_reg: 0.3469 instance_segmentation_loss_poly: 0.9373 2024/01/06 06:49:43 - mmengine - INFO - Saving checkpoint at 232000 iterations 2024/01/06 07:03:02 - mmengine - INFO - Iter(train) [232500/640000] base_lr: 1.4164e-04 lr: 1.4164e-05 eta: 7 days, 7:33:04 time: 1.5574 data_time: 0.0223 memory: 25573 grad_norm: 2.8693 loss: 1.3098 caption_loss_cls: 2.2277 detection_loss_cls: 0.0348 detection_loss_reg: 0.3390 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7220 instance_segmentation_loss_cls: 0.0341 instance_segmentation_loss_reg: 0.3448 instance_segmentation_loss_poly: 0.9327 2024/01/06 07:15:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 07:15:43 - mmengine - INFO - Iter(train) [233000/640000] base_lr: 1.4142e-04 lr: 1.4142e-05 eta: 7 days, 7:11:09 time: 1.5514 data_time: 0.0222 memory: 25573 grad_norm: 2.8356 loss: 1.3081 caption_loss_cls: 2.2250 detection_loss_cls: 0.0347 detection_loss_reg: 0.3371 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7222 instance_segmentation_loss_cls: 0.0338 instance_segmentation_loss_reg: 0.3426 instance_segmentation_loss_poly: 0.9284 2024/01/06 07:28:11 - mmengine - INFO - Iter(train) [233500/640000] base_lr: 1.4120e-04 lr: 1.4120e-05 eta: 7 days, 6:42:20 time: 1.5523 data_time: 0.0222 memory: 25573 grad_norm: 2.8500 loss: 1.3070 caption_loss_cls: 2.2239 detection_loss_cls: 0.0347 detection_loss_reg: 0.3379 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7249 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3419 instance_segmentation_loss_poly: 0.9263 2024/01/06 07:41:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 07:41:15 - mmengine - INFO - Iter(train) [234000/640000] base_lr: 1.4097e-04 lr: 1.4097e-05 eta: 7 days, 6:35:08 time: 1.5528 data_time: 0.0222 memory: 25573 grad_norm: 2.8333 loss: 1.3137 caption_loss_cls: 2.2199 detection_loss_cls: 0.0349 detection_loss_reg: 0.3393 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7196 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3426 instance_segmentation_loss_poly: 0.9257 2024/01/06 07:41:15 - mmengine - INFO - Saving checkpoint at 234000 iterations 2024/01/06 07:53:59 - mmengine - INFO - Iter(train) [234500/640000] base_lr: 1.4075e-04 lr: 1.4075e-05 eta: 7 days, 6:16:38 time: 1.5451 data_time: 0.0221 memory: 25573 grad_norm: 2.8244 loss: 1.3171 caption_loss_cls: 2.2278 detection_loss_cls: 0.0350 detection_loss_reg: 0.3401 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7156 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3432 instance_segmentation_loss_poly: 0.9268 2024/01/06 08:07:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 08:07:00 - mmengine - INFO - Iter(train) [235000/640000] base_lr: 1.4052e-04 lr: 1.4052e-05 eta: 7 days, 6:07:10 time: 1.5469 data_time: 0.0221 memory: 25573 grad_norm: 2.7889 loss: 1.2991 caption_loss_cls: 2.2277 detection_loss_cls: 0.0348 detection_loss_reg: 0.3390 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7096 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3434 instance_segmentation_loss_poly: 0.9270 2024/01/06 08:19:40 - mmengine - INFO - Iter(train) [235500/640000] base_lr: 1.4030e-04 lr: 1.4030e-05 eta: 7 days, 5:47:19 time: 1.5431 data_time: 0.0220 memory: 25573 grad_norm: 2.8150 loss: 1.3036 caption_loss_cls: 2.2294 detection_loss_cls: 0.0347 detection_loss_reg: 0.3379 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7097 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3434 instance_segmentation_loss_poly: 0.9262 2024/01/06 08:32:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 08:32:18 - mmengine - INFO - Iter(train) [236000/640000] base_lr: 1.4008e-04 lr: 1.4008e-05 eta: 7 days, 5:27:09 time: 1.5384 data_time: 0.0221 memory: 25573 grad_norm: 2.8473 loss: 1.3162 caption_loss_cls: 2.2332 detection_loss_cls: 0.0347 detection_loss_reg: 0.3379 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7086 instance_segmentation_loss_cls: 0.0338 instance_segmentation_loss_reg: 0.3435 instance_segmentation_loss_poly: 0.9260 2024/01/06 08:32:18 - mmengine - INFO - Saving checkpoint at 236000 iterations 2024/01/06 08:45:36 - mmengine - INFO - Iter(train) [236500/640000] base_lr: 1.3985e-04 lr: 1.3985e-05 eta: 7 days, 5:25:57 time: 1.5381 data_time: 0.0221 memory: 25573 grad_norm: 2.9589 loss: 1.3241 caption_loss_cls: 2.2335 detection_loss_cls: 0.0348 detection_loss_reg: 0.3390 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7032 instance_segmentation_loss_cls: 0.0338 instance_segmentation_loss_reg: 0.3426 instance_segmentation_loss_poly: 0.9239 2024/01/06 08:58:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 08:58:45 - mmengine - INFO - Iter(train) [237000/640000] base_lr: 1.3963e-04 lr: 1.3963e-05 eta: 7 days, 5:19:43 time: 1.5450 data_time: 0.0222 memory: 25573 grad_norm: 3.0561 loss: 1.3340 caption_loss_cls: 2.2323 detection_loss_cls: 0.0345 detection_loss_reg: 0.3382 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6966 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3410 instance_segmentation_loss_poly: 0.9215 2024/01/06 09:11:26 - mmengine - INFO - Iter(train) [237500/640000] base_lr: 1.3940e-04 lr: 1.3940e-05 eta: 7 days, 5:01:07 time: 1.5483 data_time: 0.0222 memory: 25573 grad_norm: 3.0361 loss: 1.3278 caption_loss_cls: 2.2363 detection_loss_cls: 0.0347 detection_loss_reg: 0.3402 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6974 instance_segmentation_loss_cls: 0.0336 instance_segmentation_loss_reg: 0.3416 instance_segmentation_loss_poly: 0.9224 2024/01/06 09:24:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 09:24:32 - mmengine - INFO - Iter(train) [238000/640000] base_lr: 1.3917e-04 lr: 1.3917e-05 eta: 7 days, 4:52:54 time: 1.5486 data_time: 0.0222 memory: 25573 grad_norm: 3.0502 loss: 1.3198 caption_loss_cls: 2.2387 detection_loss_cls: 0.0347 detection_loss_reg: 0.3403 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6992 instance_segmentation_loss_cls: 0.0336 instance_segmentation_loss_reg: 0.3412 instance_segmentation_loss_poly: 0.9216 2024/01/06 09:24:32 - mmengine - INFO - Saving checkpoint at 238000 iterations 2024/01/06 09:37:58 - mmengine - INFO - Iter(train) [238500/640000] base_lr: 1.3895e-04 lr: 1.3895e-05 eta: 7 days, 4:52:56 time: 1.5592 data_time: 0.0223 memory: 25573 grad_norm: 3.0872 loss: 1.3084 caption_loss_cls: 2.2344 detection_loss_cls: 0.0346 detection_loss_reg: 0.3401 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.7001 instance_segmentation_loss_cls: 0.0334 instance_segmentation_loss_reg: 0.3395 instance_segmentation_loss_poly: 0.9164 2024/01/06 09:50:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 09:50:36 - mmengine - INFO - Iter(train) [239000/640000] base_lr: 1.3872e-04 lr: 1.3872e-05 eta: 7 days, 4:33:09 time: 1.5536 data_time: 0.0223 memory: 25573 grad_norm: 3.1208 loss: 1.3200 caption_loss_cls: 2.2312 detection_loss_cls: 0.0347 detection_loss_reg: 0.3409 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6999 instance_segmentation_loss_cls: 0.0334 instance_segmentation_loss_reg: 0.3400 instance_segmentation_loss_poly: 0.9170 2024/01/06 10:03:13 - mmengine - INFO - Iter(train) [239500/640000] base_lr: 1.3850e-04 lr: 1.3850e-05 eta: 7 days, 4:13:40 time: 1.5529 data_time: 0.0223 memory: 25573 grad_norm: 3.2124 loss: 1.3225 caption_loss_cls: 2.2303 detection_loss_cls: 0.0344 detection_loss_reg: 0.3383 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6953 instance_segmentation_loss_cls: 0.0335 instance_segmentation_loss_reg: 0.3403 instance_segmentation_loss_poly: 0.9174 2024/01/06 10:16:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 10:16:03 - mmengine - INFO - Iter(train) [240000/640000] base_lr: 1.3827e-04 lr: 1.3827e-05 eta: 7 days, 3:59:04 time: 1.5557 data_time: 0.0222 memory: 25573 grad_norm: 3.2343 loss: 1.3041 caption_loss_cls: 2.2244 detection_loss_cls: 0.0343 detection_loss_reg: 0.3379 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6925 instance_segmentation_loss_cls: 0.0336 instance_segmentation_loss_reg: 0.3405 instance_segmentation_loss_poly: 0.9170 2024/01/06 10:16:03 - mmengine - INFO - Saving checkpoint at 240000 iterations 2024/01/06 10:27:30 - mmengine - INFO - Evaluating bbox... 2024/01/06 10:28:27 - mmengine - INFO - bbox_mAP_copypaste: 0.493 0.677 0.540 0.336 0.538 0.643 2024/01/06 10:28:27 - mmengine - INFO - Evaluating segm... 2024/01/06 10:29:43 - mmengine - INFO - segm_mAP_copypaste: 0.325 0.589 0.315 0.184 0.373 0.499 2024/01/06 10:31:53 - mmengine - INFO - Evaluating bbox... 2024/01/06 10:32:51 - mmengine - INFO - bbox_mAP_copypaste: 0.492 0.676 0.539 0.333 0.537 0.641 2024/01/06 10:39:28 - mmengine - INFO - per class results: 2024/01/06 10:39:28 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.03 | 89.61 | | building | 79.69 | 85.82 | | sky | 93.06 | 97.57 | | floor | 82.43 | 89.36 | | tree | 74.18 | 89.29 | | ceiling | 85.12 | 94.04 | | road | 83.77 | 91.35 | | bed | 88.49 | 95.11 | | windowpane | 61.52 | 72.68 | | grass | 66.91 | 84.28 | | cabinet | 61.66 | 69.23 | | sidewalk | 66.31 | 78.36 | | person | 80.09 | 91.83 | | earth | 36.34 | 43.8 | | door | 56.06 | 71.09 | | table | 61.74 | 83.76 | | mountain | 58.68 | 75.04 | | plant | 52.8 | 62.32 | | curtain | 75.41 | 89.03 | | chair | 60.6 | 73.67 | | car | 83.1 | 89.22 | | water | 58.94 | 78.5 | | painting | 72.75 | 85.2 | | sofa | 72.97 | 86.01 | | shelf | 44.55 | 69.88 | | house | 37.61 | 86.42 | | sea | 64.12 | 83.28 | | mirror | 64.78 | 70.17 | | rug | 66.95 | 80.86 | | field | 35.41 | 56.09 | | armchair | 51.66 | 68.01 | | seat | 63.25 | 78.87 | | fence | 43.81 | 57.19 | | desk | 50.29 | 70.99 | | rock | 53.67 | 72.96 | | wardrobe | 44.93 | 65.8 | | lamp | 62.3 | 74.69 | | bathtub | 83.47 | 87.65 | | railing | 39.37 | 62.22 | | cushion | 58.47 | 78.09 | | base | 25.35 | 29.69 | | box | 29.0 | 39.61 | | column | 51.56 | 64.09 | | signboard | 37.71 | 52.16 | | chest of drawers | 42.53 | 63.62 | | counter | 28.11 | 44.41 | | sand | 50.27 | 67.61 | | sink | 74.07 | 86.68 | | skyscraper | 60.74 | 93.96 | | fireplace | 75.29 | 93.25 | | refrigerator | 76.89 | 84.31 | | grandstand | 50.91 | 79.26 | | path | 18.6 | 32.58 | | stairs | 34.7 | 42.19 | | runway | 69.64 | 94.17 | | case | 49.29 | 61.47 | | pool table | 91.74 | 94.72 | | pillow | 58.37 | 67.29 | | screen door | 62.09 | 63.21 | | stairway | 31.89 | 45.33 | | river | 14.36 | 19.92 | | bridge | 59.07 | 75.05 | | bookcase | 29.32 | 45.18 | | blind | 41.75 | 52.06 | | coffee table | 63.69 | 76.61 | | toilet | 85.89 | 92.08 | | flower | 35.6 | 51.19 | | book | 45.7 | 70.18 | | hill | 13.71 | 18.46 | | bench | 56.57 | 68.09 | | countertop | 58.08 | 73.05 | | stove | 76.47 | 81.96 | | palm | 47.09 | 74.23 | | kitchen island | 43.3 | 53.5 | | computer | 72.65 | 93.21 | | swivel chair | 49.99 | 65.3 | | boat | 70.78 | 87.45 | | bar | 41.46 | 56.22 | | arcade machine | 35.25 | 35.91 | | hovel | 33.73 | 41.9 | | bus | 91.8 | 95.08 | | towel | 62.48 | 78.07 | | light | 52.25 | 65.76 | | truck | 36.8 | 52.6 | | tower | 25.43 | 43.06 | | chandelier | 66.66 | 79.5 | | awning | 31.96 | 38.42 | | streetlight | 34.96 | 50.69 | | booth | 48.37 | 57.74 | | television receiver | 67.15 | 89.54 | | airplane | 53.71 | 69.05 | | dirt track | 2.71 | 37.46 | | apparel | 31.4 | 48.73 | | pole | 25.0 | 34.93 | | land | 0.22 | 0.25 | | bannister | 15.22 | 20.52 | | escalator | 22.25 | 23.4 | | ottoman | 46.97 | 72.78 | | bottle | 25.41 | 32.76 | | buffet | 58.97 | 72.65 | | poster | 34.04 | 47.51 | | stage | 13.17 | 15.07 | | van | 41.49 | 76.38 | | ship | 30.73 | 33.84 | | fountain | 5.71 | 5.77 | | conveyer belt | 73.83 | 87.7 | | canopy | 26.56 | 56.05 | | washer | 76.04 | 78.39 | | plaything | 34.18 | 54.81 | | swimming pool | 57.76 | 67.7 | | stool | 46.24 | 61.34 | | barrel | 16.9 | 64.66 | | basket | 38.12 | 60.43 | | waterfall | 55.4 | 60.96 | | tent | 92.03 | 97.25 | | bag | 24.63 | 36.05 | | minibike | 71.01 | 82.75 | | cradle | 78.71 | 97.0 | | oven | 50.73 | 60.34 | | ball | 46.12 | 59.49 | | food | 53.54 | 57.04 | | step | 14.12 | 17.19 | | tank | 49.02 | 66.3 | | trade name | 15.9 | 17.56 | | microwave | 84.22 | 92.14 | | pot | 50.54 | 64.9 | | animal | 57.62 | 60.49 | | bicycle | 58.54 | 75.52 | | lake | 46.0 | 62.23 | | dishwasher | 71.83 | 85.89 | | screen | 77.62 | 87.95 | | blanket | 17.75 | 21.51 | | sculpture | 55.1 | 86.38 | | hood | 63.92 | 69.72 | | sconce | 42.7 | 55.67 | | vase | 44.01 | 58.14 | | traffic light | 40.56 | 61.12 | | tray | 11.85 | 13.59 | | ashcan | 32.19 | 37.89 | | fan | 60.44 | 75.68 | | pier | 40.55 | 48.27 | | crt screen | 5.53 | 6.25 | | plate | 56.88 | 77.12 | | monitor | 42.74 | 51.37 | | bulletin board | 31.75 | 34.18 | | shower | 8.5 | 20.1 | | radiator | 56.02 | 65.92 | | glass | 15.59 | 16.51 | | clock | 27.04 | 40.3 | | flag | 37.85 | 44.49 | +---------------------+-------+-------+ 2024/01/06 10:39:42 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4920 coco/bbox_mAP_50: 0.6760 coco/bbox_mAP_75: 0.5390 coco/bbox_mAP_s: 0.3330 coco/bbox_mAP_m: 0.5370 coco/bbox_mAP_l: 0.6410 coco/segm_mAP: 0.3250 coco/segm_mAP_50: 0.5890 coco/segm_mAP_75: 0.3150 coco/segm_mAP_s: 0.1840 coco/segm_mAP_m: 0.3730 coco/segm_mAP_l: 0.4990 Bleu_1: 0.7422 Bleu_2: 0.5720 Bleu_3: 0.4266 Bleu_4: 0.3136 METEOR: 0.2592 ROUGE_L: 0.5425 CIDEr: 1.0090 SPICE: 0.1941 aAcc: 83.3300 mIoU: 50.2200 mAcc: 63.3200 visual-grounding/miou: 0.7897 visual-grounding/acc: 0.8581 data_time: 0.0278 time: 1.9402 2024/01/06 10:52:20 - mmengine - INFO - Iter(train) [240500/640000] base_lr: 1.3804e-04 lr: 1.3804e-05 eta: 7 days, 3:41:20 time: 1.5463 data_time: 0.0171 memory: 34667 grad_norm: 3.2046 loss: 1.3148 caption_loss_cls: 2.2221 detection_loss_cls: 0.0341 detection_loss_reg: 0.3360 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6937 instance_segmentation_loss_cls: 0.0336 instance_segmentation_loss_reg: 0.3405 instance_segmentation_loss_poly: 0.9181 2024/01/06 11:04:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 11:04:36 - mmengine - INFO - Iter(train) [241000/640000] base_lr: 1.3781e-04 lr: 1.3781e-05 eta: 7 days, 3:15:09 time: 1.5331 data_time: 0.0172 memory: 25572 grad_norm: 3.3357 loss: 1.3141 caption_loss_cls: 2.2161 detection_loss_cls: 0.0341 detection_loss_reg: 0.3350 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6974 instance_segmentation_loss_cls: 0.0337 instance_segmentation_loss_reg: 0.3418 instance_segmentation_loss_poly: 0.9216 2024/01/06 11:17:36 - mmengine - INFO - Iter(train) [241500/640000] base_lr: 1.3759e-04 lr: 1.3759e-05 eta: 7 days, 3:04:38 time: 1.5377 data_time: 0.0175 memory: 25572 grad_norm: 3.3715 loss: 1.3025 caption_loss_cls: 2.2236 detection_loss_cls: 0.0340 detection_loss_reg: 0.3345 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6939 instance_segmentation_loss_cls: 0.0334 instance_segmentation_loss_reg: 0.3385 instance_segmentation_loss_poly: 0.9143 2024/01/06 11:30:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 11:30:01 - mmengine - INFO - Iter(train) [242000/640000] base_lr: 1.3736e-04 lr: 1.3736e-05 eta: 7 days, 2:42:28 time: 1.5277 data_time: 0.0177 memory: 25572 grad_norm: 3.3937 loss: 1.3097 caption_loss_cls: 2.2168 detection_loss_cls: 0.0340 detection_loss_reg: 0.3358 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6871 instance_segmentation_loss_cls: 0.0334 instance_segmentation_loss_reg: 0.3392 instance_segmentation_loss_poly: 0.9151 2024/01/06 11:30:01 - mmengine - INFO - Saving checkpoint at 242000 iterations 2024/01/06 11:43:14 - mmengine - INFO - Iter(train) [242500/640000] base_lr: 1.3713e-04 lr: 1.3713e-05 eta: 7 days, 2:36:22 time: 1.5244 data_time: 0.0197 memory: 25572 grad_norm: 3.3442 loss: 1.3084 caption_loss_cls: 2.2216 detection_loss_cls: 0.0341 detection_loss_reg: 0.3364 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6813 instance_segmentation_loss_cls: 0.0330 instance_segmentation_loss_reg: 0.3358 instance_segmentation_loss_poly: 0.9063 2024/01/06 11:55:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 11:55:35 - mmengine - INFO - Iter(train) [243000/640000] base_lr: 1.3690e-04 lr: 1.3690e-05 eta: 7 days, 2:13:22 time: 1.5201 data_time: 0.0199 memory: 25572 grad_norm: 3.3526 loss: 1.3136 caption_loss_cls: 2.2211 detection_loss_cls: 0.0339 detection_loss_reg: 0.3349 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6753 instance_segmentation_loss_cls: 0.0332 instance_segmentation_loss_reg: 0.3362 instance_segmentation_loss_poly: 0.9062 2024/01/06 12:08:18 - mmengine - INFO - Iter(train) [243500/640000] base_lr: 1.3668e-04 lr: 1.3668e-05 eta: 7 days, 1:57:52 time: 1.5216 data_time: 0.0201 memory: 25572 grad_norm: 3.2545 loss: 1.3085 caption_loss_cls: 2.2282 detection_loss_cls: 0.0341 detection_loss_reg: 0.3364 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6746 instance_segmentation_loss_cls: 0.0331 instance_segmentation_loss_reg: 0.3351 instance_segmentation_loss_poly: 0.9037 2024/01/06 12:21:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 12:21:17 - mmengine - INFO - Iter(train) [244000/640000] base_lr: 1.3645e-04 lr: 1.3645e-05 eta: 7 days, 1:47:07 time: 1.5239 data_time: 0.0205 memory: 25572 grad_norm: 3.1663 loss: 1.3040 caption_loss_cls: 2.2309 detection_loss_cls: 0.0339 detection_loss_reg: 0.3352 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6745 instance_segmentation_loss_cls: 0.0330 instance_segmentation_loss_reg: 0.3344 instance_segmentation_loss_poly: 0.9014 2024/01/06 12:21:17 - mmengine - INFO - Saving checkpoint at 244000 iterations 2024/01/06 12:34:30 - mmengine - INFO - Iter(train) [244500/640000] base_lr: 1.3622e-04 lr: 1.3622e-05 eta: 7 days, 1:40:18 time: 1.5319 data_time: 0.0270 memory: 25572 grad_norm: 3.1173 loss: 1.2960 caption_loss_cls: 2.2352 detection_loss_cls: 0.0339 detection_loss_reg: 0.3366 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6772 instance_segmentation_loss_cls: 0.0331 instance_segmentation_loss_reg: 0.3340 instance_segmentation_loss_poly: 0.9010 2024/01/06 12:46:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 12:46:45 - mmengine - INFO - Iter(train) [245000/640000] base_lr: 1.3599e-04 lr: 1.3599e-05 eta: 7 days, 1:16:41 time: 1.5316 data_time: 0.0269 memory: 25572 grad_norm: 2.9885 loss: 1.2992 caption_loss_cls: 2.2345 detection_loss_cls: 0.0340 detection_loss_reg: 0.3372 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6785 instance_segmentation_loss_cls: 0.0330 instance_segmentation_loss_reg: 0.3334 instance_segmentation_loss_poly: 0.9006 2024/01/06 12:59:04 - mmengine - INFO - Iter(train) [245500/640000] base_lr: 1.3576e-04 lr: 1.3576e-05 eta: 7 days, 0:54:39 time: 1.5214 data_time: 0.0268 memory: 25572 grad_norm: 2.9732 loss: 1.3088 caption_loss_cls: 2.2347 detection_loss_cls: 0.0341 detection_loss_reg: 0.3384 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6794 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3305 instance_segmentation_loss_poly: 0.8956 2024/01/06 13:11:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 13:11:37 - mmengine - INFO - Iter(train) [246000/640000] base_lr: 1.3553e-04 lr: 1.3553e-05 eta: 7 days, 0:37:02 time: 1.5235 data_time: 0.0269 memory: 25572 grad_norm: 2.9624 loss: 1.3133 caption_loss_cls: 2.2366 detection_loss_cls: 0.0341 detection_loss_reg: 0.3385 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6742 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8973 2024/01/06 13:11:37 - mmengine - INFO - Saving checkpoint at 246000 iterations 2024/01/06 13:24:44 - mmengine - INFO - Iter(train) [246500/640000] base_lr: 1.3530e-04 lr: 1.3530e-05 eta: 7 days, 0:28:41 time: 1.5221 data_time: 0.0263 memory: 25572 grad_norm: 2.9590 loss: 1.3126 caption_loss_cls: 2.2374 detection_loss_cls: 0.0339 detection_loss_reg: 0.3370 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6747 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8981 2024/01/06 13:37:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 13:37:55 - mmengine - INFO - Iter(train) [247000/640000] base_lr: 1.3507e-04 lr: 1.3507e-05 eta: 7 days, 0:20:52 time: 1.5344 data_time: 0.0264 memory: 25572 grad_norm: 2.9102 loss: 1.3089 caption_loss_cls: 2.2391 detection_loss_cls: 0.0337 detection_loss_reg: 0.3371 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6719 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8988 2024/01/06 13:50:11 - mmengine - INFO - Iter(train) [247500/640000] base_lr: 1.3484e-04 lr: 1.3484e-05 eta: 6 days, 23:59:02 time: 1.5276 data_time: 0.0264 memory: 25572 grad_norm: 2.9324 loss: 1.3151 caption_loss_cls: 2.2445 detection_loss_cls: 0.0337 detection_loss_reg: 0.3367 semantic_segmentation_loss_cls: 0.0088 grounding_loss_reg: 2.6720 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3308 instance_segmentation_loss_poly: 0.8950 2024/01/06 14:02:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 14:02:11 - mmengine - INFO - Iter(train) [248000/640000] base_lr: 1.3461e-04 lr: 1.3461e-05 eta: 6 days, 23:33:29 time: 1.5129 data_time: 0.0261 memory: 25572 grad_norm: 2.9967 loss: 1.3312 caption_loss_cls: 2.2481 detection_loss_cls: 0.0337 detection_loss_reg: 0.3371 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6699 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3310 instance_segmentation_loss_poly: 0.8956 2024/01/06 14:02:11 - mmengine - INFO - Saving checkpoint at 248000 iterations 2024/01/06 14:15:01 - mmengine - INFO - Iter(train) [248500/640000] base_lr: 1.3438e-04 lr: 1.3438e-05 eta: 6 days, 23:20:45 time: 1.5073 data_time: 0.0261 memory: 25572 grad_norm: 3.0368 loss: 1.3265 caption_loss_cls: 2.2430 detection_loss_cls: 0.0339 detection_loss_reg: 0.3389 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6697 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8953 2024/01/06 14:27:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 14:27:46 - mmengine - INFO - Iter(train) [249000/640000] base_lr: 1.3415e-04 lr: 1.3415e-05 eta: 6 days, 23:06:52 time: 1.5149 data_time: 0.0262 memory: 25572 grad_norm: 3.0429 loss: 1.3264 caption_loss_cls: 2.2477 detection_loss_cls: 0.0338 detection_loss_reg: 0.3383 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6740 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8959 2024/01/06 14:40:01 - mmengine - INFO - Iter(train) [249500/640000] base_lr: 1.3392e-04 lr: 1.3392e-05 eta: 6 days, 22:45:48 time: 1.5138 data_time: 0.0263 memory: 25572 grad_norm: 3.0888 loss: 1.3468 caption_loss_cls: 2.2613 detection_loss_cls: 0.0336 detection_loss_reg: 0.3368 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6714 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3325 instance_segmentation_loss_poly: 0.8966 2024/01/06 14:52:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 14:52:48 - mmengine - INFO - Iter(train) [250000/640000] base_lr: 1.3369e-04 lr: 1.3369e-05 eta: 6 days, 22:32:35 time: 1.5173 data_time: 0.0264 memory: 25572 grad_norm: 3.1032 loss: 1.3413 caption_loss_cls: 2.2649 detection_loss_cls: 0.0338 detection_loss_reg: 0.3378 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6709 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8925 2024/01/06 14:52:48 - mmengine - INFO - Saving checkpoint at 250000 iterations 2024/01/06 15:06:17 - mmengine - INFO - Iter(train) [250500/640000] base_lr: 1.3346e-04 lr: 1.3346e-05 eta: 6 days, 22:28:49 time: 1.5226 data_time: 0.0264 memory: 25572 grad_norm: 3.1476 loss: 1.3284 caption_loss_cls: 2.2617 detection_loss_cls: 0.0338 detection_loss_reg: 0.3380 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6688 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8872 2024/01/06 15:18:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 15:18:48 - mmengine - INFO - Iter(train) [251000/640000] base_lr: 1.3323e-04 lr: 1.3323e-05 eta: 6 days, 22:11:48 time: 1.5128 data_time: 0.0263 memory: 25572 grad_norm: 3.2121 loss: 1.3313 caption_loss_cls: 2.2620 detection_loss_cls: 0.0340 detection_loss_reg: 0.3401 semantic_segmentation_loss_cls: 0.0087 grounding_loss_reg: 2.6688 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8872 2024/01/06 15:31:28 - mmengine - INFO - Iter(train) [251500/640000] base_lr: 1.3300e-04 lr: 1.3300e-05 eta: 6 days, 21:57:04 time: 1.5188 data_time: 0.0264 memory: 25572 grad_norm: 3.2316 loss: 1.3171 caption_loss_cls: 2.2585 detection_loss_cls: 0.0339 detection_loss_reg: 0.3388 semantic_segmentation_loss_cls: 0.0086 grounding_loss_reg: 2.6668 instance_segmentation_loss_cls: 0.0322 instance_segmentation_loss_reg: 0.3292 instance_segmentation_loss_poly: 0.8855 2024/01/06 15:44:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 15:44:57 - mmengine - INFO - Iter(train) [252000/640000] base_lr: 1.3276e-04 lr: 1.3276e-05 eta: 6 days, 21:52:49 time: 1.5410 data_time: 0.0268 memory: 25572 grad_norm: 3.1665 loss: 1.3053 caption_loss_cls: 2.2521 detection_loss_cls: 0.0338 detection_loss_reg: 0.3394 semantic_segmentation_loss_cls: 0.0086 grounding_loss_reg: 2.6675 instance_segmentation_loss_cls: 0.0322 instance_segmentation_loss_reg: 0.3296 instance_segmentation_loss_poly: 0.8860 2024/01/06 15:44:57 - mmengine - INFO - Saving checkpoint at 252000 iterations 2024/01/06 15:57:47 - mmengine - INFO - Iter(train) [252500/640000] base_lr: 1.3253e-04 lr: 1.3253e-05 eta: 6 days, 21:40:02 time: 1.5410 data_time: 0.0268 memory: 25572 grad_norm: 3.1852 loss: 1.3099 caption_loss_cls: 2.2503 detection_loss_cls: 0.0338 detection_loss_reg: 0.3387 semantic_segmentation_loss_cls: 0.0086 grounding_loss_reg: 2.6673 instance_segmentation_loss_cls: 0.0322 instance_segmentation_loss_reg: 0.3301 instance_segmentation_loss_poly: 0.8866 2024/01/06 16:10:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 16:10:36 - mmengine - INFO - Iter(train) [253000/640000] base_lr: 1.3230e-04 lr: 1.3230e-05 eta: 6 days, 21:27:07 time: 1.5420 data_time: 0.0268 memory: 25572 grad_norm: 3.1358 loss: 1.3122 caption_loss_cls: 2.2560 detection_loss_cls: 0.0335 detection_loss_reg: 0.3369 semantic_segmentation_loss_cls: 0.0086 grounding_loss_reg: 2.6658 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8904 2024/01/06 16:22:41 - mmengine - INFO - Iter(train) [253500/640000] base_lr: 1.3207e-04 lr: 1.3207e-05 eta: 6 days, 21:04:59 time: 1.5394 data_time: 0.0267 memory: 25572 grad_norm: 3.0955 loss: 1.3039 caption_loss_cls: 2.2610 detection_loss_cls: 0.0333 detection_loss_reg: 0.3353 semantic_segmentation_loss_cls: 0.0086 grounding_loss_reg: 2.6670 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3333 instance_segmentation_loss_poly: 0.8932 2024/01/06 16:35:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 16:35:49 - mmengine - INFO - Iter(train) [254000/640000] base_lr: 1.3183e-04 lr: 1.3183e-05 eta: 6 days, 20:56:02 time: 1.5447 data_time: 0.0268 memory: 25572 grad_norm: 3.1221 loss: 1.2995 caption_loss_cls: 2.2573 detection_loss_cls: 0.0332 detection_loss_reg: 0.3352 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6685 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3327 instance_segmentation_loss_poly: 0.8935 2024/01/06 16:35:49 - mmengine - INFO - Saving checkpoint at 254000 iterations 2024/01/06 16:48:47 - mmengine - INFO - Iter(train) [254500/640000] base_lr: 1.3160e-04 lr: 1.3160e-05 eta: 6 days, 20:44:50 time: 1.5369 data_time: 0.0266 memory: 25572 grad_norm: 3.1417 loss: 1.3114 caption_loss_cls: 2.2543 detection_loss_cls: 0.0329 detection_loss_reg: 0.3341 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6673 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3323 instance_segmentation_loss_poly: 0.8935 2024/01/06 17:01:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 17:01:15 - mmengine - INFO - Iter(train) [255000/640000] base_lr: 1.3137e-04 lr: 1.3137e-05 eta: 6 days, 20:27:52 time: 1.5363 data_time: 0.0266 memory: 25572 grad_norm: 3.1540 loss: 1.3233 caption_loss_cls: 2.2573 detection_loss_cls: 0.0332 detection_loss_reg: 0.3357 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6704 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3310 instance_segmentation_loss_poly: 0.8922 2024/01/06 17:14:03 - mmengine - INFO - Iter(train) [255500/640000] base_lr: 1.3114e-04 lr: 1.3114e-05 eta: 6 days, 20:14:57 time: 1.5383 data_time: 0.0267 memory: 25572 grad_norm: 3.1157 loss: 1.3308 caption_loss_cls: 2.2528 detection_loss_cls: 0.0332 detection_loss_reg: 0.3361 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6685 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3332 instance_segmentation_loss_poly: 0.8973 2024/01/06 17:26:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 17:26:29 - mmengine - INFO - Iter(train) [256000/640000] base_lr: 1.3090e-04 lr: 1.3090e-05 eta: 6 days, 19:57:43 time: 1.5225 data_time: 0.0265 memory: 25572 grad_norm: 3.1719 loss: 1.3351 caption_loss_cls: 2.2561 detection_loss_cls: 0.0330 detection_loss_reg: 0.3353 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6677 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8974 2024/01/06 17:26:29 - mmengine - INFO - Saving checkpoint at 256000 iterations 2024/01/06 17:39:23 - mmengine - INFO - Iter(train) [256500/640000] base_lr: 1.3067e-04 lr: 1.3067e-05 eta: 6 days, 19:45:53 time: 1.5235 data_time: 0.0265 memory: 25572 grad_norm: 3.1871 loss: 1.3430 caption_loss_cls: 2.2524 detection_loss_cls: 0.0332 detection_loss_reg: 0.3370 semantic_segmentation_loss_cls: 0.0085 grounding_loss_reg: 2.6689 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8964 2024/01/06 17:51:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 17:51:55 - mmengine - INFO - Iter(train) [257000/640000] base_lr: 1.3043e-04 lr: 1.3043e-05 eta: 6 days, 19:29:58 time: 1.5192 data_time: 0.0265 memory: 25572 grad_norm: 3.2617 loss: 1.3380 caption_loss_cls: 2.2476 detection_loss_cls: 0.0332 detection_loss_reg: 0.3365 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6731 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3335 instance_segmentation_loss_poly: 0.9008 2024/01/06 18:04:31 - mmengine - INFO - Iter(train) [257500/640000] base_lr: 1.3020e-04 lr: 1.3020e-05 eta: 6 days, 19:14:52 time: 1.5271 data_time: 0.0266 memory: 25572 grad_norm: 3.2214 loss: 1.3235 caption_loss_cls: 2.2439 detection_loss_cls: 0.0330 detection_loss_reg: 0.3353 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6746 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3351 instance_segmentation_loss_poly: 0.9038 2024/01/06 18:17:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 18:17:10 - mmengine - INFO - Iter(train) [258000/640000] base_lr: 1.2997e-04 lr: 1.2997e-05 eta: 6 days, 19:00:24 time: 1.5198 data_time: 0.0265 memory: 25572 grad_norm: 3.1893 loss: 1.3277 caption_loss_cls: 2.2427 detection_loss_cls: 0.0328 detection_loss_reg: 0.3328 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6737 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3351 instance_segmentation_loss_poly: 0.9038 2024/01/06 18:17:10 - mmengine - INFO - Saving checkpoint at 258000 iterations 2024/01/06 18:30:27 - mmengine - INFO - Iter(train) [258500/640000] base_lr: 1.2973e-04 lr: 1.2973e-05 eta: 6 days, 18:52:37 time: 1.5246 data_time: 0.0267 memory: 25572 grad_norm: 3.1633 loss: 1.3360 caption_loss_cls: 2.2383 detection_loss_cls: 0.0329 detection_loss_reg: 0.3343 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6723 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3359 instance_segmentation_loss_poly: 0.9047 2024/01/06 18:43:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 18:43:28 - mmengine - INFO - Iter(train) [259000/640000] base_lr: 1.2950e-04 lr: 1.2950e-05 eta: 6 days, 18:41:55 time: 1.5329 data_time: 0.0268 memory: 25572 grad_norm: 3.1978 loss: 1.3142 caption_loss_cls: 2.2349 detection_loss_cls: 0.0329 detection_loss_reg: 0.3339 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6743 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3356 instance_segmentation_loss_poly: 0.9041 2024/01/06 18:56:02 - mmengine - INFO - Iter(train) [259500/640000] base_lr: 1.2926e-04 lr: 1.2926e-05 eta: 6 days, 18:26:33 time: 1.5291 data_time: 0.0267 memory: 25572 grad_norm: 3.2533 loss: 1.3178 caption_loss_cls: 2.2334 detection_loss_cls: 0.0331 detection_loss_reg: 0.3362 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6767 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3350 instance_segmentation_loss_poly: 0.9040 2024/01/06 19:08:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 19:08:20 - mmengine - INFO - Iter(train) [260000/640000] base_lr: 1.2903e-04 lr: 1.2903e-05 eta: 6 days, 18:08:39 time: 1.5273 data_time: 0.0267 memory: 25572 grad_norm: 3.2537 loss: 1.3224 caption_loss_cls: 2.2369 detection_loss_cls: 0.0333 detection_loss_reg: 0.3396 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6734 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3346 instance_segmentation_loss_poly: 0.9030 2024/01/06 19:08:20 - mmengine - INFO - Saving checkpoint at 260000 iterations 2024/01/06 19:20:00 - mmengine - INFO - Evaluating bbox... 2024/01/06 19:21:00 - mmengine - INFO - bbox_mAP_copypaste: 0.491 0.676 0.538 0.332 0.543 0.635 2024/01/06 19:21:00 - mmengine - INFO - Evaluating segm... 2024/01/06 19:22:13 - mmengine - INFO - segm_mAP_copypaste: 0.327 0.590 0.321 0.182 0.369 0.502 2024/01/06 19:24:21 - mmengine - INFO - Evaluating bbox... 2024/01/06 19:25:20 - mmengine - INFO - bbox_mAP_copypaste: 0.490 0.674 0.537 0.331 0.543 0.633 2024/01/06 19:30:12 - mmengine - INFO - per class results: 2024/01/06 19:30:12 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.49 | 89.28 | | building | 82.17 | 91.24 | | sky | 93.19 | 97.59 | | floor | 81.67 | 89.94 | | tree | 74.4 | 85.45 | | ceiling | 85.2 | 92.59 | | road | 82.6 | 90.2 | | bed | 88.99 | 96.74 | | windowpane | 63.96 | 80.45 | | grass | 64.57 | 76.68 | | cabinet | 64.65 | 76.63 | | sidewalk | 67.7 | 82.83 | | person | 81.13 | 90.3 | | earth | 35.05 | 47.89 | | door | 54.82 | 66.81 | | table | 62.17 | 81.0 | | mountain | 58.81 | 74.16 | | plant | 51.73 | 63.4 | | curtain | 73.97 | 90.78 | | chair | 61.0 | 75.37 | | car | 84.36 | 91.5 | | water | 60.31 | 72.05 | | painting | 71.24 | 89.58 | | sofa | 70.96 | 83.41 | | shelf | 44.4 | 65.21 | | house | 44.17 | 73.2 | | sea | 69.13 | 83.26 | | mirror | 68.61 | 75.21 | | rug | 68.13 | 83.08 | | field | 29.52 | 54.07 | | armchair | 50.97 | 69.02 | | seat | 64.66 | 80.88 | | fence | 47.35 | 70.29 | | desk | 50.81 | 65.07 | | rock | 52.76 | 80.38 | | wardrobe | 50.52 | 68.94 | | lamp | 63.12 | 78.37 | | bathtub | 79.53 | 88.25 | | railing | 35.7 | 51.04 | | cushion | 58.84 | 72.66 | | base | 27.08 | 34.56 | | box | 26.68 | 34.74 | | column | 51.69 | 61.57 | | signboard | 37.67 | 54.92 | | chest of drawers | 34.98 | 51.19 | | counter | 34.62 | 47.67 | | sand | 39.45 | 54.95 | | sink | 77.18 | 85.74 | | skyscraper | 58.88 | 72.11 | | fireplace | 71.12 | 92.08 | | refrigerator | 76.2 | 83.52 | | grandstand | 47.94 | 79.65 | | path | 15.75 | 22.18 | | stairs | 31.61 | 37.42 | | runway | 52.88 | 64.1 | | case | 51.45 | 61.6 | | pool table | 90.97 | 95.81 | | pillow | 56.61 | 66.73 | | screen door | 57.74 | 59.5 | | stairway | 31.69 | 55.68 | | river | 14.55 | 24.75 | | bridge | 53.34 | 61.19 | | bookcase | 37.96 | 61.59 | | blind | 37.42 | 39.6 | | coffee table | 66.06 | 76.68 | | toilet | 85.25 | 90.22 | | flower | 40.17 | 55.49 | | book | 46.2 | 67.7 | | hill | 7.66 | 13.42 | | bench | 53.66 | 63.12 | | countertop | 59.0 | 64.79 | | stove | 73.68 | 89.81 | | palm | 46.84 | 65.86 | | kitchen island | 34.76 | 43.6 | | computer | 76.54 | 84.15 | | swivel chair | 44.78 | 55.34 | | boat | 73.5 | 84.39 | | bar | 20.72 | 24.37 | | arcade machine | 74.63 | 79.93 | | hovel | 7.1 | 7.45 | | bus | 90.11 | 94.66 | | towel | 64.01 | 76.11 | | light | 52.47 | 65.79 | | truck | 40.82 | 56.97 | | tower | 27.28 | 44.95 | | chandelier | 68.24 | 84.68 | | awning | 33.45 | 42.91 | | streetlight | 35.01 | 47.01 | | booth | 42.82 | 57.33 | | television receiver | 68.41 | 89.15 | | airplane | 67.41 | 78.17 | | dirt track | 6.23 | 11.0 | | apparel | 30.79 | 47.11 | | pole | 26.57 | 36.03 | | land | 1.89 | 2.41 | | bannister | 11.35 | 14.89 | | escalator | 20.78 | 21.65 | | ottoman | 49.46 | 71.02 | | bottle | 22.57 | 29.59 | | buffet | 56.84 | 62.11 | | poster | 38.46 | 53.64 | | stage | 10.59 | 19.33 | | van | 42.13 | 55.83 | | ship | 8.66 | 9.05 | | fountain | 24.13 | 24.43 | | conveyer belt | 67.18 | 88.7 | | canopy | 29.33 | 44.99 | | washer | 68.07 | 70.91 | | plaything | 31.96 | 39.52 | | swimming pool | 56.43 | 85.69 | | stool | 41.53 | 53.75 | | barrel | 17.08 | 52.24 | | basket | 33.82 | 53.49 | | waterfall | 46.71 | 53.33 | | tent | 64.43 | 96.94 | | bag | 18.44 | 24.52 | | minibike | 72.3 | 85.88 | | cradle | 85.13 | 95.77 | | oven | 18.22 | 20.09 | | ball | 49.57 | 73.24 | | food | 45.21 | 47.41 | | step | 17.25 | 19.82 | | tank | 51.43 | 57.32 | | trade name | 23.45 | 26.35 | | microwave | 85.47 | 93.39 | | pot | 50.68 | 62.88 | | animal | 62.78 | 65.59 | | bicycle | 58.24 | 79.83 | | lake | 32.4 | 65.86 | | dishwasher | 61.53 | 63.49 | | screen | 75.74 | 89.75 | | blanket | 20.48 | 25.23 | | sculpture | 42.85 | 85.79 | | hood | 60.51 | 73.21 | | sconce | 38.59 | 50.27 | | vase | 43.75 | 58.63 | | traffic light | 40.57 | 59.99 | | tray | 14.93 | 27.49 | | ashcan | 35.46 | 49.17 | | fan | 59.41 | 74.36 | | pier | 35.45 | 45.25 | | crt screen | 7.8 | 11.59 | | plate | 54.75 | 75.99 | | monitor | 22.73 | 25.42 | | bulletin board | 44.42 | 49.98 | | shower | 3.84 | 11.71 | | radiator | 58.87 | 64.77 | | glass | 18.7 | 21.96 | | clock | 31.87 | 40.09 | | flag | 41.62 | 51.57 | +---------------------+-------+-------+ 2024/01/06 19:30:24 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4900 coco/bbox_mAP_50: 0.6740 coco/bbox_mAP_75: 0.5370 coco/bbox_mAP_s: 0.3310 coco/bbox_mAP_m: 0.5430 coco/bbox_mAP_l: 0.6330 coco/segm_mAP: 0.3270 coco/segm_mAP_50: 0.5900 coco/segm_mAP_75: 0.3210 coco/segm_mAP_s: 0.1820 coco/segm_mAP_m: 0.3690 coco/segm_mAP_l: 0.5020 Bleu_1: 0.7483 Bleu_2: 0.5797 Bleu_3: 0.4340 Bleu_4: 0.3218 METEOR: 0.2611 ROUGE_L: 0.5458 CIDEr: 1.0354 SPICE: 0.1931 aAcc: 83.4500 mIoU: 49.1900 mAcc: 61.4200 visual-grounding/miou: 0.7794 visual-grounding/acc: 0.8456 data_time: 0.0121 time: 1.8940 2024/01/06 19:43:17 - mmengine - INFO - Iter(train) [260500/640000] base_lr: 1.2879e-04 lr: 1.2879e-05 eta: 6 days, 17:56:56 time: 1.5275 data_time: 0.0201 memory: 34661 grad_norm: 3.1994 loss: 1.3089 caption_loss_cls: 2.2394 detection_loss_cls: 0.0336 detection_loss_reg: 0.3413 semantic_segmentation_loss_cls: 0.0084 grounding_loss_reg: 2.6703 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3330 instance_segmentation_loss_poly: 0.9006 2024/01/06 19:55:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 19:55:32 - mmengine - INFO - Iter(train) [261000/640000] base_lr: 1.2856e-04 lr: 1.2856e-05 eta: 6 days, 17:38:46 time: 1.5233 data_time: 0.0200 memory: 25570 grad_norm: 3.1629 loss: 1.2986 caption_loss_cls: 2.2322 detection_loss_cls: 0.0336 detection_loss_reg: 0.3416 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6612 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3325 instance_segmentation_loss_poly: 0.8986 2024/01/06 20:08:23 - mmengine - INFO - Iter(train) [261500/640000] base_lr: 1.2832e-04 lr: 1.2832e-05 eta: 6 days, 17:26:22 time: 1.5270 data_time: 0.0200 memory: 25570 grad_norm: 3.1568 loss: 1.2859 caption_loss_cls: 2.2362 detection_loss_cls: 0.0336 detection_loss_reg: 0.3419 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6582 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3324 instance_segmentation_loss_poly: 0.8970 2024/01/06 20:21:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 20:21:06 - mmengine - INFO - Iter(train) [262000/640000] base_lr: 1.2809e-04 lr: 1.2809e-05 eta: 6 days, 17:12:50 time: 1.5280 data_time: 0.0199 memory: 25570 grad_norm: 3.1582 loss: 1.2862 caption_loss_cls: 2.2337 detection_loss_cls: 0.0335 detection_loss_reg: 0.3413 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6591 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3336 instance_segmentation_loss_poly: 0.8997 2024/01/06 20:21:06 - mmengine - INFO - Saving checkpoint at 262000 iterations 2024/01/06 20:34:03 - mmengine - INFO - Iter(train) [262500/640000] base_lr: 1.2785e-04 lr: 1.2785e-05 eta: 6 days, 17:01:24 time: 1.5229 data_time: 0.0199 memory: 25570 grad_norm: 3.1591 loss: 1.2876 caption_loss_cls: 2.2381 detection_loss_cls: 0.0334 detection_loss_reg: 0.3415 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6622 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3344 instance_segmentation_loss_poly: 0.9020 2024/01/06 20:46:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 20:46:31 - mmengine - INFO - Iter(train) [263000/640000] base_lr: 1.2762e-04 lr: 1.2762e-05 eta: 6 days, 16:45:34 time: 1.5147 data_time: 0.0198 memory: 25570 grad_norm: 3.1181 loss: 1.2956 caption_loss_cls: 2.2321 detection_loss_cls: 0.0336 detection_loss_reg: 0.3421 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6625 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3346 instance_segmentation_loss_poly: 0.9027 2024/01/06 20:59:06 - mmengine - INFO - Iter(train) [263500/640000] base_lr: 1.2738e-04 lr: 1.2738e-05 eta: 6 days, 16:30:56 time: 1.5151 data_time: 0.0199 memory: 25570 grad_norm: 3.0708 loss: 1.2982 caption_loss_cls: 2.2268 detection_loss_cls: 0.0334 detection_loss_reg: 0.3422 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6633 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3363 instance_segmentation_loss_poly: 0.9079 2024/01/06 21:11:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 21:11:37 - mmengine - INFO - Iter(train) [264000/640000] base_lr: 1.2714e-04 lr: 1.2714e-05 eta: 6 days, 16:15:32 time: 1.5182 data_time: 0.0199 memory: 25570 grad_norm: 3.0605 loss: 1.2967 caption_loss_cls: 2.2263 detection_loss_cls: 0.0335 detection_loss_reg: 0.3430 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6620 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3374 instance_segmentation_loss_poly: 0.9096 2024/01/06 21:11:37 - mmengine - INFO - Saving checkpoint at 264000 iterations 2024/01/06 21:24:49 - mmengine - INFO - Iter(train) [264500/640000] base_lr: 1.2691e-04 lr: 1.2691e-05 eta: 6 days, 16:06:22 time: 1.5224 data_time: 0.0266 memory: 25570 grad_norm: 3.0634 loss: 1.3102 caption_loss_cls: 2.2239 detection_loss_cls: 0.0337 detection_loss_reg: 0.3443 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6590 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3394 instance_segmentation_loss_poly: 0.9117 2024/01/06 21:37:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 21:37:38 - mmengine - INFO - Iter(train) [265000/640000] base_lr: 1.2667e-04 lr: 1.2667e-05 eta: 6 days, 15:53:49 time: 1.5310 data_time: 0.0267 memory: 25570 grad_norm: 3.0678 loss: 1.3132 caption_loss_cls: 2.2189 detection_loss_cls: 0.0336 detection_loss_reg: 0.3443 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6574 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3389 instance_segmentation_loss_poly: 0.9109 2024/01/06 21:50:24 - mmengine - INFO - Iter(train) [265500/640000] base_lr: 1.2644e-04 lr: 1.2644e-05 eta: 6 days, 15:40:44 time: 1.5298 data_time: 0.0268 memory: 25570 grad_norm: 3.0708 loss: 1.3164 caption_loss_cls: 2.2172 detection_loss_cls: 0.0335 detection_loss_reg: 0.3439 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6561 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3393 instance_segmentation_loss_poly: 0.9111 2024/01/06 22:03:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 22:03:15 - mmengine - INFO - Iter(train) [266000/640000] base_lr: 1.2620e-04 lr: 1.2620e-05 eta: 6 days, 15:28:22 time: 1.5316 data_time: 0.0269 memory: 25570 grad_norm: 3.0550 loss: 1.3131 caption_loss_cls: 2.2130 detection_loss_cls: 0.0335 detection_loss_reg: 0.3434 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6561 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3415 instance_segmentation_loss_poly: 0.9162 2024/01/06 22:03:15 - mmengine - INFO - Saving checkpoint at 266000 iterations 2024/01/06 22:16:38 - mmengine - INFO - Iter(train) [266500/640000] base_lr: 1.2596e-04 lr: 1.2596e-05 eta: 6 days, 15:20:36 time: 1.5384 data_time: 0.0268 memory: 25570 grad_norm: 3.0498 loss: 1.3006 caption_loss_cls: 2.2173 detection_loss_cls: 0.0337 detection_loss_reg: 0.3451 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6549 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3408 instance_segmentation_loss_poly: 0.9139 2024/01/06 22:29:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 22:29:15 - mmengine - INFO - Iter(train) [267000/640000] base_lr: 1.2572e-04 lr: 1.2572e-05 eta: 6 days, 15:06:16 time: 1.5406 data_time: 0.0269 memory: 25570 grad_norm: 3.0373 loss: 1.3054 caption_loss_cls: 2.2124 detection_loss_cls: 0.0337 detection_loss_reg: 0.3449 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6530 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3400 instance_segmentation_loss_poly: 0.9124 2024/01/06 22:42:16 - mmengine - INFO - Iter(train) [267500/640000] base_lr: 1.2549e-04 lr: 1.2549e-05 eta: 6 days, 14:55:09 time: 1.5468 data_time: 0.0270 memory: 25570 grad_norm: 3.0181 loss: 1.2992 caption_loss_cls: 2.2076 detection_loss_cls: 0.0336 detection_loss_reg: 0.3445 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6544 instance_segmentation_loss_cls: 0.0329 instance_segmentation_loss_reg: 0.3415 instance_segmentation_loss_poly: 0.9164 2024/01/06 22:54:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 22:54:50 - mmengine - INFO - Iter(train) [268000/640000] base_lr: 1.2525e-04 lr: 1.2525e-05 eta: 6 days, 14:40:27 time: 1.5478 data_time: 0.0269 memory: 25570 grad_norm: 3.0027 loss: 1.2936 caption_loss_cls: 2.2132 detection_loss_cls: 0.0337 detection_loss_reg: 0.3448 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6535 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3418 instance_segmentation_loss_poly: 0.9173 2024/01/06 22:54:50 - mmengine - INFO - Saving checkpoint at 268000 iterations 2024/01/06 23:08:13 - mmengine - INFO - Iter(train) [268500/640000] base_lr: 1.2501e-04 lr: 1.2501e-05 eta: 6 days, 14:32:19 time: 1.5505 data_time: 0.0269 memory: 25570 grad_norm: 2.9632 loss: 1.2812 caption_loss_cls: 2.2185 detection_loss_cls: 0.0338 detection_loss_reg: 0.3463 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6548 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3413 instance_segmentation_loss_poly: 0.9182 2024/01/06 23:20:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 23:20:59 - mmengine - INFO - Iter(train) [269000/640000] base_lr: 1.2477e-04 lr: 1.2477e-05 eta: 6 days, 14:19:09 time: 1.5496 data_time: 0.0269 memory: 25570 grad_norm: 2.9110 loss: 1.2874 caption_loss_cls: 2.2221 detection_loss_cls: 0.0337 detection_loss_reg: 0.3453 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6511 instance_segmentation_loss_cls: 0.0328 instance_segmentation_loss_reg: 0.3411 instance_segmentation_loss_poly: 0.9176 2024/01/06 23:33:39 - mmengine - INFO - Iter(train) [269500/640000] base_lr: 1.2454e-04 lr: 1.2454e-05 eta: 6 days, 14:05:20 time: 1.5483 data_time: 0.0269 memory: 25570 grad_norm: 2.9333 loss: 1.3003 caption_loss_cls: 2.2141 detection_loss_cls: 0.0338 detection_loss_reg: 0.3472 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6536 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3389 instance_segmentation_loss_poly: 0.9130 2024/01/06 23:46:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/06 23:46:52 - mmengine - INFO - Iter(train) [270000/640000] base_lr: 1.2430e-04 lr: 1.2430e-05 eta: 6 days, 13:55:43 time: 1.5539 data_time: 0.0269 memory: 25570 grad_norm: 2.9250 loss: 1.3023 caption_loss_cls: 2.2178 detection_loss_cls: 0.0336 detection_loss_reg: 0.3457 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6500 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3385 instance_segmentation_loss_poly: 0.9114 2024/01/06 23:46:52 - mmengine - INFO - Saving checkpoint at 270000 iterations 2024/01/07 00:00:05 - mmengine - INFO - Iter(train) [270500/640000] base_lr: 1.2406e-04 lr: 1.2406e-05 eta: 6 days, 13:46:00 time: 1.5512 data_time: 0.0268 memory: 25570 grad_norm: 2.9285 loss: 1.3045 caption_loss_cls: 2.2147 detection_loss_cls: 0.0336 detection_loss_reg: 0.3453 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6471 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3392 instance_segmentation_loss_poly: 0.9142 2024/01/07 00:13:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 00:13:12 - mmengine - INFO - Iter(train) [271000/640000] base_lr: 1.2382e-04 lr: 1.2382e-05 eta: 6 days, 13:35:30 time: 1.5587 data_time: 0.0269 memory: 25570 grad_norm: 2.8720 loss: 1.2949 caption_loss_cls: 2.2143 detection_loss_cls: 0.0336 detection_loss_reg: 0.3447 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6484 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3396 instance_segmentation_loss_poly: 0.9159 2024/01/07 00:26:07 - mmengine - INFO - Iter(train) [271500/640000] base_lr: 1.2358e-04 lr: 1.2358e-05 eta: 6 days, 13:23:23 time: 1.5573 data_time: 0.0268 memory: 25570 grad_norm: 2.8665 loss: 1.2762 caption_loss_cls: 2.2134 detection_loss_cls: 0.0335 detection_loss_reg: 0.3464 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6485 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3391 instance_segmentation_loss_poly: 0.9141 2024/01/07 00:39:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 00:39:05 - mmengine - INFO - Iter(train) [272000/640000] base_lr: 1.2335e-04 lr: 1.2335e-05 eta: 6 days, 13:11:39 time: 1.5632 data_time: 0.0269 memory: 25570 grad_norm: 2.8927 loss: 1.2746 caption_loss_cls: 2.2198 detection_loss_cls: 0.0335 detection_loss_reg: 0.3466 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6480 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3390 instance_segmentation_loss_poly: 0.9140 2024/01/07 00:39:05 - mmengine - INFO - Saving checkpoint at 272000 iterations 2024/01/07 00:52:25 - mmengine - INFO - Iter(train) [272500/640000] base_lr: 1.2311e-04 lr: 1.2311e-05 eta: 6 days, 13:02:32 time: 1.5624 data_time: 0.0269 memory: 25570 grad_norm: 2.9424 loss: 1.2820 caption_loss_cls: 2.2227 detection_loss_cls: 0.0335 detection_loss_reg: 0.3471 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6491 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3384 instance_segmentation_loss_poly: 0.9139 2024/01/07 01:04:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 01:04:44 - mmengine - INFO - Iter(train) [273000/640000] base_lr: 1.2287e-04 lr: 1.2287e-05 eta: 6 days, 12:46:06 time: 1.5558 data_time: 0.0269 memory: 25570 grad_norm: 2.9867 loss: 1.2906 caption_loss_cls: 2.2200 detection_loss_cls: 0.0337 detection_loss_reg: 0.3492 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6452 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3391 instance_segmentation_loss_poly: 0.9140 2024/01/07 01:17:22 - mmengine - INFO - Iter(train) [273500/640000] base_lr: 1.2263e-04 lr: 1.2263e-05 eta: 6 days, 12:31:56 time: 1.5551 data_time: 0.0269 memory: 25570 grad_norm: 3.0746 loss: 1.2899 caption_loss_cls: 2.2198 detection_loss_cls: 0.0340 detection_loss_reg: 0.3510 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.6434 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3390 instance_segmentation_loss_poly: 0.9135 2024/01/07 01:30:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 01:30:03 - mmengine - INFO - Iter(train) [274000/640000] base_lr: 1.2239e-04 lr: 1.2239e-05 eta: 6 days, 12:18:14 time: 1.5472 data_time: 0.0268 memory: 25570 grad_norm: 3.1198 loss: 1.2890 caption_loss_cls: 2.2176 detection_loss_cls: 0.0340 detection_loss_reg: 0.3510 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6380 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3383 instance_segmentation_loss_poly: 0.9127 2024/01/07 01:30:03 - mmengine - INFO - Saving checkpoint at 274000 iterations 2024/01/07 01:42:59 - mmengine - INFO - Iter(train) [274500/640000] base_lr: 1.2215e-04 lr: 1.2215e-05 eta: 6 days, 12:06:12 time: 1.5429 data_time: 0.0268 memory: 25570 grad_norm: 3.1160 loss: 1.2845 caption_loss_cls: 2.2201 detection_loss_cls: 0.0342 detection_loss_reg: 0.3522 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6346 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3379 instance_segmentation_loss_poly: 0.9116 2024/01/07 01:55:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 01:55:56 - mmengine - INFO - Iter(train) [275000/640000] base_lr: 1.2191e-04 lr: 1.2191e-05 eta: 6 days, 11:54:22 time: 1.5404 data_time: 0.0268 memory: 25570 grad_norm: 3.1328 loss: 1.2854 caption_loss_cls: 2.2181 detection_loss_cls: 0.0338 detection_loss_reg: 0.3506 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6282 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3378 instance_segmentation_loss_poly: 0.9103 2024/01/07 02:08:38 - mmengine - INFO - Iter(train) [275500/640000] base_lr: 1.2167e-04 lr: 1.2167e-05 eta: 6 days, 11:40:43 time: 1.5372 data_time: 0.0268 memory: 25570 grad_norm: 3.2070 loss: 1.2985 caption_loss_cls: 2.2196 detection_loss_cls: 0.0337 detection_loss_reg: 0.3495 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6252 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3372 instance_segmentation_loss_poly: 0.9084 2024/01/07 02:21:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240106_022733 2024/01/07 02:21:24 - mmengine - INFO - Iter(train) [276000/640000] base_lr: 1.2143e-04 lr: 1.2143e-05 eta: 6 days, 11:27:34 time: 1.5343 data_time: 0.0267 memory: 25570 grad_norm: 3.1874 loss: 1.2915 caption_loss_cls: 2.2214 detection_loss_cls: 0.0337 detection_loss_reg: 0.3503 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6229 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3366 instance_segmentation_loss_poly: 0.9058 2024/01/07 02:21:24 - mmengine - INFO - Saving checkpoint at 276000 iterations 2024/01/07 02:45:17 - mmengine - INFO - Iter(train) [276500/640000] base_lr: 1.2119e-04 lr: 1.2119e-05 eta: 6 days, 6:55:58 time: 1.5213 data_time: 0.0192 memory: 25565 grad_norm: 3.2024 loss: 1.2812 caption_loss_cls: 2.2178 detection_loss_cls: 0.0336 detection_loss_reg: 0.3496 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6199 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3370 instance_segmentation_loss_poly: 0.9078 2024/01/07 02:57:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 02:57:35 - mmengine - INFO - Iter(train) [277000/640000] base_lr: 1.2095e-04 lr: 1.2095e-05 eta: 6 days, 5:40:38 time: 1.5208 data_time: 0.0190 memory: 25565 grad_norm: 3.2282 loss: 1.2806 caption_loss_cls: 2.2145 detection_loss_cls: 0.0337 detection_loss_reg: 0.3512 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6144 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3376 instance_segmentation_loss_poly: 0.9086 2024/01/07 03:10:08 - mmengine - INFO - Iter(train) [277500/640000] base_lr: 1.2071e-04 lr: 1.2071e-05 eta: 6 days, 6:12:11 time: 1.5197 data_time: 0.0188 memory: 25565 grad_norm: 3.1800 loss: 1.2780 caption_loss_cls: 2.2189 detection_loss_cls: 0.0336 detection_loss_reg: 0.3508 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6134 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3369 instance_segmentation_loss_poly: 0.9058 2024/01/07 03:23:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 03:23:00 - mmengine - INFO - Iter(train) [278000/640000] base_lr: 1.2047e-04 lr: 1.2047e-05 eta: 6 days, 7:18:52 time: 1.5224 data_time: 0.0186 memory: 25565 grad_norm: 3.1687 loss: 1.2773 caption_loss_cls: 2.2158 detection_loss_cls: 0.0336 detection_loss_reg: 0.3509 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6077 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3357 instance_segmentation_loss_poly: 0.9021 2024/01/07 03:23:00 - mmengine - INFO - Saving checkpoint at 278000 iterations 2024/01/07 03:36:06 - mmengine - INFO - Iter(train) [278500/640000] base_lr: 1.2023e-04 lr: 1.2023e-05 eta: 6 days, 8:25:39 time: 1.5249 data_time: 0.0178 memory: 25565 grad_norm: 3.1766 loss: 1.2933 caption_loss_cls: 2.2193 detection_loss_cls: 0.0337 detection_loss_reg: 0.3521 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6029 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3367 instance_segmentation_loss_poly: 0.9050 2024/01/07 03:48:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 03:48:56 - mmengine - INFO - Iter(train) [279000/640000] base_lr: 1.1999e-04 lr: 1.1999e-05 eta: 6 days, 8:34:20 time: 1.5229 data_time: 0.0175 memory: 25565 grad_norm: 3.2463 loss: 1.3055 caption_loss_cls: 2.2263 detection_loss_cls: 0.0337 detection_loss_reg: 0.3510 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.6002 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3385 instance_segmentation_loss_poly: 0.9104 2024/01/07 04:01:42 - mmengine - INFO - Iter(train) [279500/640000] base_lr: 1.1975e-04 lr: 1.1975e-05 eta: 6 days, 8:30:01 time: 1.5240 data_time: 0.0172 memory: 25565 grad_norm: 3.2503 loss: 1.3046 caption_loss_cls: 2.2326 detection_loss_cls: 0.0336 detection_loss_reg: 0.3494 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.5931 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3375 instance_segmentation_loss_poly: 0.9068 2024/01/07 04:14:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 04:14:44 - mmengine - INFO - Iter(train) [280000/640000] base_lr: 1.1951e-04 lr: 1.1951e-05 eta: 6 days, 8:47:28 time: 1.5279 data_time: 0.0170 memory: 25565 grad_norm: 3.2101 loss: 1.3064 caption_loss_cls: 2.2304 detection_loss_cls: 0.0334 detection_loss_reg: 0.3481 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.5939 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3384 instance_segmentation_loss_poly: 0.9088 2024/01/07 04:14:44 - mmengine - INFO - Saving checkpoint at 280000 iterations 2024/01/07 04:26:15 - mmengine - INFO - Evaluating bbox... 2024/01/07 04:27:13 - mmengine - INFO - bbox_mAP_copypaste: 0.499 0.682 0.548 0.350 0.550 0.640 2024/01/07 04:27:13 - mmengine - INFO - Evaluating segm... 2024/01/07 04:28:25 - mmengine - INFO - segm_mAP_copypaste: 0.331 0.592 0.325 0.196 0.374 0.506 2024/01/07 04:30:36 - mmengine - INFO - Evaluating bbox... 2024/01/07 04:31:34 - mmengine - INFO - bbox_mAP_copypaste: 0.499 0.680 0.548 0.347 0.550 0.640 2024/01/07 04:37:44 - mmengine - INFO - per class results: 2024/01/07 04:37:44 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.73 | 87.84 | | building | 80.56 | 88.16 | | sky | 93.08 | 97.96 | | floor | 82.77 | 90.18 | | tree | 74.22 | 86.75 | | ceiling | 84.54 | 93.55 | | road | 83.61 | 91.9 | | bed | 89.7 | 94.73 | | windowpane | 64.17 | 77.78 | | grass | 67.31 | 89.48 | | cabinet | 60.92 | 71.66 | | sidewalk | 67.09 | 75.53 | | person | 81.12 | 90.98 | | earth | 37.14 | 49.04 | | door | 51.16 | 74.82 | | table | 64.37 | 77.79 | | mountain | 55.14 | 68.69 | | plant | 52.32 | 63.99 | | curtain | 75.32 | 88.17 | | chair | 61.32 | 75.91 | | car | 84.77 | 90.94 | | water | 57.74 | 66.0 | | painting | 72.35 | 87.29 | | sofa | 69.17 | 78.5 | | shelf | 47.26 | 68.89 | | house | 43.14 | 78.13 | | sea | 69.82 | 81.48 | | mirror | 67.73 | 75.73 | | rug | 69.41 | 84.82 | | field | 26.45 | 35.04 | | armchair | 50.48 | 73.34 | | seat | 65.31 | 79.2 | | fence | 45.76 | 63.38 | | desk | 53.49 | 69.29 | | rock | 46.73 | 80.66 | | wardrobe | 46.69 | 75.07 | | lamp | 63.2 | 76.27 | | bathtub | 76.88 | 90.37 | | railing | 38.83 | 55.54 | | cushion | 55.94 | 68.92 | | base | 28.24 | 39.56 | | box | 27.02 | 34.05 | | column | 48.61 | 59.89 | | signboard | 36.91 | 49.69 | | chest of drawers | 38.62 | 69.63 | | counter | 32.09 | 50.78 | | sand | 45.89 | 60.33 | | sink | 74.84 | 86.97 | | skyscraper | 50.89 | 69.06 | | fireplace | 72.99 | 92.6 | | refrigerator | 74.14 | 88.23 | | grandstand | 45.06 | 65.99 | | path | 22.72 | 35.97 | | stairs | 34.0 | 41.04 | | runway | 57.82 | 69.04 | | case | 48.63 | 59.25 | | pool table | 90.96 | 96.11 | | pillow | 61.04 | 77.53 | | screen door | 46.08 | 46.65 | | stairway | 34.73 | 59.46 | | river | 14.86 | 43.57 | | bridge | 56.92 | 69.38 | | bookcase | 37.36 | 54.54 | | blind | 41.15 | 45.69 | | coffee table | 60.76 | 86.07 | | toilet | 85.16 | 90.6 | | flower | 36.41 | 49.56 | | book | 46.89 | 63.09 | | hill | 10.79 | 19.59 | | bench | 62.64 | 74.58 | | countertop | 58.63 | 72.48 | | stove | 80.91 | 86.17 | | palm | 50.91 | 73.13 | | kitchen island | 46.82 | 76.31 | | computer | 77.21 | 90.54 | | swivel chair | 42.18 | 52.57 | | boat | 69.07 | 86.33 | | bar | 38.54 | 47.83 | | arcade machine | 53.61 | 58.66 | | hovel | 22.4 | 84.56 | | bus | 90.21 | 95.78 | | towel | 64.99 | 77.29 | | light | 45.01 | 51.98 | | truck | 46.25 | 59.73 | | tower | 33.48 | 67.79 | | chandelier | 66.39 | 82.91 | | awning | 34.97 | 51.46 | | streetlight | 33.49 | 40.56 | | booth | 51.98 | 67.22 | | television receiver | 69.08 | 83.48 | | airplane | 66.0 | 84.37 | | dirt track | 6.16 | 14.13 | | apparel | 23.47 | 37.24 | | pole | 29.14 | 42.6 | | land | 3.73 | 5.86 | | bannister | 17.16 | 22.3 | | escalator | 26.42 | 27.46 | | ottoman | 44.54 | 68.91 | | bottle | 28.27 | 35.64 | | buffet | 57.47 | 68.68 | | poster | 31.7 | 39.83 | | stage | 9.27 | 17.17 | | van | 48.04 | 61.57 | | ship | 22.65 | 24.36 | | fountain | 26.65 | 27.12 | | conveyer belt | 75.85 | 92.15 | | canopy | 28.93 | 41.98 | | washer | 65.42 | 68.29 | | plaything | 34.18 | 51.3 | | swimming pool | 59.74 | 65.85 | | stool | 46.51 | 60.44 | | barrel | 27.33 | 65.04 | | basket | 38.12 | 50.12 | | waterfall | 63.09 | 66.01 | | tent | 70.77 | 96.42 | | bag | 20.42 | 27.66 | | minibike | 70.86 | 88.24 | | cradle | 77.99 | 96.34 | | oven | 51.15 | 61.87 | | ball | 46.37 | 67.57 | | food | 54.88 | 61.69 | | step | 8.55 | 10.83 | | tank | 52.7 | 65.15 | | trade name | 29.76 | 40.93 | | microwave | 86.25 | 93.63 | | pot | 48.88 | 57.71 | | animal | 64.68 | 67.64 | | bicycle | 60.71 | 75.4 | | lake | 57.76 | 63.06 | | dishwasher | 62.91 | 65.44 | | screen | 73.46 | 90.41 | | blanket | 18.49 | 21.81 | | sculpture | 55.37 | 83.39 | | hood | 55.07 | 68.22 | | sconce | 43.64 | 58.58 | | vase | 41.82 | 52.1 | | traffic light | 38.9 | 66.56 | | tray | 13.72 | 24.65 | | ashcan | 37.53 | 58.78 | | fan | 61.48 | 77.25 | | pier | 44.34 | 61.22 | | crt screen | 1.52 | 2.76 | | plate | 57.68 | 78.1 | | monitor | 29.06 | 36.91 | | bulletin board | 43.73 | 57.43 | | shower | 0.17 | 0.27 | | radiator | 59.39 | 69.18 | | glass | 17.4 | 19.21 | | clock | 33.45 | 47.63 | | flag | 51.01 | 59.36 | +---------------------+-------+-------+ 2024/01/07 04:37:58 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4990 coco/bbox_mAP_50: 0.6800 coco/bbox_mAP_75: 0.5480 coco/bbox_mAP_s: 0.3470 coco/bbox_mAP_m: 0.5500 coco/bbox_mAP_l: 0.6400 coco/segm_mAP: 0.3310 coco/segm_mAP_50: 0.5920 coco/segm_mAP_75: 0.3250 coco/segm_mAP_s: 0.1960 coco/segm_mAP_m: 0.3740 coco/segm_mAP_l: 0.5060 Bleu_1: 0.7423 Bleu_2: 0.5724 Bleu_3: 0.4283 Bleu_4: 0.3168 METEOR: 0.2637 ROUGE_L: 0.5452 CIDEr: 1.0350 SPICE: 0.1979 aAcc: 83.2900 mIoU: 50.5200 mAcc: 63.8700 visual-grounding/miou: 0.8000 visual-grounding/acc: 0.8672 data_time: 0.0285 time: 1.9196 2024/01/07 04:50:19 - mmengine - INFO - Iter(train) [280500/640000] base_lr: 1.1927e-04 lr: 1.1927e-05 eta: 6 days, 8:07:42 time: 1.5270 data_time: 0.0178 memory: 34658 grad_norm: 3.1785 loss: 1.3197 caption_loss_cls: 2.2315 detection_loss_cls: 0.0334 detection_loss_reg: 0.3481 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.5929 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3386 instance_segmentation_loss_poly: 0.9087 2024/01/07 05:03:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 05:03:11 - mmengine - INFO - Iter(train) [281000/640000] base_lr: 1.1903e-04 lr: 1.1903e-05 eta: 6 days, 8:06:51 time: 1.5356 data_time: 0.0179 memory: 25565 grad_norm: 3.0985 loss: 1.2900 caption_loss_cls: 2.2339 detection_loss_cls: 0.0333 detection_loss_reg: 0.3491 semantic_segmentation_loss_cls: 0.0083 grounding_loss_reg: 2.5922 instance_segmentation_loss_cls: 0.0327 instance_segmentation_loss_reg: 0.3398 instance_segmentation_loss_poly: 0.9102 2024/01/07 05:16:40 - mmengine - INFO - Iter(train) [281500/640000] base_lr: 1.1879e-04 lr: 1.1879e-05 eta: 6 days, 8:43:47 time: 1.5494 data_time: 0.0183 memory: 25565 grad_norm: 3.0539 loss: 1.2846 caption_loss_cls: 2.2339 detection_loss_cls: 0.0331 detection_loss_reg: 0.3475 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5904 instance_segmentation_loss_cls: 0.0326 instance_segmentation_loss_reg: 0.3393 instance_segmentation_loss_poly: 0.9107 2024/01/07 05:29:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 05:29:07 - mmengine - INFO - Iter(train) [282000/640000] base_lr: 1.1855e-04 lr: 1.1855e-05 eta: 6 days, 8:10:35 time: 1.5430 data_time: 0.0183 memory: 25565 grad_norm: 3.0269 loss: 1.2837 caption_loss_cls: 2.2288 detection_loss_cls: 0.0331 detection_loss_reg: 0.3474 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5893 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3384 instance_segmentation_loss_poly: 0.9083 2024/01/07 05:29:07 - mmengine - INFO - Saving checkpoint at 282000 iterations 2024/01/07 05:42:06 - mmengine - INFO - Iter(train) [282500/640000] base_lr: 1.1830e-04 lr: 1.1830e-05 eta: 6 days, 8:10:38 time: 1.5415 data_time: 0.0197 memory: 25565 grad_norm: 2.9665 loss: 1.2652 caption_loss_cls: 2.2210 detection_loss_cls: 0.0332 detection_loss_reg: 0.3462 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5844 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3386 instance_segmentation_loss_poly: 0.9083 2024/01/07 05:54:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 05:54:58 - mmengine - INFO - Iter(train) [283000/640000] base_lr: 1.1806e-04 lr: 1.1806e-05 eta: 6 days, 8:02:39 time: 1.5420 data_time: 0.0198 memory: 25565 grad_norm: 2.8717 loss: 1.2538 caption_loss_cls: 2.2157 detection_loss_cls: 0.0333 detection_loss_reg: 0.3475 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5846 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3372 instance_segmentation_loss_poly: 0.9061 2024/01/07 06:07:17 - mmengine - INFO - Iter(train) [283500/640000] base_lr: 1.1782e-04 lr: 1.1782e-05 eta: 6 days, 7:28:05 time: 1.5354 data_time: 0.0200 memory: 25565 grad_norm: 2.7766 loss: 1.2665 caption_loss_cls: 2.2155 detection_loss_cls: 0.0332 detection_loss_reg: 0.3478 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5826 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3370 instance_segmentation_loss_poly: 0.9064 2024/01/07 06:20:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 06:20:08 - mmengine - INFO - Iter(train) [284000/640000] base_lr: 1.1758e-04 lr: 1.1758e-05 eta: 6 days, 7:19:36 time: 1.5326 data_time: 0.0201 memory: 25565 grad_norm: 2.7356 loss: 1.2586 caption_loss_cls: 2.2118 detection_loss_cls: 0.0332 detection_loss_reg: 0.3464 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5789 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3378 instance_segmentation_loss_poly: 0.9091 2024/01/07 06:20:08 - mmengine - INFO - Saving checkpoint at 284000 iterations 2024/01/07 06:33:10 - mmengine - INFO - Iter(train) [284500/640000] base_lr: 1.1734e-04 lr: 1.1734e-05 eta: 6 days, 7:18:06 time: 1.5420 data_time: 0.0266 memory: 25565 grad_norm: 2.7200 loss: 1.2555 caption_loss_cls: 2.2044 detection_loss_cls: 0.0327 detection_loss_reg: 0.3431 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5795 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3355 instance_segmentation_loss_poly: 0.9054 2024/01/07 06:45:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 06:45:45 - mmengine - INFO - Iter(train) [285000/640000] base_lr: 1.1710e-04 lr: 1.1710e-05 eta: 6 days, 6:58:00 time: 1.5379 data_time: 0.0266 memory: 25565 grad_norm: 2.7076 loss: 1.2699 caption_loss_cls: 2.2092 detection_loss_cls: 0.0325 detection_loss_reg: 0.3406 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5810 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3353 instance_segmentation_loss_poly: 0.9046 2024/01/07 06:58:27 - mmengine - INFO - Iter(train) [285500/640000] base_lr: 1.1685e-04 lr: 1.1685e-05 eta: 6 days, 6:43:02 time: 1.5263 data_time: 0.0264 memory: 25565 grad_norm: 2.6601 loss: 1.2590 caption_loss_cls: 2.2117 detection_loss_cls: 0.0325 detection_loss_reg: 0.3407 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5748 instance_segmentation_loss_cls: 0.0322 instance_segmentation_loss_reg: 0.3339 instance_segmentation_loss_poly: 0.8999 2024/01/07 07:11:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 07:11:09 - mmengine - INFO - Iter(train) [286000/640000] base_lr: 1.1661e-04 lr: 1.1661e-05 eta: 6 days, 6:27:59 time: 1.5300 data_time: 0.0265 memory: 25565 grad_norm: 2.6642 loss: 1.2621 caption_loss_cls: 2.2094 detection_loss_cls: 0.0324 detection_loss_reg: 0.3402 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5716 instance_segmentation_loss_cls: 0.0321 instance_segmentation_loss_reg: 0.3324 instance_segmentation_loss_poly: 0.8972 2024/01/07 07:11:09 - mmengine - INFO - Saving checkpoint at 286000 iterations 2024/01/07 07:24:27 - mmengine - INFO - Iter(train) [286500/640000] base_lr: 1.1637e-04 lr: 1.1637e-05 eta: 6 days, 6:33:55 time: 1.5348 data_time: 0.0260 memory: 25565 grad_norm: 2.6256 loss: 1.2554 caption_loss_cls: 2.2011 detection_loss_cls: 0.0323 detection_loss_reg: 0.3410 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5691 instance_segmentation_loss_cls: 0.0321 instance_segmentation_loss_reg: 0.3327 instance_segmentation_loss_poly: 0.8990 2024/01/07 07:36:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 07:36:59 - mmengine - INFO - Iter(train) [287000/640000] base_lr: 1.1613e-04 lr: 1.1613e-05 eta: 6 days, 6:13:10 time: 1.5298 data_time: 0.0259 memory: 25565 grad_norm: 2.6097 loss: 1.2567 caption_loss_cls: 2.2037 detection_loss_cls: 0.0324 detection_loss_reg: 0.3430 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5654 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3328 instance_segmentation_loss_poly: 0.8986 2024/01/07 07:49:13 - mmengine - INFO - Iter(train) [287500/640000] base_lr: 1.1589e-04 lr: 1.1589e-05 eta: 6 days, 5:43:41 time: 1.5284 data_time: 0.0259 memory: 25565 grad_norm: 2.6146 loss: 1.2509 caption_loss_cls: 2.2041 detection_loss_cls: 0.0322 detection_loss_reg: 0.3414 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5601 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3326 instance_segmentation_loss_poly: 0.8967 2024/01/07 08:02:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 08:02:05 - mmengine - INFO - Iter(train) [288000/640000] base_lr: 1.1564e-04 lr: 1.1564e-05 eta: 6 days, 5:34:33 time: 1.5287 data_time: 0.0260 memory: 25565 grad_norm: 2.6369 loss: 1.2629 caption_loss_cls: 2.2027 detection_loss_cls: 0.0322 detection_loss_reg: 0.3413 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5587 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3336 instance_segmentation_loss_poly: 0.8990 2024/01/07 08:02:05 - mmengine - INFO - Saving checkpoint at 288000 iterations 2024/01/07 08:14:40 - mmengine - INFO - Iter(train) [288500/640000] base_lr: 1.1540e-04 lr: 1.1540e-05 eta: 6 days, 5:16:55 time: 1.5220 data_time: 0.0260 memory: 25565 grad_norm: 2.6448 loss: 1.2589 caption_loss_cls: 2.1927 detection_loss_cls: 0.0319 detection_loss_reg: 0.3389 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5559 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3330 instance_segmentation_loss_poly: 0.8970 2024/01/07 08:27:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 08:27:30 - mmengine - INFO - Iter(train) [289000/640000] base_lr: 1.1516e-04 lr: 1.1516e-05 eta: 6 days, 5:06:31 time: 1.5257 data_time: 0.0260 memory: 25565 grad_norm: 2.6057 loss: 1.2487 caption_loss_cls: 2.1890 detection_loss_cls: 0.0320 detection_loss_reg: 0.3409 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5552 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3332 instance_segmentation_loss_poly: 0.8986 2024/01/07 08:39:55 - mmengine - INFO - Iter(train) [289500/640000] base_lr: 1.1492e-04 lr: 1.1492e-05 eta: 6 days, 4:45:08 time: 1.5214 data_time: 0.0259 memory: 25565 grad_norm: 2.6085 loss: 1.2574 caption_loss_cls: 2.1918 detection_loss_cls: 0.0320 detection_loss_reg: 0.3410 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5485 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3347 instance_segmentation_loss_poly: 0.9009 2024/01/07 08:52:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 08:52:23 - mmengine - INFO - Iter(train) [290000/640000] base_lr: 1.1467e-04 lr: 1.1467e-05 eta: 6 days, 4:25:34 time: 1.5179 data_time: 0.0259 memory: 25565 grad_norm: 2.5986 loss: 1.2628 caption_loss_cls: 2.1844 detection_loss_cls: 0.0318 detection_loss_reg: 0.3392 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5471 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3356 instance_segmentation_loss_poly: 0.9040 2024/01/07 08:52:23 - mmengine - INFO - Saving checkpoint at 290000 iterations 2024/01/07 09:05:36 - mmengine - INFO - Iter(train) [290500/640000] base_lr: 1.1443e-04 lr: 1.1443e-05 eta: 6 days, 4:24:45 time: 1.5166 data_time: 0.0259 memory: 25565 grad_norm: 2.6158 loss: 1.2693 caption_loss_cls: 2.1866 detection_loss_cls: 0.0316 detection_loss_reg: 0.3392 semantic_segmentation_loss_cls: 0.0082 grounding_loss_reg: 2.5489 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3346 instance_segmentation_loss_poly: 0.9028 2024/01/07 09:18:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 09:18:26 - mmengine - INFO - Iter(train) [291000/640000] base_lr: 1.1419e-04 lr: 1.1419e-05 eta: 6 days, 4:14:13 time: 1.5211 data_time: 0.0261 memory: 25565 grad_norm: 2.6352 loss: 1.2747 caption_loss_cls: 2.1831 detection_loss_cls: 0.0318 detection_loss_reg: 0.3400 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5471 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3348 instance_segmentation_loss_poly: 0.9016 2024/01/07 09:30:52 - mmengine - INFO - Iter(train) [291500/640000] base_lr: 1.1394e-04 lr: 1.1394e-05 eta: 6 days, 3:54:25 time: 1.5243 data_time: 0.0261 memory: 25565 grad_norm: 2.6225 loss: 1.2668 caption_loss_cls: 2.1820 detection_loss_cls: 0.0317 detection_loss_reg: 0.3396 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5436 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3363 instance_segmentation_loss_poly: 0.9056 2024/01/07 09:43:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 09:43:42 - mmengine - INFO - Iter(train) [292000/640000] base_lr: 1.1370e-04 lr: 1.1370e-05 eta: 6 days, 3:43:37 time: 1.5236 data_time: 0.0260 memory: 25565 grad_norm: 2.5866 loss: 1.2593 caption_loss_cls: 2.1815 detection_loss_cls: 0.0315 detection_loss_reg: 0.3367 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5442 instance_segmentation_loss_cls: 0.0325 instance_segmentation_loss_reg: 0.3355 instance_segmentation_loss_poly: 0.9032 2024/01/07 09:43:42 - mmengine - INFO - Saving checkpoint at 292000 iterations 2024/01/07 09:56:57 - mmengine - INFO - Iter(train) [292500/640000] base_lr: 1.1346e-04 lr: 1.1346e-05 eta: 6 days, 3:41:39 time: 1.5337 data_time: 0.0260 memory: 25565 grad_norm: 2.5915 loss: 1.2527 caption_loss_cls: 2.1747 detection_loss_cls: 0.0315 detection_loss_reg: 0.3358 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5432 instance_segmentation_loss_cls: 0.0322 instance_segmentation_loss_reg: 0.3342 instance_segmentation_loss_poly: 0.8988 2024/01/07 10:09:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 10:09:22 - mmengine - INFO - Iter(train) [293000/640000] base_lr: 1.1322e-04 lr: 1.1322e-05 eta: 6 days, 3:22:07 time: 1.5275 data_time: 0.0260 memory: 25565 grad_norm: 2.6084 loss: 1.2527 caption_loss_cls: 2.1700 detection_loss_cls: 0.0313 detection_loss_reg: 0.3338 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5407 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3348 instance_segmentation_loss_poly: 0.9003 2024/01/07 10:21:54 - mmengine - INFO - Iter(train) [293500/640000] base_lr: 1.1297e-04 lr: 1.1297e-05 eta: 6 days, 3:05:00 time: 1.5291 data_time: 0.0260 memory: 25565 grad_norm: 2.6242 loss: 1.2601 caption_loss_cls: 2.1661 detection_loss_cls: 0.0312 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0081 grounding_loss_reg: 2.5397 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3345 instance_segmentation_loss_poly: 0.9012 2024/01/07 10:34:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 10:34:32 - mmengine - INFO - Iter(train) [294000/640000] base_lr: 1.1273e-04 lr: 1.1273e-05 eta: 6 days, 2:50:28 time: 1.5319 data_time: 0.0260 memory: 25565 grad_norm: 2.6298 loss: 1.2501 caption_loss_cls: 2.1673 detection_loss_cls: 0.0312 detection_loss_reg: 0.3322 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5379 instance_segmentation_loss_cls: 0.0324 instance_segmentation_loss_reg: 0.3355 instance_segmentation_loss_poly: 0.9033 2024/01/07 10:34:32 - mmengine - INFO - Saving checkpoint at 294000 iterations 2024/01/07 10:47:38 - mmengine - INFO - Iter(train) [294500/640000] base_lr: 1.1249e-04 lr: 1.1249e-05 eta: 6 days, 2:44:30 time: 1.5300 data_time: 0.0259 memory: 25565 grad_norm: 2.6729 loss: 1.2508 caption_loss_cls: 2.1656 detection_loss_cls: 0.0310 detection_loss_reg: 0.3306 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5343 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3362 instance_segmentation_loss_poly: 0.9041 2024/01/07 10:59:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 10:59:57 - mmengine - INFO - Iter(train) [295000/640000] base_lr: 1.1224e-04 lr: 1.1224e-05 eta: 6 days, 2:23:46 time: 1.5221 data_time: 0.0257 memory: 25565 grad_norm: 2.7031 loss: 1.2498 caption_loss_cls: 2.1603 detection_loss_cls: 0.0310 detection_loss_reg: 0.3298 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5329 instance_segmentation_loss_cls: 0.0323 instance_segmentation_loss_reg: 0.3379 instance_segmentation_loss_poly: 0.9074 2024/01/07 11:13:07 - mmengine - INFO - Iter(train) [295500/640000] base_lr: 1.1200e-04 lr: 1.1200e-05 eta: 6 days, 2:18:44 time: 1.5331 data_time: 0.0259 memory: 25565 grad_norm: 2.6827 loss: 1.2451 caption_loss_cls: 2.1596 detection_loss_cls: 0.0311 detection_loss_reg: 0.3299 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5332 instance_segmentation_loss_cls: 0.0321 instance_segmentation_loss_reg: 0.3379 instance_segmentation_loss_poly: 0.9070 2024/01/07 11:25:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 11:25:36 - mmengine - INFO - Iter(train) [296000/640000] base_lr: 1.1175e-04 lr: 1.1175e-05 eta: 6 days, 2:01:28 time: 1.5280 data_time: 0.0259 memory: 25565 grad_norm: 2.7350 loss: 1.2545 caption_loss_cls: 2.1528 detection_loss_cls: 0.0312 detection_loss_reg: 0.3295 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5345 instance_segmentation_loss_cls: 0.0320 instance_segmentation_loss_reg: 0.3377 instance_segmentation_loss_poly: 0.9069 2024/01/07 11:25:36 - mmengine - INFO - Saving checkpoint at 296000 iterations 2024/01/07 11:38:52 - mmengine - INFO - Iter(train) [296500/640000] base_lr: 1.1151e-04 lr: 1.1151e-05 eta: 6 days, 1:57:43 time: 1.5283 data_time: 0.0258 memory: 25565 grad_norm: 2.7316 loss: 1.2565 caption_loss_cls: 2.1501 detection_loss_cls: 0.0312 detection_loss_reg: 0.3301 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5331 instance_segmentation_loss_cls: 0.0319 instance_segmentation_loss_reg: 0.3374 instance_segmentation_loss_poly: 0.9062 2024/01/07 11:51:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 11:51:11 - mmengine - INFO - Iter(train) [297000/640000] base_lr: 1.1127e-04 lr: 1.1127e-05 eta: 6 days, 1:37:44 time: 1.5266 data_time: 0.0258 memory: 25565 grad_norm: 2.7774 loss: 1.2647 caption_loss_cls: 2.1503 detection_loss_cls: 0.0312 detection_loss_reg: 0.3295 semantic_segmentation_loss_cls: 0.0080 grounding_loss_reg: 2.5305 instance_segmentation_loss_cls: 0.0317 instance_segmentation_loss_reg: 0.3358 instance_segmentation_loss_poly: 0.9025 2024/01/07 12:03:54 - mmengine - INFO - Iter(train) [297500/640000] base_lr: 1.1102e-04 lr: 1.1102e-05 eta: 6 days, 1:24:45 time: 1.5297 data_time: 0.0258 memory: 25565 grad_norm: 2.7687 loss: 1.2627 caption_loss_cls: 2.1537 detection_loss_cls: 0.0310 detection_loss_reg: 0.3271 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5293 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3352 instance_segmentation_loss_poly: 0.9014 2024/01/07 12:16:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 12:16:56 - mmengine - INFO - Iter(train) [298000/640000] base_lr: 1.1078e-04 lr: 1.1078e-05 eta: 6 days, 1:16:30 time: 1.5355 data_time: 0.0258 memory: 25565 grad_norm: 2.7350 loss: 1.2535 caption_loss_cls: 2.1532 detection_loss_cls: 0.0310 detection_loss_reg: 0.3285 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5280 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3348 instance_segmentation_loss_poly: 0.9016 2024/01/07 12:16:56 - mmengine - INFO - Saving checkpoint at 298000 iterations 2024/01/07 12:30:08 - mmengine - INFO - Iter(train) [298500/640000] base_lr: 1.1053e-04 lr: 1.1053e-05 eta: 6 days, 1:10:39 time: 1.5370 data_time: 0.0255 memory: 25565 grad_norm: 2.7228 loss: 1.2508 caption_loss_cls: 2.1532 detection_loss_cls: 0.0308 detection_loss_reg: 0.3270 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5235 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3354 instance_segmentation_loss_poly: 0.9025 2024/01/07 12:42:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 12:42:23 - mmengine - INFO - Iter(train) [299000/640000] base_lr: 1.1029e-04 lr: 1.1029e-05 eta: 6 days, 0:50:09 time: 1.5359 data_time: 0.0255 memory: 25565 grad_norm: 2.7260 loss: 1.2636 caption_loss_cls: 2.1547 detection_loss_cls: 0.0307 detection_loss_reg: 0.3264 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5281 instance_segmentation_loss_cls: 0.0315 instance_segmentation_loss_reg: 0.3347 instance_segmentation_loss_poly: 0.9012 2024/01/07 12:54:46 - mmengine - INFO - Iter(train) [299500/640000] base_lr: 1.1005e-04 lr: 1.1005e-05 eta: 6 days, 0:32:17 time: 1.5243 data_time: 0.0253 memory: 25565 grad_norm: 2.7855 loss: 1.2734 caption_loss_cls: 2.1456 detection_loss_cls: 0.0309 detection_loss_reg: 0.3278 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5272 instance_segmentation_loss_cls: 0.0315 instance_segmentation_loss_reg: 0.3346 instance_segmentation_loss_poly: 0.9006 2024/01/07 13:07:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 13:07:16 - mmengine - INFO - Iter(train) [300000/640000] base_lr: 1.0980e-04 lr: 1.0980e-05 eta: 6 days, 0:16:13 time: 1.5246 data_time: 0.0253 memory: 25565 grad_norm: 2.8166 loss: 1.2753 caption_loss_cls: 2.1425 detection_loss_cls: 0.0309 detection_loss_reg: 0.3269 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5250 instance_segmentation_loss_cls: 0.0317 instance_segmentation_loss_reg: 0.3353 instance_segmentation_loss_poly: 0.9034 2024/01/07 13:07:16 - mmengine - INFO - Saving checkpoint at 300000 iterations 2024/01/07 13:19:14 - mmengine - INFO - Evaluating bbox... 2024/01/07 13:20:11 - mmengine - INFO - bbox_mAP_copypaste: 0.499 0.681 0.548 0.340 0.552 0.639 2024/01/07 13:20:11 - mmengine - INFO - Evaluating segm... 2024/01/07 13:21:22 - mmengine - INFO - segm_mAP_copypaste: 0.324 0.588 0.315 0.179 0.375 0.504 2024/01/07 13:23:36 - mmengine - INFO - Evaluating bbox... 2024/01/07 13:24:33 - mmengine - INFO - bbox_mAP_copypaste: 0.498 0.680 0.547 0.338 0.551 0.639 2024/01/07 13:30:22 - mmengine - INFO - per class results: 2024/01/07 13:30:22 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.81 | 89.03 | | building | 81.7 | 90.35 | | sky | 93.54 | 97.68 | | floor | 82.84 | 89.41 | | tree | 74.27 | 88.92 | | ceiling | 85.22 | 93.42 | | road | 84.36 | 91.19 | | bed | 89.61 | 95.1 | | windowpane | 63.89 | 83.77 | | grass | 67.42 | 83.24 | | cabinet | 60.21 | 71.14 | | sidewalk | 67.75 | 78.83 | | person | 81.17 | 91.4 | | earth | 33.02 | 41.99 | | door | 55.4 | 70.21 | | table | 61.93 | 81.97 | | mountain | 53.23 | 66.51 | | plant | 50.72 | 60.49 | | curtain | 75.41 | 84.7 | | chair | 61.11 | 74.65 | | car | 84.07 | 91.58 | | water | 52.77 | 66.78 | | painting | 71.64 | 88.1 | | sofa | 70.25 | 79.1 | | shelf | 43.85 | 60.17 | | house | 41.68 | 64.27 | | sea | 59.97 | 78.96 | | mirror | 67.65 | 72.52 | | rug | 69.57 | 80.68 | | field | 23.59 | 37.41 | | armchair | 50.13 | 74.25 | | seat | 68.02 | 80.56 | | fence | 46.23 | 66.02 | | desk | 51.15 | 75.45 | | rock | 51.8 | 78.23 | | wardrobe | 47.65 | 68.08 | | lamp | 62.96 | 77.15 | | bathtub | 78.57 | 89.04 | | railing | 38.21 | 51.36 | | cushion | 58.15 | 69.79 | | base | 25.92 | 38.38 | | box | 28.14 | 38.04 | | column | 54.06 | 65.42 | | signboard | 38.34 | 57.72 | | chest of drawers | 38.22 | 61.48 | | counter | 30.81 | 49.69 | | sand | 49.11 | 68.4 | | sink | 76.22 | 84.53 | | skyscraper | 45.02 | 56.45 | | fireplace | 75.71 | 84.29 | | refrigerator | 75.21 | 86.61 | | grandstand | 52.53 | 76.51 | | path | 21.21 | 40.66 | | stairs | 38.16 | 46.5 | | runway | 73.34 | 92.98 | | case | 48.91 | 61.16 | | pool table | 91.26 | 94.91 | | pillow | 60.67 | 75.1 | | screen door | 73.79 | 75.58 | | stairway | 41.19 | 53.03 | | river | 11.16 | 30.97 | | bridge | 69.25 | 78.42 | | bookcase | 40.16 | 53.89 | | blind | 37.96 | 42.26 | | coffee table | 64.21 | 78.45 | | toilet | 83.88 | 94.14 | | flower | 36.61 | 46.42 | | book | 44.16 | 67.97 | | hill | 11.5 | 21.61 | | bench | 59.62 | 68.18 | | countertop | 61.22 | 69.02 | | stove | 78.59 | 82.96 | | palm | 47.63 | 72.33 | | kitchen island | 47.68 | 76.75 | | computer | 75.28 | 89.79 | | swivel chair | 44.02 | 53.92 | | boat | 59.13 | 75.59 | | bar | 40.43 | 47.62 | | arcade machine | 72.54 | 75.04 | | hovel | 28.0 | 38.79 | | bus | 87.11 | 93.96 | | towel | 65.15 | 76.06 | | light | 52.76 | 66.92 | | truck | 42.96 | 59.32 | | tower | 27.47 | 44.96 | | chandelier | 64.22 | 75.85 | | awning | 29.8 | 39.65 | | streetlight | 31.39 | 45.48 | | booth | 46.08 | 51.83 | | television receiver | 69.42 | 90.58 | | airplane | 64.7 | 73.38 | | dirt track | 10.3 | 18.97 | | apparel | 31.4 | 41.27 | | pole | 21.09 | 28.85 | | land | 4.36 | 6.05 | | bannister | 17.79 | 28.14 | | escalator | 20.33 | 21.08 | | ottoman | 49.67 | 70.24 | | bottle | 23.18 | 27.9 | | buffet | 54.95 | 63.44 | | poster | 25.47 | 29.99 | | stage | 13.8 | 25.93 | | van | 49.47 | 66.9 | | ship | 8.91 | 9.79 | | fountain | 19.55 | 19.8 | | conveyer belt | 55.28 | 93.23 | | canopy | 38.34 | 56.75 | | washer | 70.18 | 72.64 | | plaything | 34.85 | 45.15 | | swimming pool | 38.95 | 54.45 | | stool | 41.93 | 61.86 | | barrel | 36.38 | 64.77 | | basket | 35.2 | 45.58 | | waterfall | 68.27 | 84.07 | | tent | 74.23 | 97.9 | | bag | 19.79 | 27.27 | | minibike | 71.28 | 76.3 | | cradle | 83.23 | 95.68 | | oven | 48.86 | 60.26 | | ball | 53.52 | 71.2 | | food | 57.37 | 65.15 | | step | 21.29 | 23.69 | | tank | 54.01 | 64.32 | | trade name | 24.69 | 28.87 | | microwave | 84.05 | 94.09 | | pot | 51.14 | 66.13 | | animal | 60.61 | 63.86 | | bicycle | 57.51 | 72.73 | | lake | 53.93 | 63.72 | | dishwasher | 65.73 | 78.83 | | screen | 65.1 | 82.03 | | blanket | 29.27 | 37.98 | | sculpture | 52.52 | 84.4 | | hood | 58.61 | 73.78 | | sconce | 45.24 | 68.76 | | vase | 42.63 | 58.88 | | traffic light | 39.71 | 51.81 | | tray | 14.16 | 17.28 | | ashcan | 38.53 | 46.1 | | fan | 62.53 | 75.82 | | pier | 14.96 | 62.89 | | crt screen | 4.74 | 7.56 | | plate | 57.9 | 72.18 | | monitor | 40.22 | 49.27 | | bulletin board | 39.54 | 47.54 | | shower | 2.48 | 2.99 | | radiator | 59.04 | 69.79 | | glass | 17.55 | 19.21 | | clock | 40.8 | 58.62 | | flag | 40.04 | 48.15 | +---------------------+-------+-------+ 2024/01/07 13:30:36 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.4980 coco/bbox_mAP_50: 0.6800 coco/bbox_mAP_75: 0.5470 coco/bbox_mAP_s: 0.3380 coco/bbox_mAP_m: 0.5510 coco/bbox_mAP_l: 0.6390 coco/segm_mAP: 0.3240 coco/segm_mAP_50: 0.5880 coco/segm_mAP_75: 0.3150 coco/segm_mAP_s: 0.1790 coco/segm_mAP_m: 0.3750 coco/segm_mAP_l: 0.5040 Bleu_1: 0.7545 Bleu_2: 0.5872 Bleu_3: 0.4429 Bleu_4: 0.3300 METEOR: 0.2674 ROUGE_L: 0.5533 CIDEr: 1.0689 SPICE: 0.1998 aAcc: 83.3700 mIoU: 50.5800 mAcc: 63.2300 visual-grounding/miou: 0.8059 visual-grounding/acc: 0.8737 data_time: 0.0145 time: 1.9431 2024/01/07 13:43:16 - mmengine - INFO - Iter(train) [300500/640000] base_lr: 1.0956e-04 lr: 1.0956e-05 eta: 6 days, 0:03:28 time: 1.5165 data_time: 0.0191 memory: 34656 grad_norm: 2.8104 loss: 1.2810 caption_loss_cls: 2.1454 detection_loss_cls: 0.0308 detection_loss_reg: 0.3248 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5277 instance_segmentation_loss_cls: 0.0315 instance_segmentation_loss_reg: 0.3348 instance_segmentation_loss_poly: 0.9025 2024/01/07 13:56:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 13:56:24 - mmengine - INFO - Iter(train) [301000/640000] base_lr: 1.0931e-04 lr: 1.0931e-05 eta: 5 days, 23:56:00 time: 1.5286 data_time: 0.0192 memory: 25564 grad_norm: 2.7946 loss: 1.2756 caption_loss_cls: 2.1452 detection_loss_cls: 0.0307 detection_loss_reg: 0.3236 semantic_segmentation_loss_cls: 0.0079 grounding_loss_reg: 2.5285 instance_segmentation_loss_cls: 0.0314 instance_segmentation_loss_reg: 0.3349 instance_segmentation_loss_poly: 0.9023 2024/01/07 14:09:24 - mmengine - INFO - Iter(train) [301500/640000] base_lr: 1.0907e-04 lr: 1.0907e-05 eta: 5 days, 23:46:45 time: 1.5328 data_time: 0.0192 memory: 25564 grad_norm: 2.8551 loss: 1.2858 caption_loss_cls: 2.1453 detection_loss_cls: 0.0309 detection_loss_reg: 0.3250 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5302 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3370 instance_segmentation_loss_poly: 0.9071 2024/01/07 14:21:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 14:21:26 - mmengine - INFO - Iter(train) [302000/640000] base_lr: 1.0882e-04 lr: 1.0882e-05 eta: 5 days, 23:24:49 time: 1.5180 data_time: 0.0191 memory: 25564 grad_norm: 2.9307 loss: 1.3059 caption_loss_cls: 2.1511 detection_loss_cls: 0.0308 detection_loss_reg: 0.3256 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5308 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3359 instance_segmentation_loss_poly: 0.9055 2024/01/07 14:21:26 - mmengine - INFO - Saving checkpoint at 302000 iterations 2024/01/07 14:34:17 - mmengine - INFO - Iter(train) [302500/640000] base_lr: 1.0858e-04 lr: 1.0858e-05 eta: 5 days, 23:13:37 time: 1.5127 data_time: 0.0193 memory: 25564 grad_norm: 3.0147 loss: 1.3120 caption_loss_cls: 2.1570 detection_loss_cls: 0.0310 detection_loss_reg: 0.3283 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5279 instance_segmentation_loss_cls: 0.0316 instance_segmentation_loss_reg: 0.3361 instance_segmentation_loss_poly: 0.9045 2024/01/07 14:47:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 14:47:07 - mmengine - INFO - Iter(train) [303000/640000] base_lr: 1.0834e-04 lr: 1.0834e-05 eta: 5 days, 23:02:06 time: 1.5217 data_time: 0.0195 memory: 25564 grad_norm: 3.0218 loss: 1.3036 caption_loss_cls: 2.1590 detection_loss_cls: 0.0308 detection_loss_reg: 0.3283 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5286 instance_segmentation_loss_cls: 0.0314 instance_segmentation_loss_reg: 0.3364 instance_segmentation_loss_poly: 0.9056 2024/01/07 14:59:33 - mmengine - INFO - Iter(train) [303500/640000] base_lr: 1.0809e-04 lr: 1.0809e-05 eta: 5 days, 22:45:37 time: 1.5223 data_time: 0.0196 memory: 25564 grad_norm: 3.0506 loss: 1.3137 caption_loss_cls: 2.1620 detection_loss_cls: 0.0307 detection_loss_reg: 0.3275 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5235 instance_segmentation_loss_cls: 0.0315 instance_segmentation_loss_reg: 0.3380 instance_segmentation_loss_poly: 0.9074 2024/01/07 15:12:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 15:12:23 - mmengine - INFO - Iter(train) [304000/640000] base_lr: 1.0785e-04 lr: 1.0785e-05 eta: 5 days, 22:34:01 time: 1.5271 data_time: 0.0197 memory: 25564 grad_norm: 3.0963 loss: 1.3047 caption_loss_cls: 2.1615 detection_loss_cls: 0.0308 detection_loss_reg: 0.3284 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5244 instance_segmentation_loss_cls: 0.0314 instance_segmentation_loss_reg: 0.3376 instance_segmentation_loss_poly: 0.9052 2024/01/07 15:12:23 - mmengine - INFO - Saving checkpoint at 304000 iterations 2024/01/07 15:25:55 - mmengine - INFO - Iter(train) [304500/640000] base_lr: 1.0760e-04 lr: 1.0760e-05 eta: 5 days, 22:30:41 time: 1.5391 data_time: 0.0261 memory: 25564 grad_norm: 3.0632 loss: 1.2936 caption_loss_cls: 2.1634 detection_loss_cls: 0.0309 detection_loss_reg: 0.3292 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5195 instance_segmentation_loss_cls: 0.0314 instance_segmentation_loss_reg: 0.3376 instance_segmentation_loss_poly: 0.9059 2024/01/07 15:38:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 15:38:25 - mmengine - INFO - Iter(train) [305000/640000] base_lr: 1.0736e-04 lr: 1.0736e-05 eta: 5 days, 22:15:06 time: 1.5298 data_time: 0.0260 memory: 25564 grad_norm: 3.0728 loss: 1.2901 caption_loss_cls: 2.1608 detection_loss_cls: 0.0312 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5143 instance_segmentation_loss_cls: 0.0312 instance_segmentation_loss_reg: 0.3368 instance_segmentation_loss_poly: 0.9028 2024/01/07 15:51:10 - mmengine - INFO - Iter(train) [305500/640000] base_lr: 1.0711e-04 lr: 1.0711e-05 eta: 5 days, 22:02:30 time: 1.5261 data_time: 0.0261 memory: 25564 grad_norm: 3.0182 loss: 1.2732 caption_loss_cls: 2.1642 detection_loss_cls: 0.0312 detection_loss_reg: 0.3315 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5160 instance_segmentation_loss_cls: 0.0312 instance_segmentation_loss_reg: 0.3374 instance_segmentation_loss_poly: 0.9049 2024/01/07 16:04:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 16:04:39 - mmengine - INFO - Iter(train) [306000/640000] base_lr: 1.0687e-04 lr: 1.0687e-05 eta: 5 days, 21:57:52 time: 1.5475 data_time: 0.0264 memory: 25564 grad_norm: 2.9542 loss: 1.2600 caption_loss_cls: 2.1648 detection_loss_cls: 0.0313 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5166 instance_segmentation_loss_cls: 0.0313 instance_segmentation_loss_reg: 0.3375 instance_segmentation_loss_poly: 0.9061 2024/01/07 16:04:39 - mmengine - INFO - Saving checkpoint at 306000 iterations 2024/01/07 16:17:30 - mmengine - INFO - Iter(train) [306500/640000] base_lr: 1.0662e-04 lr: 1.0662e-05 eta: 5 days, 21:46:16 time: 1.5477 data_time: 0.0265 memory: 25564 grad_norm: 2.9071 loss: 1.2514 caption_loss_cls: 2.1588 detection_loss_cls: 0.0312 detection_loss_reg: 0.3323 semantic_segmentation_loss_cls: 0.0078 grounding_loss_reg: 2.5147 instance_segmentation_loss_cls: 0.0311 instance_segmentation_loss_reg: 0.3367 instance_segmentation_loss_poly: 0.9035 2024/01/07 16:30:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 16:30:16 - mmengine - INFO - Iter(train) [307000/640000] base_lr: 1.0638e-04 lr: 1.0638e-05 eta: 5 days, 21:33:37 time: 1.5466 data_time: 0.0264 memory: 25564 grad_norm: 2.8991 loss: 1.2377 caption_loss_cls: 2.1582 detection_loss_cls: 0.0310 detection_loss_reg: 0.3299 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5159 instance_segmentation_loss_cls: 0.0309 instance_segmentation_loss_reg: 0.3351 instance_segmentation_loss_poly: 0.8983 2024/01/07 16:42:23 - mmengine - INFO - Iter(train) [307500/640000] base_lr: 1.0613e-04 lr: 1.0613e-05 eta: 5 days, 21:14:00 time: 1.5418 data_time: 0.0263 memory: 25564 grad_norm: 2.8885 loss: 1.2304 caption_loss_cls: 2.1571 detection_loss_cls: 0.0310 detection_loss_reg: 0.3289 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5164 instance_segmentation_loss_cls: 0.0306 instance_segmentation_loss_reg: 0.3337 instance_segmentation_loss_poly: 0.8973 2024/01/07 16:55:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 16:55:38 - mmengine - INFO - Iter(train) [308000/640000] base_lr: 1.0589e-04 lr: 1.0589e-05 eta: 5 days, 21:06:28 time: 1.5481 data_time: 0.0263 memory: 25564 grad_norm: 2.7779 loss: 1.2234 caption_loss_cls: 2.1545 detection_loss_cls: 0.0312 detection_loss_reg: 0.3302 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5144 instance_segmentation_loss_cls: 0.0307 instance_segmentation_loss_reg: 0.3338 instance_segmentation_loss_poly: 0.8990 2024/01/07 16:55:38 - mmengine - INFO - Saving checkpoint at 308000 iterations 2024/01/07 17:08:38 - mmengine - INFO - Iter(train) [308500/640000] base_lr: 1.0564e-04 lr: 1.0564e-05 eta: 5 days, 20:56:17 time: 1.5403 data_time: 0.0261 memory: 25564 grad_norm: 2.8286 loss: 1.2257 caption_loss_cls: 2.1537 detection_loss_cls: 0.0313 detection_loss_reg: 0.3311 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5127 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8952 2024/01/07 17:21:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 17:21:16 - mmengine - INFO - Iter(train) [309000/640000] base_lr: 1.0540e-04 lr: 1.0540e-05 eta: 5 days, 20:42:10 time: 1.5421 data_time: 0.0263 memory: 25564 grad_norm: 2.8379 loss: 1.2414 caption_loss_cls: 2.1521 detection_loss_cls: 0.0314 detection_loss_reg: 0.3310 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5107 instance_segmentation_loss_cls: 0.0307 instance_segmentation_loss_reg: 0.3341 instance_segmentation_loss_poly: 0.8985 2024/01/07 17:34:10 - mmengine - INFO - Iter(train) [309500/640000] base_lr: 1.0515e-04 lr: 1.0515e-05 eta: 5 days, 20:30:51 time: 1.5443 data_time: 0.0263 memory: 25564 grad_norm: 2.8207 loss: 1.2390 caption_loss_cls: 2.1521 detection_loss_cls: 0.0315 detection_loss_reg: 0.3307 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5092 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3333 instance_segmentation_loss_poly: 0.8983 2024/01/07 17:46:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 17:46:39 - mmengine - INFO - Iter(train) [310000/640000] base_lr: 1.0491e-04 lr: 1.0491e-05 eta: 5 days, 20:15:26 time: 1.5295 data_time: 0.0260 memory: 25564 grad_norm: 2.8272 loss: 1.2529 caption_loss_cls: 2.1603 detection_loss_cls: 0.0314 detection_loss_reg: 0.3285 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5113 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3338 instance_segmentation_loss_poly: 0.8996 2024/01/07 17:46:39 - mmengine - INFO - Saving checkpoint at 310000 iterations 2024/01/07 17:59:35 - mmengine - INFO - Iter(train) [310500/640000] base_lr: 1.0466e-04 lr: 1.0466e-05 eta: 5 days, 20:04:24 time: 1.5306 data_time: 0.0259 memory: 25564 grad_norm: 2.7923 loss: 1.2648 caption_loss_cls: 2.1627 detection_loss_cls: 0.0315 detection_loss_reg: 0.3285 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5071 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3342 instance_segmentation_loss_poly: 0.9007 2024/01/07 18:12:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 18:12:08 - mmengine - INFO - Iter(train) [311000/640000] base_lr: 1.0442e-04 lr: 1.0442e-05 eta: 5 days, 19:49:44 time: 1.5274 data_time: 0.0258 memory: 25564 grad_norm: 2.7595 loss: 1.2662 caption_loss_cls: 2.1652 detection_loss_cls: 0.0316 detection_loss_reg: 0.3298 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5068 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3338 instance_segmentation_loss_poly: 0.8997 2024/01/07 18:24:45 - mmengine - INFO - Iter(train) [311500/640000] base_lr: 1.0417e-04 lr: 1.0417e-05 eta: 5 days, 19:35:41 time: 1.5350 data_time: 0.0259 memory: 25564 grad_norm: 2.7216 loss: 1.2641 caption_loss_cls: 2.1709 detection_loss_cls: 0.0316 detection_loss_reg: 0.3300 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.5082 instance_segmentation_loss_cls: 0.0303 instance_segmentation_loss_reg: 0.3327 instance_segmentation_loss_poly: 0.8967 2024/01/07 18:37:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 18:37:22 - mmengine - INFO - Iter(train) [312000/640000] base_lr: 1.0393e-04 lr: 1.0393e-05 eta: 5 days, 19:21:39 time: 1.5255 data_time: 0.0258 memory: 25564 grad_norm: 2.7382 loss: 1.2698 caption_loss_cls: 2.1662 detection_loss_cls: 0.0317 detection_loss_reg: 0.3304 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5053 instance_segmentation_loss_cls: 0.0303 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8957 2024/01/07 18:37:22 - mmengine - INFO - Saving checkpoint at 312000 iterations 2024/01/07 18:50:42 - mmengine - INFO - Iter(train) [312500/640000] base_lr: 1.0368e-04 lr: 1.0368e-05 eta: 5 days, 19:14:03 time: 1.5302 data_time: 0.0266 memory: 25564 grad_norm: 2.7121 loss: 1.2846 caption_loss_cls: 2.1680 detection_loss_cls: 0.0316 detection_loss_reg: 0.3302 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5066 instance_segmentation_loss_cls: 0.0303 instance_segmentation_loss_reg: 0.3339 instance_segmentation_loss_poly: 0.8995 2024/01/07 19:03:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 19:03:44 - mmengine - INFO - Iter(train) [313000/640000] base_lr: 1.0344e-04 lr: 1.0344e-05 eta: 5 days, 19:03:49 time: 1.5365 data_time: 0.0265 memory: 25564 grad_norm: 2.6550 loss: 1.2693 caption_loss_cls: 2.1680 detection_loss_cls: 0.0317 detection_loss_reg: 0.3316 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5079 instance_segmentation_loss_cls: 0.0303 instance_segmentation_loss_reg: 0.3336 instance_segmentation_loss_poly: 0.8982 2024/01/07 19:16:19 - mmengine - INFO - Iter(train) [313500/640000] base_lr: 1.0319e-04 lr: 1.0319e-05 eta: 5 days, 18:49:27 time: 1.5317 data_time: 0.0265 memory: 25564 grad_norm: 2.6704 loss: 1.2716 caption_loss_cls: 2.1698 detection_loss_cls: 0.0319 detection_loss_reg: 0.3334 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5051 instance_segmentation_loss_cls: 0.0301 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8953 2024/01/07 19:28:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 19:28:42 - mmengine - INFO - Iter(train) [314000/640000] base_lr: 1.0295e-04 lr: 1.0295e-05 eta: 5 days, 18:33:26 time: 1.5301 data_time: 0.0264 memory: 25564 grad_norm: 2.6901 loss: 1.2613 caption_loss_cls: 2.1729 detection_loss_cls: 0.0315 detection_loss_reg: 0.3312 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5082 instance_segmentation_loss_cls: 0.0301 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8947 2024/01/07 19:28:42 - mmengine - INFO - Saving checkpoint at 314000 iterations 2024/01/07 19:42:05 - mmengine - INFO - Iter(train) [314500/640000] base_lr: 1.0270e-04 lr: 1.0270e-05 eta: 5 days, 18:26:02 time: 1.5369 data_time: 0.0265 memory: 25564 grad_norm: 2.6896 loss: 1.2593 caption_loss_cls: 2.1735 detection_loss_cls: 0.0314 detection_loss_reg: 0.3319 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5095 instance_segmentation_loss_cls: 0.0301 instance_segmentation_loss_reg: 0.3324 instance_segmentation_loss_poly: 0.8954 2024/01/07 19:54:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 19:54:17 - mmengine - INFO - Iter(train) [315000/640000] base_lr: 1.0245e-04 lr: 1.0245e-05 eta: 5 days, 18:08:35 time: 1.5316 data_time: 0.0265 memory: 25564 grad_norm: 2.7279 loss: 1.2805 caption_loss_cls: 2.1813 detection_loss_cls: 0.0314 detection_loss_reg: 0.3322 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5112 instance_segmentation_loss_cls: 0.0302 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8954 2024/01/07 20:07:10 - mmengine - INFO - Iter(train) [315500/640000] base_lr: 1.0221e-04 lr: 1.0221e-05 eta: 5 days, 17:56:52 time: 1.5356 data_time: 0.0265 memory: 25564 grad_norm: 2.7241 loss: 1.2640 caption_loss_cls: 2.1816 detection_loss_cls: 0.0314 detection_loss_reg: 0.3326 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5092 instance_segmentation_loss_cls: 0.0302 instance_segmentation_loss_reg: 0.3305 instance_segmentation_loss_poly: 0.8930 2024/01/07 20:19:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 20:19:53 - mmengine - INFO - Iter(train) [316000/640000] base_lr: 1.0196e-04 lr: 1.0196e-05 eta: 5 days, 17:43:48 time: 1.5372 data_time: 0.0266 memory: 25564 grad_norm: 2.7404 loss: 1.2695 caption_loss_cls: 2.1847 detection_loss_cls: 0.0314 detection_loss_reg: 0.3341 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5037 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8966 2024/01/07 20:19:53 - mmengine - INFO - Saving checkpoint at 316000 iterations 2024/01/07 20:32:49 - mmengine - INFO - Iter(train) [316500/640000] base_lr: 1.0172e-04 lr: 1.0172e-05 eta: 5 days, 17:32:29 time: 1.5314 data_time: 0.0258 memory: 25564 grad_norm: 2.7392 loss: 1.2674 caption_loss_cls: 2.1891 detection_loss_cls: 0.0313 detection_loss_reg: 0.3333 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5073 instance_segmentation_loss_cls: 0.0304 instance_segmentation_loss_reg: 0.3326 instance_segmentation_loss_poly: 0.8976 2024/01/07 20:45:15 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 20:45:15 - mmengine - INFO - Iter(train) [317000/640000] base_lr: 1.0147e-04 lr: 1.0147e-05 eta: 5 days, 17:17:04 time: 1.5220 data_time: 0.0257 memory: 25564 grad_norm: 2.7808 loss: 1.2815 caption_loss_cls: 2.1897 detection_loss_cls: 0.0313 detection_loss_reg: 0.3336 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5079 instance_segmentation_loss_cls: 0.0304 instance_segmentation_loss_reg: 0.3327 instance_segmentation_loss_poly: 0.8972 2024/01/07 20:57:50 - mmengine - INFO - Iter(train) [317500/640000] base_lr: 1.0123e-04 lr: 1.0123e-05 eta: 5 days, 17:02:58 time: 1.5221 data_time: 0.0258 memory: 25564 grad_norm: 2.7926 loss: 1.2890 caption_loss_cls: 2.1959 detection_loss_cls: 0.0315 detection_loss_reg: 0.3359 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5072 instance_segmentation_loss_cls: 0.0306 instance_segmentation_loss_reg: 0.3334 instance_segmentation_loss_poly: 0.8989 2024/01/07 21:10:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 21:10:20 - mmengine - INFO - Iter(train) [318000/640000] base_lr: 1.0098e-04 lr: 1.0098e-05 eta: 5 days, 16:48:17 time: 1.5239 data_time: 0.0258 memory: 25564 grad_norm: 2.7859 loss: 1.2812 caption_loss_cls: 2.1820 detection_loss_cls: 0.0313 detection_loss_reg: 0.3337 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5068 instance_segmentation_loss_cls: 0.0306 instance_segmentation_loss_reg: 0.3332 instance_segmentation_loss_poly: 0.8984 2024/01/07 21:10:20 - mmengine - INFO - Saving checkpoint at 318000 iterations 2024/01/07 21:23:34 - mmengine - INFO - Iter(train) [318500/640000] base_lr: 1.0074e-04 lr: 1.0074e-05 eta: 5 days, 16:39:14 time: 1.5217 data_time: 0.0256 memory: 25564 grad_norm: 2.7838 loss: 1.2778 caption_loss_cls: 2.1732 detection_loss_cls: 0.0313 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5091 instance_segmentation_loss_cls: 0.0305 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8973 2024/01/07 21:36:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 21:36:30 - mmengine - INFO - Iter(train) [319000/640000] base_lr: 1.0049e-04 lr: 1.0049e-05 eta: 5 days, 16:27:48 time: 1.5327 data_time: 0.0257 memory: 25564 grad_norm: 2.7851 loss: 1.2616 caption_loss_cls: 2.1719 detection_loss_cls: 0.0313 detection_loss_reg: 0.3334 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5085 instance_segmentation_loss_cls: 0.0304 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8960 2024/01/07 21:49:14 - mmengine - INFO - Iter(train) [319500/640000] base_lr: 1.0025e-04 lr: 1.0025e-05 eta: 5 days, 16:14:52 time: 1.5304 data_time: 0.0258 memory: 25564 grad_norm: 2.7592 loss: 1.2723 caption_loss_cls: 2.1757 detection_loss_cls: 0.0313 detection_loss_reg: 0.3331 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5078 instance_segmentation_loss_cls: 0.0303 instance_segmentation_loss_reg: 0.3308 instance_segmentation_loss_poly: 0.8944 2024/01/07 22:02:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 22:02:10 - mmengine - INFO - Iter(train) [320000/640000] base_lr: 1.0000e-04 lr: 1.0000e-05 eta: 5 days, 16:03:25 time: 1.5337 data_time: 0.0257 memory: 25564 grad_norm: 2.7271 loss: 1.2638 caption_loss_cls: 2.1759 detection_loss_cls: 0.0311 detection_loss_reg: 0.3321 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5046 instance_segmentation_loss_cls: 0.0302 instance_segmentation_loss_reg: 0.3305 instance_segmentation_loss_poly: 0.8927 2024/01/07 22:02:10 - mmengine - INFO - Saving checkpoint at 320000 iterations 2024/01/07 22:14:01 - mmengine - INFO - Evaluating bbox... 2024/01/07 22:14:58 - mmengine - INFO - bbox_mAP_copypaste: 0.502 0.685 0.551 0.347 0.557 0.645 2024/01/07 22:14:58 - mmengine - INFO - Evaluating segm... 2024/01/07 22:16:11 - mmengine - INFO - segm_mAP_copypaste: 0.335 0.600 0.327 0.192 0.385 0.511 2024/01/07 22:18:20 - mmengine - INFO - Evaluating bbox... 2024/01/07 22:19:18 - mmengine - INFO - bbox_mAP_copypaste: 0.502 0.685 0.550 0.346 0.557 0.644 2024/01/07 22:24:49 - mmengine - INFO - per class results: 2024/01/07 22:24:49 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.61 | 89.5 | | building | 82.7 | 91.03 | | sky | 93.65 | 97.77 | | floor | 82.39 | 90.43 | | tree | 74.08 | 87.65 | | ceiling | 84.93 | 94.31 | | road | 84.7 | 89.9 | | bed | 88.82 | 96.71 | | windowpane | 63.1 | 78.97 | | grass | 68.06 | 79.78 | | cabinet | 60.51 | 70.71 | | sidewalk | 67.63 | 80.9 | | person | 81.38 | 92.28 | | earth | 39.96 | 57.12 | | door | 51.84 | 63.72 | | table | 65.06 | 78.98 | | mountain | 56.59 | 73.85 | | plant | 53.58 | 63.33 | | curtain | 74.27 | 87.36 | | chair | 60.72 | 72.46 | | car | 84.73 | 92.31 | | water | 57.5 | 70.75 | | painting | 72.93 | 86.55 | | sofa | 70.55 | 77.42 | | shelf | 47.49 | 74.55 | | house | 47.93 | 64.27 | | sea | 59.43 | 69.41 | | mirror | 68.98 | 74.55 | | rug | 66.47 | 76.24 | | field | 32.15 | 48.48 | | armchair | 50.2 | 78.46 | | seat | 68.43 | 79.52 | | fence | 46.67 | 64.72 | | desk | 53.15 | 72.87 | | rock | 51.02 | 78.83 | | wardrobe | 43.8 | 58.65 | | lamp | 65.18 | 74.61 | | bathtub | 77.87 | 91.26 | | railing | 38.35 | 52.81 | | cushion | 61.3 | 72.14 | | base | 28.21 | 42.23 | | box | 27.19 | 34.91 | | column | 53.95 | 61.29 | | signboard | 39.91 | 55.98 | | chest of drawers | 36.96 | 76.92 | | counter | 26.49 | 37.35 | | sand | 41.19 | 53.88 | | sink | 75.57 | 84.77 | | skyscraper | 44.12 | 52.84 | | fireplace | 76.08 | 91.23 | | refrigerator | 80.71 | 83.91 | | grandstand | 45.8 | 85.09 | | path | 24.51 | 35.69 | | stairs | 31.72 | 36.82 | | runway | 74.54 | 96.72 | | case | 46.41 | 54.91 | | pool table | 91.94 | 95.6 | | pillow | 60.01 | 70.79 | | screen door | 80.98 | 83.87 | | stairway | 26.62 | 38.56 | | river | 13.91 | 39.14 | | bridge | 68.71 | 80.27 | | bookcase | 34.18 | 46.5 | | blind | 39.92 | 49.07 | | coffee table | 63.66 | 76.01 | | toilet | 85.54 | 92.2 | | flower | 40.72 | 55.99 | | book | 45.75 | 70.78 | | hill | 11.83 | 19.1 | | bench | 57.7 | 68.72 | | countertop | 56.31 | 80.46 | | stove | 77.81 | 80.81 | | palm | 48.95 | 72.63 | | kitchen island | 37.75 | 63.61 | | computer | 75.2 | 88.74 | | swivel chair | 42.64 | 54.71 | | boat | 59.56 | 78.22 | | bar | 38.57 | 47.31 | | arcade machine | 48.83 | 53.0 | | hovel | 33.51 | 38.46 | | bus | 85.81 | 95.2 | | towel | 62.61 | 77.77 | | light | 49.15 | 56.65 | | truck | 41.84 | 66.77 | | tower | 34.92 | 59.78 | | chandelier | 65.92 | 83.26 | | awning | 29.51 | 47.5 | | streetlight | 32.81 | 42.78 | | booth | 44.41 | 56.16 | | television receiver | 74.68 | 87.33 | | airplane | 72.65 | 82.43 | | dirt track | 4.39 | 23.65 | | apparel | 31.38 | 40.53 | | pole | 27.12 | 40.33 | | land | 0.3 | 0.38 | | bannister | 17.4 | 28.36 | | escalator | 23.22 | 24.53 | | ottoman | 52.44 | 71.52 | | bottle | 22.02 | 27.24 | | buffet | 56.87 | 64.5 | | poster | 40.72 | 54.41 | | stage | 11.43 | 21.93 | | van | 41.9 | 54.43 | | ship | 8.21 | 10.06 | | fountain | 21.08 | 21.65 | | conveyer belt | 68.98 | 91.91 | | canopy | 32.82 | 45.77 | | washer | 68.53 | 72.36 | | plaything | 31.0 | 41.54 | | swimming pool | 78.55 | 80.23 | | stool | 45.03 | 63.95 | | barrel | 29.06 | 64.17 | | basket | 32.88 | 40.22 | | waterfall | 77.19 | 89.09 | | tent | 74.05 | 97.45 | | bag | 19.05 | 23.61 | | minibike | 71.45 | 81.84 | | cradle | 85.3 | 96.68 | | oven | 45.35 | 56.53 | | ball | 49.16 | 61.24 | | food | 57.45 | 66.43 | | step | 5.37 | 6.71 | | tank | 49.32 | 54.48 | | trade name | 29.01 | 33.97 | | microwave | 85.96 | 93.61 | | pot | 45.68 | 52.21 | | animal | 60.12 | 63.06 | | bicycle | 59.8 | 81.64 | | lake | 45.9 | 49.26 | | dishwasher | 70.18 | 83.59 | | screen | 68.95 | 92.02 | | blanket | 21.25 | 26.11 | | sculpture | 62.67 | 83.57 | | hood | 62.66 | 73.56 | | sconce | 45.52 | 57.22 | | vase | 43.61 | 67.47 | | traffic light | 38.89 | 60.23 | | tray | 18.0 | 26.81 | | ashcan | 44.35 | 56.78 | | fan | 65.5 | 78.2 | | pier | 36.51 | 47.37 | | crt screen | 4.93 | 13.61 | | plate | 59.48 | 76.34 | | monitor | 9.79 | 10.63 | | bulletin board | 57.38 | 62.92 | | shower | 2.64 | 3.1 | | radiator | 61.48 | 69.25 | | glass | 18.15 | 20.41 | | clock | 31.28 | 44.35 | | flag | 51.75 | 65.71 | +---------------------+-------+-------+ 2024/01/07 22:25:01 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5020 coco/bbox_mAP_50: 0.6850 coco/bbox_mAP_75: 0.5500 coco/bbox_mAP_s: 0.3460 coco/bbox_mAP_m: 0.5570 coco/bbox_mAP_l: 0.6440 coco/segm_mAP: 0.3350 coco/segm_mAP_50: 0.6000 coco/segm_mAP_75: 0.3270 coco/segm_mAP_s: 0.1920 coco/segm_mAP_m: 0.3850 coco/segm_mAP_l: 0.5110 Bleu_1: 0.7523 Bleu_2: 0.5870 Bleu_3: 0.4439 Bleu_4: 0.3325 METEOR: 0.2674 ROUGE_L: 0.5530 CIDEr: 1.0738 SPICE: 0.2000 aAcc: 83.7500 mIoU: 50.9700 mAcc: 63.2700 visual-grounding/miou: 0.8096 visual-grounding/acc: 0.8742 data_time: 0.0128 time: 1.9188 2024/01/07 22:37:56 - mmengine - INFO - Iter(train) [320500/640000] base_lr: 9.9755e-05 lr: 9.9755e-06 eta: 5 days, 15:52:02 time: 1.5339 data_time: 0.0193 memory: 34656 grad_norm: 2.6890 loss: 1.2413 caption_loss_cls: 2.1731 detection_loss_cls: 0.0311 detection_loss_reg: 0.3322 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5002 instance_segmentation_loss_cls: 0.0301 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8898 2024/01/07 22:50:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 22:50:33 - mmengine - INFO - Iter(train) [321000/640000] base_lr: 9.9510e-05 lr: 9.9510e-06 eta: 5 days, 15:38:17 time: 1.5369 data_time: 0.0193 memory: 25564 grad_norm: 2.6907 loss: 1.2405 caption_loss_cls: 2.1724 detection_loss_cls: 0.0314 detection_loss_reg: 0.3349 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.5015 instance_segmentation_loss_cls: 0.0300 instance_segmentation_loss_reg: 0.3285 instance_segmentation_loss_poly: 0.8872 2024/01/07 23:03:35 - mmengine - INFO - Iter(train) [321500/640000] base_lr: 9.9264e-05 lr: 9.9264e-06 eta: 5 days, 15:27:24 time: 1.5436 data_time: 0.0195 memory: 25564 grad_norm: 2.6769 loss: 1.2322 caption_loss_cls: 2.1698 detection_loss_cls: 0.0313 detection_loss_reg: 0.3344 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4959 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3276 instance_segmentation_loss_poly: 0.8856 2024/01/07 23:16:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 23:16:08 - mmengine - INFO - Iter(train) [322000/640000] base_lr: 9.9019e-05 lr: 9.9019e-06 eta: 5 days, 15:13:12 time: 1.5444 data_time: 0.0194 memory: 25564 grad_norm: 2.6621 loss: 1.2484 caption_loss_cls: 2.1740 detection_loss_cls: 0.0312 detection_loss_reg: 0.3346 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4981 instance_segmentation_loss_cls: 0.0299 instance_segmentation_loss_reg: 0.3288 instance_segmentation_loss_poly: 0.8880 2024/01/07 23:16:08 - mmengine - INFO - Saving checkpoint at 322000 iterations 2024/01/07 23:29:30 - mmengine - INFO - Iter(train) [322500/640000] base_lr: 9.8773e-05 lr: 9.8773e-06 eta: 5 days, 15:04:30 time: 1.5462 data_time: 0.0195 memory: 25564 grad_norm: 2.6331 loss: 1.2280 caption_loss_cls: 2.1735 detection_loss_cls: 0.0312 detection_loss_reg: 0.3333 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4959 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3277 instance_segmentation_loss_poly: 0.8857 2024/01/07 23:42:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/07 23:42:11 - mmengine - INFO - Iter(train) [323000/640000] base_lr: 9.8528e-05 lr: 9.8528e-06 eta: 5 days, 14:51:15 time: 1.5426 data_time: 0.0195 memory: 25564 grad_norm: 2.5923 loss: 1.2228 caption_loss_cls: 2.1687 detection_loss_cls: 0.0312 detection_loss_reg: 0.3322 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4945 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3274 instance_segmentation_loss_poly: 0.8843 2024/01/07 23:54:52 - mmengine - INFO - Iter(train) [323500/640000] base_lr: 9.8283e-05 lr: 9.8283e-06 eta: 5 days, 14:37:50 time: 1.5416 data_time: 0.0196 memory: 25564 grad_norm: 2.6252 loss: 1.2303 caption_loss_cls: 2.1694 detection_loss_cls: 0.0311 detection_loss_reg: 0.3302 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4915 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3261 instance_segmentation_loss_poly: 0.8818 2024/01/08 00:08:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 00:08:02 - mmengine - INFO - Iter(train) [324000/640000] base_lr: 9.8037e-05 lr: 9.8037e-06 eta: 5 days, 14:27:43 time: 1.5451 data_time: 0.0198 memory: 25564 grad_norm: 2.6322 loss: 1.2394 caption_loss_cls: 2.1712 detection_loss_cls: 0.0311 detection_loss_reg: 0.3293 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4888 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3266 instance_segmentation_loss_poly: 0.8830 2024/01/08 00:08:02 - mmengine - INFO - Saving checkpoint at 324000 iterations 2024/01/08 00:21:21 - mmengine - INFO - Iter(train) [324500/640000] base_lr: 9.7792e-05 lr: 9.7792e-06 eta: 5 days, 14:18:30 time: 1.5506 data_time: 0.0262 memory: 25564 grad_norm: 2.6823 loss: 1.2573 caption_loss_cls: 2.1725 detection_loss_cls: 0.0311 detection_loss_reg: 0.3305 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4935 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3261 instance_segmentation_loss_poly: 0.8814 2024/01/08 00:34:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 00:34:28 - mmengine - INFO - Iter(train) [325000/640000] base_lr: 9.7546e-05 lr: 9.7546e-06 eta: 5 days, 14:07:56 time: 1.5580 data_time: 0.0263 memory: 25564 grad_norm: 2.6483 loss: 1.2463 caption_loss_cls: 2.1732 detection_loss_cls: 0.0309 detection_loss_reg: 0.3290 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4930 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3290 instance_segmentation_loss_poly: 0.8865 2024/01/08 00:47:27 - mmengine - INFO - Iter(train) [325500/640000] base_lr: 9.7301e-05 lr: 9.7301e-06 eta: 5 days, 13:56:32 time: 1.5575 data_time: 0.0262 memory: 25564 grad_norm: 2.6227 loss: 1.2397 caption_loss_cls: 2.1773 detection_loss_cls: 0.0309 detection_loss_reg: 0.3299 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4916 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3294 instance_segmentation_loss_poly: 0.8884 2024/01/08 01:00:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 01:00:27 - mmengine - INFO - Iter(train) [326000/640000] base_lr: 9.7056e-05 lr: 9.7056e-06 eta: 5 days, 13:45:04 time: 1.5640 data_time: 0.0264 memory: 25564 grad_norm: 2.6175 loss: 1.2331 caption_loss_cls: 2.1839 detection_loss_cls: 0.0308 detection_loss_reg: 0.3293 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4929 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3305 instance_segmentation_loss_poly: 0.8913 2024/01/08 01:00:27 - mmengine - INFO - Saving checkpoint at 326000 iterations 2024/01/08 01:13:47 - mmengine - INFO - Iter(train) [326500/640000] base_lr: 9.6810e-05 lr: 9.6810e-06 eta: 5 days, 13:35:42 time: 1.5637 data_time: 0.0263 memory: 25564 grad_norm: 2.6274 loss: 1.2360 caption_loss_cls: 2.1843 detection_loss_cls: 0.0307 detection_loss_reg: 0.3277 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4854 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8906 2024/01/08 01:26:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 01:26:08 - mmengine - INFO - Iter(train) [327000/640000] base_lr: 9.6565e-05 lr: 9.6565e-06 eta: 5 days, 13:20:14 time: 1.5586 data_time: 0.0263 memory: 25564 grad_norm: 2.6713 loss: 1.2435 caption_loss_cls: 2.1843 detection_loss_cls: 0.0307 detection_loss_reg: 0.3272 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4822 instance_segmentation_loss_cls: 0.0299 instance_segmentation_loss_reg: 0.3310 instance_segmentation_loss_poly: 0.8940 2024/01/08 01:38:50 - mmengine - INFO - Iter(train) [327500/640000] base_lr: 9.6320e-05 lr: 9.6320e-06 eta: 5 days, 13:06:56 time: 1.5590 data_time: 0.0262 memory: 25564 grad_norm: 2.6726 loss: 1.2350 caption_loss_cls: 2.1873 detection_loss_cls: 0.0307 detection_loss_reg: 0.3282 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4833 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3304 instance_segmentation_loss_poly: 0.8910 2024/01/08 01:51:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 01:51:39 - mmengine - INFO - Iter(train) [328000/640000] base_lr: 9.6075e-05 lr: 9.6075e-06 eta: 5 days, 12:54:24 time: 1.5539 data_time: 0.0260 memory: 25564 grad_norm: 2.7000 loss: 1.2273 caption_loss_cls: 2.1884 detection_loss_cls: 0.0306 detection_loss_reg: 0.3281 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4794 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3298 instance_segmentation_loss_poly: 0.8875 2024/01/08 01:51:39 - mmengine - INFO - Saving checkpoint at 328000 iterations 2024/01/08 02:04:30 - mmengine - INFO - Iter(train) [328500/640000] base_lr: 9.5829e-05 lr: 9.5829e-06 eta: 5 days, 12:41:54 time: 1.5466 data_time: 0.0260 memory: 25564 grad_norm: 2.7119 loss: 1.2174 caption_loss_cls: 2.1886 detection_loss_cls: 0.0306 detection_loss_reg: 0.3296 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4750 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3309 instance_segmentation_loss_poly: 0.8910 2024/01/08 02:17:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240107_022829 2024/01/08 02:17:28 - mmengine - INFO - Iter(train) [329000/640000] base_lr: 9.5584e-05 lr: 9.5584e-06 eta: 5 days, 12:30:12 time: 1.5444 data_time: 0.0259 memory: 25564 grad_norm: 2.7459 loss: 1.2198 caption_loss_cls: 2.1895 detection_loss_cls: 0.0306 detection_loss_reg: 0.3297 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4727 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8894 2024/01/08 03:10:48 - mmengine - INFO - Iter(train) [329500/640000] base_lr: 9.5339e-05 lr: 9.5339e-06 eta: 5 days, 9:02:56 time: 1.5237 data_time: 0.0183 memory: 25568 grad_norm: 2.7611 loss: 1.2267 caption_loss_cls: 2.1877 detection_loss_cls: 0.0304 detection_loss_reg: 0.3269 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4709 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3295 instance_segmentation_loss_poly: 0.8863 2024/01/08 03:23:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 03:23:45 - mmengine - INFO - Iter(train) [330000/640000] base_lr: 9.5094e-05 lr: 9.5094e-06 eta: 5 days, 10:03:24 time: 1.5230 data_time: 0.0180 memory: 25568 grad_norm: 2.8079 loss: 1.2272 caption_loss_cls: 2.1799 detection_loss_cls: 0.0301 detection_loss_reg: 0.3269 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4675 instance_segmentation_loss_cls: 0.0295 instance_segmentation_loss_reg: 0.3289 instance_segmentation_loss_poly: 0.8844 2024/01/08 03:23:45 - mmengine - INFO - Saving checkpoint at 330000 iterations 2024/01/08 03:36:52 - mmengine - INFO - Iter(train) [330500/640000] base_lr: 9.4849e-05 lr: 9.4849e-06 eta: 5 days, 10:55:50 time: 1.5197 data_time: 0.0173 memory: 25568 grad_norm: 2.8484 loss: 1.2473 caption_loss_cls: 2.1777 detection_loss_cls: 0.0301 detection_loss_reg: 0.3272 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4631 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3294 instance_segmentation_loss_poly: 0.8846 2024/01/08 03:49:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 03:49:53 - mmengine - INFO - Iter(train) [331000/640000] base_lr: 9.4604e-05 lr: 9.4604e-06 eta: 5 days, 11:16:47 time: 1.5298 data_time: 0.0174 memory: 25568 grad_norm: 2.8086 loss: 1.2297 caption_loss_cls: 2.1689 detection_loss_cls: 0.0301 detection_loss_reg: 0.3287 semantic_segmentation_loss_cls: 0.0077 grounding_loss_reg: 2.4593 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3300 instance_segmentation_loss_poly: 0.8859 2024/01/08 04:02:48 - mmengine - INFO - Iter(train) [331500/640000] base_lr: 9.4358e-05 lr: 9.4358e-06 eta: 5 days, 11:18:08 time: 1.5330 data_time: 0.0173 memory: 25568 grad_norm: 2.8026 loss: 1.2319 caption_loss_cls: 2.1683 detection_loss_cls: 0.0299 detection_loss_reg: 0.3261 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4567 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3301 instance_segmentation_loss_poly: 0.8864 2024/01/08 04:15:51 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 04:15:51 - mmengine - INFO - Iter(train) [332000/640000] base_lr: 9.4113e-05 lr: 9.4113e-06 eta: 5 days, 11:26:26 time: 1.5363 data_time: 0.0171 memory: 25568 grad_norm: 2.7867 loss: 1.2266 caption_loss_cls: 2.1726 detection_loss_cls: 0.0298 detection_loss_reg: 0.3264 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4565 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3302 instance_segmentation_loss_poly: 0.8865 2024/01/08 04:15:51 - mmengine - INFO - Saving checkpoint at 332000 iterations 2024/01/08 04:28:48 - mmengine - INFO - Iter(train) [332500/640000] base_lr: 9.3868e-05 lr: 9.3868e-06 eta: 5 days, 11:23:54 time: 1.5434 data_time: 0.0233 memory: 25568 grad_norm: 2.8301 loss: 1.2469 caption_loss_cls: 2.1680 detection_loss_cls: 0.0300 detection_loss_reg: 0.3289 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4539 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8858 2024/01/08 04:42:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 04:42:03 - mmengine - INFO - Iter(train) [333000/640000] base_lr: 9.3623e-05 lr: 9.3623e-06 eta: 5 days, 11:36:56 time: 1.5574 data_time: 0.0234 memory: 25568 grad_norm: 2.8150 loss: 1.2324 caption_loss_cls: 2.1653 detection_loss_cls: 0.0300 detection_loss_reg: 0.3280 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4519 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3303 instance_segmentation_loss_poly: 0.8874 2024/01/08 04:55:55 - mmengine - INFO - Iter(train) [333500/640000] base_lr: 9.3379e-05 lr: 9.3379e-06 eta: 5 days, 12:19:31 time: 1.5760 data_time: 0.0237 memory: 25568 grad_norm: 2.7917 loss: 1.2227 caption_loss_cls: 2.1639 detection_loss_cls: 0.0298 detection_loss_reg: 0.3265 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4502 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3306 instance_segmentation_loss_poly: 0.8864 2024/01/08 05:08:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 05:08:35 - mmengine - INFO - Iter(train) [334000/640000] base_lr: 9.3134e-05 lr: 9.3134e-06 eta: 5 days, 11:52:11 time: 1.5719 data_time: 0.0237 memory: 25568 grad_norm: 2.7985 loss: 1.2181 caption_loss_cls: 2.1594 detection_loss_cls: 0.0297 detection_loss_reg: 0.3245 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4464 instance_segmentation_loss_cls: 0.0299 instance_segmentation_loss_reg: 0.3319 instance_segmentation_loss_poly: 0.8888 2024/01/08 05:08:35 - mmengine - INFO - Saving checkpoint at 334000 iterations 2024/01/08 05:21:50 - mmengine - INFO - Iter(train) [334500/640000] base_lr: 9.2889e-05 lr: 9.2889e-06 eta: 5 days, 11:54:20 time: 1.5740 data_time: 0.0233 memory: 25568 grad_norm: 2.7951 loss: 1.2057 caption_loss_cls: 2.1521 detection_loss_cls: 0.0297 detection_loss_reg: 0.3249 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4399 instance_segmentation_loss_cls: 0.0299 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8879 2024/01/08 05:34:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 05:34:56 - mmengine - INFO - Iter(train) [335000/640000] base_lr: 9.2644e-05 lr: 9.2644e-06 eta: 5 days, 11:47:11 time: 1.5750 data_time: 0.0231 memory: 25568 grad_norm: 2.8372 loss: 1.2140 caption_loss_cls: 2.1465 detection_loss_cls: 0.0297 detection_loss_reg: 0.3247 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4363 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8872 2024/01/08 05:47:27 - mmengine - INFO - Iter(train) [335500/640000] base_lr: 9.2399e-05 lr: 9.2399e-06 eta: 5 days, 11:16:10 time: 1.5692 data_time: 0.0231 memory: 25568 grad_norm: 2.8898 loss: 1.2187 caption_loss_cls: 2.1474 detection_loss_cls: 0.0296 detection_loss_reg: 0.3249 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4321 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8866 2024/01/08 06:00:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 06:00:31 - mmengine - INFO - Iter(train) [336000/640000] base_lr: 9.2155e-05 lr: 9.2155e-06 eta: 5 days, 11:07:59 time: 1.5695 data_time: 0.0232 memory: 25568 grad_norm: 2.8999 loss: 1.2176 caption_loss_cls: 2.1386 detection_loss_cls: 0.0296 detection_loss_reg: 0.3240 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4294 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8880 2024/01/08 06:00:31 - mmengine - INFO - Saving checkpoint at 336000 iterations 2024/01/08 06:13:57 - mmengine - INFO - Iter(train) [336500/640000] base_lr: 9.1910e-05 lr: 9.1910e-06 eta: 5 days, 11:12:28 time: 1.5766 data_time: 0.0233 memory: 25568 grad_norm: 2.9379 loss: 1.2232 caption_loss_cls: 2.1373 detection_loss_cls: 0.0296 detection_loss_reg: 0.3235 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4264 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8867 2024/01/08 06:26:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 06:26:50 - mmengine - INFO - Iter(train) [337000/640000] base_lr: 9.1665e-05 lr: 9.1665e-06 eta: 5 days, 10:56:21 time: 1.5711 data_time: 0.0233 memory: 25568 grad_norm: 2.9603 loss: 1.2302 caption_loss_cls: 2.1326 detection_loss_cls: 0.0294 detection_loss_reg: 0.3220 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4217 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8869 2024/01/08 06:39:48 - mmengine - INFO - Iter(train) [337500/640000] base_lr: 9.1421e-05 lr: 9.1421e-06 eta: 5 days, 10:43:20 time: 1.5577 data_time: 0.0231 memory: 25568 grad_norm: 2.9720 loss: 1.2269 caption_loss_cls: 2.1284 detection_loss_cls: 0.0293 detection_loss_reg: 0.3211 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4192 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3316 instance_segmentation_loss_poly: 0.8859 2024/01/08 06:52:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 06:52:50 - mmengine - INFO - Iter(train) [338000/640000] base_lr: 9.1176e-05 lr: 9.1176e-06 eta: 5 days, 10:32:28 time: 1.5632 data_time: 0.0232 memory: 25568 grad_norm: 2.9672 loss: 1.2327 caption_loss_cls: 2.1325 detection_loss_cls: 0.0294 detection_loss_reg: 0.3218 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4180 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8872 2024/01/08 06:52:50 - mmengine - INFO - Saving checkpoint at 338000 iterations 2024/01/08 07:06:25 - mmengine - INFO - Iter(train) [338500/640000] base_lr: 9.0932e-05 lr: 9.0932e-06 eta: 5 days, 10:36:39 time: 1.5679 data_time: 0.0233 memory: 25568 grad_norm: 2.9429 loss: 1.2351 caption_loss_cls: 2.1361 detection_loss_cls: 0.0295 detection_loss_reg: 0.3240 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4143 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3326 instance_segmentation_loss_poly: 0.8883 2024/01/08 07:19:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 07:19:20 - mmengine - INFO - Iter(train) [339000/640000] base_lr: 9.0687e-05 lr: 9.0687e-06 eta: 5 days, 10:21:30 time: 1.5654 data_time: 0.0233 memory: 25568 grad_norm: 2.9199 loss: 1.2318 caption_loss_cls: 2.1372 detection_loss_cls: 0.0295 detection_loss_reg: 0.3239 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4105 instance_segmentation_loss_cls: 0.0300 instance_segmentation_loss_reg: 0.3335 instance_segmentation_loss_poly: 0.8897 2024/01/08 07:31:54 - mmengine - INFO - Iter(train) [339500/640000] base_lr: 9.0443e-05 lr: 9.0443e-06 eta: 5 days, 9:57:18 time: 1.5661 data_time: 0.0233 memory: 25568 grad_norm: 2.9162 loss: 1.2214 caption_loss_cls: 2.1282 detection_loss_cls: 0.0295 detection_loss_reg: 0.3243 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4046 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3328 instance_segmentation_loss_poly: 0.8866 2024/01/08 07:45:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 07:45:03 - mmengine - INFO - Iter(train) [340000/640000] base_lr: 9.0199e-05 lr: 9.0199e-06 eta: 5 days, 9:48:24 time: 1.5673 data_time: 0.0232 memory: 25568 grad_norm: 2.9057 loss: 1.2307 caption_loss_cls: 2.1269 detection_loss_cls: 0.0296 detection_loss_reg: 0.3258 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4048 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3323 instance_segmentation_loss_poly: 0.8844 2024/01/08 07:45:03 - mmengine - INFO - Saving checkpoint at 340000 iterations 2024/01/08 07:56:30 - mmengine - INFO - Evaluating bbox... 2024/01/08 07:57:26 - mmengine - INFO - bbox_mAP_copypaste: 0.508 0.690 0.559 0.358 0.561 0.651 2024/01/08 07:57:26 - mmengine - INFO - Evaluating segm... 2024/01/08 07:58:39 - mmengine - INFO - segm_mAP_copypaste: 0.339 0.607 0.335 0.198 0.387 0.516 2024/01/08 08:00:49 - mmengine - INFO - Evaluating bbox... 2024/01/08 08:01:46 - mmengine - INFO - bbox_mAP_copypaste: 0.507 0.688 0.558 0.350 0.562 0.649 2024/01/08 08:07:17 - mmengine - INFO - per class results: 2024/01/08 08:07:17 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 77.76 | 86.23 | | building | 81.22 | 88.94 | | sky | 93.23 | 97.94 | | floor | 81.82 | 90.65 | | tree | 74.06 | 87.01 | | ceiling | 82.56 | 97.55 | | road | 85.2 | 91.24 | | bed | 89.9 | 95.83 | | windowpane | 63.09 | 83.01 | | grass | 69.14 | 81.97 | | cabinet | 61.69 | 71.6 | | sidewalk | 69.5 | 77.29 | | person | 80.84 | 90.72 | | earth | 39.93 | 53.48 | | door | 56.87 | 68.71 | | table | 64.44 | 79.25 | | mountain | 58.77 | 78.69 | | plant | 53.94 | 65.06 | | curtain | 75.46 | 89.39 | | chair | 60.29 | 73.69 | | car | 84.73 | 91.75 | | water | 63.63 | 77.22 | | painting | 72.34 | 87.21 | | sofa | 70.4 | 79.66 | | shelf | 48.6 | 74.17 | | house | 41.55 | 67.97 | | sea | 63.08 | 72.93 | | mirror | 68.69 | 76.52 | | rug | 67.55 | 79.33 | | field | 29.46 | 41.64 | | armchair | 49.25 | 74.48 | | seat | 65.99 | 85.05 | | fence | 46.14 | 65.14 | | desk | 50.79 | 65.43 | | rock | 44.24 | 68.61 | | wardrobe | 49.11 | 73.91 | | lamp | 65.34 | 78.33 | | bathtub | 76.75 | 87.95 | | railing | 39.83 | 56.88 | | cushion | 63.55 | 76.77 | | base | 26.22 | 40.13 | | box | 24.84 | 32.26 | | column | 46.26 | 74.95 | | signboard | 38.2 | 52.4 | | chest of drawers | 37.09 | 57.83 | | counter | 30.52 | 48.96 | | sand | 48.87 | 67.37 | | sink | 75.15 | 81.58 | | skyscraper | 44.17 | 58.47 | | fireplace | 75.02 | 92.98 | | refrigerator | 79.76 | 85.06 | | grandstand | 48.82 | 76.78 | | path | 22.24 | 34.51 | | stairs | 36.75 | 51.96 | | runway | 74.54 | 94.8 | | case | 49.23 | 60.96 | | pool table | 91.03 | 95.52 | | pillow | 60.07 | 71.24 | | screen door | 78.02 | 84.44 | | stairway | 38.44 | 60.25 | | river | 13.64 | 31.82 | | bridge | 63.08 | 81.9 | | bookcase | 39.69 | 47.41 | | blind | 40.55 | 43.92 | | coffee table | 61.69 | 84.11 | | toilet | 86.3 | 91.37 | | flower | 40.0 | 52.3 | | book | 45.85 | 67.18 | | hill | 14.12 | 26.63 | | bench | 57.84 | 74.15 | | countertop | 62.39 | 76.15 | | stove | 78.73 | 83.41 | | palm | 46.7 | 72.35 | | kitchen island | 45.92 | 89.35 | | computer | 77.55 | 86.85 | | swivel chair | 45.65 | 59.52 | | boat | 67.05 | 83.99 | | bar | 37.35 | 47.86 | | arcade machine | 75.19 | 86.04 | | hovel | 23.57 | 29.04 | | bus | 86.63 | 94.73 | | towel | 65.74 | 76.73 | | light | 50.57 | 56.74 | | truck | 44.7 | 63.44 | | tower | 39.71 | 68.85 | | chandelier | 63.52 | 71.44 | | awning | 32.0 | 45.39 | | streetlight | 33.07 | 44.96 | | booth | 45.61 | 73.68 | | television receiver | 70.52 | 88.47 | | airplane | 61.18 | 67.68 | | dirt track | 3.51 | 11.07 | | apparel | 28.11 | 47.59 | | pole | 27.79 | 40.51 | | land | 2.06 | 3.06 | | bannister | 11.54 | 15.39 | | escalator | 21.33 | 25.15 | | ottoman | 55.87 | 70.16 | | bottle | 22.24 | 28.56 | | buffet | 54.73 | 66.05 | | poster | 39.97 | 55.93 | | stage | 11.66 | 16.16 | | van | 44.86 | 62.27 | | ship | 8.46 | 9.26 | | fountain | 30.44 | 30.87 | | conveyer belt | 73.96 | 92.4 | | canopy | 36.73 | 44.84 | | washer | 70.36 | 72.89 | | plaything | 30.17 | 37.78 | | swimming pool | 68.14 | 86.22 | | stool | 43.86 | 61.6 | | barrel | 12.14 | 66.59 | | basket | 32.66 | 39.4 | | waterfall | 72.14 | 87.24 | | tent | 81.05 | 96.83 | | bag | 17.51 | 20.76 | | minibike | 72.8 | 84.81 | | cradle | 77.79 | 96.55 | | oven | 50.08 | 67.52 | | ball | 49.41 | 64.33 | | food | 54.95 | 62.33 | | step | 12.23 | 17.89 | | tank | 50.56 | 55.62 | | trade name | 27.09 | 33.09 | | microwave | 85.03 | 92.78 | | pot | 47.59 | 59.46 | | animal | 63.49 | 66.38 | | bicycle | 55.25 | 65.12 | | lake | 56.2 | 62.69 | | dishwasher | 73.03 | 86.53 | | screen | 62.75 | 79.5 | | blanket | 21.54 | 27.15 | | sculpture | 62.47 | 82.31 | | hood | 57.57 | 72.08 | | sconce | 45.49 | 55.26 | | vase | 41.7 | 61.55 | | traffic light | 38.42 | 52.67 | | tray | 15.42 | 21.71 | | ashcan | 42.04 | 50.87 | | fan | 64.17 | 72.04 | | pier | 46.02 | 63.95 | | crt screen | 9.96 | 26.26 | | plate | 58.51 | 77.8 | | monitor | 20.25 | 25.54 | | bulletin board | 46.57 | 49.34 | | shower | 1.93 | 6.94 | | radiator | 58.69 | 67.22 | | glass | 18.48 | 20.03 | | clock | 37.84 | 47.44 | | flag | 48.19 | 57.89 | +---------------------+-------+-------+ 2024/01/08 08:07:31 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5070 coco/bbox_mAP_50: 0.6880 coco/bbox_mAP_75: 0.5580 coco/bbox_mAP_s: 0.3500 coco/bbox_mAP_m: 0.5620 coco/bbox_mAP_l: 0.6490 coco/segm_mAP: 0.3390 coco/segm_mAP_50: 0.6070 coco/segm_mAP_75: 0.3350 coco/segm_mAP_s: 0.1980 coco/segm_mAP_m: 0.3870 coco/segm_mAP_l: 0.5160 Bleu_1: 0.7555 Bleu_2: 0.5912 Bleu_3: 0.4501 Bleu_4: 0.3385 METEOR: 0.2718 ROUGE_L: 0.5575 CIDEr: 1.0927 SPICE: 0.1994 aAcc: 83.5900 mIoU: 51.3100 mAcc: 64.2700 visual-grounding/miou: 0.8086 visual-grounding/acc: 0.8716 data_time: 0.0308 time: 1.9221 2024/01/08 08:19:43 - mmengine - INFO - Iter(train) [340500/640000] base_lr: 8.9955e-05 lr: 8.9955e-06 eta: 5 days, 9:17:38 time: 1.5494 data_time: 0.0176 memory: 34658 grad_norm: 2.8856 loss: 1.2170 caption_loss_cls: 2.1258 detection_loss_cls: 0.0294 detection_loss_reg: 0.3255 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.4058 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8840 2024/01/08 08:32:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 08:32:36 - mmengine - INFO - Iter(train) [341000/640000] base_lr: 8.9710e-05 lr: 8.9710e-06 eta: 5 days, 9:03:07 time: 1.5495 data_time: 0.0178 memory: 25568 grad_norm: 2.8764 loss: 1.2054 caption_loss_cls: 2.1244 detection_loss_cls: 0.0290 detection_loss_reg: 0.3219 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4035 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3337 instance_segmentation_loss_poly: 0.8871 2024/01/08 08:45:04 - mmengine - INFO - Iter(train) [341500/640000] base_lr: 8.9466e-05 lr: 8.9466e-06 eta: 5 days, 8:39:13 time: 1.5419 data_time: 0.0180 memory: 25568 grad_norm: 2.9114 loss: 1.2036 caption_loss_cls: 2.1167 detection_loss_cls: 0.0289 detection_loss_reg: 0.3201 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3981 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3329 instance_segmentation_loss_poly: 0.8853 2024/01/08 08:57:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 08:57:31 - mmengine - INFO - Iter(train) [342000/640000] base_lr: 8.9222e-05 lr: 8.9222e-06 eta: 5 days, 8:15:56 time: 1.5330 data_time: 0.0181 memory: 25568 grad_norm: 2.9520 loss: 1.1960 caption_loss_cls: 2.1054 detection_loss_cls: 0.0289 detection_loss_reg: 0.3201 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3946 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3327 instance_segmentation_loss_poly: 0.8849 2024/01/08 08:57:31 - mmengine - INFO - Saving checkpoint at 342000 iterations 2024/01/08 09:10:45 - mmengine - INFO - Iter(train) [342500/640000] base_lr: 8.8978e-05 lr: 8.8978e-06 eta: 5 days, 8:09:40 time: 1.5281 data_time: 0.0196 memory: 25568 grad_norm: 2.9763 loss: 1.1936 caption_loss_cls: 2.1051 detection_loss_cls: 0.0287 detection_loss_reg: 0.3196 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3952 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3328 instance_segmentation_loss_poly: 0.8855 2024/01/08 09:23:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 09:23:44 - mmengine - INFO - Iter(train) [343000/640000] base_lr: 8.8734e-05 lr: 8.8734e-06 eta: 5 days, 7:57:35 time: 1.5288 data_time: 0.0199 memory: 25568 grad_norm: 3.0350 loss: 1.1985 caption_loss_cls: 2.1010 detection_loss_cls: 0.0289 detection_loss_reg: 0.3216 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3937 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8841 2024/01/08 09:36:11 - mmengine - INFO - Iter(train) [343500/640000] base_lr: 8.8491e-05 lr: 8.8491e-06 eta: 5 days, 7:35:34 time: 1.5271 data_time: 0.0201 memory: 25568 grad_norm: 3.0098 loss: 1.1979 caption_loss_cls: 2.1000 detection_loss_cls: 0.0288 detection_loss_reg: 0.3218 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3904 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3328 instance_segmentation_loss_poly: 0.8845 2024/01/08 09:49:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 09:49:04 - mmengine - INFO - Iter(train) [344000/640000] base_lr: 8.8247e-05 lr: 8.8247e-06 eta: 5 days, 7:22:01 time: 1.5232 data_time: 0.0203 memory: 25568 grad_norm: 3.0039 loss: 1.1829 caption_loss_cls: 2.0903 detection_loss_cls: 0.0288 detection_loss_reg: 0.3220 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3952 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8827 2024/01/08 09:49:04 - mmengine - INFO - Saving checkpoint at 344000 iterations 2024/01/08 10:02:21 - mmengine - INFO - Iter(train) [344500/640000] base_lr: 8.8003e-05 lr: 8.8003e-06 eta: 5 days, 7:15:49 time: 1.5389 data_time: 0.0271 memory: 25568 grad_norm: 2.9819 loss: 1.1871 caption_loss_cls: 2.0895 detection_loss_cls: 0.0288 detection_loss_reg: 0.3209 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3948 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3319 instance_segmentation_loss_poly: 0.8817 2024/01/08 10:14:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 10:14:49 - mmengine - INFO - Iter(train) [345000/640000] base_lr: 8.7759e-05 lr: 8.7759e-06 eta: 5 days, 6:54:44 time: 1.5324 data_time: 0.0271 memory: 25568 grad_norm: 2.9668 loss: 1.1949 caption_loss_cls: 2.0907 detection_loss_cls: 0.0288 detection_loss_reg: 0.3210 semantic_segmentation_loss_cls: 0.0076 grounding_loss_reg: 2.3938 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8833 2024/01/08 10:27:18 - mmengine - INFO - Iter(train) [345500/640000] base_lr: 8.7516e-05 lr: 8.7516e-06 eta: 5 days, 6:34:41 time: 1.5328 data_time: 0.0271 memory: 25568 grad_norm: 2.9261 loss: 1.2105 caption_loss_cls: 2.0865 detection_loss_cls: 0.0286 detection_loss_reg: 0.3202 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3914 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8820 2024/01/08 10:40:00 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 10:40:00 - mmengine - INFO - Iter(train) [346000/640000] base_lr: 8.7272e-05 lr: 8.7272e-06 eta: 5 days, 6:18:37 time: 1.5366 data_time: 0.0271 memory: 25568 grad_norm: 2.8567 loss: 1.2110 caption_loss_cls: 2.0900 detection_loss_cls: 0.0286 detection_loss_reg: 0.3194 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3938 instance_segmentation_loss_cls: 0.0297 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8813 2024/01/08 10:40:00 - mmengine - INFO - Saving checkpoint at 346000 iterations 2024/01/08 10:54:07 - mmengine - INFO - Iter(train) [346500/640000] base_lr: 8.7029e-05 lr: 8.7029e-06 eta: 5 days, 6:24:59 time: 1.5496 data_time: 0.0267 memory: 25568 grad_norm: 2.8825 loss: 1.2207 caption_loss_cls: 2.0887 detection_loss_cls: 0.0286 detection_loss_reg: 0.3194 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3963 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8802 2024/01/08 11:06:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 11:06:19 - mmengine - INFO - Iter(train) [347000/640000] base_lr: 8.6786e-05 lr: 8.6786e-06 eta: 5 days, 6:00:52 time: 1.5381 data_time: 0.0266 memory: 25568 grad_norm: 2.8552 loss: 1.2217 caption_loss_cls: 2.0855 detection_loss_cls: 0.0287 detection_loss_reg: 0.3208 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3968 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8805 2024/01/08 11:19:19 - mmengine - INFO - Iter(train) [347500/640000] base_lr: 8.6542e-05 lr: 8.6542e-06 eta: 5 days, 5:49:34 time: 1.5465 data_time: 0.0268 memory: 25568 grad_norm: 2.8092 loss: 1.2197 caption_loss_cls: 2.0850 detection_loss_cls: 0.0287 detection_loss_reg: 0.3206 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3944 instance_segmentation_loss_cls: 0.0300 instance_segmentation_loss_reg: 0.3339 instance_segmentation_loss_poly: 0.8847 2024/01/08 11:31:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 11:31:43 - mmengine - INFO - Iter(train) [348000/640000] base_lr: 8.6299e-05 lr: 8.6299e-06 eta: 5 days, 5:28:59 time: 1.5390 data_time: 0.0266 memory: 25568 grad_norm: 2.8613 loss: 1.2307 caption_loss_cls: 2.0838 detection_loss_cls: 0.0287 detection_loss_reg: 0.3216 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3938 instance_segmentation_loss_cls: 0.0299 instance_segmentation_loss_reg: 0.3335 instance_segmentation_loss_poly: 0.8845 2024/01/08 11:31:43 - mmengine - INFO - Saving checkpoint at 348000 iterations 2024/01/08 11:44:59 - mmengine - INFO - Iter(train) [348500/640000] base_lr: 8.6056e-05 lr: 8.6056e-06 eta: 5 days, 5:21:31 time: 1.5389 data_time: 0.0267 memory: 25568 grad_norm: 2.8382 loss: 1.2313 caption_loss_cls: 2.0849 detection_loss_cls: 0.0286 detection_loss_reg: 0.3226 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3962 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3328 instance_segmentation_loss_poly: 0.8831 2024/01/08 11:57:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 11:57:19 - mmengine - INFO - Iter(train) [349000/640000] base_lr: 8.5813e-05 lr: 8.5813e-06 eta: 5 days, 5:00:38 time: 1.5370 data_time: 0.0266 memory: 25568 grad_norm: 2.8619 loss: 1.2312 caption_loss_cls: 2.0776 detection_loss_cls: 0.0286 detection_loss_reg: 0.3221 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3972 instance_segmentation_loss_cls: 0.0300 instance_segmentation_loss_reg: 0.3340 instance_segmentation_loss_poly: 0.8865 2024/01/08 12:10:03 - mmengine - INFO - Iter(train) [349500/640000] base_lr: 8.5570e-05 lr: 8.5570e-06 eta: 5 days, 4:45:34 time: 1.5407 data_time: 0.0266 memory: 25568 grad_norm: 2.8899 loss: 1.2245 caption_loss_cls: 2.0807 detection_loss_cls: 0.0286 detection_loss_reg: 0.3225 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3992 instance_segmentation_loss_cls: 0.0298 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8816 2024/01/08 12:23:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 12:23:03 - mmengine - INFO - Iter(train) [350000/640000] base_lr: 8.5327e-05 lr: 8.5327e-06 eta: 5 days, 4:34:03 time: 1.5450 data_time: 0.0265 memory: 25568 grad_norm: 2.9032 loss: 1.2139 caption_loss_cls: 2.0819 detection_loss_cls: 0.0286 detection_loss_reg: 0.3223 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3985 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3316 instance_segmentation_loss_poly: 0.8801 2024/01/08 12:23:03 - mmengine - INFO - Saving checkpoint at 350000 iterations 2024/01/08 12:36:16 - mmengine - INFO - Iter(train) [350500/640000] base_lr: 8.5085e-05 lr: 8.5085e-06 eta: 5 days, 4:25:29 time: 1.5318 data_time: 0.0264 memory: 25568 grad_norm: 2.8821 loss: 1.1950 caption_loss_cls: 2.0806 detection_loss_cls: 0.0286 detection_loss_reg: 0.3216 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3975 instance_segmentation_loss_cls: 0.0295 instance_segmentation_loss_reg: 0.3303 instance_segmentation_loss_poly: 0.8766 2024/01/08 12:48:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 12:48:29 - mmengine - INFO - Iter(train) [351000/640000] base_lr: 8.4842e-05 lr: 8.4842e-06 eta: 5 days, 4:03:57 time: 1.5320 data_time: 0.0264 memory: 25568 grad_norm: 2.8590 loss: 1.1982 caption_loss_cls: 2.0770 detection_loss_cls: 0.0287 detection_loss_reg: 0.3216 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4012 instance_segmentation_loss_cls: 0.0295 instance_segmentation_loss_reg: 0.3311 instance_segmentation_loss_poly: 0.8783 2024/01/08 13:00:52 - mmengine - INFO - Iter(train) [351500/640000] base_lr: 8.4600e-05 lr: 8.4600e-06 eta: 5 days, 3:44:58 time: 1.5225 data_time: 0.0262 memory: 25568 grad_norm: 2.9213 loss: 1.2078 caption_loss_cls: 2.0794 detection_loss_cls: 0.0286 detection_loss_reg: 0.3206 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4000 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3306 instance_segmentation_loss_poly: 0.8779 2024/01/08 13:13:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 13:13:24 - mmengine - INFO - Iter(train) [352000/640000] base_lr: 8.4357e-05 lr: 8.4357e-06 eta: 5 days, 3:27:54 time: 1.5246 data_time: 0.0264 memory: 25568 grad_norm: 2.9718 loss: 1.2185 caption_loss_cls: 2.0792 detection_loss_cls: 0.0287 detection_loss_reg: 0.3214 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3982 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8798 2024/01/08 13:13:24 - mmengine - INFO - Saving checkpoint at 352000 iterations 2024/01/08 13:26:33 - mmengine - INFO - Iter(train) [352500/640000] base_lr: 8.4115e-05 lr: 8.4115e-06 eta: 5 days, 3:18:25 time: 1.5227 data_time: 0.0263 memory: 25568 grad_norm: 3.0093 loss: 1.2140 caption_loss_cls: 2.0755 detection_loss_cls: 0.0285 detection_loss_reg: 0.3210 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3979 instance_segmentation_loss_cls: 0.0295 instance_segmentation_loss_reg: 0.3306 instance_segmentation_loss_poly: 0.8778 2024/01/08 13:39:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 13:39:49 - mmengine - INFO - Iter(train) [353000/640000] base_lr: 8.3872e-05 lr: 8.3872e-06 eta: 5 days, 3:10:07 time: 1.5368 data_time: 0.0265 memory: 25568 grad_norm: 2.9873 loss: 1.2096 caption_loss_cls: 2.0731 detection_loss_cls: 0.0284 detection_loss_reg: 0.3203 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.3987 instance_segmentation_loss_cls: 0.0296 instance_segmentation_loss_reg: 0.3322 instance_segmentation_loss_poly: 0.8812 2024/01/08 13:52:09 - mmengine - INFO - Iter(train) [353500/640000] base_lr: 8.3630e-05 lr: 8.3630e-06 eta: 5 days, 2:51:03 time: 1.5308 data_time: 0.0265 memory: 25568 grad_norm: 3.0711 loss: 1.2080 caption_loss_cls: 2.0705 detection_loss_cls: 0.0285 detection_loss_reg: 0.3214 semantic_segmentation_loss_cls: 0.0075 grounding_loss_reg: 2.4003 instance_segmentation_loss_cls: 0.0293 instance_segmentation_loss_reg: 0.3304 instance_segmentation_loss_poly: 0.8767 2024/01/08 14:04:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 14:04:07 - mmengine - INFO - Iter(train) [354000/640000] base_lr: 8.3388e-05 lr: 8.3388e-06 eta: 5 days, 2:28:13 time: 1.5153 data_time: 0.0264 memory: 25568 grad_norm: 3.1218 loss: 1.2295 caption_loss_cls: 2.0750 detection_loss_cls: 0.0285 detection_loss_reg: 0.3220 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4007 instance_segmentation_loss_cls: 0.0294 instance_segmentation_loss_reg: 0.3311 instance_segmentation_loss_poly: 0.8786 2024/01/08 14:04:07 - mmengine - INFO - Saving checkpoint at 354000 iterations 2024/01/08 14:16:56 - mmengine - INFO - Iter(train) [354500/640000] base_lr: 8.3146e-05 lr: 8.3146e-06 eta: 5 days, 2:14:59 time: 1.5092 data_time: 0.0265 memory: 25568 grad_norm: 3.1296 loss: 1.2442 caption_loss_cls: 2.0749 detection_loss_cls: 0.0285 detection_loss_reg: 0.3212 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4014 instance_segmentation_loss_cls: 0.0293 instance_segmentation_loss_reg: 0.3309 instance_segmentation_loss_poly: 0.8778 2024/01/08 14:29:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 14:29:44 - mmengine - INFO - Iter(train) [355000/640000] base_lr: 8.2904e-05 lr: 8.2904e-06 eta: 5 days, 2:01:42 time: 1.5181 data_time: 0.0265 memory: 25568 grad_norm: 3.1322 loss: 1.2358 caption_loss_cls: 2.0663 detection_loss_cls: 0.0285 detection_loss_reg: 0.3223 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4026 instance_segmentation_loss_cls: 0.0290 instance_segmentation_loss_reg: 0.3289 instance_segmentation_loss_poly: 0.8738 2024/01/08 14:41:58 - mmengine - INFO - Iter(train) [355500/640000] base_lr: 8.2663e-05 lr: 8.2663e-06 eta: 5 days, 1:42:25 time: 1.5157 data_time: 0.0265 memory: 25568 grad_norm: 3.1414 loss: 1.2379 caption_loss_cls: 2.0740 detection_loss_cls: 0.0286 detection_loss_reg: 0.3220 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4008 instance_segmentation_loss_cls: 0.0289 instance_segmentation_loss_reg: 0.3283 instance_segmentation_loss_poly: 0.8738 2024/01/08 14:54:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 14:54:49 - mmengine - INFO - Iter(train) [356000/640000] base_lr: 8.2421e-05 lr: 8.2421e-06 eta: 5 days, 1:29:39 time: 1.5206 data_time: 0.0264 memory: 25568 grad_norm: 3.0631 loss: 1.2293 caption_loss_cls: 2.0728 detection_loss_cls: 0.0286 detection_loss_reg: 0.3230 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4013 instance_segmentation_loss_cls: 0.0289 instance_segmentation_loss_reg: 0.3279 instance_segmentation_loss_poly: 0.8731 2024/01/08 14:54:49 - mmengine - INFO - Saving checkpoint at 356000 iterations 2024/01/08 15:08:21 - mmengine - INFO - Iter(train) [356500/640000] base_lr: 8.2179e-05 lr: 8.2179e-06 eta: 5 days, 1:23:43 time: 1.5263 data_time: 0.0264 memory: 25568 grad_norm: 3.0417 loss: 1.2279 caption_loss_cls: 2.0757 detection_loss_cls: 0.0285 detection_loss_reg: 0.3237 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4002 instance_segmentation_loss_cls: 0.0287 instance_segmentation_loss_reg: 0.3265 instance_segmentation_loss_poly: 0.8701 2024/01/08 15:20:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 15:20:50 - mmengine - INFO - Iter(train) [357000/640000] base_lr: 8.1938e-05 lr: 8.1938e-06 eta: 5 days, 1:07:21 time: 1.5146 data_time: 0.0263 memory: 25568 grad_norm: 3.0959 loss: 1.2290 caption_loss_cls: 2.0750 detection_loss_cls: 0.0285 detection_loss_reg: 0.3240 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4005 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3255 instance_segmentation_loss_poly: 0.8671 2024/01/08 15:33:30 - mmengine - INFO - Iter(train) [357500/640000] base_lr: 8.1696e-05 lr: 8.1696e-06 eta: 5 days, 0:52:43 time: 1.5195 data_time: 0.0265 memory: 25568 grad_norm: 3.0360 loss: 1.2250 caption_loss_cls: 2.0740 detection_loss_cls: 0.0286 detection_loss_reg: 0.3242 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4000 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3260 instance_segmentation_loss_poly: 0.8680 2024/01/08 15:46:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 15:46:59 - mmengine - INFO - Iter(train) [358000/640000] base_lr: 8.1455e-05 lr: 8.1455e-06 eta: 5 days, 0:45:55 time: 1.5423 data_time: 0.0268 memory: 25568 grad_norm: 3.0221 loss: 1.2116 caption_loss_cls: 2.0690 detection_loss_cls: 0.0287 detection_loss_reg: 0.3253 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3981 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3265 instance_segmentation_loss_poly: 0.8691 2024/01/08 15:46:59 - mmengine - INFO - Saving checkpoint at 358000 iterations 2024/01/08 15:59:48 - mmengine - INFO - Iter(train) [358500/640000] base_lr: 8.1214e-05 lr: 8.1214e-06 eta: 5 days, 0:32:49 time: 1.5425 data_time: 0.0268 memory: 25568 grad_norm: 3.1305 loss: 1.2163 caption_loss_cls: 2.0725 detection_loss_cls: 0.0287 detection_loss_reg: 0.3250 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4009 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3260 instance_segmentation_loss_poly: 0.8680 2024/01/08 16:12:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 16:12:43 - mmengine - INFO - Iter(train) [359000/640000] base_lr: 8.0973e-05 lr: 8.0973e-06 eta: 5 days, 0:20:28 time: 1.5440 data_time: 0.0269 memory: 25568 grad_norm: 3.1933 loss: 1.2254 caption_loss_cls: 2.0749 detection_loss_cls: 0.0288 detection_loss_reg: 0.3257 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.4014 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3263 instance_segmentation_loss_poly: 0.8685 2024/01/08 16:24:52 - mmengine - INFO - Iter(train) [359500/640000] base_lr: 8.0732e-05 lr: 8.0732e-06 eta: 5 days, 0:01:22 time: 1.5428 data_time: 0.0269 memory: 25568 grad_norm: 3.2100 loss: 1.2176 caption_loss_cls: 2.0729 detection_loss_cls: 0.0288 detection_loss_reg: 0.3257 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3997 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3250 instance_segmentation_loss_poly: 0.8666 2024/01/08 16:38:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 16:38:08 - mmengine - INFO - Iter(train) [360000/640000] base_lr: 8.0491e-05 lr: 8.0491e-06 eta: 4 days, 23:52:18 time: 1.5492 data_time: 0.0269 memory: 25568 grad_norm: 3.2180 loss: 1.2066 caption_loss_cls: 2.0700 detection_loss_cls: 0.0285 detection_loss_reg: 0.3244 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3956 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3262 instance_segmentation_loss_poly: 0.8693 2024/01/08 16:38:08 - mmengine - INFO - Saving checkpoint at 360000 iterations 2024/01/08 16:49:59 - mmengine - INFO - Evaluating bbox... 2024/01/08 16:50:57 - mmengine - INFO - bbox_mAP_copypaste: 0.511 0.691 0.561 0.358 0.561 0.656 2024/01/08 16:50:57 - mmengine - INFO - Evaluating segm... 2024/01/08 16:52:11 - mmengine - INFO - segm_mAP_copypaste: 0.341 0.608 0.335 0.195 0.387 0.522 2024/01/08 16:54:19 - mmengine - INFO - Evaluating bbox... 2024/01/08 16:55:18 - mmengine - INFO - bbox_mAP_copypaste: 0.509 0.691 0.561 0.357 0.560 0.656 2024/01/08 17:01:55 - mmengine - INFO - per class results: 2024/01/08 17:01:55 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.45 | 88.6 | | building | 82.28 | 92.01 | | sky | 93.36 | 97.72 | | floor | 82.74 | 90.23 | | tree | 74.43 | 87.81 | | ceiling | 86.15 | 92.98 | | road | 84.01 | 89.52 | | bed | 90.51 | 95.36 | | windowpane | 63.59 | 81.6 | | grass | 64.54 | 78.97 | | cabinet | 62.72 | 75.32 | | sidewalk | 67.7 | 82.36 | | person | 82.02 | 91.44 | | earth | 36.88 | 47.13 | | door | 55.12 | 74.74 | | table | 63.21 | 80.66 | | mountain | 53.94 | 64.8 | | plant | 51.92 | 63.51 | | curtain | 76.24 | 87.71 | | chair | 61.35 | 72.56 | | car | 84.71 | 91.49 | | water | 62.35 | 73.97 | | painting | 71.89 | 88.87 | | sofa | 70.81 | 80.18 | | shelf | 45.37 | 71.41 | | house | 46.53 | 63.4 | | sea | 62.26 | 78.13 | | mirror | 66.74 | 75.54 | | rug | 67.2 | 79.29 | | field | 31.95 | 60.69 | | armchair | 51.48 | 71.43 | | seat | 66.29 | 80.39 | | fence | 45.65 | 64.8 | | desk | 54.57 | 70.31 | | rock | 48.09 | 70.3 | | wardrobe | 47.87 | 72.52 | | lamp | 63.5 | 79.65 | | bathtub | 80.57 | 90.35 | | railing | 42.67 | 55.87 | | cushion | 63.51 | 79.54 | | base | 22.92 | 33.5 | | box | 28.76 | 39.08 | | column | 56.58 | 69.53 | | signboard | 37.87 | 55.24 | | chest of drawers | 34.77 | 46.11 | | counter | 27.62 | 38.69 | | sand | 43.58 | 63.7 | | sink | 77.6 | 84.31 | | skyscraper | 48.49 | 61.4 | | fireplace | 74.8 | 85.41 | | refrigerator | 70.76 | 81.1 | | grandstand | 52.21 | 78.38 | | path | 19.57 | 35.07 | | stairs | 41.54 | 52.53 | | runway | 74.93 | 96.31 | | case | 53.36 | 62.18 | | pool table | 91.88 | 95.09 | | pillow | 61.41 | 72.23 | | screen door | 67.14 | 71.77 | | stairway | 33.93 | 38.83 | | river | 15.07 | 32.62 | | bridge | 66.75 | 75.24 | | bookcase | 40.16 | 54.91 | | blind | 40.78 | 43.55 | | coffee table | 64.46 | 77.53 | | toilet | 85.77 | 91.12 | | flower | 38.85 | 52.26 | | book | 45.58 | 63.31 | | hill | 13.63 | 19.68 | | bench | 58.42 | 73.69 | | countertop | 58.67 | 81.16 | | stove | 79.61 | 85.74 | | palm | 48.43 | 73.78 | | kitchen island | 42.72 | 56.58 | | computer | 75.36 | 87.63 | | swivel chair | 49.53 | 68.77 | | boat | 57.07 | 73.31 | | bar | 33.04 | 43.7 | | arcade machine | 31.93 | 33.17 | | hovel | 17.65 | 19.27 | | bus | 91.55 | 93.48 | | towel | 68.6 | 80.98 | | light | 54.56 | 65.98 | | truck | 48.68 | 61.38 | | tower | 33.85 | 54.93 | | chandelier | 66.65 | 81.07 | | awning | 29.16 | 39.33 | | streetlight | 32.48 | 44.24 | | booth | 37.55 | 58.95 | | television receiver | 76.11 | 82.1 | | airplane | 72.48 | 78.63 | | dirt track | 9.27 | 30.45 | | apparel | 32.8 | 53.64 | | pole | 26.44 | 38.05 | | land | 0.75 | 1.0 | | bannister | 16.55 | 24.66 | | escalator | 27.7 | 31.21 | | ottoman | 51.48 | 68.11 | | bottle | 25.68 | 33.67 | | buffet | 51.39 | 62.38 | | poster | 35.65 | 52.38 | | stage | 10.61 | 16.63 | | van | 44.41 | 57.14 | | ship | 59.13 | 68.79 | | fountain | 25.9 | 26.29 | | conveyer belt | 65.95 | 91.92 | | canopy | 40.67 | 51.67 | | washer | 69.89 | 72.34 | | plaything | 28.41 | 36.38 | | swimming pool | 61.5 | 67.46 | | stool | 44.05 | 69.03 | | barrel | 15.59 | 66.63 | | basket | 36.63 | 48.24 | | waterfall | 63.93 | 92.59 | | tent | 77.2 | 97.06 | | bag | 21.52 | 27.41 | | minibike | 72.77 | 84.66 | | cradle | 80.65 | 97.33 | | oven | 45.69 | 54.52 | | ball | 47.77 | 57.62 | | food | 53.39 | 57.37 | | step | 8.71 | 11.43 | | tank | 54.12 | 58.01 | | trade name | 25.22 | 29.36 | | microwave | 85.69 | 94.01 | | pot | 50.1 | 58.9 | | animal | 61.37 | 64.96 | | bicycle | 57.36 | 69.4 | | lake | 52.84 | 62.96 | | dishwasher | 72.22 | 83.41 | | screen | 67.83 | 83.62 | | blanket | 35.01 | 47.15 | | sculpture | 66.71 | 79.39 | | hood | 61.59 | 73.35 | | sconce | 41.13 | 69.37 | | vase | 44.34 | 64.36 | | traffic light | 40.38 | 54.77 | | tray | 17.47 | 32.39 | | ashcan | 40.37 | 53.71 | | fan | 64.37 | 74.98 | | pier | 41.65 | 56.85 | | crt screen | 5.05 | 14.04 | | plate | 59.16 | 77.01 | | monitor | 15.41 | 17.32 | | bulletin board | 47.83 | 57.11 | | shower | 1.28 | 1.3 | | radiator | 62.29 | 78.27 | | glass | 15.86 | 17.1 | | clock | 35.15 | 41.1 | | flag | 42.71 | 49.89 | +---------------------+-------+-------+ 2024/01/08 17:02:07 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5090 coco/bbox_mAP_50: 0.6910 coco/bbox_mAP_75: 0.5610 coco/bbox_mAP_s: 0.3570 coco/bbox_mAP_m: 0.5600 coco/bbox_mAP_l: 0.6560 coco/segm_mAP: 0.3410 coco/segm_mAP_50: 0.6080 coco/segm_mAP_75: 0.3350 coco/segm_mAP_s: 0.1950 coco/segm_mAP_m: 0.3870 coco/segm_mAP_l: 0.5220 Bleu_1: 0.7601 Bleu_2: 0.5952 Bleu_3: 0.4541 Bleu_4: 0.3430 METEOR: 0.2734 ROUGE_L: 0.5616 CIDEr: 1.1004 SPICE: 0.2023 aAcc: 83.7900 mIoU: 51.3100 mAcc: 63.6900 visual-grounding/miou: 0.8138 visual-grounding/acc: 0.8771 data_time: 0.0115 time: 1.9016 2024/01/08 17:14:37 - mmengine - INFO - Iter(train) [360500/640000] base_lr: 8.0251e-05 lr: 8.0251e-06 eta: 4 days, 23:36:44 time: 1.5342 data_time: 0.0203 memory: 34658 grad_norm: 3.2730 loss: 1.2024 caption_loss_cls: 2.0719 detection_loss_cls: 0.0285 detection_loss_reg: 0.3239 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3934 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3259 instance_segmentation_loss_poly: 0.8693 2024/01/08 17:27:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 17:27:13 - mmengine - INFO - Iter(train) [361000/640000] base_lr: 8.0010e-05 lr: 8.0010e-06 eta: 4 days, 23:21:47 time: 1.5357 data_time: 0.0204 memory: 25567 grad_norm: 3.2648 loss: 1.2161 caption_loss_cls: 2.0778 detection_loss_cls: 0.0287 detection_loss_reg: 0.3254 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3900 instance_segmentation_loss_cls: 0.0286 instance_segmentation_loss_reg: 0.3263 instance_segmentation_loss_poly: 0.8705 2024/01/08 17:40:03 - mmengine - INFO - Iter(train) [361500/640000] base_lr: 7.9770e-05 lr: 7.9770e-06 eta: 4 days, 23:08:59 time: 1.5385 data_time: 0.0204 memory: 25567 grad_norm: 3.2453 loss: 1.2233 caption_loss_cls: 2.0844 detection_loss_cls: 0.0289 detection_loss_reg: 0.3280 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3898 instance_segmentation_loss_cls: 0.0286 instance_segmentation_loss_reg: 0.3276 instance_segmentation_loss_poly: 0.8736 2024/01/08 17:52:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 17:52:33 - mmengine - INFO - Iter(train) [362000/640000] base_lr: 7.9530e-05 lr: 7.9530e-06 eta: 4 days, 22:53:23 time: 1.5238 data_time: 0.0201 memory: 25567 grad_norm: 3.2667 loss: 1.2192 caption_loss_cls: 2.0891 detection_loss_cls: 0.0289 detection_loss_reg: 0.3297 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3874 instance_segmentation_loss_cls: 0.0287 instance_segmentation_loss_reg: 0.3274 instance_segmentation_loss_poly: 0.8735 2024/01/08 17:52:33 - mmengine - INFO - Saving checkpoint at 362000 iterations 2024/01/08 18:05:31 - mmengine - INFO - Iter(train) [362500/640000] base_lr: 7.9289e-05 lr: 7.9289e-06 eta: 4 days, 22:41:32 time: 1.5258 data_time: 0.0204 memory: 25567 grad_norm: 3.1623 loss: 1.2157 caption_loss_cls: 2.0861 detection_loss_cls: 0.0290 detection_loss_reg: 0.3313 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3844 instance_segmentation_loss_cls: 0.0288 instance_segmentation_loss_reg: 0.3278 instance_segmentation_loss_poly: 0.8721 2024/01/08 18:18:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 18:18:02 - mmengine - INFO - Iter(train) [363000/640000] base_lr: 7.9049e-05 lr: 7.9049e-06 eta: 4 days, 22:26:12 time: 1.5200 data_time: 0.0203 memory: 25567 grad_norm: 3.0834 loss: 1.2054 caption_loss_cls: 2.0907 detection_loss_cls: 0.0289 detection_loss_reg: 0.3305 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3818 instance_segmentation_loss_cls: 0.0288 instance_segmentation_loss_reg: 0.3289 instance_segmentation_loss_poly: 0.8735 2024/01/08 18:30:38 - mmengine - INFO - Iter(train) [363500/640000] base_lr: 7.8809e-05 lr: 7.8809e-06 eta: 4 days, 22:11:35 time: 1.5268 data_time: 0.0204 memory: 25567 grad_norm: 3.0209 loss: 1.2057 caption_loss_cls: 2.0922 detection_loss_cls: 0.0291 detection_loss_reg: 0.3320 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3797 instance_segmentation_loss_cls: 0.0289 instance_segmentation_loss_reg: 0.3295 instance_segmentation_loss_poly: 0.8745 2024/01/08 18:43:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 18:43:18 - mmengine - INFO - Iter(train) [364000/640000] base_lr: 7.8570e-05 lr: 7.8570e-06 eta: 4 days, 21:57:30 time: 1.5176 data_time: 0.0203 memory: 25567 grad_norm: 3.0137 loss: 1.2125 caption_loss_cls: 2.0935 detection_loss_cls: 0.0291 detection_loss_reg: 0.3322 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3761 instance_segmentation_loss_cls: 0.0287 instance_segmentation_loss_reg: 0.3279 instance_segmentation_loss_poly: 0.8716 2024/01/08 18:43:18 - mmengine - INFO - Saving checkpoint at 364000 iterations 2024/01/08 18:56:36 - mmengine - INFO - Iter(train) [364500/640000] base_lr: 7.8330e-05 lr: 7.8330e-06 eta: 4 days, 21:48:11 time: 1.5290 data_time: 0.0271 memory: 25567 grad_norm: 2.9864 loss: 1.2246 caption_loss_cls: 2.0940 detection_loss_cls: 0.0292 detection_loss_reg: 0.3333 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3743 instance_segmentation_loss_cls: 0.0288 instance_segmentation_loss_reg: 0.3295 instance_segmentation_loss_poly: 0.8741 2024/01/08 19:09:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 19:09:39 - mmengine - INFO - Iter(train) [365000/640000] base_lr: 7.8090e-05 lr: 7.8090e-06 eta: 4 days, 21:36:58 time: 1.5359 data_time: 0.0270 memory: 25567 grad_norm: 2.9384 loss: 1.1963 caption_loss_cls: 2.0887 detection_loss_cls: 0.0292 detection_loss_reg: 0.3337 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3715 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3255 instance_segmentation_loss_poly: 0.8652 2024/01/08 19:22:17 - mmengine - INFO - Iter(train) [365500/640000] base_lr: 7.7851e-05 lr: 7.7851e-06 eta: 4 days, 21:22:40 time: 1.5327 data_time: 0.0269 memory: 25567 grad_norm: 2.9434 loss: 1.1944 caption_loss_cls: 2.0884 detection_loss_cls: 0.0292 detection_loss_reg: 0.3332 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3662 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3265 instance_segmentation_loss_poly: 0.8668 2024/01/08 19:34:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 19:34:42 - mmengine - INFO - Iter(train) [366000/640000] base_lr: 7.7612e-05 lr: 7.7612e-06 eta: 4 days, 21:06:47 time: 1.5313 data_time: 0.0269 memory: 25567 grad_norm: 2.9052 loss: 1.1926 caption_loss_cls: 2.0874 detection_loss_cls: 0.0292 detection_loss_reg: 0.3337 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3623 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3256 instance_segmentation_loss_poly: 0.8642 2024/01/08 19:34:42 - mmengine - INFO - Saving checkpoint at 366000 iterations 2024/01/08 19:48:08 - mmengine - INFO - Iter(train) [366500/640000] base_lr: 7.7373e-05 lr: 7.7373e-06 eta: 4 days, 20:58:15 time: 1.5385 data_time: 0.0270 memory: 25567 grad_norm: 2.9380 loss: 1.1908 caption_loss_cls: 2.0860 detection_loss_cls: 0.0293 detection_loss_reg: 0.3353 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3616 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3262 instance_segmentation_loss_poly: 0.8638 2024/01/08 20:00:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 20:00:22 - mmengine - INFO - Iter(train) [367000/640000] base_lr: 7.7134e-05 lr: 7.7134e-06 eta: 4 days, 20:41:15 time: 1.5343 data_time: 0.0270 memory: 25567 grad_norm: 3.0094 loss: 1.1986 caption_loss_cls: 2.0790 detection_loss_cls: 0.0295 detection_loss_reg: 0.3363 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3586 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3275 instance_segmentation_loss_poly: 0.8663 2024/01/08 20:13:17 - mmengine - INFO - Iter(train) [367500/640000] base_lr: 7.6895e-05 lr: 7.6895e-06 eta: 4 days, 20:29:01 time: 1.5389 data_time: 0.0271 memory: 25567 grad_norm: 2.9966 loss: 1.1880 caption_loss_cls: 2.0765 detection_loss_cls: 0.0294 detection_loss_reg: 0.3363 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3592 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3269 instance_segmentation_loss_poly: 0.8660 2024/01/08 20:26:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 20:26:03 - mmengine - INFO - Iter(train) [368000/640000] base_lr: 7.6656e-05 lr: 7.6656e-06 eta: 4 days, 20:15:47 time: 1.5405 data_time: 0.0272 memory: 25567 grad_norm: 3.0062 loss: 1.1886 caption_loss_cls: 2.0752 detection_loss_cls: 0.0294 detection_loss_reg: 0.3354 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3562 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3273 instance_segmentation_loss_poly: 0.8672 2024/01/08 20:26:03 - mmengine - INFO - Saving checkpoint at 368000 iterations 2024/01/08 20:38:59 - mmengine - INFO - Iter(train) [368500/640000] base_lr: 7.6417e-05 lr: 7.6417e-06 eta: 4 days, 20:03:41 time: 1.5352 data_time: 0.0270 memory: 25567 grad_norm: 2.9685 loss: 1.1777 caption_loss_cls: 2.0691 detection_loss_cls: 0.0294 detection_loss_reg: 0.3348 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3600 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3284 instance_segmentation_loss_poly: 0.8694 2024/01/08 20:51:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 20:51:26 - mmengine - INFO - Iter(train) [369000/640000] base_lr: 7.6179e-05 lr: 7.6179e-06 eta: 4 days, 19:48:22 time: 1.5262 data_time: 0.0270 memory: 25567 grad_norm: 3.0037 loss: 1.1983 caption_loss_cls: 2.0688 detection_loss_cls: 0.0292 detection_loss_reg: 0.3344 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3577 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3283 instance_segmentation_loss_poly: 0.8690 2024/01/08 21:04:04 - mmengine - INFO - Iter(train) [369500/640000] base_lr: 7.5941e-05 lr: 7.5941e-06 eta: 4 days, 19:34:12 time: 1.5259 data_time: 0.0271 memory: 25567 grad_norm: 3.0043 loss: 1.2023 caption_loss_cls: 2.0701 detection_loss_cls: 0.0293 detection_loss_reg: 0.3347 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3550 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3299 instance_segmentation_loss_poly: 0.8718 2024/01/08 21:16:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 21:16:32 - mmengine - INFO - Iter(train) [370000/640000] base_lr: 7.5702e-05 lr: 7.5702e-06 eta: 4 days, 19:19:10 time: 1.5270 data_time: 0.0272 memory: 25567 grad_norm: 3.0633 loss: 1.2136 caption_loss_cls: 2.0747 detection_loss_cls: 0.0292 detection_loss_reg: 0.3342 semantic_segmentation_loss_cls: 0.0074 grounding_loss_reg: 2.3536 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3314 instance_segmentation_loss_poly: 0.8754 2024/01/08 21:16:32 - mmengine - INFO - Saving checkpoint at 370000 iterations 2024/01/08 21:29:43 - mmengine - INFO - Iter(train) [370500/640000] base_lr: 7.5464e-05 lr: 7.5464e-06 eta: 4 days, 19:08:41 time: 1.5233 data_time: 0.0270 memory: 25567 grad_norm: 3.0297 loss: 1.2080 caption_loss_cls: 2.0765 detection_loss_cls: 0.0292 detection_loss_reg: 0.3328 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3518 instance_segmentation_loss_cls: 0.0287 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8755 2024/01/08 21:42:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 21:42:39 - mmengine - INFO - Iter(train) [371000/640000] base_lr: 7.5227e-05 lr: 7.5227e-06 eta: 4 days, 18:56:30 time: 1.5335 data_time: 0.0271 memory: 25567 grad_norm: 3.0130 loss: 1.2047 caption_loss_cls: 2.0738 detection_loss_cls: 0.0292 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3483 instance_segmentation_loss_cls: 0.0286 instance_segmentation_loss_reg: 0.3308 instance_segmentation_loss_poly: 0.8746 2024/01/08 21:55:27 - mmengine - INFO - Iter(train) [371500/640000] base_lr: 7.4989e-05 lr: 7.4989e-06 eta: 4 days, 18:43:32 time: 1.5318 data_time: 0.0270 memory: 25567 grad_norm: 3.0231 loss: 1.2050 caption_loss_cls: 2.0742 detection_loss_cls: 0.0289 detection_loss_reg: 0.3330 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3472 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3314 instance_segmentation_loss_poly: 0.8758 2024/01/08 22:08:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 22:08:18 - mmengine - INFO - Iter(train) [372000/640000] base_lr: 7.4751e-05 lr: 7.4751e-06 eta: 4 days, 18:30:51 time: 1.5330 data_time: 0.0270 memory: 25567 grad_norm: 3.1733 loss: 1.2047 caption_loss_cls: 2.0795 detection_loss_cls: 0.0289 detection_loss_reg: 0.3337 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3445 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3304 instance_segmentation_loss_poly: 0.8731 2024/01/08 22:08:18 - mmengine - INFO - Saving checkpoint at 372000 iterations 2024/01/08 22:21:42 - mmengine - INFO - Iter(train) [372500/640000] base_lr: 7.4514e-05 lr: 7.4514e-06 eta: 4 days, 18:21:29 time: 1.5400 data_time: 0.0270 memory: 25567 grad_norm: 3.1747 loss: 1.1974 caption_loss_cls: 2.0752 detection_loss_cls: 0.0288 detection_loss_reg: 0.3328 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3409 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3296 instance_segmentation_loss_poly: 0.8720 2024/01/08 22:34:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 22:34:17 - mmengine - INFO - Iter(train) [373000/640000] base_lr: 7.4277e-05 lr: 7.4277e-06 eta: 4 days, 18:07:15 time: 1.5420 data_time: 0.0271 memory: 25567 grad_norm: 3.1743 loss: 1.2025 caption_loss_cls: 2.0806 detection_loss_cls: 0.0288 detection_loss_reg: 0.3333 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3366 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3307 instance_segmentation_loss_poly: 0.8752 2024/01/08 22:47:22 - mmengine - INFO - Iter(train) [373500/640000] base_lr: 7.4040e-05 lr: 7.4040e-06 eta: 4 days, 17:55:55 time: 1.5490 data_time: 0.0272 memory: 25567 grad_norm: 3.1518 loss: 1.1928 caption_loss_cls: 2.0808 detection_loss_cls: 0.0288 detection_loss_reg: 0.3325 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3324 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3301 instance_segmentation_loss_poly: 0.8749 2024/01/08 23:00:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 23:00:01 - mmengine - INFO - Iter(train) [374000/640000] base_lr: 7.3803e-05 lr: 7.3803e-06 eta: 4 days, 17:41:58 time: 1.5514 data_time: 0.0271 memory: 25567 grad_norm: 3.1197 loss: 1.1781 caption_loss_cls: 2.0773 detection_loss_cls: 0.0288 detection_loss_reg: 0.3331 semantic_segmentation_loss_cls: 0.0073 grounding_loss_reg: 2.3309 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3283 instance_segmentation_loss_poly: 0.8715 2024/01/08 23:00:01 - mmengine - INFO - Saving checkpoint at 374000 iterations 2024/01/08 23:13:22 - mmengine - INFO - Iter(train) [374500/640000] base_lr: 7.3566e-05 lr: 7.3566e-06 eta: 4 days, 17:32:10 time: 1.5539 data_time: 0.0271 memory: 25567 grad_norm: 3.0797 loss: 1.1737 caption_loss_cls: 2.0777 detection_loss_cls: 0.0289 detection_loss_reg: 0.3333 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3326 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3273 instance_segmentation_loss_poly: 0.8700 2024/01/08 23:26:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 23:26:09 - mmengine - INFO - Iter(train) [375000/640000] base_lr: 7.3329e-05 lr: 7.3329e-06 eta: 4 days, 17:19:01 time: 1.5517 data_time: 0.0270 memory: 25567 grad_norm: 3.0649 loss: 1.1650 caption_loss_cls: 2.0852 detection_loss_cls: 0.0288 detection_loss_reg: 0.3328 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3260 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3276 instance_segmentation_loss_poly: 0.8695 2024/01/08 23:38:48 - mmengine - INFO - Iter(train) [375500/640000] base_lr: 7.3093e-05 lr: 7.3093e-06 eta: 4 days, 17:05:11 time: 1.5495 data_time: 0.0271 memory: 25567 grad_norm: 3.1234 loss: 1.1851 caption_loss_cls: 2.0806 detection_loss_cls: 0.0289 detection_loss_reg: 0.3331 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3313 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3286 instance_segmentation_loss_poly: 0.8705 2024/01/08 23:52:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/08 23:52:05 - mmengine - INFO - Iter(train) [376000/640000] base_lr: 7.2856e-05 lr: 7.2856e-06 eta: 4 days, 16:54:51 time: 1.5561 data_time: 0.0272 memory: 25567 grad_norm: 2.9708 loss: 1.1882 caption_loss_cls: 2.0778 detection_loss_cls: 0.0288 detection_loss_reg: 0.3325 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3279 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3292 instance_segmentation_loss_poly: 0.8717 2024/01/08 23:52:05 - mmengine - INFO - Saving checkpoint at 376000 iterations 2024/01/09 00:05:26 - mmengine - INFO - Iter(train) [376500/640000] base_lr: 7.2620e-05 lr: 7.2620e-06 eta: 4 days, 16:44:49 time: 1.5554 data_time: 0.0274 memory: 25567 grad_norm: 3.0053 loss: 1.2034 caption_loss_cls: 2.0750 detection_loss_cls: 0.0288 detection_loss_reg: 0.3320 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3292 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3300 instance_segmentation_loss_poly: 0.8733 2024/01/09 00:18:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/09 00:18:31 - mmengine - INFO - Iter(train) [377000/640000] base_lr: 7.2384e-05 lr: 7.2384e-06 eta: 4 days, 16:33:17 time: 1.5628 data_time: 0.0275 memory: 25567 grad_norm: 2.9531 loss: 1.1924 caption_loss_cls: 2.0736 detection_loss_cls: 0.0287 detection_loss_reg: 0.3311 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3264 instance_segmentation_loss_cls: 0.0287 instance_segmentation_loss_reg: 0.3330 instance_segmentation_loss_poly: 0.8784 2024/01/09 00:31:26 - mmengine - INFO - Iter(train) [377500/640000] base_lr: 7.2149e-05 lr: 7.2149e-06 eta: 4 days, 16:20:45 time: 1.5601 data_time: 0.0274 memory: 25567 grad_norm: 2.9173 loss: 1.1868 caption_loss_cls: 2.0786 detection_loss_cls: 0.0287 detection_loss_reg: 0.3317 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3289 instance_segmentation_loss_cls: 0.0286 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8776 2024/01/09 00:44:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/09 00:44:29 - mmengine - INFO - Iter(train) [378000/640000] base_lr: 7.1913e-05 lr: 7.1913e-06 eta: 4 days, 16:09:01 time: 1.5664 data_time: 0.0275 memory: 25567 grad_norm: 2.8536 loss: 1.1825 caption_loss_cls: 2.0768 detection_loss_cls: 0.0285 detection_loss_reg: 0.3302 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3285 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8724 2024/01/09 00:44:29 - mmengine - INFO - Saving checkpoint at 378000 iterations 2024/01/09 00:57:51 - mmengine - INFO - Iter(train) [378500/640000] base_lr: 7.1677e-05 lr: 7.1677e-06 eta: 4 days, 15:58:53 time: 1.5666 data_time: 0.0276 memory: 25567 grad_norm: 2.8860 loss: 1.1866 caption_loss_cls: 2.0769 detection_loss_cls: 0.0285 detection_loss_reg: 0.3307 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3274 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3287 instance_segmentation_loss_poly: 0.8697 2024/01/09 01:10:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/09 01:10:09 - mmengine - INFO - Iter(train) [379000/640000] base_lr: 7.1442e-05 lr: 7.1442e-06 eta: 4 days, 15:43:12 time: 1.5594 data_time: 0.0276 memory: 25567 grad_norm: 2.8895 loss: 1.1991 caption_loss_cls: 2.0725 detection_loss_cls: 0.0284 detection_loss_reg: 0.3291 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3250 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3282 instance_segmentation_loss_poly: 0.8689 2024/01/09 01:22:49 - mmengine - INFO - Iter(train) [379500/640000] base_lr: 7.1207e-05 lr: 7.1207e-06 eta: 4 days, 15:29:24 time: 1.5594 data_time: 0.0277 memory: 25567 grad_norm: 2.8496 loss: 1.1965 caption_loss_cls: 2.0679 detection_loss_cls: 0.0285 detection_loss_reg: 0.3300 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3272 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3315 instance_segmentation_loss_poly: 0.8757 2024/01/09 01:35:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/09 01:35:36 - mmengine - INFO - Iter(train) [380000/640000] base_lr: 7.0972e-05 lr: 7.0972e-06 eta: 4 days, 15:16:18 time: 1.5520 data_time: 0.0276 memory: 25567 grad_norm: 2.8505 loss: 1.1930 caption_loss_cls: 2.0724 detection_loss_cls: 0.0285 detection_loss_reg: 0.3300 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3320 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3299 instance_segmentation_loss_poly: 0.8723 2024/01/09 01:35:36 - mmengine - INFO - Saving checkpoint at 380000 iterations 2024/01/09 01:47:25 - mmengine - INFO - Evaluating bbox... 2024/01/09 01:48:22 - mmengine - INFO - bbox_mAP_copypaste: 0.508 0.689 0.557 0.354 0.558 0.659 2024/01/09 01:48:22 - mmengine - INFO - Evaluating segm... 2024/01/09 01:49:36 - mmengine - INFO - segm_mAP_copypaste: 0.342 0.613 0.338 0.199 0.390 0.513 2024/01/09 01:51:44 - mmengine - INFO - Evaluating bbox... 2024/01/09 01:52:42 - mmengine - INFO - bbox_mAP_copypaste: 0.507 0.689 0.556 0.353 0.558 0.658 2024/01/09 01:58:12 - mmengine - INFO - per class results: 2024/01/09 01:58:12 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.4 | 90.72 | | building | 82.22 | 90.85 | | sky | 93.7 | 97.72 | | floor | 82.76 | 90.33 | | tree | 74.5 | 88.64 | | ceiling | 86.29 | 94.43 | | road | 85.58 | 91.34 | | bed | 90.49 | 95.88 | | windowpane | 63.62 | 78.59 | | grass | 68.46 | 83.12 | | cabinet | 62.31 | 73.49 | | sidewalk | 69.63 | 82.91 | | person | 81.66 | 91.18 | | earth | 34.96 | 44.98 | | door | 53.98 | 68.55 | | table | 63.86 | 79.92 | | mountain | 53.91 | 63.12 | | plant | 52.65 | 63.39 | | curtain | 75.93 | 86.7 | | chair | 62.82 | 79.07 | | car | 84.93 | 90.81 | | water | 62.14 | 74.0 | | painting | 71.4 | 85.45 | | sofa | 71.58 | 83.93 | | shelf | 47.51 | 68.8 | | house | 44.2 | 66.76 | | sea | 61.3 | 78.3 | | mirror | 67.56 | 73.47 | | rug | 66.89 | 77.93 | | field | 34.66 | 59.33 | | armchair | 50.99 | 64.9 | | seat | 64.36 | 80.59 | | fence | 46.04 | 68.46 | | desk | 52.48 | 72.07 | | rock | 58.25 | 83.09 | | wardrobe | 46.56 | 61.06 | | lamp | 65.54 | 75.88 | | bathtub | 85.1 | 89.11 | | railing | 41.28 | 58.54 | | cushion | 62.87 | 75.79 | | base | 16.44 | 20.24 | | box | 27.69 | 33.88 | | column | 57.74 | 65.85 | | signboard | 37.37 | 47.68 | | chest of drawers | 34.43 | 61.92 | | counter | 32.57 | 48.08 | | sand | 44.22 | 67.13 | | sink | 75.47 | 82.14 | | skyscraper | 46.24 | 58.57 | | fireplace | 75.87 | 86.39 | | refrigerator | 77.66 | 83.11 | | grandstand | 52.78 | 81.82 | | path | 19.06 | 28.48 | | stairs | 36.08 | 42.68 | | runway | 70.17 | 90.88 | | case | 47.6 | 61.57 | | pool table | 92.11 | 95.28 | | pillow | 61.89 | 72.75 | | screen door | 71.5 | 73.04 | | stairway | 32.19 | 40.63 | | river | 14.97 | 33.12 | | bridge | 61.29 | 80.16 | | bookcase | 38.86 | 52.92 | | blind | 40.73 | 45.24 | | coffee table | 65.13 | 78.93 | | toilet | 86.82 | 92.78 | | flower | 42.15 | 57.72 | | book | 51.25 | 72.63 | | hill | 16.75 | 23.26 | | bench | 61.52 | 70.6 | | countertop | 61.46 | 77.0 | | stove | 80.42 | 84.36 | | palm | 51.14 | 75.25 | | kitchen island | 42.66 | 70.77 | | computer | 77.17 | 87.94 | | swivel chair | 43.13 | 55.27 | | boat | 51.69 | 65.48 | | bar | 32.39 | 42.27 | | arcade machine | 53.09 | 54.71 | | hovel | 19.46 | 22.36 | | bus | 87.05 | 95.65 | | towel | 64.63 | 79.98 | | light | 52.55 | 62.12 | | truck | 43.2 | 62.73 | | tower | 34.94 | 59.18 | | chandelier | 66.56 | 76.98 | | awning | 30.4 | 43.72 | | streetlight | 31.86 | 41.17 | | booth | 34.23 | 40.94 | | television receiver | 71.81 | 83.69 | | airplane | 73.62 | 82.2 | | dirt track | 5.57 | 13.78 | | apparel | 30.63 | 44.15 | | pole | 27.02 | 39.24 | | land | 3.14 | 6.7 | | bannister | 19.05 | 27.0 | | escalator | 26.45 | 28.79 | | ottoman | 54.25 | 71.96 | | bottle | 29.63 | 39.67 | | buffet | 51.94 | 57.98 | | poster | 35.4 | 41.65 | | stage | 13.32 | 20.54 | | van | 50.74 | 69.31 | | ship | 12.39 | 15.12 | | fountain | 22.82 | 23.22 | | conveyer belt | 73.32 | 91.32 | | canopy | 36.33 | 40.04 | | washer | 70.05 | 72.54 | | plaything | 26.88 | 35.73 | | swimming pool | 70.83 | 72.65 | | stool | 43.93 | 57.92 | | barrel | 29.82 | 67.16 | | basket | 34.86 | 42.03 | | waterfall | 74.64 | 90.14 | | tent | 78.06 | 97.51 | | bag | 22.03 | 30.17 | | minibike | 73.3 | 84.76 | | cradle | 73.82 | 97.85 | | oven | 53.29 | 66.0 | | ball | 40.71 | 48.93 | | food | 49.84 | 54.47 | | step | 12.82 | 21.13 | | tank | 50.89 | 53.55 | | trade name | 32.04 | 45.35 | | microwave | 86.6 | 92.3 | | pot | 52.28 | 62.0 | | animal | 63.5 | 67.73 | | bicycle | 61.4 | 79.13 | | lake | 53.44 | 63.56 | | dishwasher | 68.62 | 85.43 | | screen | 72.32 | 88.73 | | blanket | 24.22 | 29.95 | | sculpture | 69.31 | 80.49 | | hood | 66.27 | 73.15 | | sconce | 46.74 | 62.35 | | vase | 43.0 | 58.17 | | traffic light | 37.79 | 48.82 | | tray | 16.66 | 20.19 | | ashcan | 43.52 | 60.88 | | fan | 62.6 | 72.84 | | pier | 36.13 | 48.15 | | crt screen | 9.76 | 20.1 | | plate | 58.63 | 75.91 | | monitor | 4.65 | 5.04 | | bulletin board | 43.31 | 46.28 | | shower | 4.11 | 4.73 | | radiator | 59.95 | 71.25 | | glass | 18.52 | 20.04 | | clock | 35.47 | 50.8 | | flag | 54.42 | 62.48 | +---------------------+-------+-------+ 2024/01/09 01:58:24 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5070 coco/bbox_mAP_50: 0.6890 coco/bbox_mAP_75: 0.5560 coco/bbox_mAP_s: 0.3530 coco/bbox_mAP_m: 0.5580 coco/bbox_mAP_l: 0.6580 coco/segm_mAP: 0.3420 coco/segm_mAP_50: 0.6130 coco/segm_mAP_75: 0.3380 coco/segm_mAP_s: 0.1990 coco/segm_mAP_m: 0.3900 coco/segm_mAP_l: 0.5130 Bleu_1: 0.7601 Bleu_2: 0.5982 Bleu_3: 0.4576 Bleu_4: 0.3451 METEOR: 0.2733 ROUGE_L: 0.5621 CIDEr: 1.1176 SPICE: 0.2043 aAcc: 84.0100 mIoU: 51.4600 mAcc: 63.0300 visual-grounding/miou: 0.8217 visual-grounding/acc: 0.8835 data_time: 0.0129 time: 1.8937 2024/01/09 02:10:48 - mmengine - INFO - Iter(train) [380500/640000] base_lr: 7.0737e-05 lr: 7.0737e-06 eta: 4 days, 15:01:29 time: 1.5384 data_time: 0.0209 memory: 34658 grad_norm: 2.9921 loss: 1.1791 caption_loss_cls: 2.0726 detection_loss_cls: 0.0283 detection_loss_reg: 0.3283 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3276 instance_segmentation_loss_cls: 0.0285 instance_segmentation_loss_reg: 0.3302 instance_segmentation_loss_poly: 0.8729 2024/01/09 02:23:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240108_022928 2024/01/09 02:23:47 - mmengine - INFO - Iter(train) [381000/640000] base_lr: 7.0503e-05 lr: 7.0503e-06 eta: 4 days, 14:49:20 time: 1.5369 data_time: 0.0207 memory: 25567 grad_norm: 3.0187 loss: 1.1812 caption_loss_cls: 2.0656 detection_loss_cls: 0.0285 detection_loss_reg: 0.3299 semantic_segmentation_loss_cls: 0.0072 grounding_loss_reg: 2.3281 instance_segmentation_loss_cls: 0.0284 instance_segmentation_loss_reg: 0.3304 instance_segmentation_loss_poly: 0.8738 2024/01/09 03:09:20 - mmengine - INFO - Iter(train) [381500/640000] base_lr: 7.0268e-05 lr: 7.0268e-06 eta: 4 days, 3:00:22 time: 1.4792 data_time: 0.0192 memory: 25557 grad_norm: 2.9693 loss: 1.1898 caption_loss_cls: 2.0564 detection_loss_cls: 0.0284 detection_loss_reg: 0.3298 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3251 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3302 instance_segmentation_loss_poly: 0.8723 2024/01/09 03:21:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 03:21:22 - mmengine - INFO - Iter(train) [382000/640000] base_lr: 7.0034e-05 lr: 7.0034e-06 eta: 4 days, 3:58:02 time: 1.4639 data_time: 0.0190 memory: 25557 grad_norm: 3.0079 loss: 1.2043 caption_loss_cls: 2.0537 detection_loss_cls: 0.0284 detection_loss_reg: 0.3291 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3270 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8746 2024/01/09 03:21:22 - mmengine - INFO - Saving checkpoint at 382000 iterations 2024/01/09 03:33:30 - mmengine - INFO - Iter(train) [382500/640000] base_lr: 6.9800e-05 lr: 6.9800e-06 eta: 4 days, 4:38:26 time: 1.4453 data_time: 0.0180 memory: 25557 grad_norm: 3.0488 loss: 1.2172 caption_loss_cls: 2.0524 detection_loss_cls: 0.0283 detection_loss_reg: 0.3267 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3288 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3312 instance_segmentation_loss_poly: 0.8754 2024/01/09 03:45:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 03:45:26 - mmengine - INFO - Iter(train) [383000/640000] base_lr: 6.9566e-05 lr: 6.9566e-06 eta: 4 days, 4:43:25 time: 1.4397 data_time: 0.0178 memory: 25557 grad_norm: 3.0301 loss: 1.2077 caption_loss_cls: 2.0486 detection_loss_cls: 0.0283 detection_loss_reg: 0.3264 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3297 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8778 2024/01/09 03:57:13 - mmengine - INFO - Iter(train) [383500/640000] base_lr: 6.9332e-05 lr: 6.9332e-06 eta: 4 days, 4:32:58 time: 1.4266 data_time: 0.0174 memory: 25557 grad_norm: 3.2188 loss: 1.1885 caption_loss_cls: 2.0442 detection_loss_cls: 0.0281 detection_loss_reg: 0.3251 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3257 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3311 instance_segmentation_loss_poly: 0.8761 2024/01/09 04:09:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 04:09:20 - mmengine - INFO - Iter(train) [384000/640000] base_lr: 6.9099e-05 lr: 6.9099e-06 eta: 4 days, 4:44:26 time: 1.4167 data_time: 0.0171 memory: 25557 grad_norm: 3.2305 loss: 1.1771 caption_loss_cls: 2.0436 detection_loss_cls: 0.0280 detection_loss_reg: 0.3238 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3224 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8782 2024/01/09 04:09:20 - mmengine - INFO - Saving checkpoint at 384000 iterations 2024/01/09 04:21:15 - mmengine - INFO - Iter(train) [384500/640000] base_lr: 6.8865e-05 lr: 6.8865e-06 eta: 4 days, 4:38:37 time: 1.4212 data_time: 0.0232 memory: 25557 grad_norm: 3.3663 loss: 1.1885 caption_loss_cls: 2.0456 detection_loss_cls: 0.0280 detection_loss_reg: 0.3237 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3202 instance_segmentation_loss_cls: 0.0280 instance_segmentation_loss_reg: 0.3303 instance_segmentation_loss_poly: 0.8750 2024/01/09 04:33:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 04:33:25 - mmengine - INFO - Iter(train) [385000/640000] base_lr: 6.8632e-05 lr: 6.8632e-06 eta: 4 days, 4:44:17 time: 1.4345 data_time: 0.0234 memory: 25557 grad_norm: 3.3404 loss: 1.1786 caption_loss_cls: 2.0469 detection_loss_cls: 0.0280 detection_loss_reg: 0.3232 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3199 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3334 instance_segmentation_loss_poly: 0.8805 2024/01/09 04:46:08 - mmengine - INFO - Iter(train) [385500/640000] base_lr: 6.8399e-05 lr: 6.8399e-06 eta: 4 days, 5:12:10 time: 1.4513 data_time: 0.0236 memory: 25557 grad_norm: 3.3702 loss: 1.1799 caption_loss_cls: 2.0434 detection_loss_cls: 0.0280 detection_loss_reg: 0.3225 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3214 instance_segmentation_loss_cls: 0.0283 instance_segmentation_loss_reg: 0.3336 instance_segmentation_loss_poly: 0.8811 2024/01/09 04:57:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 04:57:45 - mmengine - INFO - Iter(train) [386000/640000] base_lr: 6.8167e-05 lr: 6.8167e-06 eta: 4 days, 4:46:19 time: 1.4449 data_time: 0.0235 memory: 25557 grad_norm: 3.4458 loss: 1.1722 caption_loss_cls: 2.0400 detection_loss_cls: 0.0279 detection_loss_reg: 0.3221 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3208 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3326 instance_segmentation_loss_poly: 0.8797 2024/01/09 04:57:45 - mmengine - INFO - Saving checkpoint at 386000 iterations 2024/01/09 05:09:50 - mmengine - INFO - Iter(train) [386500/640000] base_lr: 6.7934e-05 lr: 6.7934e-06 eta: 4 days, 4:41:37 time: 1.4443 data_time: 0.0233 memory: 25557 grad_norm: 3.4259 loss: 1.1541 caption_loss_cls: 2.0382 detection_loss_cls: 0.0281 detection_loss_reg: 0.3223 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3141 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8788 2024/01/09 05:21:51 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 05:21:51 - mmengine - INFO - Iter(train) [387000/640000] base_lr: 6.7702e-05 lr: 6.7702e-06 eta: 4 days, 4:33:11 time: 1.4457 data_time: 0.0233 memory: 25557 grad_norm: 3.4977 loss: 1.1519 caption_loss_cls: 2.0394 detection_loss_cls: 0.0279 detection_loss_reg: 0.3209 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3168 instance_segmentation_loss_cls: 0.0279 instance_segmentation_loss_reg: 0.3294 instance_segmentation_loss_poly: 0.8733 2024/01/09 05:33:21 - mmengine - INFO - Iter(train) [387500/640000] base_lr: 6.7469e-05 lr: 6.7469e-06 eta: 4 days, 4:06:44 time: 1.4415 data_time: 0.0233 memory: 25557 grad_norm: 3.5207 loss: 1.1590 caption_loss_cls: 2.0334 detection_loss_cls: 0.0277 detection_loss_reg: 0.3191 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3176 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3293 instance_segmentation_loss_poly: 0.8731 2024/01/09 05:45:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 05:45:20 - mmengine - INFO - Iter(train) [388000/640000] base_lr: 6.7237e-05 lr: 6.7237e-06 eta: 4 days, 3:57:26 time: 1.4393 data_time: 0.0233 memory: 25557 grad_norm: 3.5556 loss: 1.1686 caption_loss_cls: 2.0382 detection_loss_cls: 0.0276 detection_loss_reg: 0.3190 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3163 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3281 instance_segmentation_loss_poly: 0.8697 2024/01/09 05:45:20 - mmengine - INFO - Saving checkpoint at 388000 iterations 2024/01/09 05:57:36 - mmengine - INFO - Iter(train) [388500/640000] base_lr: 6.7006e-05 lr: 6.7006e-06 eta: 4 days, 3:56:15 time: 1.4445 data_time: 0.0235 memory: 25557 grad_norm: 3.5313 loss: 1.1856 caption_loss_cls: 2.0462 detection_loss_cls: 0.0277 detection_loss_reg: 0.3199 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3129 instance_segmentation_loss_cls: 0.0280 instance_segmentation_loss_reg: 0.3308 instance_segmentation_loss_poly: 0.8756 2024/01/09 06:09:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 06:09:30 - mmengine - INFO - Iter(train) [389000/640000] base_lr: 6.6774e-05 lr: 6.6774e-06 eta: 4 days, 3:43:34 time: 1.4405 data_time: 0.0235 memory: 25557 grad_norm: 3.6292 loss: 1.1889 caption_loss_cls: 2.0460 detection_loss_cls: 0.0278 detection_loss_reg: 0.3206 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3098 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3278 instance_segmentation_loss_poly: 0.8698 2024/01/09 06:21:23 - mmengine - INFO - Iter(train) [389500/640000] base_lr: 6.6543e-05 lr: 6.6543e-06 eta: 4 days, 3:30:36 time: 1.4281 data_time: 0.0232 memory: 25557 grad_norm: 3.5733 loss: 1.1826 caption_loss_cls: 2.0431 detection_loss_cls: 0.0279 detection_loss_reg: 0.3210 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3122 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3273 instance_segmentation_loss_poly: 0.8680 2024/01/09 06:33:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 06:33:11 - mmengine - INFO - Iter(train) [390000/640000] base_lr: 6.6311e-05 lr: 6.6311e-06 eta: 4 days, 3:15:21 time: 1.4308 data_time: 0.0233 memory: 25557 grad_norm: 3.5435 loss: 1.1908 caption_loss_cls: 2.0424 detection_loss_cls: 0.0280 detection_loss_reg: 0.3219 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3120 instance_segmentation_loss_cls: 0.0274 instance_segmentation_loss_reg: 0.3257 instance_segmentation_loss_poly: 0.8645 2024/01/09 06:33:11 - mmengine - INFO - Saving checkpoint at 390000 iterations 2024/01/09 06:45:34 - mmengine - INFO - Iter(train) [390500/640000] base_lr: 6.6080e-05 lr: 6.6080e-06 eta: 4 days, 3:14:36 time: 1.4352 data_time: 0.0232 memory: 25557 grad_norm: 3.5023 loss: 1.1912 caption_loss_cls: 2.0407 detection_loss_cls: 0.0281 detection_loss_reg: 0.3227 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3114 instance_segmentation_loss_cls: 0.0274 instance_segmentation_loss_reg: 0.3259 instance_segmentation_loss_poly: 0.8648 2024/01/09 06:57:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 06:57:18 - mmengine - INFO - Iter(train) [391000/640000] base_lr: 6.5850e-05 lr: 6.5850e-06 eta: 4 days, 2:57:59 time: 1.4309 data_time: 0.0233 memory: 25557 grad_norm: 3.4536 loss: 1.2007 caption_loss_cls: 2.0416 detection_loss_cls: 0.0282 detection_loss_reg: 0.3251 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3141 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3285 instance_segmentation_loss_poly: 0.8703 2024/01/09 07:08:43 - mmengine - INFO - Iter(train) [391500/640000] base_lr: 6.5619e-05 lr: 6.5619e-06 eta: 4 days, 2:34:58 time: 1.4297 data_time: 0.0232 memory: 25557 grad_norm: 3.3425 loss: 1.2077 caption_loss_cls: 2.0441 detection_loss_cls: 0.0282 detection_loss_reg: 0.3248 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3117 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3287 instance_segmentation_loss_poly: 0.8703 2024/01/09 07:20:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 07:20:48 - mmengine - INFO - Iter(train) [392000/640000] base_lr: 6.5389e-05 lr: 6.5389e-06 eta: 4 days, 2:26:55 time: 1.4314 data_time: 0.0232 memory: 25557 grad_norm: 3.3246 loss: 1.2081 caption_loss_cls: 2.0419 detection_loss_cls: 0.0282 detection_loss_reg: 0.3234 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3054 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3297 instance_segmentation_loss_poly: 0.8737 2024/01/09 07:20:48 - mmengine - INFO - Saving checkpoint at 392000 iterations 2024/01/09 07:32:36 - mmengine - INFO - Iter(train) [392500/640000] base_lr: 6.5159e-05 lr: 6.5159e-06 eta: 4 days, 2:12:31 time: 1.4242 data_time: 0.0230 memory: 25557 grad_norm: 3.3062 loss: 1.1965 caption_loss_cls: 2.0454 detection_loss_cls: 0.0283 detection_loss_reg: 0.3234 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3008 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3295 instance_segmentation_loss_poly: 0.8721 2024/01/09 07:44:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 07:44:36 - mmengine - INFO - Iter(train) [393000/640000] base_lr: 6.4929e-05 lr: 6.4929e-06 eta: 4 days, 2:02:23 time: 1.4257 data_time: 0.0230 memory: 25557 grad_norm: 3.2047 loss: 1.1811 caption_loss_cls: 2.0445 detection_loss_cls: 0.0283 detection_loss_reg: 0.3225 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2991 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3292 instance_segmentation_loss_poly: 0.8706 2024/01/09 07:56:12 - mmengine - INFO - Iter(train) [393500/640000] base_lr: 6.4699e-05 lr: 6.4699e-06 eta: 4 days, 1:44:55 time: 1.4216 data_time: 0.0231 memory: 25557 grad_norm: 3.1979 loss: 1.1929 caption_loss_cls: 2.0448 detection_loss_cls: 0.0282 detection_loss_reg: 0.3220 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3012 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3306 instance_segmentation_loss_poly: 0.8732 2024/01/09 08:07:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 08:07:44 - mmengine - INFO - Iter(train) [394000/640000] base_lr: 6.4469e-05 lr: 6.4469e-06 eta: 4 days, 1:26:32 time: 1.4177 data_time: 0.0230 memory: 25557 grad_norm: 3.1803 loss: 1.1901 caption_loss_cls: 2.0385 detection_loss_cls: 0.0284 detection_loss_reg: 0.3235 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3011 instance_segmentation_loss_cls: 0.0279 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8750 2024/01/09 08:07:44 - mmengine - INFO - Saving checkpoint at 394000 iterations 2024/01/09 08:19:59 - mmengine - INFO - Iter(train) [394500/640000] base_lr: 6.4240e-05 lr: 6.4240e-06 eta: 4 days, 1:20:43 time: 1.4157 data_time: 0.0229 memory: 25557 grad_norm: 3.2582 loss: 1.1956 caption_loss_cls: 2.0339 detection_loss_cls: 0.0284 detection_loss_reg: 0.3239 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2997 instance_segmentation_loss_cls: 0.0280 instance_segmentation_loss_reg: 0.3326 instance_segmentation_loss_poly: 0.8775 2024/01/09 08:31:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 08:31:55 - mmengine - INFO - Iter(train) [395000/640000] base_lr: 6.4011e-05 lr: 6.4011e-06 eta: 4 days, 1:09:26 time: 1.4188 data_time: 0.0229 memory: 25557 grad_norm: 3.2643 loss: 1.1922 caption_loss_cls: 2.0299 detection_loss_cls: 0.0285 detection_loss_reg: 0.3257 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2996 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3336 instance_segmentation_loss_poly: 0.8802 2024/01/09 08:43:34 - mmengine - INFO - Iter(train) [395500/640000] base_lr: 6.3782e-05 lr: 6.3782e-06 eta: 4 days, 0:53:30 time: 1.4222 data_time: 0.0230 memory: 25557 grad_norm: 3.1750 loss: 1.1917 caption_loss_cls: 2.0325 detection_loss_cls: 0.0285 detection_loss_reg: 0.3244 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3041 instance_segmentation_loss_cls: 0.0282 instance_segmentation_loss_reg: 0.3340 instance_segmentation_loss_poly: 0.8813 2024/01/09 08:55:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 08:55:33 - mmengine - INFO - Iter(train) [396000/640000] base_lr: 6.3553e-05 lr: 6.3553e-06 eta: 4 days, 0:42:51 time: 1.4204 data_time: 0.0229 memory: 25557 grad_norm: 3.1220 loss: 1.1782 caption_loss_cls: 2.0262 detection_loss_cls: 0.0284 detection_loss_reg: 0.3230 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.3029 instance_segmentation_loss_cls: 0.0281 instance_segmentation_loss_reg: 0.3337 instance_segmentation_loss_poly: 0.8800 2024/01/09 08:55:33 - mmengine - INFO - Saving checkpoint at 396000 iterations 2024/01/09 09:07:52 - mmengine - INFO - Iter(train) [396500/640000] base_lr: 6.3325e-05 lr: 6.3325e-06 eta: 4 days, 0:37:14 time: 1.4284 data_time: 0.0228 memory: 25557 grad_norm: 3.0696 loss: 1.1620 caption_loss_cls: 2.0208 detection_loss_cls: 0.0283 detection_loss_reg: 0.3221 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2982 instance_segmentation_loss_cls: 0.0279 instance_segmentation_loss_reg: 0.3330 instance_segmentation_loss_poly: 0.8769 2024/01/09 09:19:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 09:19:30 - mmengine - INFO - Iter(train) [397000/640000] base_lr: 6.3097e-05 lr: 6.3097e-06 eta: 4 days, 0:21:16 time: 1.4228 data_time: 0.0228 memory: 25557 grad_norm: 3.0855 loss: 1.1725 caption_loss_cls: 2.0160 detection_loss_cls: 0.0285 detection_loss_reg: 0.3244 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.3007 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3320 instance_segmentation_loss_poly: 0.8746 2024/01/09 09:31:03 - mmengine - INFO - Iter(train) [397500/640000] base_lr: 6.2869e-05 lr: 6.2869e-06 eta: 4 days, 0:04:42 time: 1.4221 data_time: 0.0228 memory: 25557 grad_norm: 3.1132 loss: 1.1654 caption_loss_cls: 2.0163 detection_loss_cls: 0.0285 detection_loss_reg: 0.3241 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2981 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3302 instance_segmentation_loss_poly: 0.8700 2024/01/09 09:42:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 09:42:59 - mmengine - INFO - Iter(train) [398000/640000] base_lr: 6.2641e-05 lr: 6.2641e-06 eta: 3 days, 23:53:14 time: 1.4279 data_time: 0.0228 memory: 25557 grad_norm: 3.0822 loss: 1.1561 caption_loss_cls: 2.0125 detection_loss_cls: 0.0286 detection_loss_reg: 0.3254 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2951 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3316 instance_segmentation_loss_poly: 0.8736 2024/01/09 09:42:59 - mmengine - INFO - Saving checkpoint at 398000 iterations 2024/01/09 09:55:10 - mmengine - INFO - Iter(train) [398500/640000] base_lr: 6.2413e-05 lr: 6.2413e-06 eta: 3 days, 23:45:14 time: 1.4271 data_time: 0.0228 memory: 25557 grad_norm: 3.0423 loss: 1.1548 caption_loss_cls: 2.0119 detection_loss_cls: 0.0287 detection_loss_reg: 0.3250 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2882 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3335 instance_segmentation_loss_poly: 0.8775 2024/01/09 10:06:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 10:06:39 - mmengine - INFO - Iter(train) [399000/640000] base_lr: 6.2186e-05 lr: 6.2186e-06 eta: 3 days, 23:28:04 time: 1.4203 data_time: 0.0227 memory: 25557 grad_norm: 3.0545 loss: 1.1573 caption_loss_cls: 2.0183 detection_loss_cls: 0.0286 detection_loss_reg: 0.3247 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2833 instance_segmentation_loss_cls: 0.0280 instance_segmentation_loss_reg: 0.3337 instance_segmentation_loss_poly: 0.8794 2024/01/09 10:18:58 - mmengine - INFO - Iter(train) [399500/640000] base_lr: 6.1959e-05 lr: 6.1959e-06 eta: 3 days, 23:21:20 time: 1.4302 data_time: 0.0229 memory: 25557 grad_norm: 3.0280 loss: 1.1552 caption_loss_cls: 2.0196 detection_loss_cls: 0.0287 detection_loss_reg: 0.3259 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2814 instance_segmentation_loss_cls: 0.0279 instance_segmentation_loss_reg: 0.3340 instance_segmentation_loss_poly: 0.8793 2024/01/09 10:30:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 10:30:35 - mmengine - INFO - Iter(train) [400000/640000] base_lr: 6.1732e-05 lr: 6.1732e-06 eta: 3 days, 23:06:05 time: 1.4249 data_time: 0.0229 memory: 25557 grad_norm: 3.0856 loss: 1.1765 caption_loss_cls: 2.0234 detection_loss_cls: 0.0287 detection_loss_reg: 0.3257 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2751 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3344 instance_segmentation_loss_poly: 0.8795 2024/01/09 10:30:35 - mmengine - INFO - Saving checkpoint at 400000 iterations 2024/01/09 10:42:00 - mmengine - INFO - Evaluating bbox... 2024/01/09 10:42:58 - mmengine - INFO - bbox_mAP_copypaste: 0.514 0.696 0.564 0.360 0.561 0.666 2024/01/09 10:42:58 - mmengine - INFO - Evaluating segm... 2024/01/09 10:44:13 - mmengine - INFO - segm_mAP_copypaste: 0.341 0.609 0.334 0.194 0.387 0.531 2024/01/09 10:46:22 - mmengine - INFO - Evaluating bbox... 2024/01/09 10:47:21 - mmengine - INFO - bbox_mAP_copypaste: 0.513 0.696 0.563 0.360 0.561 0.667 2024/01/09 10:53:34 - mmengine - INFO - per class results: 2024/01/09 10:53:34 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.22 | 88.99 | | building | 82.52 | 91.71 | | sky | 93.13 | 98.29 | | floor | 82.6 | 90.75 | | tree | 73.77 | 86.39 | | ceiling | 85.35 | 94.72 | | road | 83.46 | 88.76 | | bed | 89.76 | 95.12 | | windowpane | 64.05 | 78.28 | | grass | 69.31 | 83.76 | | cabinet | 64.18 | 79.87 | | sidewalk | 66.48 | 80.94 | | person | 81.45 | 91.68 | | earth | 40.35 | 57.9 | | door | 55.91 | 71.93 | | table | 64.46 | 80.66 | | mountain | 59.84 | 82.52 | | plant | 52.14 | 61.61 | | curtain | 77.64 | 88.66 | | chair | 62.48 | 76.44 | | car | 85.41 | 92.16 | | water | 63.04 | 79.97 | | painting | 73.67 | 89.63 | | sofa | 72.43 | 84.68 | | shelf | 49.3 | 73.84 | | house | 43.64 | 60.44 | | sea | 62.21 | 74.61 | | mirror | 68.11 | 78.68 | | rug | 63.49 | 70.58 | | field | 34.7 | 43.51 | | armchair | 52.38 | 73.31 | | seat | 66.98 | 80.96 | | fence | 47.53 | 64.9 | | desk | 54.39 | 70.59 | | rock | 35.46 | 43.87 | | wardrobe | 44.82 | 65.77 | | lamp | 67.11 | 79.45 | | bathtub | 73.06 | 87.74 | | railing | 41.57 | 58.3 | | cushion | 62.02 | 74.11 | | base | 26.92 | 36.9 | | box | 27.37 | 35.87 | | column | 55.54 | 69.73 | | signboard | 38.91 | 53.1 | | chest of drawers | 37.2 | 43.34 | | counter | 31.39 | 45.65 | | sand | 46.05 | 62.03 | | sink | 77.98 | 87.25 | | skyscraper | 48.19 | 63.17 | | fireplace | 73.62 | 86.4 | | refrigerator | 75.77 | 84.5 | | grandstand | 52.41 | 80.86 | | path | 24.61 | 35.55 | | stairs | 38.08 | 46.07 | | runway | 72.69 | 93.17 | | case | 50.2 | 59.84 | | pool table | 91.83 | 94.96 | | pillow | 58.11 | 65.58 | | screen door | 80.98 | 85.08 | | stairway | 30.7 | 40.01 | | river | 16.8 | 32.03 | | bridge | 63.69 | 80.6 | | bookcase | 34.58 | 37.96 | | blind | 41.58 | 47.12 | | coffee table | 65.22 | 80.38 | | toilet | 85.69 | 93.49 | | flower | 40.21 | 54.36 | | book | 49.34 | 71.52 | | hill | 17.32 | 22.81 | | bench | 61.64 | 71.66 | | countertop | 63.48 | 75.69 | | stove | 80.12 | 84.29 | | palm | 48.47 | 69.68 | | kitchen island | 43.37 | 64.39 | | computer | 75.59 | 88.67 | | swivel chair | 45.99 | 59.83 | | boat | 55.5 | 70.58 | | bar | 38.33 | 48.0 | | arcade machine | 69.38 | 72.54 | | hovel | 20.32 | 23.29 | | bus | 86.67 | 95.9 | | towel | 66.04 | 79.54 | | light | 50.9 | 58.6 | | truck | 45.13 | 58.82 | | tower | 35.39 | 61.16 | | chandelier | 64.02 | 71.01 | | awning | 25.58 | 32.7 | | streetlight | 32.84 | 41.06 | | booth | 45.24 | 55.56 | | television receiver | 71.77 | 80.87 | | airplane | 68.32 | 73.77 | | dirt track | 7.5 | 15.86 | | apparel | 29.41 | 45.13 | | pole | 30.66 | 47.49 | | land | 0.09 | 0.13 | | bannister | 19.75 | 26.57 | | escalator | 29.79 | 33.31 | | ottoman | 54.28 | 71.69 | | bottle | 28.15 | 35.3 | | buffet | 52.99 | 55.73 | | poster | 37.07 | 48.87 | | stage | 9.49 | 15.82 | | van | 44.12 | 60.28 | | ship | 8.08 | 10.04 | | fountain | 23.18 | 23.6 | | conveyer belt | 83.05 | 89.53 | | canopy | 33.19 | 45.93 | | washer | 69.57 | 71.78 | | plaything | 32.21 | 40.92 | | swimming pool | 68.67 | 71.21 | | stool | 46.29 | 59.74 | | barrel | 31.87 | 69.22 | | basket | 33.05 | 41.63 | | waterfall | 75.29 | 86.42 | | tent | 70.08 | 97.68 | | bag | 20.27 | 26.88 | | minibike | 75.03 | 86.94 | | cradle | 82.1 | 96.34 | | oven | 53.34 | 62.16 | | ball | 31.53 | 35.72 | | food | 50.28 | 54.83 | | step | 6.64 | 7.64 | | tank | 37.5 | 40.77 | | trade name | 25.2 | 28.55 | | microwave | 85.76 | 95.4 | | pot | 50.59 | 58.1 | | animal | 64.71 | 69.1 | | bicycle | 60.37 | 78.81 | | lake | 60.66 | 60.97 | | dishwasher | 72.21 | 80.44 | | screen | 71.96 | 89.84 | | blanket | 27.91 | 34.42 | | sculpture | 60.25 | 79.67 | | hood | 60.44 | 71.86 | | sconce | 47.38 | 55.8 | | vase | 42.49 | 65.3 | | traffic light | 40.17 | 52.32 | | tray | 17.72 | 30.13 | | ashcan | 43.37 | 55.36 | | fan | 65.1 | 81.8 | | pier | 38.13 | 44.94 | | crt screen | 5.05 | 10.82 | | plate | 58.04 | 78.06 | | monitor | 13.07 | 15.19 | | bulletin board | 54.0 | 63.44 | | shower | 4.93 | 9.4 | | radiator | 63.28 | 72.26 | | glass | 18.16 | 20.39 | | clock | 38.4 | 44.54 | | flag | 44.22 | 49.23 | +---------------------+-------+-------+ 2024/01/09 10:53:48 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5130 coco/bbox_mAP_50: 0.6960 coco/bbox_mAP_75: 0.5630 coco/bbox_mAP_s: 0.3600 coco/bbox_mAP_m: 0.5610 coco/bbox_mAP_l: 0.6670 coco/segm_mAP: 0.3410 coco/segm_mAP_50: 0.6090 coco/segm_mAP_75: 0.3340 coco/segm_mAP_s: 0.1940 coco/segm_mAP_m: 0.3870 coco/segm_mAP_l: 0.5310 Bleu_1: 0.7671 Bleu_2: 0.6017 Bleu_3: 0.4569 Bleu_4: 0.3422 METEOR: 0.2730 ROUGE_L: 0.5624 CIDEr: 1.1178 SPICE: 0.2070 aAcc: 84.1700 mIoU: 51.6300 mAcc: 62.7600 visual-grounding/miou: 0.8209 visual-grounding/acc: 0.8813 data_time: 0.0269 time: 1.9098 2024/01/09 11:05:30 - mmengine - INFO - Iter(train) [400500/640000] base_lr: 6.1505e-05 lr: 6.1505e-06 eta: 3 days, 22:52:29 time: 1.4163 data_time: 0.0176 memory: 34649 grad_norm: 3.0896 loss: 1.1828 caption_loss_cls: 2.0317 detection_loss_cls: 0.0288 detection_loss_reg: 0.3257 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2724 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3335 instance_segmentation_loss_poly: 0.8780 2024/01/09 11:16:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 11:16:48 - mmengine - INFO - Iter(train) [401000/640000] base_lr: 6.1279e-05 lr: 6.1279e-06 eta: 3 days, 22:33:55 time: 1.4114 data_time: 0.0178 memory: 25557 grad_norm: 3.1307 loss: 1.1936 caption_loss_cls: 2.0331 detection_loss_cls: 0.0286 detection_loss_reg: 0.3245 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2719 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3338 instance_segmentation_loss_poly: 0.8794 2024/01/09 11:28:28 - mmengine - INFO - Iter(train) [401500/640000] base_lr: 6.1053e-05 lr: 6.1053e-06 eta: 3 days, 22:19:39 time: 1.4129 data_time: 0.0180 memory: 25557 grad_norm: 3.0987 loss: 1.1839 caption_loss_cls: 2.0361 detection_loss_cls: 0.0284 detection_loss_reg: 0.3227 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2656 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3332 instance_segmentation_loss_poly: 0.8781 2024/01/09 11:40:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 11:40:28 - mmengine - INFO - Iter(train) [402000/640000] base_lr: 6.0827e-05 lr: 6.0827e-06 eta: 3 days, 22:09:14 time: 1.4141 data_time: 0.0186 memory: 25557 grad_norm: 3.0682 loss: 1.1778 caption_loss_cls: 2.0360 detection_loss_cls: 0.0284 detection_loss_reg: 0.3241 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2687 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3321 instance_segmentation_loss_poly: 0.8753 2024/01/09 11:40:28 - mmengine - INFO - Saving checkpoint at 402000 iterations 2024/01/09 11:52:44 - mmengine - INFO - Iter(train) [402500/640000] base_lr: 6.0601e-05 lr: 6.0601e-06 eta: 3 days, 22:01:26 time: 1.4151 data_time: 0.0203 memory: 25557 grad_norm: 3.0349 loss: 1.1667 caption_loss_cls: 2.0320 detection_loss_cls: 0.0283 detection_loss_reg: 0.3239 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2661 instance_segmentation_loss_cls: 0.0278 instance_segmentation_loss_reg: 0.3311 instance_segmentation_loss_poly: 0.8729 2024/01/09 12:03:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 12:03:54 - mmengine - INFO - Iter(train) [403000/640000] base_lr: 6.0376e-05 lr: 6.0376e-06 eta: 3 days, 21:42:08 time: 1.4103 data_time: 0.0204 memory: 25557 grad_norm: 3.0130 loss: 1.1617 caption_loss_cls: 2.0319 detection_loss_cls: 0.0284 detection_loss_reg: 0.3239 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2617 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3299 instance_segmentation_loss_poly: 0.8712 2024/01/09 12:15:14 - mmengine - INFO - Iter(train) [403500/640000] base_lr: 6.0151e-05 lr: 6.0151e-06 eta: 3 days, 21:24:57 time: 1.3957 data_time: 0.0203 memory: 25557 grad_norm: 3.0605 loss: 1.1650 caption_loss_cls: 2.0357 detection_loss_cls: 0.0282 detection_loss_reg: 0.3229 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2631 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3295 instance_segmentation_loss_poly: 0.8707 2024/01/09 12:26:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 12:26:38 - mmengine - INFO - Iter(train) [404000/640000] base_lr: 5.9926e-05 lr: 5.9926e-06 eta: 3 days, 21:08:41 time: 1.3925 data_time: 0.0206 memory: 25557 grad_norm: 3.0675 loss: 1.1590 caption_loss_cls: 2.0244 detection_loss_cls: 0.0284 detection_loss_reg: 0.3238 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2610 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3303 instance_segmentation_loss_poly: 0.8732 2024/01/09 12:26:38 - mmengine - INFO - Saving checkpoint at 404000 iterations 2024/01/09 12:38:43 - mmengine - INFO - Iter(train) [404500/640000] base_lr: 5.9701e-05 lr: 5.9701e-06 eta: 3 days, 20:59:08 time: 1.3976 data_time: 0.0272 memory: 25557 grad_norm: 3.0626 loss: 1.1540 caption_loss_cls: 2.0249 detection_loss_cls: 0.0281 detection_loss_reg: 0.3218 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2613 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3300 instance_segmentation_loss_poly: 0.8717 2024/01/09 12:50:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 12:50:57 - mmengine - INFO - Iter(train) [405000/640000] base_lr: 5.9476e-05 lr: 5.9476e-06 eta: 3 days, 20:50:53 time: 1.4115 data_time: 0.0272 memory: 25557 grad_norm: 2.9998 loss: 1.1387 caption_loss_cls: 2.0228 detection_loss_cls: 0.0282 detection_loss_reg: 0.3228 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2626 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3309 instance_segmentation_loss_poly: 0.8738 2024/01/09 13:02:07 - mmengine - INFO - Iter(train) [405500/640000] base_lr: 5.9252e-05 lr: 5.9252e-06 eta: 3 days, 20:32:33 time: 1.4039 data_time: 0.0272 memory: 25557 grad_norm: 3.0580 loss: 1.1560 caption_loss_cls: 2.0235 detection_loss_cls: 0.0280 detection_loss_reg: 0.3224 semantic_segmentation_loss_cls: 0.0071 grounding_loss_reg: 2.2656 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3303 instance_segmentation_loss_poly: 0.8732 2024/01/09 13:14:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 13:14:02 - mmengine - INFO - Iter(train) [406000/640000] base_lr: 5.9028e-05 lr: 5.9028e-06 eta: 3 days, 20:21:27 time: 1.4028 data_time: 0.0267 memory: 25557 grad_norm: 3.1564 loss: 1.1762 caption_loss_cls: 2.0252 detection_loss_cls: 0.0280 detection_loss_reg: 0.3213 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2641 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3296 instance_segmentation_loss_poly: 0.8717 2024/01/09 13:14:02 - mmengine - INFO - Saving checkpoint at 406000 iterations 2024/01/09 13:25:48 - mmengine - INFO - Iter(train) [406500/640000] base_lr: 5.8804e-05 lr: 5.8804e-06 eta: 3 days, 20:08:54 time: 1.3954 data_time: 0.0261 memory: 25557 grad_norm: 3.2003 loss: 1.1874 caption_loss_cls: 2.0333 detection_loss_cls: 0.0280 detection_loss_reg: 0.3226 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2659 instance_segmentation_loss_cls: 0.0274 instance_segmentation_loss_reg: 0.3278 instance_segmentation_loss_poly: 0.8680 2024/01/09 13:37:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 13:37:29 - mmengine - INFO - Iter(train) [407000/640000] base_lr: 5.8581e-05 lr: 5.8581e-06 eta: 3 days, 19:55:40 time: 1.4032 data_time: 0.0263 memory: 25557 grad_norm: 3.2182 loss: 1.1827 caption_loss_cls: 2.0319 detection_loss_cls: 0.0280 detection_loss_reg: 0.3227 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2657 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3287 instance_segmentation_loss_poly: 0.8690 2024/01/09 13:48:36 - mmengine - INFO - Iter(train) [407500/640000] base_lr: 5.8357e-05 lr: 5.8357e-06 eta: 3 days, 19:37:36 time: 1.3997 data_time: 0.0264 memory: 25557 grad_norm: 3.2326 loss: 1.1926 caption_loss_cls: 2.0282 detection_loss_cls: 0.0280 detection_loss_reg: 0.3243 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2643 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3299 instance_segmentation_loss_poly: 0.8725 2024/01/09 14:00:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 14:00:16 - mmengine - INFO - Iter(train) [408000/640000] base_lr: 5.8134e-05 lr: 5.8134e-06 eta: 3 days, 19:24:26 time: 1.4036 data_time: 0.0264 memory: 25557 grad_norm: 3.2247 loss: 1.1920 caption_loss_cls: 2.0293 detection_loss_cls: 0.0281 detection_loss_reg: 0.3252 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2618 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3296 instance_segmentation_loss_poly: 0.8713 2024/01/09 14:00:16 - mmengine - INFO - Saving checkpoint at 408000 iterations 2024/01/09 14:12:44 - mmengine - INFO - Iter(train) [408500/640000] base_lr: 5.7912e-05 lr: 5.7912e-06 eta: 3 days, 19:17:50 time: 1.4094 data_time: 0.0264 memory: 25557 grad_norm: 3.1964 loss: 1.1952 caption_loss_cls: 2.0334 detection_loss_cls: 0.0280 detection_loss_reg: 0.3232 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2575 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3296 instance_segmentation_loss_poly: 0.8701 2024/01/09 14:24:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 14:24:12 - mmengine - INFO - Iter(train) [409000/640000] base_lr: 5.7689e-05 lr: 5.7689e-06 eta: 3 days, 19:03:03 time: 1.3979 data_time: 0.0264 memory: 25557 grad_norm: 3.2551 loss: 1.2126 caption_loss_cls: 2.0359 detection_loss_cls: 0.0281 detection_loss_reg: 0.3246 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2582 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3302 instance_segmentation_loss_poly: 0.8712 2024/01/09 14:35:51 - mmengine - INFO - Iter(train) [409500/640000] base_lr: 5.7467e-05 lr: 5.7467e-06 eta: 3 days, 18:49:49 time: 1.4053 data_time: 0.0265 memory: 25557 grad_norm: 3.2271 loss: 1.2035 caption_loss_cls: 2.0319 detection_loss_cls: 0.0280 detection_loss_reg: 0.3238 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2584 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3313 instance_segmentation_loss_poly: 0.8736 2024/01/09 14:48:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 14:48:17 - mmengine - INFO - Iter(train) [410000/640000] base_lr: 5.7245e-05 lr: 5.7245e-06 eta: 3 days, 18:42:35 time: 1.4128 data_time: 0.0268 memory: 25557 grad_norm: 3.1446 loss: 1.1863 caption_loss_cls: 2.0287 detection_loss_cls: 0.0279 detection_loss_reg: 0.3233 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2555 instance_segmentation_loss_cls: 0.0277 instance_segmentation_loss_reg: 0.3318 instance_segmentation_loss_poly: 0.8741 2024/01/09 14:48:17 - mmengine - INFO - Saving checkpoint at 410000 iterations 2024/01/09 15:00:02 - mmengine - INFO - Iter(train) [410500/640000] base_lr: 5.7023e-05 lr: 5.7023e-06 eta: 3 days, 18:30:09 time: 1.4126 data_time: 0.0268 memory: 25557 grad_norm: 3.1895 loss: 1.1854 caption_loss_cls: 2.0285 detection_loss_cls: 0.0276 detection_loss_reg: 0.3208 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2536 instance_segmentation_loss_cls: 0.0276 instance_segmentation_loss_reg: 0.3319 instance_segmentation_loss_poly: 0.8742 2024/01/09 15:11:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 15:11:49 - mmengine - INFO - Iter(train) [411000/640000] base_lr: 5.6802e-05 lr: 5.6802e-06 eta: 3 days, 18:17:52 time: 1.4140 data_time: 0.0268 memory: 25557 grad_norm: 3.1643 loss: 1.1852 caption_loss_cls: 2.0202 detection_loss_cls: 0.0277 detection_loss_reg: 0.3207 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2523 instance_segmentation_loss_cls: 0.0275 instance_segmentation_loss_reg: 0.3314 instance_segmentation_loss_poly: 0.8732 2024/01/09 15:22:53 - mmengine - INFO - Iter(train) [411500/640000] base_lr: 5.6580e-05 lr: 5.6580e-06 eta: 3 days, 18:00:34 time: 1.4136 data_time: 0.0266 memory: 25557 grad_norm: 3.1586 loss: 1.1687 caption_loss_cls: 2.0215 detection_loss_cls: 0.0275 detection_loss_reg: 0.3184 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2513 instance_segmentation_loss_cls: 0.0273 instance_segmentation_loss_reg: 0.3298 instance_segmentation_loss_poly: 0.8696 2024/01/09 15:35:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 15:35:06 - mmengine - INFO - Iter(train) [412000/640000] base_lr: 5.6360e-05 lr: 5.6360e-06 eta: 3 days, 17:51:28 time: 1.4217 data_time: 0.0267 memory: 25557 grad_norm: 3.1085 loss: 1.1561 caption_loss_cls: 2.0179 detection_loss_cls: 0.0276 detection_loss_reg: 0.3199 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2509 instance_segmentation_loss_cls: 0.0272 instance_segmentation_loss_reg: 0.3292 instance_segmentation_loss_poly: 0.8682 2024/01/09 15:35:06 - mmengine - INFO - Saving checkpoint at 412000 iterations 2024/01/09 15:47:02 - mmengine - INFO - Iter(train) [412500/640000] base_lr: 5.6139e-05 lr: 5.6139e-06 eta: 3 days, 17:40:27 time: 1.4139 data_time: 0.0266 memory: 25557 grad_norm: 3.1523 loss: 1.1551 caption_loss_cls: 2.0116 detection_loss_cls: 0.0275 detection_loss_reg: 0.3196 semantic_segmentation_loss_cls: 0.0070 grounding_loss_reg: 2.2508 instance_segmentation_loss_cls: 0.0271 instance_segmentation_loss_reg: 0.3278 instance_segmentation_loss_poly: 0.8657 2024/01/09 15:58:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 15:58:29 - mmengine - INFO - Iter(train) [413000/640000] base_lr: 5.5918e-05 lr: 5.5918e-06 eta: 3 days, 17:25:55 time: 1.4134 data_time: 0.0266 memory: 25557 grad_norm: 3.1127 loss: 1.1511 caption_loss_cls: 2.0099 detection_loss_cls: 0.0276 detection_loss_reg: 0.3217 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2501 instance_segmentation_loss_cls: 0.0272 instance_segmentation_loss_reg: 0.3274 instance_segmentation_loss_poly: 0.8657 2024/01/09 16:10:21 - mmengine - INFO - Iter(train) [413500/640000] base_lr: 5.5698e-05 lr: 5.5698e-06 eta: 3 days, 17:14:26 time: 1.4168 data_time: 0.0265 memory: 25557 grad_norm: 3.0913 loss: 1.1467 caption_loss_cls: 2.0152 detection_loss_cls: 0.0277 detection_loss_reg: 0.3226 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2485 instance_segmentation_loss_cls: 0.0269 instance_segmentation_loss_reg: 0.3257 instance_segmentation_loss_poly: 0.8632 2024/01/09 16:21:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 16:21:47 - mmengine - INFO - Iter(train) [414000/640000] base_lr: 5.5478e-05 lr: 5.5478e-06 eta: 3 days, 16:59:58 time: 1.4018 data_time: 0.0262 memory: 25557 grad_norm: 3.1245 loss: 1.1445 caption_loss_cls: 2.0134 detection_loss_cls: 0.0276 detection_loss_reg: 0.3209 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2453 instance_segmentation_loss_cls: 0.0268 instance_segmentation_loss_reg: 0.3238 instance_segmentation_loss_poly: 0.8599 2024/01/09 16:21:47 - mmengine - INFO - Saving checkpoint at 414000 iterations 2024/01/09 16:33:39 - mmengine - INFO - Iter(train) [414500/640000] base_lr: 5.5259e-05 lr: 5.5259e-06 eta: 3 days, 16:48:28 time: 1.4035 data_time: 0.0264 memory: 25557 grad_norm: 3.0815 loss: 1.1555 caption_loss_cls: 2.0183 detection_loss_cls: 0.0275 detection_loss_reg: 0.3199 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2427 instance_segmentation_loss_cls: 0.0270 instance_segmentation_loss_reg: 0.3248 instance_segmentation_loss_poly: 0.8617 2024/01/09 16:45:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 16:45:09 - mmengine - INFO - Iter(train) [415000/640000] base_lr: 5.5039e-05 lr: 5.5039e-06 eta: 3 days, 16:34:35 time: 1.3994 data_time: 0.0263 memory: 25557 grad_norm: 3.1136 loss: 1.1521 caption_loss_cls: 2.0192 detection_loss_cls: 0.0274 detection_loss_reg: 0.3201 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2393 instance_segmentation_loss_cls: 0.0269 instance_segmentation_loss_reg: 0.3235 instance_segmentation_loss_poly: 0.8582 2024/01/09 16:56:39 - mmengine - INFO - Iter(train) [415500/640000] base_lr: 5.4820e-05 lr: 5.4820e-06 eta: 3 days, 16:20:47 time: 1.4057 data_time: 0.0263 memory: 25557 grad_norm: 3.0994 loss: 1.1424 caption_loss_cls: 2.0230 detection_loss_cls: 0.0273 detection_loss_reg: 0.3184 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2310 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3223 instance_segmentation_loss_poly: 0.8549 2024/01/09 17:08:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 17:08:13 - mmengine - INFO - Iter(train) [416000/640000] base_lr: 5.4601e-05 lr: 5.4601e-06 eta: 3 days, 16:07:25 time: 1.3960 data_time: 0.0262 memory: 25557 grad_norm: 3.1423 loss: 1.1532 caption_loss_cls: 2.0228 detection_loss_cls: 0.0275 detection_loss_reg: 0.3216 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.2307 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3226 instance_segmentation_loss_poly: 0.8557 2024/01/09 17:08:13 - mmengine - INFO - Saving checkpoint at 416000 iterations 2024/01/09 17:20:22 - mmengine - INFO - Iter(train) [416500/640000] base_lr: 5.4383e-05 lr: 5.4383e-06 eta: 3 days, 15:57:44 time: 1.3992 data_time: 0.0263 memory: 25557 grad_norm: 3.1551 loss: 1.1653 caption_loss_cls: 2.0233 detection_loss_cls: 0.0275 detection_loss_reg: 0.3224 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2328 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3238 instance_segmentation_loss_poly: 0.8586 2024/01/09 17:32:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 17:32:20 - mmengine - INFO - Iter(train) [417000/640000] base_lr: 5.4165e-05 lr: 5.4165e-06 eta: 3 days, 15:46:49 time: 1.4070 data_time: 0.0263 memory: 25557 grad_norm: 3.1322 loss: 1.1531 caption_loss_cls: 2.0236 detection_loss_cls: 0.0275 detection_loss_reg: 0.3231 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2302 instance_segmentation_loss_cls: 0.0268 instance_segmentation_loss_reg: 0.3244 instance_segmentation_loss_poly: 0.8612 2024/01/09 17:43:48 - mmengine - INFO - Iter(train) [417500/640000] base_lr: 5.3947e-05 lr: 5.3947e-06 eta: 3 days, 15:32:58 time: 1.4009 data_time: 0.0263 memory: 25557 grad_norm: 3.2003 loss: 1.1570 caption_loss_cls: 2.0312 detection_loss_cls: 0.0273 detection_loss_reg: 0.3215 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2297 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3225 instance_segmentation_loss_poly: 0.8567 2024/01/09 17:55:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 17:55:09 - mmengine - INFO - Iter(train) [418000/640000] base_lr: 5.3729e-05 lr: 5.3729e-06 eta: 3 days, 15:18:27 time: 1.3997 data_time: 0.0264 memory: 25557 grad_norm: 3.1909 loss: 1.1693 caption_loss_cls: 2.0353 detection_loss_cls: 0.0274 detection_loss_reg: 0.3217 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2323 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3228 instance_segmentation_loss_poly: 0.8576 2024/01/09 17:55:09 - mmengine - INFO - Saving checkpoint at 418000 iterations 2024/01/09 18:07:31 - mmengine - INFO - Iter(train) [418500/640000] base_lr: 5.3511e-05 lr: 5.3511e-06 eta: 3 days, 15:09:52 time: 1.4071 data_time: 0.0264 memory: 25557 grad_norm: 3.1853 loss: 1.1594 caption_loss_cls: 2.0303 detection_loss_cls: 0.0273 detection_loss_reg: 0.3219 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2396 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3227 instance_segmentation_loss_poly: 0.8564 2024/01/09 18:18:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 18:18:42 - mmengine - INFO - Iter(train) [419000/640000] base_lr: 5.3294e-05 lr: 5.3294e-06 eta: 3 days, 14:54:29 time: 1.4023 data_time: 0.0264 memory: 25557 grad_norm: 3.2132 loss: 1.1647 caption_loss_cls: 2.0252 detection_loss_cls: 0.0273 detection_loss_reg: 0.3213 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2355 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3221 instance_segmentation_loss_poly: 0.8532 2024/01/09 18:30:31 - mmengine - INFO - Iter(train) [419500/640000] base_lr: 5.3077e-05 lr: 5.3077e-06 eta: 3 days, 14:42:51 time: 1.4073 data_time: 0.0266 memory: 25557 grad_norm: 3.1873 loss: 1.1574 caption_loss_cls: 2.0197 detection_loss_cls: 0.0271 detection_loss_reg: 0.3204 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2291 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3205 instance_segmentation_loss_poly: 0.8490 2024/01/09 18:42:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 18:42:16 - mmengine - INFO - Iter(train) [420000/640000] base_lr: 5.2861e-05 lr: 5.2861e-06 eta: 3 days, 14:30:43 time: 1.4100 data_time: 0.0266 memory: 25557 grad_norm: 3.1951 loss: 1.1604 caption_loss_cls: 2.0131 detection_loss_cls: 0.0271 detection_loss_reg: 0.3195 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2352 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3202 instance_segmentation_loss_poly: 0.8488 2024/01/09 18:42:16 - mmengine - INFO - Saving checkpoint at 420000 iterations 2024/01/09 18:54:03 - mmengine - INFO - Evaluating bbox... 2024/01/09 18:55:01 - mmengine - INFO - bbox_mAP_copypaste: 0.515 0.699 0.563 0.360 0.561 0.664 2024/01/09 18:55:01 - mmengine - INFO - Evaluating segm... 2024/01/09 18:56:14 - mmengine - INFO - segm_mAP_copypaste: 0.345 0.616 0.343 0.203 0.391 0.530 2024/01/09 18:58:23 - mmengine - INFO - Evaluating bbox... 2024/01/09 18:59:21 - mmengine - INFO - bbox_mAP_copypaste: 0.513 0.696 0.561 0.352 0.559 0.663 2024/01/09 19:04:09 - mmengine - INFO - per class results: 2024/01/09 19:04:09 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.13 | 89.57 | | building | 82.17 | 90.32 | | sky | 93.37 | 97.79 | | floor | 82.15 | 90.65 | | tree | 74.47 | 88.4 | | ceiling | 85.56 | 94.08 | | road | 84.46 | 91.94 | | bed | 90.47 | 96.55 | | windowpane | 64.27 | 80.26 | | grass | 66.87 | 79.45 | | cabinet | 63.26 | 75.39 | | sidewalk | 67.88 | 76.21 | | person | 81.25 | 91.17 | | earth | 36.63 | 50.25 | | door | 57.27 | 70.64 | | table | 64.83 | 81.52 | | mountain | 60.07 | 77.99 | | plant | 53.17 | 63.52 | | curtain | 77.23 | 88.29 | | chair | 62.04 | 75.15 | | car | 84.34 | 92.35 | | water | 62.63 | 76.43 | | painting | 74.95 | 87.9 | | sofa | 71.25 | 82.26 | | shelf | 46.75 | 66.07 | | house | 47.81 | 66.63 | | sea | 63.3 | 74.32 | | mirror | 68.19 | 77.21 | | rug | 63.35 | 73.33 | | field | 30.54 | 57.2 | | armchair | 51.52 | 73.62 | | seat | 67.49 | 81.72 | | fence | 48.26 | 62.88 | | desk | 54.81 | 70.21 | | rock | 50.9 | 72.06 | | wardrobe | 46.61 | 66.71 | | lamp | 67.46 | 79.69 | | bathtub | 80.93 | 88.92 | | railing | 40.96 | 57.95 | | cushion | 63.1 | 74.88 | | base | 17.26 | 23.46 | | box | 29.01 | 37.37 | | column | 55.28 | 64.16 | | signboard | 39.58 | 51.99 | | chest of drawers | 35.86 | 51.11 | | counter | 23.4 | 30.12 | | sand | 42.96 | 55.96 | | sink | 75.99 | 85.86 | | skyscraper | 47.69 | 60.37 | | fireplace | 75.86 | 87.86 | | refrigerator | 76.29 | 82.77 | | grandstand | 46.08 | 80.16 | | path | 19.06 | 29.24 | | stairs | 40.04 | 47.79 | | runway | 74.2 | 94.94 | | case | 50.78 | 71.08 | | pool table | 92.05 | 95.83 | | pillow | 59.12 | 68.67 | | screen door | 78.19 | 80.64 | | stairway | 32.92 | 41.73 | | river | 16.27 | 37.23 | | bridge | 67.31 | 79.35 | | bookcase | 40.22 | 61.62 | | blind | 39.44 | 45.59 | | coffee table | 64.16 | 81.92 | | toilet | 89.2 | 93.64 | | flower | 40.96 | 58.79 | | book | 48.54 | 64.86 | | hill | 17.42 | 25.89 | | bench | 63.79 | 73.05 | | countertop | 65.07 | 76.85 | | stove | 80.51 | 86.78 | | palm | 47.37 | 67.37 | | kitchen island | 39.86 | 64.92 | | computer | 78.27 | 88.33 | | swivel chair | 45.78 | 64.11 | | boat | 65.73 | 89.01 | | bar | 37.43 | 49.98 | | arcade machine | 68.28 | 71.0 | | hovel | 45.09 | 53.98 | | bus | 86.6 | 95.48 | | towel | 69.56 | 81.57 | | light | 54.09 | 63.93 | | truck | 45.19 | 63.43 | | tower | 36.72 | 61.1 | | chandelier | 67.49 | 77.58 | | awning | 30.79 | 42.01 | | streetlight | 34.52 | 51.16 | | booth | 43.87 | 60.88 | | television receiver | 71.5 | 87.82 | | airplane | 70.55 | 85.4 | | dirt track | 8.76 | 30.82 | | apparel | 31.52 | 47.94 | | pole | 24.26 | 34.2 | | land | 1.73 | 2.39 | | bannister | 15.39 | 20.04 | | escalator | 19.81 | 21.71 | | ottoman | 53.07 | 71.92 | | bottle | 26.81 | 36.32 | | buffet | 55.12 | 66.72 | | poster | 33.95 | 40.42 | | stage | 11.37 | 20.19 | | van | 37.91 | 48.34 | | ship | 30.66 | 32.92 | | fountain | 20.62 | 20.94 | | conveyer belt | 52.12 | 92.35 | | canopy | 34.45 | 46.99 | | washer | 70.47 | 72.87 | | plaything | 35.39 | 51.5 | | swimming pool | 67.15 | 68.82 | | stool | 45.13 | 62.2 | | barrel | 18.29 | 67.68 | | basket | 34.7 | 45.15 | | waterfall | 73.95 | 86.28 | | tent | 74.86 | 97.37 | | bag | 24.53 | 35.42 | | minibike | 73.72 | 85.26 | | cradle | 81.98 | 96.3 | | oven | 49.59 | 58.54 | | ball | 46.6 | 55.69 | | food | 58.48 | 67.09 | | step | 9.12 | 11.62 | | tank | 49.54 | 53.49 | | trade name | 25.23 | 30.14 | | microwave | 87.22 | 92.95 | | pot | 51.27 | 59.56 | | animal | 65.15 | 68.79 | | bicycle | 58.11 | 72.4 | | lake | 55.87 | 62.27 | | dishwasher | 70.37 | 86.41 | | screen | 58.7 | 70.01 | | blanket | 29.25 | 34.98 | | sculpture | 69.21 | 82.22 | | hood | 62.35 | 73.14 | | sconce | 49.57 | 59.58 | | vase | 43.92 | 63.16 | | traffic light | 40.84 | 57.23 | | tray | 19.11 | 27.56 | | ashcan | 41.99 | 55.79 | | fan | 64.5 | 76.1 | | pier | 36.31 | 39.32 | | crt screen | 10.05 | 34.12 | | plate | 58.02 | 72.84 | | monitor | 10.26 | 12.24 | | bulletin board | 54.72 | 68.11 | | shower | 4.45 | 5.94 | | radiator | 59.38 | 68.22 | | glass | 15.63 | 16.58 | | clock | 40.86 | 51.83 | | flag | 52.49 | 60.16 | +---------------------+-------+-------+ 2024/01/09 19:04:21 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5130 coco/bbox_mAP_50: 0.6960 coco/bbox_mAP_75: 0.5610 coco/bbox_mAP_s: 0.3520 coco/bbox_mAP_m: 0.5590 coco/bbox_mAP_l: 0.6630 coco/segm_mAP: 0.3450 coco/segm_mAP_50: 0.6160 coco/segm_mAP_75: 0.3430 coco/segm_mAP_s: 0.2030 coco/segm_mAP_m: 0.3910 coco/segm_mAP_l: 0.5300 Bleu_1: 0.7638 Bleu_2: 0.6002 Bleu_3: 0.4573 Bleu_4: 0.3449 METEOR: 0.2760 ROUGE_L: 0.5620 CIDEr: 1.1311 SPICE: 0.2061 aAcc: 84.0900 mIoU: 52.0100 mAcc: 64.1900 visual-grounding/miou: 0.8211 visual-grounding/acc: 0.8801 data_time: 0.0120 time: 1.8949 2024/01/09 19:15:49 - mmengine - INFO - Iter(train) [420500/640000] base_lr: 5.2644e-05 lr: 5.2644e-06 eta: 3 days, 14:17:19 time: 1.4003 data_time: 0.0202 memory: 34647 grad_norm: 3.2060 loss: 1.1534 caption_loss_cls: 2.0123 detection_loss_cls: 0.0269 detection_loss_reg: 0.3182 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2386 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3210 instance_segmentation_loss_poly: 0.8497 2024/01/09 19:27:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 19:27:14 - mmengine - INFO - Iter(train) [421000/640000] base_lr: 5.2428e-05 lr: 5.2428e-06 eta: 3 days, 14:03:29 time: 1.3922 data_time: 0.0201 memory: 25554 grad_norm: 3.2652 loss: 1.1576 caption_loss_cls: 2.0094 detection_loss_cls: 0.0269 detection_loss_reg: 0.3186 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2347 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3210 instance_segmentation_loss_poly: 0.8499 2024/01/09 19:38:44 - mmengine - INFO - Iter(train) [421500/640000] base_lr: 5.2213e-05 lr: 5.2213e-06 eta: 3 days, 13:50:08 time: 1.3926 data_time: 0.0201 memory: 25554 grad_norm: 3.2280 loss: 1.1543 caption_loss_cls: 2.0104 detection_loss_cls: 0.0270 detection_loss_reg: 0.3206 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2358 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3214 instance_segmentation_loss_poly: 0.8504 2024/01/09 19:50:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 19:50:07 - mmengine - INFO - Iter(train) [422000/640000] base_lr: 5.1997e-05 lr: 5.1997e-06 eta: 3 days, 13:36:14 time: 1.3932 data_time: 0.0202 memory: 25554 grad_norm: 3.2435 loss: 1.1557 caption_loss_cls: 2.0106 detection_loss_cls: 0.0272 detection_loss_reg: 0.3212 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2369 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3217 instance_segmentation_loss_poly: 0.8510 2024/01/09 19:50:07 - mmengine - INFO - Saving checkpoint at 422000 iterations 2024/01/09 20:02:12 - mmengine - INFO - Iter(train) [422500/640000] base_lr: 5.1782e-05 lr: 5.1782e-06 eta: 3 days, 13:25:58 time: 1.3890 data_time: 0.0202 memory: 25554 grad_norm: 3.2380 loss: 1.1481 caption_loss_cls: 2.0098 detection_loss_cls: 0.0273 detection_loss_reg: 0.3210 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2347 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3209 instance_segmentation_loss_poly: 0.8505 2024/01/09 20:13:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 20:13:59 - mmengine - INFO - Iter(train) [423000/640000] base_lr: 5.1567e-05 lr: 5.1567e-06 eta: 3 days, 13:14:10 time: 1.3980 data_time: 0.0203 memory: 25554 grad_norm: 3.2294 loss: 1.1510 caption_loss_cls: 2.0033 detection_loss_cls: 0.0273 detection_loss_reg: 0.3219 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2333 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3209 instance_segmentation_loss_poly: 0.8505 2024/01/09 20:25:43 - mmengine - INFO - Iter(train) [423500/640000] base_lr: 5.1353e-05 lr: 5.1353e-06 eta: 3 days, 13:02:03 time: 1.3964 data_time: 0.0203 memory: 25554 grad_norm: 3.3065 loss: 1.1524 caption_loss_cls: 2.0014 detection_loss_cls: 0.0271 detection_loss_reg: 0.3207 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2323 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3212 instance_segmentation_loss_poly: 0.8504 2024/01/09 20:37:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 20:37:30 - mmengine - INFO - Iter(train) [424000/640000] base_lr: 5.1138e-05 lr: 5.1138e-06 eta: 3 days, 12:50:12 time: 1.3969 data_time: 0.0203 memory: 25554 grad_norm: 3.2971 loss: 1.1495 caption_loss_cls: 2.0078 detection_loss_cls: 0.0272 detection_loss_reg: 0.3227 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2343 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3205 instance_segmentation_loss_poly: 0.8489 2024/01/09 20:37:30 - mmengine - INFO - Saving checkpoint at 424000 iterations 2024/01/09 20:49:52 - mmengine - INFO - Iter(train) [424500/640000] base_lr: 5.0924e-05 lr: 5.0924e-06 eta: 3 days, 12:41:17 time: 1.4100 data_time: 0.0268 memory: 25554 grad_norm: 3.2456 loss: 1.1393 caption_loss_cls: 2.0109 detection_loss_cls: 0.0271 detection_loss_reg: 0.3219 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2312 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3200 instance_segmentation_loss_poly: 0.8491 2024/01/09 21:01:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 21:01:25 - mmengine - INFO - Iter(train) [425000/640000] base_lr: 5.0711e-05 lr: 5.0711e-06 eta: 3 days, 12:28:15 time: 1.4118 data_time: 0.0270 memory: 25554 grad_norm: 3.2583 loss: 1.1439 caption_loss_cls: 2.0098 detection_loss_cls: 0.0271 detection_loss_reg: 0.3217 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2271 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3200 instance_segmentation_loss_poly: 0.8504 2024/01/09 21:13:25 - mmengine - INFO - Iter(train) [425500/640000] base_lr: 5.0497e-05 lr: 5.0497e-06 eta: 3 days, 12:17:28 time: 1.4194 data_time: 0.0271 memory: 25554 grad_norm: 3.2345 loss: 1.1436 caption_loss_cls: 2.0110 detection_loss_cls: 0.0269 detection_loss_reg: 0.3210 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2225 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3202 instance_segmentation_loss_poly: 0.8513 2024/01/09 21:25:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 21:25:02 - mmengine - INFO - Iter(train) [426000/640000] base_lr: 5.0284e-05 lr: 5.0284e-06 eta: 3 days, 12:04:52 time: 1.4229 data_time: 0.0271 memory: 25554 grad_norm: 3.2213 loss: 1.1418 caption_loss_cls: 2.0076 detection_loss_cls: 0.0269 detection_loss_reg: 0.3213 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2223 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3200 instance_segmentation_loss_poly: 0.8505 2024/01/09 21:25:02 - mmengine - INFO - Saving checkpoint at 426000 iterations 2024/01/09 21:37:25 - mmengine - INFO - Iter(train) [426500/640000] base_lr: 5.0071e-05 lr: 5.0071e-06 eta: 3 days, 11:55:47 time: 1.4274 data_time: 0.0271 memory: 25554 grad_norm: 3.2186 loss: 1.1445 caption_loss_cls: 2.0143 detection_loss_cls: 0.0269 detection_loss_reg: 0.3225 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2236 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3223 instance_segmentation_loss_poly: 0.8561 2024/01/09 21:49:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 21:49:07 - mmengine - INFO - Iter(train) [427000/640000] base_lr: 4.9859e-05 lr: 4.9859e-06 eta: 3 days, 11:43:32 time: 1.4261 data_time: 0.0271 memory: 25554 grad_norm: 3.1697 loss: 1.1419 caption_loss_cls: 2.0196 detection_loss_cls: 0.0268 detection_loss_reg: 0.3206 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2215 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3222 instance_segmentation_loss_poly: 0.8555 2024/01/09 22:00:43 - mmengine - INFO - Iter(train) [427500/640000] base_lr: 4.9647e-05 lr: 4.9647e-06 eta: 3 days, 11:30:52 time: 1.4243 data_time: 0.0272 memory: 25554 grad_norm: 3.1409 loss: 1.1559 caption_loss_cls: 2.0119 detection_loss_cls: 0.0266 detection_loss_reg: 0.3188 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2212 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3215 instance_segmentation_loss_poly: 0.8518 2024/01/09 22:12:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 22:12:55 - mmengine - INFO - Iter(train) [428000/640000] base_lr: 4.9435e-05 lr: 4.9435e-06 eta: 3 days, 11:20:49 time: 1.4305 data_time: 0.0273 memory: 25554 grad_norm: 3.1508 loss: 1.1549 caption_loss_cls: 2.0057 detection_loss_cls: 0.0266 detection_loss_reg: 0.3185 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2161 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3228 instance_segmentation_loss_poly: 0.8558 2024/01/09 22:12:55 - mmengine - INFO - Saving checkpoint at 428000 iterations 2024/01/09 22:25:07 - mmengine - INFO - Iter(train) [428500/640000] base_lr: 4.9223e-05 lr: 4.9223e-06 eta: 3 days, 11:10:49 time: 1.4280 data_time: 0.0272 memory: 25554 grad_norm: 3.2060 loss: 1.1606 caption_loss_cls: 1.9964 detection_loss_cls: 0.0264 detection_loss_reg: 0.3167 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2176 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3218 instance_segmentation_loss_poly: 0.8547 2024/01/09 22:37:08 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 22:37:08 - mmengine - INFO - Iter(train) [429000/640000] base_lr: 4.9012e-05 lr: 4.9012e-06 eta: 3 days, 10:59:53 time: 1.4350 data_time: 0.0272 memory: 25554 grad_norm: 3.1438 loss: 1.1511 caption_loss_cls: 1.9912 detection_loss_cls: 0.0263 detection_loss_reg: 0.3166 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2177 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3210 instance_segmentation_loss_poly: 0.8536 2024/01/09 22:49:00 - mmengine - INFO - Iter(train) [429500/640000] base_lr: 4.8801e-05 lr: 4.8801e-06 eta: 3 days, 10:48:19 time: 1.4329 data_time: 0.0271 memory: 25554 grad_norm: 3.0994 loss: 1.1357 caption_loss_cls: 1.9902 detection_loss_cls: 0.0260 detection_loss_reg: 0.3143 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2134 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3211 instance_segmentation_loss_poly: 0.8545 2024/01/09 23:01:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 23:01:02 - mmengine - INFO - Iter(train) [430000/640000] base_lr: 4.8590e-05 lr: 4.8590e-06 eta: 3 days, 10:37:30 time: 1.4392 data_time: 0.0272 memory: 25554 grad_norm: 3.1073 loss: 1.1273 caption_loss_cls: 1.9943 detection_loss_cls: 0.0261 detection_loss_reg: 0.3147 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2105 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3212 instance_segmentation_loss_poly: 0.8555 2024/01/09 23:01:02 - mmengine - INFO - Saving checkpoint at 430000 iterations 2024/01/09 23:13:18 - mmengine - INFO - Iter(train) [430500/640000] base_lr: 4.8380e-05 lr: 4.8380e-06 eta: 3 days, 10:27:36 time: 1.4375 data_time: 0.0273 memory: 25554 grad_norm: 3.1249 loss: 1.1290 caption_loss_cls: 1.9932 detection_loss_cls: 0.0261 detection_loss_reg: 0.3136 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2087 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3212 instance_segmentation_loss_poly: 0.8558 2024/01/09 23:24:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 23:24:29 - mmengine - INFO - Iter(train) [431000/640000] base_lr: 4.8170e-05 lr: 4.8170e-06 eta: 3 days, 10:13:11 time: 1.4297 data_time: 0.0272 memory: 25554 grad_norm: 3.1871 loss: 1.1379 caption_loss_cls: 2.0005 detection_loss_cls: 0.0261 detection_loss_reg: 0.3149 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2067 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3219 instance_segmentation_loss_poly: 0.8571 2024/01/09 23:36:04 - mmengine - INFO - Iter(train) [431500/640000] base_lr: 4.7960e-05 lr: 4.7960e-06 eta: 3 days, 10:00:27 time: 1.4293 data_time: 0.0272 memory: 25554 grad_norm: 3.2098 loss: 1.1307 caption_loss_cls: 1.9946 detection_loss_cls: 0.0261 detection_loss_reg: 0.3148 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2051 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3231 instance_segmentation_loss_poly: 0.8595 2024/01/09 23:47:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/09 23:47:50 - mmengine - INFO - Iter(train) [432000/640000] base_lr: 4.7751e-05 lr: 4.7751e-06 eta: 3 days, 9:48:32 time: 1.4231 data_time: 0.0271 memory: 25554 grad_norm: 3.2366 loss: 1.1298 caption_loss_cls: 1.9996 detection_loss_cls: 0.0261 detection_loss_reg: 0.3149 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2057 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3233 instance_segmentation_loss_poly: 0.8592 2024/01/09 23:47:50 - mmengine - INFO - Saving checkpoint at 432000 iterations 2024/01/09 23:59:43 - mmengine - INFO - Iter(train) [432500/640000] base_lr: 4.7541e-05 lr: 4.7541e-06 eta: 3 days, 9:37:01 time: 1.4181 data_time: 0.0272 memory: 25554 grad_norm: 3.2250 loss: 1.1278 caption_loss_cls: 2.0019 detection_loss_cls: 0.0259 detection_loss_reg: 0.3140 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1989 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3245 instance_segmentation_loss_poly: 0.8618 2024/01/10 00:11:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 00:11:42 - mmengine - INFO - Iter(train) [433000/640000] base_lr: 4.7333e-05 lr: 4.7333e-06 eta: 3 days, 9:25:56 time: 1.4179 data_time: 0.0272 memory: 25554 grad_norm: 3.2560 loss: 1.1394 caption_loss_cls: 2.0082 detection_loss_cls: 0.0259 detection_loss_reg: 0.3140 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.2002 instance_segmentation_loss_cls: 0.0267 instance_segmentation_loss_reg: 0.3243 instance_segmentation_loss_poly: 0.8594 2024/01/10 00:23:22 - mmengine - INFO - Iter(train) [433500/640000] base_lr: 4.7124e-05 lr: 4.7124e-06 eta: 3 days, 9:13:34 time: 1.4148 data_time: 0.0272 memory: 25554 grad_norm: 3.3186 loss: 1.1493 caption_loss_cls: 2.0097 detection_loss_cls: 0.0262 detection_loss_reg: 0.3164 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1960 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3230 instance_segmentation_loss_poly: 0.8561 2024/01/10 00:35:09 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 00:35:09 - mmengine - INFO - Iter(train) [434000/640000] base_lr: 4.6916e-05 lr: 4.6916e-06 eta: 3 days, 9:01:41 time: 1.4110 data_time: 0.0271 memory: 25554 grad_norm: 3.3647 loss: 1.1431 caption_loss_cls: 2.0100 detection_loss_cls: 0.0261 detection_loss_reg: 0.3166 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1951 instance_segmentation_loss_cls: 0.0266 instance_segmentation_loss_reg: 0.3244 instance_segmentation_loss_poly: 0.8589 2024/01/10 00:35:09 - mmengine - INFO - Saving checkpoint at 434000 iterations 2024/01/10 00:47:34 - mmengine - INFO - Iter(train) [434500/640000] base_lr: 4.6708e-05 lr: 4.6708e-06 eta: 3 days, 8:52:09 time: 1.4130 data_time: 0.0270 memory: 25554 grad_norm: 3.3472 loss: 1.1317 caption_loss_cls: 2.0049 detection_loss_cls: 0.0260 detection_loss_reg: 0.3160 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1945 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3232 instance_segmentation_loss_poly: 0.8563 2024/01/10 00:59:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 00:59:03 - mmengine - INFO - Iter(train) [435000/640000] base_lr: 4.6501e-05 lr: 4.6501e-06 eta: 3 days, 8:39:08 time: 1.4177 data_time: 0.0270 memory: 25554 grad_norm: 3.3379 loss: 1.1209 caption_loss_cls: 2.0022 detection_loss_cls: 0.0259 detection_loss_reg: 0.3155 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1935 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3235 instance_segmentation_loss_poly: 0.8567 2024/01/10 01:10:37 - mmengine - INFO - Iter(train) [435500/640000] base_lr: 4.6293e-05 lr: 4.6293e-06 eta: 3 days, 8:26:28 time: 1.4176 data_time: 0.0269 memory: 25554 grad_norm: 3.3591 loss: 1.1249 caption_loss_cls: 2.0028 detection_loss_cls: 0.0259 detection_loss_reg: 0.3163 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1953 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3235 instance_segmentation_loss_poly: 0.8570 2024/01/10 01:22:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 01:22:32 - mmengine - INFO - Iter(train) [436000/640000] base_lr: 4.6087e-05 lr: 4.6087e-06 eta: 3 days, 8:15:02 time: 1.4196 data_time: 0.0269 memory: 25554 grad_norm: 3.3798 loss: 1.1268 caption_loss_cls: 2.0028 detection_loss_cls: 0.0258 detection_loss_reg: 0.3147 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1932 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3239 instance_segmentation_loss_poly: 0.8581 2024/01/10 01:22:32 - mmengine - INFO - Saving checkpoint at 436000 iterations 2024/01/10 01:34:24 - mmengine - INFO - Iter(train) [436500/640000] base_lr: 4.5880e-05 lr: 4.5880e-06 eta: 3 days, 8:03:29 time: 1.4196 data_time: 0.0271 memory: 25554 grad_norm: 3.4230 loss: 1.1403 caption_loss_cls: 2.0019 detection_loss_cls: 0.0258 detection_loss_reg: 0.3132 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1913 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3228 instance_segmentation_loss_poly: 0.8553 2024/01/10 01:46:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 01:46:10 - mmengine - INFO - Iter(train) [437000/640000] base_lr: 4.5674e-05 lr: 4.5674e-06 eta: 3 days, 7:51:31 time: 1.4162 data_time: 0.0271 memory: 25554 grad_norm: 3.4005 loss: 1.1210 caption_loss_cls: 2.0008 detection_loss_cls: 0.0257 detection_loss_reg: 0.3112 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1907 instance_segmentation_loss_cls: 0.0263 instance_segmentation_loss_reg: 0.3223 instance_segmentation_loss_poly: 0.8542 2024/01/10 01:57:48 - mmengine - INFO - Iter(train) [437500/640000] base_lr: 4.5468e-05 lr: 4.5468e-06 eta: 3 days, 7:39:06 time: 1.4158 data_time: 0.0271 memory: 25554 grad_norm: 3.3921 loss: 1.1252 caption_loss_cls: 1.9990 detection_loss_cls: 0.0255 detection_loss_reg: 0.3106 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1869 instance_segmentation_loss_cls: 0.0265 instance_segmentation_loss_reg: 0.3238 instance_segmentation_loss_poly: 0.8594 2024/01/10 02:09:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240109_023115 2024/01/10 02:09:29 - mmengine - INFO - Iter(train) [438000/640000] base_lr: 4.5262e-05 lr: 4.5262e-06 eta: 3 days, 7:26:50 time: 1.4141 data_time: 0.0271 memory: 25554 grad_norm: 3.3881 loss: 1.1367 caption_loss_cls: 2.0027 detection_loss_cls: 0.0253 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1840 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3217 instance_segmentation_loss_poly: 0.8555 2024/01/10 02:09:29 - mmengine - INFO - Saving checkpoint at 438000 iterations 2024/01/10 02:21:21 - mmengine - INFO - Iter(train) [438500/640000] base_lr: 4.5057e-05 lr: 4.5057e-06 eta: 3 days, 7:15:15 time: 1.4060 data_time: 0.0271 memory: 25554 grad_norm: 3.4750 loss: 1.1451 caption_loss_cls: 1.9993 detection_loss_cls: 0.0253 detection_loss_reg: 0.3100 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1758 instance_segmentation_loss_cls: 0.0264 instance_segmentation_loss_reg: 0.3218 instance_segmentation_loss_poly: 0.8572 2024/01/10 02:58:33 - mmengine - INFO - Iter(train) [439000/640000] base_lr: 4.4852e-05 lr: 4.4852e-06 eta: 3 days, 3:50:41 time: 1.3955 data_time: 0.0195 memory: 25567 grad_norm: 3.3964 loss: 1.1441 caption_loss_cls: 2.0033 detection_loss_cls: 0.0253 detection_loss_reg: 0.3104 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1803 instance_segmentation_loss_cls: 0.0261 instance_segmentation_loss_reg: 0.3197 instance_segmentation_loss_poly: 0.8524 2024/01/10 03:10:08 - mmengine - INFO - Iter(train) [439500/640000] base_lr: 4.4648e-05 lr: 4.4648e-06 eta: 3 days, 4:14:03 time: 1.3957 data_time: 0.0193 memory: 25567 grad_norm: 3.3690 loss: 1.1381 caption_loss_cls: 2.0044 detection_loss_cls: 0.0254 detection_loss_reg: 0.3111 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1791 instance_segmentation_loss_cls: 0.0260 instance_segmentation_loss_reg: 0.3198 instance_segmentation_loss_poly: 0.8524 2024/01/10 03:22:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 03:22:01 - mmengine - INFO - Iter(train) [440000/640000] base_lr: 4.4443e-05 lr: 4.4443e-06 eta: 3 days, 4:48:35 time: 1.3951 data_time: 0.0190 memory: 25567 grad_norm: 3.3758 loss: 1.1316 caption_loss_cls: 1.9987 detection_loss_cls: 0.0254 detection_loss_reg: 0.3103 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1732 instance_segmentation_loss_cls: 0.0258 instance_segmentation_loss_reg: 0.3172 instance_segmentation_loss_poly: 0.8471 2024/01/10 03:22:01 - mmengine - INFO - Saving checkpoint at 440000 iterations 2024/01/10 03:33:33 - mmengine - INFO - Evaluating bbox... 2024/01/10 03:34:30 - mmengine - INFO - bbox_mAP_copypaste: 0.515 0.697 0.562 0.356 0.564 0.670 2024/01/10 03:34:30 - mmengine - INFO - Evaluating segm... 2024/01/10 03:35:46 - mmengine - INFO - segm_mAP_copypaste: 0.346 0.614 0.342 0.198 0.395 0.534 2024/01/10 03:37:55 - mmengine - INFO - Evaluating bbox... 2024/01/10 03:38:53 - mmengine - INFO - bbox_mAP_copypaste: 0.513 0.696 0.561 0.353 0.562 0.668 2024/01/10 03:45:23 - mmengine - INFO - per class results: 2024/01/10 03:45:23 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.18 | 89.61 | | building | 82.77 | 92.41 | | sky | 93.43 | 98.03 | | floor | 82.22 | 90.31 | | tree | 73.78 | 87.32 | | ceiling | 85.71 | 94.33 | | road | 84.45 | 92.07 | | bed | 90.61 | 95.85 | | windowpane | 63.88 | 79.17 | | grass | 67.79 | 87.24 | | cabinet | 62.45 | 73.83 | | sidewalk | 67.43 | 76.02 | | person | 81.86 | 91.6 | | earth | 36.09 | 44.24 | | door | 55.98 | 72.56 | | table | 63.65 | 79.36 | | mountain | 59.25 | 78.96 | | plant | 50.05 | 59.44 | | curtain | 76.76 | 89.36 | | chair | 63.0 | 78.33 | | car | 84.75 | 92.2 | | water | 62.84 | 77.45 | | painting | 73.76 | 87.86 | | sofa | 71.48 | 83.21 | | shelf | 47.74 | 69.0 | | house | 38.54 | 50.37 | | sea | 65.62 | 80.46 | | mirror | 68.93 | 80.0 | | rug | 65.93 | 77.27 | | field | 35.92 | 53.59 | | armchair | 52.44 | 68.26 | | seat | 67.23 | 81.32 | | fence | 46.92 | 62.63 | | desk | 52.06 | 66.56 | | rock | 40.02 | 54.38 | | wardrobe | 44.17 | 65.28 | | lamp | 65.69 | 79.15 | | bathtub | 78.39 | 87.63 | | railing | 41.68 | 58.63 | | cushion | 64.43 | 78.23 | | base | 16.9 | 24.89 | | box | 29.44 | 37.83 | | column | 57.13 | 69.14 | | signboard | 38.8 | 50.6 | | chest of drawers | 34.8 | 54.77 | | counter | 27.57 | 37.67 | | sand | 49.85 | 67.25 | | sink | 76.01 | 85.98 | | skyscraper | 46.65 | 56.78 | | fireplace | 75.34 | 87.34 | | refrigerator | 77.58 | 84.67 | | grandstand | 46.69 | 80.9 | | path | 18.05 | 27.23 | | stairs | 31.67 | 37.18 | | runway | 73.9 | 94.03 | | case | 46.31 | 61.86 | | pool table | 91.82 | 95.33 | | pillow | 62.23 | 74.25 | | screen door | 70.76 | 74.14 | | stairway | 31.91 | 46.8 | | river | 10.98 | 19.79 | | bridge | 66.1 | 77.65 | | bookcase | 39.27 | 56.01 | | blind | 38.62 | 43.53 | | coffee table | 63.66 | 79.72 | | toilet | 88.49 | 92.72 | | flower | 41.49 | 56.86 | | book | 50.04 | 69.31 | | hill | 17.16 | 25.43 | | bench | 62.45 | 73.41 | | countertop | 63.37 | 76.51 | | stove | 81.29 | 85.06 | | palm | 48.86 | 72.05 | | kitchen island | 45.64 | 63.35 | | computer | 77.77 | 88.7 | | swivel chair | 44.42 | 56.31 | | boat | 66.54 | 88.51 | | bar | 29.35 | 35.77 | | arcade machine | 72.8 | 75.9 | | hovel | 13.48 | 15.66 | | bus | 87.27 | 95.38 | | towel | 66.9 | 79.62 | | light | 51.82 | 61.52 | | truck | 46.92 | 62.83 | | tower | 35.26 | 59.87 | | chandelier | 65.47 | 81.17 | | awning | 31.45 | 40.37 | | streetlight | 35.01 | 50.27 | | booth | 44.62 | 62.74 | | television receiver | 70.86 | 86.13 | | airplane | 64.73 | 71.6 | | dirt track | 4.82 | 12.64 | | apparel | 31.58 | 52.65 | | pole | 27.13 | 38.99 | | land | 1.86 | 2.61 | | bannister | 15.78 | 20.43 | | escalator | 29.42 | 33.5 | | ottoman | 55.77 | 71.55 | | bottle | 26.25 | 36.28 | | buffet | 54.73 | 62.15 | | poster | 36.08 | 50.1 | | stage | 11.2 | 18.75 | | van | 42.63 | 55.28 | | ship | 8.39 | 8.76 | | fountain | 24.9 | 25.31 | | conveyer belt | 75.5 | 91.19 | | canopy | 32.06 | 45.63 | | washer | 69.08 | 72.01 | | plaything | 34.78 | 45.53 | | swimming pool | 64.5 | 73.22 | | stool | 46.32 | 65.94 | | barrel | 22.83 | 71.16 | | basket | 33.68 | 43.79 | | waterfall | 68.26 | 89.64 | | tent | 81.22 | 97.02 | | bag | 24.46 | 35.64 | | minibike | 73.87 | 84.62 | | cradle | 82.46 | 97.32 | | oven | 53.32 | 69.35 | | ball | 50.69 | 62.83 | | food | 50.48 | 53.78 | | step | 8.24 | 10.07 | | tank | 52.84 | 57.07 | | trade name | 27.97 | 32.68 | | microwave | 86.94 | 93.42 | | pot | 48.96 | 55.65 | | animal | 59.32 | 62.17 | | bicycle | 58.94 | 73.33 | | lake | 54.87 | 69.19 | | dishwasher | 75.85 | 86.14 | | screen | 68.32 | 89.45 | | blanket | 28.59 | 34.47 | | sculpture | 66.94 | 79.67 | | hood | 60.92 | 73.24 | | sconce | 47.06 | 61.68 | | vase | 41.72 | 65.27 | | traffic light | 40.99 | 64.48 | | tray | 18.89 | 30.22 | | ashcan | 42.35 | 53.06 | | fan | 61.01 | 69.59 | | pier | 41.97 | 46.27 | | crt screen | 5.57 | 15.61 | | plate | 57.32 | 76.71 | | monitor | 9.31 | 10.35 | | bulletin board | 56.32 | 65.35 | | shower | 6.1 | 11.89 | | radiator | 61.39 | 69.78 | | glass | 17.95 | 19.83 | | clock | 38.46 | 45.74 | | flag | 44.72 | 53.6 | +---------------------+-------+-------+ 2024/01/10 03:45:36 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5130 coco/bbox_mAP_50: 0.6960 coco/bbox_mAP_75: 0.5610 coco/bbox_mAP_s: 0.3530 coco/bbox_mAP_m: 0.5620 coco/bbox_mAP_l: 0.6680 coco/segm_mAP: 0.3460 coco/segm_mAP_50: 0.6140 coco/segm_mAP_75: 0.3420 coco/segm_mAP_s: 0.1980 coco/segm_mAP_m: 0.3950 coco/segm_mAP_l: 0.5340 Bleu_1: 0.7617 Bleu_2: 0.5988 Bleu_3: 0.4567 Bleu_4: 0.3444 METEOR: 0.2763 ROUGE_L: 0.5619 CIDEr: 1.1222 SPICE: 0.2045 aAcc: 84.0900 mIoU: 51.5900 mAcc: 63.4600 visual-grounding/miou: 0.8281 visual-grounding/acc: 0.8876 data_time: 0.0279 time: 1.9153 2024/01/10 03:56:54 - mmengine - INFO - Iter(train) [440500/640000] base_lr: 4.4239e-05 lr: 4.4239e-06 eta: 3 days, 4:21:45 time: 1.3870 data_time: 0.0120 memory: 34661 grad_norm: 3.4088 loss: 1.1191 caption_loss_cls: 2.0001 detection_loss_cls: 0.0253 detection_loss_reg: 0.3098 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1674 instance_segmentation_loss_cls: 0.0257 instance_segmentation_loss_reg: 0.3159 instance_segmentation_loss_poly: 0.8430 2024/01/10 04:08:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 04:08:28 - mmengine - INFO - Iter(train) [441000/640000] base_lr: 4.4036e-05 lr: 4.4036e-06 eta: 3 days, 4:14:36 time: 1.3839 data_time: 0.0120 memory: 25566 grad_norm: 3.4321 loss: 1.1253 caption_loss_cls: 1.9980 detection_loss_cls: 0.0253 detection_loss_reg: 0.3105 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1664 instance_segmentation_loss_cls: 0.0257 instance_segmentation_loss_reg: 0.3156 instance_segmentation_loss_poly: 0.8420 2024/01/10 04:19:55 - mmengine - INFO - Iter(train) [441500/640000] base_lr: 4.3833e-05 lr: 4.3833e-06 eta: 3 days, 4:00:11 time: 1.3811 data_time: 0.0119 memory: 25566 grad_norm: 3.4420 loss: 1.1205 caption_loss_cls: 1.9945 detection_loss_cls: 0.0252 detection_loss_reg: 0.3091 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1649 instance_segmentation_loss_cls: 0.0257 instance_segmentation_loss_reg: 0.3157 instance_segmentation_loss_poly: 0.8426 2024/01/10 04:31:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 04:31:39 - mmengine - INFO - Iter(train) [442000/640000] base_lr: 4.3630e-05 lr: 4.3630e-06 eta: 3 days, 4:01:12 time: 1.3822 data_time: 0.0119 memory: 25566 grad_norm: 3.3929 loss: 1.1061 caption_loss_cls: 1.9967 detection_loss_cls: 0.0251 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1629 instance_segmentation_loss_cls: 0.0257 instance_segmentation_loss_reg: 0.3160 instance_segmentation_loss_poly: 0.8425 2024/01/10 04:31:39 - mmengine - INFO - Saving checkpoint at 442000 iterations 2024/01/10 04:43:19 - mmengine - INFO - Iter(train) [442500/640000] base_lr: 4.3427e-05 lr: 4.3427e-06 eta: 3 days, 3:55:28 time: 1.3846 data_time: 0.0194 memory: 25566 grad_norm: 3.5108 loss: 1.1024 caption_loss_cls: 1.9960 detection_loss_cls: 0.0249 detection_loss_reg: 0.3082 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1593 instance_segmentation_loss_cls: 0.0256 instance_segmentation_loss_reg: 0.3157 instance_segmentation_loss_poly: 0.8404 2024/01/10 04:55:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 04:55:05 - mmengine - INFO - Iter(train) [443000/640000] base_lr: 4.3225e-05 lr: 4.3225e-06 eta: 3 days, 3:52:46 time: 1.3937 data_time: 0.0196 memory: 25566 grad_norm: 3.4627 loss: 1.0928 caption_loss_cls: 1.9963 detection_loss_cls: 0.0249 detection_loss_reg: 0.3094 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1585 instance_segmentation_loss_cls: 0.0254 instance_segmentation_loss_reg: 0.3142 instance_segmentation_loss_poly: 0.8371 2024/01/10 05:07:29 - mmengine - INFO - Iter(train) [443500/640000] base_lr: 4.3023e-05 lr: 4.3023e-06 eta: 3 days, 4:11:38 time: 1.4061 data_time: 0.0201 memory: 25566 grad_norm: 3.4545 loss: 1.0969 caption_loss_cls: 1.9971 detection_loss_cls: 0.0249 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1573 instance_segmentation_loss_cls: 0.0255 instance_segmentation_loss_reg: 0.3157 instance_segmentation_loss_poly: 0.8380 2024/01/10 05:18:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 05:18:44 - mmengine - INFO - Iter(train) [444000/640000] base_lr: 4.2822e-05 lr: 4.2822e-06 eta: 3 days, 3:47:20 time: 1.3968 data_time: 0.0202 memory: 25566 grad_norm: 3.4344 loss: 1.0995 caption_loss_cls: 1.9946 detection_loss_cls: 0.0248 detection_loss_reg: 0.3078 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1559 instance_segmentation_loss_cls: 0.0255 instance_segmentation_loss_reg: 0.3155 instance_segmentation_loss_poly: 0.8387 2024/01/10 05:18:44 - mmengine - INFO - Saving checkpoint at 444000 iterations 2024/01/10 05:30:34 - mmengine - INFO - Iter(train) [444500/640000] base_lr: 4.2620e-05 lr: 4.2620e-06 eta: 3 days, 3:42:22 time: 1.4041 data_time: 0.0262 memory: 25566 grad_norm: 3.3982 loss: 1.0992 caption_loss_cls: 1.9889 detection_loss_cls: 0.0248 detection_loss_reg: 0.3080 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1548 instance_segmentation_loss_cls: 0.0255 instance_segmentation_loss_reg: 0.3158 instance_segmentation_loss_poly: 0.8394 2024/01/10 05:42:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 05:42:17 - mmengine - INFO - Iter(train) [445000/640000] base_lr: 4.2420e-05 lr: 4.2420e-06 eta: 3 days, 3:33:15 time: 1.4065 data_time: 0.0261 memory: 25566 grad_norm: 3.4257 loss: 1.1000 caption_loss_cls: 1.9770 detection_loss_cls: 0.0249 detection_loss_reg: 0.3095 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1585 instance_segmentation_loss_cls: 0.0254 instance_segmentation_loss_reg: 0.3159 instance_segmentation_loss_poly: 0.8384 2024/01/10 05:53:28 - mmengine - INFO - Iter(train) [445500/640000] base_lr: 4.2219e-05 lr: 4.2219e-06 eta: 3 days, 3:10:07 time: 1.4026 data_time: 0.0260 memory: 25566 grad_norm: 3.4585 loss: 1.1109 caption_loss_cls: 1.9770 detection_loss_cls: 0.0248 detection_loss_reg: 0.3097 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1586 instance_segmentation_loss_cls: 0.0254 instance_segmentation_loss_reg: 0.3152 instance_segmentation_loss_poly: 0.8363 2024/01/10 06:05:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 06:05:12 - mmengine - INFO - Iter(train) [446000/640000] base_lr: 4.2019e-05 lr: 4.2019e-06 eta: 3 days, 3:01:38 time: 1.4023 data_time: 0.0259 memory: 25566 grad_norm: 3.5017 loss: 1.1067 caption_loss_cls: 1.9741 detection_loss_cls: 0.0247 detection_loss_reg: 0.3085 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1541 instance_segmentation_loss_cls: 0.0253 instance_segmentation_loss_reg: 0.3145 instance_segmentation_loss_poly: 0.8346 2024/01/10 06:05:12 - mmengine - INFO - Saving checkpoint at 446000 iterations 2024/01/10 06:17:04 - mmengine - INFO - Iter(train) [446500/640000] base_lr: 4.1819e-05 lr: 4.1819e-06 eta: 3 days, 2:55:56 time: 1.4055 data_time: 0.0248 memory: 25566 grad_norm: 3.4831 loss: 1.1198 caption_loss_cls: 1.9709 detection_loss_cls: 0.0247 detection_loss_reg: 0.3075 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1458 instance_segmentation_loss_cls: 0.0255 instance_segmentation_loss_reg: 0.3151 instance_segmentation_loss_poly: 0.8368 2024/01/10 06:28:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 06:28:31 - mmengine - INFO - Iter(train) [447000/640000] base_lr: 4.1620e-05 lr: 4.1620e-06 eta: 3 days, 2:40:36 time: 1.4008 data_time: 0.0247 memory: 25566 grad_norm: 3.4921 loss: 1.1274 caption_loss_cls: 1.9701 detection_loss_cls: 0.0247 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1431 instance_segmentation_loss_cls: 0.0254 instance_segmentation_loss_reg: 0.3134 instance_segmentation_loss_poly: 0.8332 2024/01/10 06:40:10 - mmengine - INFO - Iter(train) [447500/640000] base_lr: 4.1421e-05 lr: 4.1421e-06 eta: 3 days, 2:29:39 time: 1.3893 data_time: 0.0243 memory: 25566 grad_norm: 3.5048 loss: 1.1125 caption_loss_cls: 1.9694 detection_loss_cls: 0.0247 detection_loss_reg: 0.3092 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1372 instance_segmentation_loss_cls: 0.0253 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8308 2024/01/10 06:51:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 06:51:50 - mmengine - INFO - Iter(train) [448000/640000] base_lr: 4.1222e-05 lr: 4.1222e-06 eta: 3 days, 2:19:15 time: 1.3957 data_time: 0.0243 memory: 25566 grad_norm: 3.5059 loss: 1.1127 caption_loss_cls: 1.9730 detection_loss_cls: 0.0247 detection_loss_reg: 0.3085 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1340 instance_segmentation_loss_cls: 0.0250 instance_segmentation_loss_reg: 0.3099 instance_segmentation_loss_poly: 0.8259 2024/01/10 06:51:50 - mmengine - INFO - Saving checkpoint at 448000 iterations 2024/01/10 07:03:57 - mmengine - INFO - Iter(train) [448500/640000] base_lr: 4.1023e-05 lr: 4.1023e-06 eta: 3 days, 2:16:28 time: 1.3999 data_time: 0.0239 memory: 25566 grad_norm: 3.4549 loss: 1.1015 caption_loss_cls: 1.9671 detection_loss_cls: 0.0247 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1348 instance_segmentation_loss_cls: 0.0250 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8255 2024/01/10 07:15:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 07:15:26 - mmengine - INFO - Iter(train) [449000/640000] base_lr: 4.0825e-05 lr: 4.0825e-06 eta: 3 days, 2:02:11 time: 1.3965 data_time: 0.0239 memory: 25566 grad_norm: 3.4405 loss: 1.1022 caption_loss_cls: 1.9676 detection_loss_cls: 0.0248 detection_loss_reg: 0.3102 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1311 instance_segmentation_loss_cls: 0.0251 instance_segmentation_loss_reg: 0.3105 instance_segmentation_loss_poly: 0.8260 2024/01/10 07:26:38 - mmengine - INFO - Iter(train) [449500/640000] base_lr: 4.0628e-05 lr: 4.0628e-06 eta: 3 days, 1:43:27 time: 1.3968 data_time: 0.0238 memory: 25566 grad_norm: 3.4313 loss: 1.1050 caption_loss_cls: 1.9726 detection_loss_cls: 0.0250 detection_loss_reg: 0.3118 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1277 instance_segmentation_loss_cls: 0.0251 instance_segmentation_loss_reg: 0.3101 instance_segmentation_loss_poly: 0.8256 2024/01/10 07:38:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 07:38:26 - mmengine - INFO - Iter(train) [450000/640000] base_lr: 4.0430e-05 lr: 4.0430e-06 eta: 3 days, 1:34:44 time: 1.3979 data_time: 0.0239 memory: 25566 grad_norm: 3.3935 loss: 1.1251 caption_loss_cls: 1.9719 detection_loss_cls: 0.0251 detection_loss_reg: 0.3137 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1296 instance_segmentation_loss_cls: 0.0252 instance_segmentation_loss_reg: 0.3115 instance_segmentation_loss_poly: 0.8283 2024/01/10 07:38:26 - mmengine - INFO - Saving checkpoint at 450000 iterations 2024/01/10 07:49:55 - mmengine - INFO - Iter(train) [450500/640000] base_lr: 4.0234e-05 lr: 4.0234e-06 eta: 3 days, 1:20:54 time: 1.3920 data_time: 0.0236 memory: 25566 grad_norm: 3.3490 loss: 1.1195 caption_loss_cls: 1.9723 detection_loss_cls: 0.0251 detection_loss_reg: 0.3138 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1274 instance_segmentation_loss_cls: 0.0250 instance_segmentation_loss_reg: 0.3107 instance_segmentation_loss_poly: 0.8247 2024/01/10 08:01:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 08:01:45 - mmengine - INFO - Iter(train) [451000/640000] base_lr: 4.0037e-05 lr: 4.0037e-06 eta: 3 days, 1:12:25 time: 1.3978 data_time: 0.0235 memory: 25566 grad_norm: 3.3238 loss: 1.1142 caption_loss_cls: 1.9726 detection_loss_cls: 0.0250 detection_loss_reg: 0.3140 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1299 instance_segmentation_loss_cls: 0.0248 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8231 2024/01/10 08:13:10 - mmengine - INFO - Iter(train) [451500/640000] base_lr: 3.9841e-05 lr: 3.9841e-06 eta: 3 days, 0:57:55 time: 1.3944 data_time: 0.0235 memory: 25566 grad_norm: 3.3035 loss: 1.1255 caption_loss_cls: 1.9754 detection_loss_cls: 0.0251 detection_loss_reg: 0.3154 semantic_segmentation_loss_cls: 0.0069 grounding_loss_reg: 2.1317 instance_segmentation_loss_cls: 0.0248 instance_segmentation_loss_reg: 0.3083 instance_segmentation_loss_poly: 0.8193 2024/01/10 08:24:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 08:24:33 - mmengine - INFO - Iter(train) [452000/640000] base_lr: 3.9645e-05 lr: 3.9645e-06 eta: 3 days, 0:43:05 time: 1.3899 data_time: 0.0236 memory: 25566 grad_norm: 3.2972 loss: 1.1368 caption_loss_cls: 1.9722 detection_loss_cls: 0.0250 detection_loss_reg: 0.3152 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1305 instance_segmentation_loss_cls: 0.0249 instance_segmentation_loss_reg: 0.3094 instance_segmentation_loss_poly: 0.8219 2024/01/10 08:24:33 - mmengine - INFO - Saving checkpoint at 452000 iterations 2024/01/10 08:36:40 - mmengine - INFO - Iter(train) [452500/640000] base_lr: 3.9449e-05 lr: 3.9449e-06 eta: 3 days, 0:37:58 time: 1.3900 data_time: 0.0236 memory: 25566 grad_norm: 3.3238 loss: 1.1456 caption_loss_cls: 1.9705 detection_loss_cls: 0.0250 detection_loss_reg: 0.3154 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1325 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8205 2024/01/10 08:48:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 08:48:30 - mmengine - INFO - Iter(train) [453000/640000] base_lr: 3.9254e-05 lr: 3.9254e-06 eta: 3 days, 0:28:52 time: 1.3951 data_time: 0.0236 memory: 25566 grad_norm: 3.3415 loss: 1.1499 caption_loss_cls: 1.9630 detection_loss_cls: 0.0250 detection_loss_reg: 0.3149 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1304 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3082 instance_segmentation_loss_poly: 0.8206 2024/01/10 08:59:52 - mmengine - INFO - Iter(train) [453500/640000] base_lr: 3.9059e-05 lr: 3.9059e-06 eta: 3 days, 0:14:07 time: 1.3977 data_time: 0.0236 memory: 25566 grad_norm: 3.3049 loss: 1.1384 caption_loss_cls: 1.9636 detection_loss_cls: 0.0248 detection_loss_reg: 0.3139 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1293 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3079 instance_segmentation_loss_poly: 0.8190 2024/01/10 09:11:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 09:11:42 - mmengine - INFO - Iter(train) [454000/640000] base_lr: 3.8865e-05 lr: 3.8865e-06 eta: 3 days, 0:04:56 time: 1.3982 data_time: 0.0235 memory: 25566 grad_norm: 3.2850 loss: 1.1152 caption_loss_cls: 1.9635 detection_loss_cls: 0.0246 detection_loss_reg: 0.3122 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1290 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3068 instance_segmentation_loss_poly: 0.8165 2024/01/10 09:11:42 - mmengine - INFO - Saving checkpoint at 454000 iterations 2024/01/10 09:23:51 - mmengine - INFO - Iter(train) [454500/640000] base_lr: 3.8671e-05 lr: 3.8671e-06 eta: 2 days, 23:59:06 time: 1.4082 data_time: 0.0237 memory: 25566 grad_norm: 3.2668 loss: 1.1062 caption_loss_cls: 1.9634 detection_loss_cls: 0.0246 detection_loss_reg: 0.3118 semantic_segmentation_loss_cls: 0.0068 grounding_loss_reg: 2.1206 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8149 2024/01/10 09:35:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 09:35:14 - mmengine - INFO - Iter(train) [455000/640000] base_lr: 3.8477e-05 lr: 3.8477e-06 eta: 2 days, 23:44:28 time: 1.4013 data_time: 0.0237 memory: 25566 grad_norm: 3.2834 loss: 1.1069 caption_loss_cls: 1.9628 detection_loss_cls: 0.0247 detection_loss_reg: 0.3131 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1212 instance_segmentation_loss_cls: 0.0249 instance_segmentation_loss_reg: 0.3079 instance_segmentation_loss_poly: 0.8191 2024/01/10 09:46:37 - mmengine - INFO - Iter(train) [455500/640000] base_lr: 3.8284e-05 lr: 3.8284e-06 eta: 2 days, 23:30:14 time: 1.4009 data_time: 0.0237 memory: 25566 grad_norm: 3.3083 loss: 1.1140 caption_loss_cls: 1.9611 detection_loss_cls: 0.0247 detection_loss_reg: 0.3124 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1216 instance_segmentation_loss_cls: 0.0249 instance_segmentation_loss_reg: 0.3079 instance_segmentation_loss_poly: 0.8200 2024/01/10 09:58:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 09:58:13 - mmengine - INFO - Iter(train) [456000/640000] base_lr: 3.8091e-05 lr: 3.8091e-06 eta: 2 days, 23:18:17 time: 1.4043 data_time: 0.0236 memory: 25566 grad_norm: 3.2743 loss: 1.0922 caption_loss_cls: 1.9572 detection_loss_cls: 0.0248 detection_loss_reg: 0.3133 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1178 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3074 instance_segmentation_loss_poly: 0.8178 2024/01/10 09:58:13 - mmengine - INFO - Saving checkpoint at 456000 iterations 2024/01/10 10:10:16 - mmengine - INFO - Iter(train) [456500/640000] base_lr: 3.7898e-05 lr: 3.7898e-06 eta: 2 days, 23:10:48 time: 1.4033 data_time: 0.0236 memory: 25566 grad_norm: 3.2882 loss: 1.0957 caption_loss_cls: 1.9619 detection_loss_cls: 0.0248 detection_loss_reg: 0.3138 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1170 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3061 instance_segmentation_loss_poly: 0.8149 2024/01/10 10:21:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 10:21:30 - mmengine - INFO - Iter(train) [457000/640000] base_lr: 3.7706e-05 lr: 3.7706e-06 eta: 2 days, 22:55:14 time: 1.3944 data_time: 0.0234 memory: 25566 grad_norm: 3.3252 loss: 1.0905 caption_loss_cls: 1.9554 detection_loss_cls: 0.0249 detection_loss_reg: 0.3161 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1146 instance_segmentation_loss_cls: 0.0245 instance_segmentation_loss_reg: 0.3045 instance_segmentation_loss_poly: 0.8114 2024/01/10 10:33:34 - mmengine - INFO - Iter(train) [457500/640000] base_lr: 3.7514e-05 lr: 3.7514e-06 eta: 2 days, 22:47:35 time: 1.4046 data_time: 0.0236 memory: 25566 grad_norm: 3.3420 loss: 1.0832 caption_loss_cls: 1.9478 detection_loss_cls: 0.0250 detection_loss_reg: 0.3166 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1109 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3052 instance_segmentation_loss_poly: 0.8125 2024/01/10 10:44:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 10:44:56 - mmengine - INFO - Iter(train) [458000/640000] base_lr: 3.7323e-05 lr: 3.7323e-06 eta: 2 days, 22:33:25 time: 1.3976 data_time: 0.0236 memory: 25566 grad_norm: 3.4250 loss: 1.1031 caption_loss_cls: 1.9441 detection_loss_cls: 0.0251 detection_loss_reg: 0.3177 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1110 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3061 instance_segmentation_loss_poly: 0.8121 2024/01/10 10:44:56 - mmengine - INFO - Saving checkpoint at 458000 iterations 2024/01/10 10:57:02 - mmengine - INFO - Iter(train) [458500/640000] base_lr: 3.7132e-05 lr: 3.7132e-06 eta: 2 days, 22:25:58 time: 1.3970 data_time: 0.0236 memory: 25566 grad_norm: 3.4231 loss: 1.1052 caption_loss_cls: 1.9369 detection_loss_cls: 0.0250 detection_loss_reg: 0.3165 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1058 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3068 instance_segmentation_loss_poly: 0.8138 2024/01/10 11:08:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 11:08:17 - mmengine - INFO - Iter(train) [459000/640000] base_lr: 3.6941e-05 lr: 3.6941e-06 eta: 2 days, 22:10:46 time: 1.3950 data_time: 0.0235 memory: 25566 grad_norm: 3.4469 loss: 1.1087 caption_loss_cls: 1.9408 detection_loss_cls: 0.0249 detection_loss_reg: 0.3152 semantic_segmentation_loss_cls: 0.0067 grounding_loss_reg: 2.1059 instance_segmentation_loss_cls: 0.0245 instance_segmentation_loss_reg: 0.3071 instance_segmentation_loss_poly: 0.8140 2024/01/10 11:20:00 - mmengine - INFO - Iter(train) [459500/640000] base_lr: 3.6751e-05 lr: 3.6751e-06 eta: 2 days, 21:59:48 time: 1.3998 data_time: 0.0236 memory: 25566 grad_norm: 3.4267 loss: 1.1030 caption_loss_cls: 1.9447 detection_loss_cls: 0.0248 detection_loss_reg: 0.3140 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1081 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8179 2024/01/10 11:31:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 11:31:54 - mmengine - INFO - Iter(train) [460000/640000] base_lr: 3.6561e-05 lr: 3.6561e-06 eta: 2 days, 21:50:24 time: 1.4045 data_time: 0.0237 memory: 25566 grad_norm: 3.4150 loss: 1.1046 caption_loss_cls: 1.9383 detection_loss_cls: 0.0250 detection_loss_reg: 0.3164 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1071 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3092 instance_segmentation_loss_poly: 0.8196 2024/01/10 11:31:54 - mmengine - INFO - Saving checkpoint at 460000 iterations 2024/01/10 11:43:36 - mmengine - INFO - Evaluating bbox... 2024/01/10 11:44:32 - mmengine - INFO - bbox_mAP_copypaste: 0.524 0.705 0.575 0.370 0.570 0.681 2024/01/10 11:44:32 - mmengine - INFO - Evaluating segm... 2024/01/10 11:45:45 - mmengine - INFO - segm_mAP_copypaste: 0.355 0.622 0.353 0.207 0.398 0.537 2024/01/10 11:47:53 - mmengine - INFO - Evaluating bbox... 2024/01/10 11:48:51 - mmengine - INFO - bbox_mAP_copypaste: 0.523 0.704 0.573 0.366 0.569 0.680 2024/01/10 11:54:29 - mmengine - INFO - per class results: 2024/01/10 11:54:29 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 78.9 | 89.26 | | building | 82.04 | 92.47 | | sky | 93.52 | 97.81 | | floor | 82.66 | 91.3 | | tree | 74.93 | 87.72 | | ceiling | 86.36 | 94.64 | | road | 85.23 | 90.9 | | bed | 90.2 | 95.59 | | windowpane | 63.0 | 81.24 | | grass | 68.75 | 84.88 | | cabinet | 63.1 | 73.63 | | sidewalk | 68.82 | 82.46 | | person | 81.83 | 90.96 | | earth | 39.19 | 49.83 | | door | 55.91 | 72.79 | | table | 65.52 | 78.93 | | mountain | 60.99 | 80.3 | | plant | 53.64 | 62.86 | | curtain | 76.41 | 85.83 | | chair | 62.52 | 75.03 | | car | 85.19 | 92.21 | | water | 57.43 | 68.45 | | painting | 74.55 | 88.48 | | sofa | 72.95 | 85.21 | | shelf | 46.7 | 67.15 | | house | 35.78 | 40.77 | | sea | 60.41 | 77.43 | | mirror | 69.63 | 79.16 | | rug | 64.62 | 73.59 | | field | 38.88 | 60.66 | | armchair | 52.56 | 69.82 | | seat | 68.29 | 81.63 | | fence | 45.4 | 65.41 | | desk | 52.15 | 74.23 | | rock | 47.83 | 64.83 | | wardrobe | 47.4 | 70.0 | | lamp | 66.47 | 78.63 | | bathtub | 79.61 | 90.12 | | railing | 42.05 | 58.57 | | cushion | 63.02 | 75.31 | | base | 20.12 | 28.76 | | box | 29.78 | 38.12 | | column | 56.79 | 64.86 | | signboard | 37.82 | 51.22 | | chest of drawers | 41.01 | 64.15 | | counter | 30.07 | 41.16 | | sand | 46.44 | 67.11 | | sink | 77.26 | 85.66 | | skyscraper | 48.25 | 61.39 | | fireplace | 76.05 | 88.41 | | refrigerator | 81.1 | 86.42 | | grandstand | 44.65 | 84.87 | | path | 18.75 | 25.18 | | stairs | 36.54 | 44.84 | | runway | 74.22 | 95.58 | | case | 46.96 | 59.93 | | pool table | 92.12 | 95.79 | | pillow | 61.44 | 73.42 | | screen door | 74.44 | 77.25 | | stairway | 31.66 | 40.39 | | river | 13.68 | 36.25 | | bridge | 68.92 | 83.34 | | bookcase | 43.38 | 57.37 | | blind | 39.32 | 47.0 | | coffee table | 65.4 | 79.66 | | toilet | 87.19 | 91.87 | | flower | 40.12 | 55.29 | | book | 48.92 | 71.04 | | hill | 16.8 | 22.69 | | bench | 59.82 | 70.76 | | countertop | 63.69 | 78.64 | | stove | 81.78 | 85.67 | | palm | 47.77 | 72.26 | | kitchen island | 52.62 | 78.82 | | computer | 77.19 | 87.61 | | swivel chair | 43.27 | 54.45 | | boat | 64.59 | 85.56 | | bar | 32.33 | 41.84 | | arcade machine | 45.68 | 47.98 | | hovel | 26.66 | 28.68 | | bus | 87.74 | 95.21 | | towel | 65.83 | 80.06 | | light | 51.81 | 61.64 | | truck | 47.39 | 63.95 | | tower | 8.6 | 12.94 | | chandelier | 66.51 | 79.18 | | awning | 31.09 | 40.6 | | streetlight | 34.72 | 46.52 | | booth | 41.94 | 57.45 | | television receiver | 74.35 | 87.51 | | airplane | 63.45 | 70.84 | | dirt track | 8.88 | 20.21 | | apparel | 33.37 | 53.67 | | pole | 29.76 | 43.24 | | land | 4.24 | 5.61 | | bannister | 15.64 | 20.98 | | escalator | 23.34 | 24.81 | | ottoman | 52.97 | 72.58 | | bottle | 28.0 | 37.63 | | buffet | 54.95 | 62.94 | | poster | 33.5 | 43.67 | | stage | 13.6 | 19.29 | | van | 44.16 | 57.76 | | ship | 8.78 | 9.41 | | fountain | 20.49 | 20.96 | | conveyer belt | 76.23 | 90.64 | | canopy | 39.82 | 46.27 | | washer | 67.09 | 71.62 | | plaything | 28.95 | 35.55 | | swimming pool | 69.01 | 72.02 | | stool | 45.89 | 64.5 | | barrel | 16.95 | 68.75 | | basket | 33.6 | 45.49 | | waterfall | 66.03 | 86.57 | | tent | 73.13 | 97.51 | | bag | 24.33 | 34.87 | | minibike | 74.17 | 85.65 | | cradle | 83.81 | 96.35 | | oven | 54.29 | 68.66 | | ball | 52.2 | 67.58 | | food | 52.87 | 57.65 | | step | 6.51 | 7.68 | | tank | 47.54 | 51.02 | | trade name | 29.68 | 39.36 | | microwave | 86.18 | 93.63 | | pot | 49.46 | 56.34 | | animal | 63.86 | 66.87 | | bicycle | 59.82 | 75.36 | | lake | 56.99 | 62.78 | | dishwasher | 73.38 | 81.35 | | screen | 64.17 | 85.82 | | blanket | 30.98 | 36.87 | | sculpture | 68.81 | 81.61 | | hood | 58.98 | 73.16 | | sconce | 47.0 | 59.37 | | vase | 45.91 | 59.87 | | traffic light | 41.26 | 63.08 | | tray | 14.86 | 16.66 | | ashcan | 41.83 | 53.78 | | fan | 63.1 | 77.46 | | pier | 38.53 | 42.25 | | crt screen | 5.74 | 16.12 | | plate | 58.03 | 77.91 | | monitor | 9.64 | 11.36 | | bulletin board | 59.89 | 69.16 | | shower | 6.28 | 8.01 | | radiator | 61.12 | 73.36 | | glass | 17.45 | 18.99 | | clock | 40.68 | 48.77 | | flag | 45.07 | 53.15 | +---------------------+-------+-------+ 2024/01/10 11:54:41 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5230 coco/bbox_mAP_50: 0.7040 coco/bbox_mAP_75: 0.5730 coco/bbox_mAP_s: 0.3660 coco/bbox_mAP_m: 0.5690 coco/bbox_mAP_l: 0.6800 coco/segm_mAP: 0.3550 coco/segm_mAP_50: 0.6220 coco/segm_mAP_75: 0.3530 coco/segm_mAP_s: 0.2070 coco/segm_mAP_m: 0.3980 coco/segm_mAP_l: 0.5370 Bleu_1: 0.7681 Bleu_2: 0.6060 Bleu_3: 0.4645 Bleu_4: 0.3529 METEOR: 0.2787 ROUGE_L: 0.5671 CIDEr: 1.1493 SPICE: 0.2070 aAcc: 84.2300 mIoU: 51.6400 mAcc: 63.4100 visual-grounding/miou: 0.8250 visual-grounding/acc: 0.8836 data_time: 0.0119 time: 1.9021 2024/01/10 12:06:25 - mmengine - INFO - Iter(train) [460500/640000] base_lr: 3.6372e-05 lr: 3.6372e-06 eta: 2 days, 21:39:41 time: 1.4002 data_time: 0.0181 memory: 34657 grad_norm: 3.4004 loss: 1.0955 caption_loss_cls: 1.9367 detection_loss_cls: 0.0250 detection_loss_reg: 0.3177 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1093 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3106 instance_segmentation_loss_poly: 0.8221 2024/01/10 12:17:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 12:17:34 - mmengine - INFO - Iter(train) [461000/640000] base_lr: 3.6182e-05 lr: 3.6182e-06 eta: 2 days, 21:24:11 time: 1.3989 data_time: 0.0181 memory: 25565 grad_norm: 3.4010 loss: 1.0953 caption_loss_cls: 1.9347 detection_loss_cls: 0.0252 detection_loss_reg: 0.3182 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1080 instance_segmentation_loss_cls: 0.0248 instance_segmentation_loss_reg: 0.3107 instance_segmentation_loss_poly: 0.8244 2024/01/10 12:28:49 - mmengine - INFO - Iter(train) [461500/640000] base_lr: 3.5994e-05 lr: 3.5994e-06 eta: 2 days, 21:09:42 time: 1.3870 data_time: 0.0180 memory: 25565 grad_norm: 3.6135 loss: 1.1072 caption_loss_cls: 1.9374 detection_loss_cls: 0.0251 detection_loss_reg: 0.3180 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1081 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3102 instance_segmentation_loss_poly: 0.8229 2024/01/10 12:40:20 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 12:40:20 - mmengine - INFO - Iter(train) [462000/640000] base_lr: 3.5805e-05 lr: 3.5805e-06 eta: 2 days, 20:57:08 time: 1.3891 data_time: 0.0181 memory: 25565 grad_norm: 3.5771 loss: 1.1047 caption_loss_cls: 1.9394 detection_loss_cls: 0.0250 detection_loss_reg: 0.3163 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1046 instance_segmentation_loss_cls: 0.0247 instance_segmentation_loss_reg: 0.3093 instance_segmentation_loss_poly: 0.8206 2024/01/10 12:40:20 - mmengine - INFO - Saving checkpoint at 462000 iterations 2024/01/10 12:52:21 - mmengine - INFO - Iter(train) [462500/640000] base_lr: 3.5617e-05 lr: 3.5617e-06 eta: 2 days, 20:48:25 time: 1.3879 data_time: 0.0179 memory: 25565 grad_norm: 3.5846 loss: 1.0970 caption_loss_cls: 1.9403 detection_loss_cls: 0.0249 detection_loss_reg: 0.3134 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1051 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3077 instance_segmentation_loss_poly: 0.8173 2024/01/10 13:04:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 13:04:32 - mmengine - INFO - Iter(train) [463000/640000] base_lr: 3.5430e-05 lr: 3.5430e-06 eta: 2 days, 20:40:39 time: 1.4021 data_time: 0.0181 memory: 25565 grad_norm: 3.5309 loss: 1.0841 caption_loss_cls: 1.9413 detection_loss_cls: 0.0249 detection_loss_reg: 0.3122 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1018 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3083 instance_segmentation_loss_poly: 0.8192 2024/01/10 13:15:44 - mmengine - INFO - Iter(train) [463500/640000] base_lr: 3.5242e-05 lr: 3.5242e-06 eta: 2 days, 20:25:50 time: 1.3941 data_time: 0.0181 memory: 25565 grad_norm: 3.5835 loss: 1.0861 caption_loss_cls: 1.9355 detection_loss_cls: 0.0248 detection_loss_reg: 0.3114 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1016 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3078 instance_segmentation_loss_poly: 0.8191 2024/01/10 13:26:37 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 13:26:37 - mmengine - INFO - Iter(train) [464000/640000] base_lr: 3.5056e-05 lr: 3.5056e-06 eta: 2 days, 20:09:09 time: 1.3788 data_time: 0.0180 memory: 25565 grad_norm: 3.6528 loss: 1.0982 caption_loss_cls: 1.9404 detection_loss_cls: 0.0248 detection_loss_reg: 0.3116 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1014 instance_segmentation_loss_cls: 0.0246 instance_segmentation_loss_reg: 0.3063 instance_segmentation_loss_poly: 0.8164 2024/01/10 13:26:37 - mmengine - INFO - Saving checkpoint at 464000 iterations 2024/01/10 13:38:20 - mmengine - INFO - Iter(train) [464500/640000] base_lr: 3.4869e-05 lr: 3.4869e-06 eta: 2 days, 19:58:08 time: 1.3781 data_time: 0.0236 memory: 25565 grad_norm: 3.6467 loss: 1.1018 caption_loss_cls: 1.9448 detection_loss_cls: 0.0247 detection_loss_reg: 0.3108 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.1015 instance_segmentation_loss_cls: 0.0244 instance_segmentation_loss_reg: 0.3048 instance_segmentation_loss_poly: 0.8124 2024/01/10 13:50:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 13:50:05 - mmengine - INFO - Iter(train) [465000/640000] base_lr: 3.4683e-05 lr: 3.4683e-06 eta: 2 days, 19:47:16 time: 1.3869 data_time: 0.0238 memory: 25565 grad_norm: 3.6036 loss: 1.1033 caption_loss_cls: 1.9436 detection_loss_cls: 0.0247 detection_loss_reg: 0.3103 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.0976 instance_segmentation_loss_cls: 0.0244 instance_segmentation_loss_reg: 0.3049 instance_segmentation_loss_poly: 0.8110 2024/01/10 14:01:15 - mmengine - INFO - Iter(train) [465500/640000] base_lr: 3.4497e-05 lr: 3.4497e-06 eta: 2 days, 19:32:48 time: 1.3856 data_time: 0.0239 memory: 25565 grad_norm: 3.4238 loss: 1.1096 caption_loss_cls: 1.9458 detection_loss_cls: 0.0247 detection_loss_reg: 0.3101 semantic_segmentation_loss_cls: 0.0066 grounding_loss_reg: 2.0941 instance_segmentation_loss_cls: 0.0245 instance_segmentation_loss_reg: 0.3059 instance_segmentation_loss_poly: 0.8136 2024/01/10 14:13:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 14:13:04 - mmengine - INFO - Iter(train) [466000/640000] base_lr: 3.4312e-05 lr: 3.4312e-06 eta: 2 days, 19:22:21 time: 1.3902 data_time: 0.0238 memory: 25565 grad_norm: 3.3765 loss: 1.1038 caption_loss_cls: 1.9470 detection_loss_cls: 0.0246 detection_loss_reg: 0.3089 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0955 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3049 instance_segmentation_loss_poly: 0.8107 2024/01/10 14:13:04 - mmengine - INFO - Saving checkpoint at 466000 iterations 2024/01/10 14:25:26 - mmengine - INFO - Iter(train) [466500/640000] base_lr: 3.4127e-05 lr: 3.4127e-06 eta: 2 days, 19:15:18 time: 1.3953 data_time: 0.0241 memory: 25565 grad_norm: 3.3366 loss: 1.1050 caption_loss_cls: 1.9438 detection_loss_cls: 0.0246 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0976 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3046 instance_segmentation_loss_poly: 0.8098 2024/01/10 14:36:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 14:36:53 - mmengine - INFO - Iter(train) [467000/640000] base_lr: 3.3943e-05 lr: 3.3943e-06 eta: 2 days, 19:02:33 time: 1.3843 data_time: 0.0240 memory: 25565 grad_norm: 3.3628 loss: 1.1155 caption_loss_cls: 1.9447 detection_loss_cls: 0.0246 detection_loss_reg: 0.3089 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0968 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3044 instance_segmentation_loss_poly: 0.8088 2024/01/10 14:48:33 - mmengine - INFO - Iter(train) [467500/640000] base_lr: 3.3759e-05 lr: 3.3759e-06 eta: 2 days, 18:51:07 time: 1.3915 data_time: 0.0240 memory: 25565 grad_norm: 3.3217 loss: 1.1081 caption_loss_cls: 1.9430 detection_loss_cls: 0.0245 detection_loss_reg: 0.3079 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0954 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3047 instance_segmentation_loss_poly: 0.8094 2024/01/10 15:00:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 15:00:54 - mmengine - INFO - Iter(train) [468000/640000] base_lr: 3.3575e-05 lr: 3.3575e-06 eta: 2 days, 18:43:38 time: 1.4135 data_time: 0.0244 memory: 25565 grad_norm: 3.2696 loss: 1.0990 caption_loss_cls: 1.9437 detection_loss_cls: 0.0247 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0980 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3067 instance_segmentation_loss_poly: 0.8137 2024/01/10 15:00:54 - mmengine - INFO - Saving checkpoint at 468000 iterations 2024/01/10 15:12:32 - mmengine - INFO - Iter(train) [468500/640000] base_lr: 3.3392e-05 lr: 3.3392e-06 eta: 2 days, 18:31:56 time: 1.4122 data_time: 0.0243 memory: 25565 grad_norm: 3.3173 loss: 1.0987 caption_loss_cls: 1.9423 detection_loss_cls: 0.0247 detection_loss_reg: 0.3097 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0947 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3066 instance_segmentation_loss_poly: 0.8130 2024/01/10 15:24:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 15:24:18 - mmengine - INFO - Iter(train) [469000/640000] base_lr: 3.3209e-05 lr: 3.3209e-06 eta: 2 days, 18:20:58 time: 1.4126 data_time: 0.0243 memory: 25565 grad_norm: 3.3422 loss: 1.0991 caption_loss_cls: 1.9415 detection_loss_cls: 0.0246 detection_loss_reg: 0.3084 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0933 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8125 2024/01/10 15:35:22 - mmengine - INFO - Iter(train) [469500/640000] base_lr: 3.3027e-05 lr: 3.3027e-06 eta: 2 days, 18:06:08 time: 1.4108 data_time: 0.0242 memory: 25565 grad_norm: 3.3699 loss: 1.0902 caption_loss_cls: 1.9403 detection_loss_cls: 0.0245 detection_loss_reg: 0.3077 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0899 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8122 2024/01/10 15:47:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 15:47:33 - mmengine - INFO - Iter(train) [470000/640000] base_lr: 3.2844e-05 lr: 3.2844e-06 eta: 2 days, 17:57:23 time: 1.4164 data_time: 0.0243 memory: 25565 grad_norm: 3.3676 loss: 1.0873 caption_loss_cls: 1.9407 detection_loss_cls: 0.0243 detection_loss_reg: 0.3050 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0890 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3044 instance_segmentation_loss_poly: 0.8071 2024/01/10 15:47:33 - mmengine - INFO - Saving checkpoint at 470000 iterations 2024/01/10 15:59:27 - mmengine - INFO - Iter(train) [470500/640000] base_lr: 3.2663e-05 lr: 3.2663e-06 eta: 2 days, 17:47:04 time: 1.4093 data_time: 0.0241 memory: 25565 grad_norm: 3.4081 loss: 1.0903 caption_loss_cls: 1.9393 detection_loss_cls: 0.0244 detection_loss_reg: 0.3056 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0838 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3026 instance_segmentation_loss_poly: 0.8030 2024/01/10 16:10:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 16:10:57 - mmengine - INFO - Iter(train) [471000/640000] base_lr: 3.2482e-05 lr: 3.2482e-06 eta: 2 days, 17:34:41 time: 1.4102 data_time: 0.0241 memory: 25565 grad_norm: 3.4468 loss: 1.0894 caption_loss_cls: 1.9337 detection_loss_cls: 0.0244 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0783 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3018 instance_segmentation_loss_poly: 0.8025 2024/01/10 16:22:41 - mmengine - INFO - Iter(train) [471500/640000] base_lr: 3.2301e-05 lr: 3.2301e-06 eta: 2 days, 17:23:28 time: 1.4112 data_time: 0.0241 memory: 25565 grad_norm: 3.4531 loss: 1.0894 caption_loss_cls: 1.9352 detection_loss_cls: 0.0243 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0766 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3018 instance_segmentation_loss_poly: 0.8028 2024/01/10 16:34:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 16:34:06 - mmengine - INFO - Iter(train) [472000/640000] base_lr: 3.2120e-05 lr: 3.2120e-06 eta: 2 days, 17:10:43 time: 1.3972 data_time: 0.0239 memory: 25565 grad_norm: 3.4824 loss: 1.0892 caption_loss_cls: 1.9285 detection_loss_cls: 0.0243 detection_loss_reg: 0.3038 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0779 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3024 instance_segmentation_loss_poly: 0.8042 2024/01/10 16:34:06 - mmengine - INFO - Saving checkpoint at 472000 iterations 2024/01/10 16:45:58 - mmengine - INFO - Iter(train) [472500/640000] base_lr: 3.1940e-05 lr: 3.1940e-06 eta: 2 days, 17:00:07 time: 1.4006 data_time: 0.0240 memory: 25565 grad_norm: 3.4408 loss: 1.0995 caption_loss_cls: 1.9259 detection_loss_cls: 0.0243 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0756 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3029 instance_segmentation_loss_poly: 0.8062 2024/01/10 16:57:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 16:57:25 - mmengine - INFO - Iter(train) [473000/640000] base_lr: 3.1761e-05 lr: 3.1761e-06 eta: 2 days, 16:47:31 time: 1.3958 data_time: 0.0240 memory: 25565 grad_norm: 3.3894 loss: 1.0946 caption_loss_cls: 1.9213 detection_loss_cls: 0.0244 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0733 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3037 instance_segmentation_loss_poly: 0.8067 2024/01/10 17:09:01 - mmengine - INFO - Iter(train) [473500/640000] base_lr: 3.1581e-05 lr: 3.1581e-06 eta: 2 days, 16:35:42 time: 1.4040 data_time: 0.0240 memory: 25565 grad_norm: 3.3430 loss: 1.0884 caption_loss_cls: 1.9242 detection_loss_cls: 0.0244 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0733 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3037 instance_segmentation_loss_poly: 0.8054 2024/01/10 17:20:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 17:20:35 - mmengine - INFO - Iter(train) [474000/640000] base_lr: 3.1403e-05 lr: 3.1403e-06 eta: 2 days, 16:23:40 time: 1.3948 data_time: 0.0240 memory: 25565 grad_norm: 3.3971 loss: 1.1012 caption_loss_cls: 1.9299 detection_loss_cls: 0.0243 detection_loss_reg: 0.3030 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0753 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3048 instance_segmentation_loss_poly: 0.8088 2024/01/10 17:20:35 - mmengine - INFO - Saving checkpoint at 474000 iterations 2024/01/10 17:32:45 - mmengine - INFO - Iter(train) [474500/640000] base_lr: 3.1224e-05 lr: 3.1224e-06 eta: 2 days, 16:14:22 time: 1.3987 data_time: 0.0243 memory: 25565 grad_norm: 3.4192 loss: 1.1061 caption_loss_cls: 1.9234 detection_loss_cls: 0.0242 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0766 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3057 instance_segmentation_loss_poly: 0.8115 2024/01/10 17:44:41 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 17:44:41 - mmengine - INFO - Iter(train) [475000/640000] base_lr: 3.1046e-05 lr: 3.1046e-06 eta: 2 days, 16:04:00 time: 1.4051 data_time: 0.0243 memory: 25565 grad_norm: 3.3753 loss: 1.0994 caption_loss_cls: 1.9250 detection_loss_cls: 0.0242 detection_loss_reg: 0.3012 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0744 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3053 instance_segmentation_loss_poly: 0.8101 2024/01/10 17:56:14 - mmengine - INFO - Iter(train) [475500/640000] base_lr: 3.0869e-05 lr: 3.0869e-06 eta: 2 days, 15:51:53 time: 1.4024 data_time: 0.0243 memory: 25565 grad_norm: 3.3663 loss: 1.1073 caption_loss_cls: 1.9280 detection_loss_cls: 0.0241 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0726 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3055 instance_segmentation_loss_poly: 0.8101 2024/01/10 18:07:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 18:07:27 - mmengine - INFO - Iter(train) [476000/640000] base_lr: 3.0692e-05 lr: 3.0692e-06 eta: 2 days, 15:38:21 time: 1.3993 data_time: 0.0242 memory: 25565 grad_norm: 3.3653 loss: 1.1160 caption_loss_cls: 1.9274 detection_loss_cls: 0.0240 detection_loss_reg: 0.3000 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0740 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8114 2024/01/10 18:07:27 - mmengine - INFO - Saving checkpoint at 476000 iterations 2024/01/10 18:19:41 - mmengine - INFO - Iter(train) [476500/640000] base_lr: 3.0515e-05 lr: 3.0515e-06 eta: 2 days, 15:29:14 time: 1.4050 data_time: 0.0242 memory: 25565 grad_norm: 3.3760 loss: 1.1060 caption_loss_cls: 1.9281 detection_loss_cls: 0.0242 detection_loss_reg: 0.3014 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0727 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3078 instance_segmentation_loss_poly: 0.8148 2024/01/10 18:30:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 18:30:52 - mmengine - INFO - Iter(train) [477000/640000] base_lr: 3.0339e-05 lr: 3.0339e-06 eta: 2 days, 15:15:35 time: 1.4010 data_time: 0.0242 memory: 25565 grad_norm: 3.4535 loss: 1.1179 caption_loss_cls: 1.9309 detection_loss_cls: 0.0240 detection_loss_reg: 0.3004 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0731 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3081 instance_segmentation_loss_poly: 0.8155 2024/01/10 18:42:48 - mmengine - INFO - Iter(train) [477500/640000] base_lr: 3.0163e-05 lr: 3.0163e-06 eta: 2 days, 15:05:05 time: 1.4058 data_time: 0.0242 memory: 25565 grad_norm: 3.4307 loss: 1.1097 caption_loss_cls: 1.9331 detection_loss_cls: 0.0239 detection_loss_reg: 0.2993 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0717 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3071 instance_segmentation_loss_poly: 0.8130 2024/01/10 18:54:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 18:54:28 - mmengine - INFO - Iter(train) [478000/640000] base_lr: 2.9987e-05 lr: 2.9987e-06 eta: 2 days, 14:53:30 time: 1.4074 data_time: 0.0242 memory: 25565 grad_norm: 3.4347 loss: 1.0911 caption_loss_cls: 1.9360 detection_loss_cls: 0.0239 detection_loss_reg: 0.2986 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0680 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8102 2024/01/10 18:54:28 - mmengine - INFO - Saving checkpoint at 478000 iterations 2024/01/10 19:06:15 - mmengine - INFO - Iter(train) [478500/640000] base_lr: 2.9812e-05 lr: 2.9812e-06 eta: 2 days, 14:42:21 time: 1.4016 data_time: 0.0240 memory: 25565 grad_norm: 3.4239 loss: 1.0919 caption_loss_cls: 1.9373 detection_loss_cls: 0.0240 detection_loss_reg: 0.2996 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0675 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3051 instance_segmentation_loss_poly: 0.8100 2024/01/10 19:17:38 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 19:17:38 - mmengine - INFO - Iter(train) [479000/640000] base_lr: 2.9638e-05 lr: 2.9638e-06 eta: 2 days, 14:29:40 time: 1.3935 data_time: 0.0239 memory: 25565 grad_norm: 3.4558 loss: 1.1039 caption_loss_cls: 1.9349 detection_loss_cls: 0.0242 detection_loss_reg: 0.3013 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0664 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3054 instance_segmentation_loss_poly: 0.8093 2024/01/10 19:29:10 - mmengine - INFO - Iter(train) [479500/640000] base_lr: 2.9463e-05 lr: 2.9463e-06 eta: 2 days, 14:17:32 time: 1.3931 data_time: 0.0240 memory: 25565 grad_norm: 3.4344 loss: 1.0940 caption_loss_cls: 1.9332 detection_loss_cls: 0.0241 detection_loss_reg: 0.3005 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0651 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3061 instance_segmentation_loss_poly: 0.8104 2024/01/10 19:40:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 19:40:30 - mmengine - INFO - Iter(train) [480000/640000] base_lr: 2.9290e-05 lr: 2.9290e-06 eta: 2 days, 14:04:44 time: 1.3951 data_time: 0.0240 memory: 25565 grad_norm: 3.4462 loss: 1.0834 caption_loss_cls: 1.9336 detection_loss_cls: 0.0241 detection_loss_reg: 0.3000 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0624 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3035 instance_segmentation_loss_poly: 0.8046 2024/01/10 19:40:30 - mmengine - INFO - Saving checkpoint at 480000 iterations 2024/01/10 19:52:13 - mmengine - INFO - Evaluating bbox... 2024/01/10 19:53:10 - mmengine - INFO - bbox_mAP_copypaste: 0.520 0.702 0.570 0.365 0.568 0.678 2024/01/10 19:53:10 - mmengine - INFO - Evaluating segm... 2024/01/10 19:54:22 - mmengine - INFO - segm_mAP_copypaste: 0.351 0.619 0.353 0.207 0.399 0.541 2024/01/10 19:56:30 - mmengine - INFO - Evaluating bbox... 2024/01/10 19:57:28 - mmengine - INFO - bbox_mAP_copypaste: 0.518 0.702 0.568 0.362 0.566 0.678 2024/01/10 20:02:15 - mmengine - INFO - per class results: 2024/01/10 20:02:15 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.23 | 89.04 | | building | 82.44 | 92.38 | | sky | 93.36 | 97.82 | | floor | 82.28 | 91.48 | | tree | 74.19 | 87.62 | | ceiling | 85.53 | 94.47 | | road | 84.73 | 90.54 | | bed | 90.87 | 95.96 | | windowpane | 63.72 | 80.47 | | grass | 68.23 | 85.16 | | cabinet | 64.31 | 76.55 | | sidewalk | 70.18 | 82.14 | | person | 81.93 | 92.43 | | earth | 35.06 | 42.47 | | door | 53.92 | 70.57 | | table | 65.08 | 80.78 | | mountain | 60.64 | 77.33 | | plant | 53.23 | 62.12 | | curtain | 76.71 | 88.0 | | chair | 62.7 | 75.0 | | car | 85.31 | 92.12 | | water | 59.92 | 73.67 | | painting | 74.9 | 87.69 | | sofa | 71.53 | 83.07 | | shelf | 45.93 | 67.41 | | house | 44.47 | 60.71 | | sea | 58.43 | 77.24 | | mirror | 68.82 | 79.03 | | rug | 61.36 | 69.69 | | field | 36.61 | 60.54 | | armchair | 51.84 | 71.33 | | seat | 65.93 | 81.63 | | fence | 47.76 | 61.75 | | desk | 54.07 | 70.74 | | rock | 47.97 | 68.69 | | wardrobe | 46.82 | 64.94 | | lamp | 67.41 | 81.47 | | bathtub | 80.15 | 89.55 | | railing | 41.17 | 58.45 | | cushion | 64.43 | 78.83 | | base | 20.04 | 27.91 | | box | 30.31 | 38.66 | | column | 54.9 | 65.77 | | signboard | 38.68 | 51.91 | | chest of drawers | 37.53 | 54.89 | | counter | 32.15 | 47.84 | | sand | 43.62 | 69.2 | | sink | 78.26 | 88.06 | | skyscraper | 46.11 | 55.76 | | fireplace | 76.82 | 87.4 | | refrigerator | 81.18 | 84.46 | | grandstand | 47.58 | 80.45 | | path | 19.36 | 30.73 | | stairs | 37.01 | 44.33 | | runway | 74.13 | 94.1 | | case | 46.79 | 59.28 | | pool table | 92.15 | 95.74 | | pillow | 61.67 | 73.37 | | screen door | 56.69 | 60.87 | | stairway | 33.64 | 40.66 | | river | 15.82 | 29.7 | | bridge | 69.72 | 77.28 | | bookcase | 42.01 | 56.22 | | blind | 39.8 | 44.99 | | coffee table | 63.51 | 76.94 | | toilet | 87.83 | 91.57 | | flower | 41.55 | 57.03 | | book | 48.24 | 68.11 | | hill | 14.95 | 23.88 | | bench | 60.66 | 70.8 | | countertop | 62.23 | 78.02 | | stove | 81.06 | 84.64 | | palm | 50.1 | 71.96 | | kitchen island | 41.45 | 58.12 | | computer | 77.83 | 88.71 | | swivel chair | 45.63 | 63.83 | | boat | 64.67 | 83.88 | | bar | 37.11 | 44.9 | | arcade machine | 83.26 | 89.75 | | hovel | 29.41 | 32.56 | | bus | 86.41 | 94.26 | | towel | 66.89 | 78.48 | | light | 54.27 | 65.61 | | truck | 47.67 | 61.19 | | tower | 27.55 | 46.1 | | chandelier | 65.52 | 75.09 | | awning | 32.89 | 43.23 | | streetlight | 33.77 | 46.09 | | booth | 45.53 | 56.34 | | television receiver | 74.19 | 86.66 | | airplane | 71.1 | 81.69 | | dirt track | 5.53 | 34.12 | | apparel | 32.34 | 48.27 | | pole | 28.23 | 40.62 | | land | 3.89 | 5.56 | | bannister | 14.67 | 19.81 | | escalator | 26.6 | 28.63 | | ottoman | 55.08 | 71.39 | | bottle | 29.49 | 38.12 | | buffet | 55.12 | 63.08 | | poster | 35.66 | 45.85 | | stage | 11.86 | 18.99 | | van | 44.58 | 59.25 | | ship | 8.84 | 9.73 | | fountain | 19.54 | 19.72 | | conveyer belt | 70.79 | 91.25 | | canopy | 35.17 | 46.05 | | washer | 67.92 | 72.45 | | plaything | 33.88 | 43.72 | | swimming pool | 69.62 | 71.65 | | stool | 44.89 | 63.62 | | barrel | 19.89 | 67.1 | | basket | 34.47 | 43.74 | | waterfall | 65.16 | 87.73 | | tent | 74.49 | 96.77 | | bag | 24.13 | 30.9 | | minibike | 73.21 | 83.32 | | cradle | 83.47 | 96.61 | | oven | 56.47 | 70.13 | | ball | 51.62 | 66.34 | | food | 54.02 | 58.91 | | step | 18.18 | 24.01 | | tank | 45.93 | 49.34 | | trade name | 23.49 | 27.92 | | microwave | 88.21 | 94.37 | | pot | 52.15 | 61.91 | | animal | 62.28 | 66.42 | | bicycle | 59.75 | 74.98 | | lake | 57.78 | 62.58 | | dishwasher | 74.75 | 86.91 | | screen | 69.55 | 86.73 | | blanket | 31.49 | 38.13 | | sculpture | 65.71 | 78.31 | | hood | 66.1 | 74.63 | | sconce | 45.09 | 54.66 | | vase | 44.15 | 62.82 | | traffic light | 42.74 | 65.15 | | tray | 19.74 | 31.84 | | ashcan | 39.65 | 54.03 | | fan | 63.97 | 76.88 | | pier | 47.45 | 52.77 | | crt screen | 8.28 | 22.22 | | plate | 58.79 | 77.11 | | monitor | 14.24 | 16.69 | | bulletin board | 54.67 | 62.68 | | shower | 3.2 | 3.4 | | radiator | 61.09 | 71.8 | | glass | 18.93 | 20.84 | | clock | 39.8 | 47.62 | | flag | 45.16 | 52.23 | +---------------------+-------+-------+ 2024/01/10 20:02:28 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5180 coco/bbox_mAP_50: 0.7020 coco/bbox_mAP_75: 0.5680 coco/bbox_mAP_s: 0.3620 coco/bbox_mAP_m: 0.5660 coco/bbox_mAP_l: 0.6780 coco/segm_mAP: 0.3510 coco/segm_mAP_50: 0.6190 coco/segm_mAP_75: 0.3530 coco/segm_mAP_s: 0.2070 coco/segm_mAP_m: 0.3990 coco/segm_mAP_l: 0.5410 Bleu_1: 0.7705 Bleu_2: 0.6092 Bleu_3: 0.4666 Bleu_4: 0.3533 METEOR: 0.2801 ROUGE_L: 0.5701 CIDEr: 1.1583 SPICE: 0.2090 aAcc: 84.2000 mIoU: 52.1800 mAcc: 64.0200 visual-grounding/miou: 0.8292 visual-grounding/acc: 0.8860 data_time: 0.0114 time: 1.9011 2024/01/10 20:14:07 - mmengine - INFO - Iter(train) [480500/640000] base_lr: 2.9116e-05 lr: 2.9116e-06 eta: 2 days, 13:53:15 time: 1.3867 data_time: 0.0184 memory: 34657 grad_norm: 3.4317 loss: 1.0861 caption_loss_cls: 1.9363 detection_loss_cls: 0.0240 detection_loss_reg: 0.2999 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0618 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3029 instance_segmentation_loss_poly: 0.8028 2024/01/10 20:25:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 20:25:56 - mmengine - INFO - Iter(train) [481000/640000] base_lr: 2.8943e-05 lr: 2.8943e-06 eta: 2 days, 13:42:14 time: 1.3963 data_time: 0.0186 memory: 25565 grad_norm: 3.3957 loss: 1.0856 caption_loss_cls: 1.9375 detection_loss_cls: 0.0238 detection_loss_reg: 0.2986 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0622 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3042 instance_segmentation_loss_poly: 0.8048 2024/01/10 20:37:37 - mmengine - INFO - Iter(train) [481500/640000] base_lr: 2.8771e-05 lr: 2.8771e-06 eta: 2 days, 13:30:43 time: 1.3927 data_time: 0.0185 memory: 25565 grad_norm: 3.3998 loss: 1.0897 caption_loss_cls: 1.9372 detection_loss_cls: 0.0238 detection_loss_reg: 0.2983 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0660 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3035 instance_segmentation_loss_poly: 0.8026 2024/01/10 20:49:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 20:49:19 - mmengine - INFO - Iter(train) [482000/640000] base_lr: 2.8599e-05 lr: 2.8599e-06 eta: 2 days, 13:19:16 time: 1.3931 data_time: 0.0186 memory: 25565 grad_norm: 3.3753 loss: 1.1055 caption_loss_cls: 1.9410 detection_loss_cls: 0.0240 detection_loss_reg: 0.3001 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0652 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3042 instance_segmentation_loss_poly: 0.8041 2024/01/10 20:49:19 - mmengine - INFO - Saving checkpoint at 482000 iterations 2024/01/10 21:01:36 - mmengine - INFO - Iter(train) [482500/640000] base_lr: 2.8427e-05 lr: 2.8427e-06 eta: 2 days, 13:09:53 time: 1.4008 data_time: 0.0188 memory: 25565 grad_norm: 3.3983 loss: 1.1000 caption_loss_cls: 1.9422 detection_loss_cls: 0.0240 detection_loss_reg: 0.3005 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0637 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3067 instance_segmentation_loss_poly: 0.8098 2024/01/10 21:13:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 21:13:07 - mmengine - INFO - Iter(train) [483000/640000] base_lr: 2.8256e-05 lr: 2.8256e-06 eta: 2 days, 12:57:45 time: 1.4027 data_time: 0.0188 memory: 25565 grad_norm: 3.3939 loss: 1.0980 caption_loss_cls: 1.9430 detection_loss_cls: 0.0240 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0634 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3071 instance_segmentation_loss_poly: 0.8106 2024/01/10 21:25:03 - mmengine - INFO - Iter(train) [483500/640000] base_lr: 2.8085e-05 lr: 2.8085e-06 eta: 2 days, 12:47:03 time: 1.4088 data_time: 0.0189 memory: 25565 grad_norm: 3.4204 loss: 1.1005 caption_loss_cls: 1.9395 detection_loss_cls: 0.0240 detection_loss_reg: 0.3009 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0619 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3084 instance_segmentation_loss_poly: 0.8138 2024/01/10 21:36:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 21:36:31 - mmengine - INFO - Iter(train) [484000/640000] base_lr: 2.7915e-05 lr: 2.7915e-06 eta: 2 days, 12:34:45 time: 1.4107 data_time: 0.0189 memory: 25565 grad_norm: 3.4028 loss: 1.0974 caption_loss_cls: 1.9400 detection_loss_cls: 0.0241 detection_loss_reg: 0.3013 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0583 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3082 instance_segmentation_loss_poly: 0.8132 2024/01/10 21:36:31 - mmengine - INFO - Saving checkpoint at 484000 iterations 2024/01/10 21:48:48 - mmengine - INFO - Iter(train) [484500/640000] base_lr: 2.7745e-05 lr: 2.7745e-06 eta: 2 days, 12:25:11 time: 1.4195 data_time: 0.0245 memory: 25565 grad_norm: 3.5325 loss: 1.0889 caption_loss_cls: 1.9380 detection_loss_cls: 0.0243 detection_loss_reg: 0.3027 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0575 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3096 instance_segmentation_loss_poly: 0.8162 2024/01/10 22:00:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 22:00:28 - mmengine - INFO - Iter(train) [485000/640000] base_lr: 2.7576e-05 lr: 2.7576e-06 eta: 2 days, 12:13:30 time: 1.4171 data_time: 0.0244 memory: 25565 grad_norm: 3.5252 loss: 1.0837 caption_loss_cls: 1.9424 detection_loss_cls: 0.0243 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0600 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3084 instance_segmentation_loss_poly: 0.8133 2024/01/10 22:12:05 - mmengine - INFO - Iter(train) [485500/640000] base_lr: 2.7407e-05 lr: 2.7407e-06 eta: 2 days, 12:01:44 time: 1.4163 data_time: 0.0245 memory: 25565 grad_norm: 3.5399 loss: 1.0927 caption_loss_cls: 1.9387 detection_loss_cls: 0.0243 detection_loss_reg: 0.3027 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0612 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3073 instance_segmentation_loss_poly: 0.8103 2024/01/10 22:24:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 22:24:12 - mmengine - INFO - Iter(train) [486000/640000] base_lr: 2.7238e-05 lr: 2.7238e-06 eta: 2 days, 11:51:30 time: 1.4224 data_time: 0.0245 memory: 25565 grad_norm: 3.5260 loss: 1.0844 caption_loss_cls: 1.9328 detection_loss_cls: 0.0243 detection_loss_reg: 0.3034 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0584 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3082 instance_segmentation_loss_poly: 0.8141 2024/01/10 22:24:12 - mmengine - INFO - Saving checkpoint at 486000 iterations 2024/01/10 22:36:21 - mmengine - INFO - Iter(train) [486500/640000] base_lr: 2.7070e-05 lr: 2.7070e-06 eta: 2 days, 11:41:23 time: 1.4204 data_time: 0.0244 memory: 25565 grad_norm: 3.5091 loss: 1.0902 caption_loss_cls: 1.9333 detection_loss_cls: 0.0244 detection_loss_reg: 0.3040 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0586 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3083 instance_segmentation_loss_poly: 0.8144 2024/01/10 22:48:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 22:48:22 - mmengine - INFO - Iter(train) [487000/640000] base_lr: 2.6902e-05 lr: 2.6902e-06 eta: 2 days, 11:30:46 time: 1.4277 data_time: 0.0244 memory: 25565 grad_norm: 3.4713 loss: 1.0755 caption_loss_cls: 1.9311 detection_loss_cls: 0.0244 detection_loss_reg: 0.3043 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0582 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3066 instance_segmentation_loss_poly: 0.8115 2024/01/10 23:00:18 - mmengine - INFO - Iter(train) [487500/640000] base_lr: 2.6735e-05 lr: 2.6735e-06 eta: 2 days, 11:19:54 time: 1.4278 data_time: 0.0243 memory: 25565 grad_norm: 3.4362 loss: 1.0614 caption_loss_cls: 1.9273 detection_loss_cls: 0.0244 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0597 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8104 2024/01/10 23:12:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 23:12:16 - mmengine - INFO - Iter(train) [488000/640000] base_lr: 2.6568e-05 lr: 2.6568e-06 eta: 2 days, 11:09:07 time: 1.4353 data_time: 0.0244 memory: 25565 grad_norm: 3.3998 loss: 1.0596 caption_loss_cls: 1.9293 detection_loss_cls: 0.0244 detection_loss_reg: 0.3027 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0597 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3046 instance_segmentation_loss_poly: 0.8083 2024/01/10 23:12:16 - mmengine - INFO - Saving checkpoint at 488000 iterations 2024/01/10 23:24:28 - mmengine - INFO - Iter(train) [488500/640000] base_lr: 2.6402e-05 lr: 2.6402e-06 eta: 2 days, 10:59:00 time: 1.4342 data_time: 0.0244 memory: 25565 grad_norm: 3.2611 loss: 1.0592 caption_loss_cls: 1.9282 detection_loss_cls: 0.0245 detection_loss_reg: 0.3027 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0577 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3056 instance_segmentation_loss_poly: 0.8094 2024/01/10 23:35:41 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 23:35:41 - mmengine - INFO - Iter(train) [489000/640000] base_lr: 2.6236e-05 lr: 2.6236e-06 eta: 2 days, 10:45:54 time: 1.4274 data_time: 0.0244 memory: 25565 grad_norm: 3.2770 loss: 1.0732 caption_loss_cls: 1.9296 detection_loss_cls: 0.0247 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0600 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8105 2024/01/10 23:47:15 - mmengine - INFO - Iter(train) [489500/640000] base_lr: 2.6070e-05 lr: 2.6070e-06 eta: 2 days, 10:33:55 time: 1.4266 data_time: 0.0244 memory: 25565 grad_norm: 3.3038 loss: 1.0635 caption_loss_cls: 1.9251 detection_loss_cls: 0.0247 detection_loss_reg: 0.3035 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0592 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3061 instance_segmentation_loss_poly: 0.8091 2024/01/10 23:58:55 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/10 23:58:55 - mmengine - INFO - Iter(train) [490000/640000] base_lr: 2.5905e-05 lr: 2.5905e-06 eta: 2 days, 10:22:12 time: 1.4200 data_time: 0.0242 memory: 25565 grad_norm: 3.3118 loss: 1.0632 caption_loss_cls: 1.9240 detection_loss_cls: 0.0248 detection_loss_reg: 0.3053 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0595 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3052 instance_segmentation_loss_poly: 0.8074 2024/01/10 23:58:55 - mmengine - INFO - Saving checkpoint at 490000 iterations 2024/01/11 00:10:46 - mmengine - INFO - Iter(train) [490500/640000] base_lr: 2.5741e-05 lr: 2.5741e-06 eta: 2 days, 10:10:58 time: 1.4152 data_time: 0.0244 memory: 25565 grad_norm: 3.2996 loss: 1.0558 caption_loss_cls: 1.9235 detection_loss_cls: 0.0248 detection_loss_reg: 0.3048 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0585 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3057 instance_segmentation_loss_poly: 0.8079 2024/01/11 00:22:43 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 00:22:43 - mmengine - INFO - Iter(train) [491000/640000] base_lr: 2.5576e-05 lr: 2.5576e-06 eta: 2 days, 10:00:04 time: 1.4145 data_time: 0.0244 memory: 25565 grad_norm: 3.3047 loss: 1.0602 caption_loss_cls: 1.9247 detection_loss_cls: 0.0248 detection_loss_reg: 0.3051 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0591 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3048 instance_segmentation_loss_poly: 0.8051 2024/01/11 00:34:24 - mmengine - INFO - Iter(train) [491500/640000] base_lr: 2.5413e-05 lr: 2.5413e-06 eta: 2 days, 9:48:24 time: 1.4108 data_time: 0.0244 memory: 25565 grad_norm: 3.3437 loss: 1.0666 caption_loss_cls: 1.9212 detection_loss_cls: 0.0249 detection_loss_reg: 0.3069 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0616 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3038 instance_segmentation_loss_poly: 0.8024 2024/01/11 00:46:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 00:46:10 - mmengine - INFO - Iter(train) [492000/640000] base_lr: 2.5249e-05 lr: 2.5249e-06 eta: 2 days, 9:36:55 time: 1.4076 data_time: 0.0243 memory: 25565 grad_norm: 3.3356 loss: 1.0706 caption_loss_cls: 1.9299 detection_loss_cls: 0.0248 detection_loss_reg: 0.3063 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0574 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3036 instance_segmentation_loss_poly: 0.8026 2024/01/11 00:46:10 - mmengine - INFO - Saving checkpoint at 492000 iterations 2024/01/11 00:58:31 - mmengine - INFO - Iter(train) [492500/640000] base_lr: 2.5087e-05 lr: 2.5087e-06 eta: 2 days, 9:27:02 time: 1.4097 data_time: 0.0242 memory: 25565 grad_norm: 3.3092 loss: 1.0581 caption_loss_cls: 1.9249 detection_loss_cls: 0.0247 detection_loss_reg: 0.3056 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0552 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3016 instance_segmentation_loss_poly: 0.7973 2024/01/11 01:10:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 01:10:03 - mmengine - INFO - Iter(train) [493000/640000] base_lr: 2.4924e-05 lr: 2.4924e-06 eta: 2 days, 9:14:55 time: 1.4145 data_time: 0.0242 memory: 25565 grad_norm: 3.3286 loss: 1.0389 caption_loss_cls: 1.9242 detection_loss_cls: 0.0245 detection_loss_reg: 0.3051 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0527 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3013 instance_segmentation_loss_poly: 0.7966 2024/01/11 01:21:40 - mmengine - INFO - Iter(train) [493500/640000] base_lr: 2.4762e-05 lr: 2.4762e-06 eta: 2 days, 9:03:03 time: 1.4153 data_time: 0.0242 memory: 25565 grad_norm: 3.2915 loss: 1.0476 caption_loss_cls: 1.9245 detection_loss_cls: 0.0246 detection_loss_reg: 0.3061 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0521 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3000 instance_segmentation_loss_poly: 0.7937 2024/01/11 01:33:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 01:33:35 - mmengine - INFO - Iter(train) [494000/640000] base_lr: 2.4601e-05 lr: 2.4601e-06 eta: 2 days, 8:51:58 time: 1.4190 data_time: 0.0242 memory: 25565 grad_norm: 3.3229 loss: 1.0478 caption_loss_cls: 1.9261 detection_loss_cls: 0.0246 detection_loss_reg: 0.3076 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0517 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.2983 instance_segmentation_loss_poly: 0.7889 2024/01/11 01:33:35 - mmengine - INFO - Saving checkpoint at 494000 iterations 2024/01/11 01:45:19 - mmengine - INFO - Iter(train) [494500/640000] base_lr: 2.4440e-05 lr: 2.4440e-06 eta: 2 days, 8:40:22 time: 1.4174 data_time: 0.0243 memory: 25565 grad_norm: 3.3675 loss: 1.0664 caption_loss_cls: 1.9209 detection_loss_cls: 0.0248 detection_loss_reg: 0.3077 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0544 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.2984 instance_segmentation_loss_poly: 0.7886 2024/01/11 01:56:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 01:56:53 - mmengine - INFO - Iter(train) [495000/640000] base_lr: 2.4279e-05 lr: 2.4279e-06 eta: 2 days, 8:28:22 time: 1.4115 data_time: 0.0242 memory: 25565 grad_norm: 3.3681 loss: 1.0660 caption_loss_cls: 1.9206 detection_loss_cls: 0.0248 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0578 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.2980 instance_segmentation_loss_poly: 0.7890 2024/01/11 02:08:26 - mmengine - INFO - Iter(train) [495500/640000] base_lr: 2.4119e-05 lr: 2.4119e-06 eta: 2 days, 8:16:20 time: 1.4095 data_time: 0.0242 memory: 25565 grad_norm: 3.3441 loss: 1.0718 caption_loss_cls: 1.9170 detection_loss_cls: 0.0248 detection_loss_reg: 0.3092 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0536 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.2977 instance_segmentation_loss_poly: 0.7884 2024/01/11 02:20:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240110_023224 2024/01/11 02:20:07 - mmengine - INFO - Iter(train) [496000/640000] base_lr: 2.3960e-05 lr: 2.3960e-06 eta: 2 days, 8:04:37 time: 1.4083 data_time: 0.0243 memory: 25565 grad_norm: 3.3715 loss: 1.0770 caption_loss_cls: 1.9188 detection_loss_cls: 0.0247 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0486 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.2974 instance_segmentation_loss_poly: 0.7887 2024/01/11 02:20:07 - mmengine - INFO - Saving checkpoint at 496000 iterations 2024/01/11 02:31:52 - mmengine - INFO - Iter(train) [496500/640000] base_lr: 2.3801e-05 lr: 2.3801e-06 eta: 2 days, 7:53:06 time: 1.3995 data_time: 0.0242 memory: 25565 grad_norm: 3.4598 loss: 1.0958 caption_loss_cls: 1.9153 detection_loss_cls: 0.0244 detection_loss_reg: 0.3063 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0502 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.2971 instance_segmentation_loss_poly: 0.7883 2024/01/11 05:43:11 - mmengine - INFO - Iter(train) [497000/640000] base_lr: 2.3642e-05 lr: 2.3642e-06 eta: 2 days, 6:22:19 time: 1.3925 data_time: 0.0180 memory: 25630 grad_norm: 3.3466 loss: 1.1066 caption_loss_cls: 1.9202 detection_loss_cls: 0.0246 detection_loss_reg: 0.3079 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0468 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.2987 instance_segmentation_loss_poly: 0.7924 2024/01/11 05:54:46 - mmengine - INFO - Iter(train) [497500/640000] base_lr: 2.3484e-05 lr: 2.3484e-06 eta: 2 days, 6:26:11 time: 1.3918 data_time: 0.0178 memory: 25630 grad_norm: 3.3529 loss: 1.0969 caption_loss_cls: 1.9102 detection_loss_cls: 0.0248 detection_loss_reg: 0.3084 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0452 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.2991 instance_segmentation_loss_poly: 0.7938 2024/01/11 06:06:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 06:06:40 - mmengine - INFO - Iter(train) [498000/640000] base_lr: 2.3326e-05 lr: 2.3326e-06 eta: 2 days, 6:46:01 time: 1.3917 data_time: 0.0178 memory: 25630 grad_norm: 3.3684 loss: 1.1050 caption_loss_cls: 1.9123 detection_loss_cls: 0.0249 detection_loss_reg: 0.3092 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0452 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3001 instance_segmentation_loss_poly: 0.7959 2024/01/11 06:06:40 - mmengine - INFO - Saving checkpoint at 498000 iterations 2024/01/11 06:18:45 - mmengine - INFO - Iter(train) [498500/640000] base_lr: 2.3168e-05 lr: 2.3168e-06 eta: 2 days, 7:02:33 time: 1.3969 data_time: 0.0177 memory: 25630 grad_norm: 3.3620 loss: 1.0988 caption_loss_cls: 1.9132 detection_loss_cls: 0.0248 detection_loss_reg: 0.3081 semantic_segmentation_loss_cls: 0.0065 grounding_loss_reg: 2.0424 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3014 instance_segmentation_loss_poly: 0.7978 2024/01/11 06:30:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 06:30:35 - mmengine - INFO - Iter(train) [499000/640000] base_lr: 2.3012e-05 lr: 2.3012e-06 eta: 2 days, 6:58:43 time: 1.4010 data_time: 0.0177 memory: 25630 grad_norm: 3.3727 loss: 1.1084 caption_loss_cls: 1.9131 detection_loss_cls: 0.0249 detection_loss_reg: 0.3094 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0445 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3017 instance_segmentation_loss_poly: 0.7988 2024/01/11 06:42:26 - mmengine - INFO - Iter(train) [499500/640000] base_lr: 2.2855e-05 lr: 2.2855e-06 eta: 2 days, 6:52:34 time: 1.4054 data_time: 0.0176 memory: 25630 grad_norm: 3.4559 loss: 1.1091 caption_loss_cls: 1.9133 detection_loss_cls: 0.0246 detection_loss_reg: 0.3069 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0444 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3013 instance_segmentation_loss_poly: 0.7970 2024/01/11 06:54:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 06:54:28 - mmengine - INFO - Iter(train) [500000/640000] base_lr: 2.2699e-05 lr: 2.2699e-06 eta: 2 days, 6:51:24 time: 1.4106 data_time: 0.0174 memory: 25630 grad_norm: 3.4281 loss: 1.0923 caption_loss_cls: 1.9131 detection_loss_cls: 0.0246 detection_loss_reg: 0.3066 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0432 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3025 instance_segmentation_loss_poly: 0.8003 2024/01/11 06:54:28 - mmengine - INFO - Saving checkpoint at 500000 iterations 2024/01/11 07:05:53 - mmengine - INFO - Evaluating bbox... 2024/01/11 07:06:50 - mmengine - INFO - bbox_mAP_copypaste: 0.524 0.704 0.573 0.379 0.571 0.677 2024/01/11 07:06:50 - mmengine - INFO - Evaluating segm... 2024/01/11 07:08:03 - mmengine - INFO - segm_mAP_copypaste: 0.354 0.623 0.354 0.216 0.399 0.541 2024/01/11 07:10:13 - mmengine - INFO - Evaluating bbox... 2024/01/11 07:11:12 - mmengine - INFO - bbox_mAP_copypaste: 0.524 0.705 0.572 0.374 0.570 0.677 2024/01/11 07:17:28 - mmengine - INFO - per class results: 2024/01/11 07:17:28 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.58 | 89.4 | | building | 82.91 | 92.66 | | sky | 93.66 | 97.93 | | floor | 82.53 | 91.14 | | tree | 74.39 | 87.09 | | ceiling | 85.83 | 94.85 | | road | 84.2 | 89.6 | | bed | 90.63 | 95.94 | | windowpane | 63.53 | 81.47 | | grass | 68.96 | 84.73 | | cabinet | 63.48 | 74.87 | | sidewalk | 68.95 | 81.23 | | person | 82.31 | 92.1 | | earth | 39.27 | 50.68 | | door | 55.04 | 71.08 | | table | 64.58 | 80.21 | | mountain | 62.98 | 78.36 | | plant | 54.31 | 64.92 | | curtain | 76.0 | 86.31 | | chair | 64.12 | 77.53 | | car | 85.29 | 92.69 | | water | 60.18 | 73.61 | | painting | 74.64 | 87.92 | | sofa | 72.9 | 84.04 | | shelf | 45.7 | 67.38 | | house | 46.16 | 58.18 | | sea | 61.59 | 78.41 | | mirror | 69.99 | 77.73 | | rug | 63.99 | 72.27 | | field | 35.95 | 53.54 | | armchair | 53.86 | 71.96 | | seat | 67.14 | 83.02 | | fence | 47.37 | 63.97 | | desk | 55.06 | 69.95 | | rock | 56.1 | 78.97 | | wardrobe | 44.12 | 66.2 | | lamp | 67.49 | 80.0 | | bathtub | 76.9 | 88.13 | | railing | 43.04 | 61.28 | | cushion | 63.24 | 77.37 | | base | 19.78 | 25.82 | | box | 30.2 | 38.06 | | column | 55.59 | 63.66 | | signboard | 39.52 | 53.53 | | chest of drawers | 39.04 | 57.95 | | counter | 30.99 | 47.61 | | sand | 45.64 | 69.78 | | sink | 78.27 | 86.11 | | skyscraper | 47.17 | 59.61 | | fireplace | 76.14 | 89.14 | | refrigerator | 81.88 | 86.47 | | grandstand | 47.67 | 83.54 | | path | 21.37 | 31.72 | | stairs | 34.76 | 41.74 | | runway | 72.99 | 94.42 | | case | 46.39 | 58.86 | | pool table | 92.01 | 96.0 | | pillow | 61.58 | 71.91 | | screen door | 58.02 | 59.22 | | stairway | 32.88 | 42.2 | | river | 12.82 | 27.44 | | bridge | 66.98 | 76.01 | | bookcase | 43.71 | 65.15 | | blind | 40.33 | 43.5 | | coffee table | 66.13 | 81.46 | | toilet | 87.71 | 91.94 | | flower | 42.98 | 57.35 | | book | 47.26 | 65.33 | | hill | 15.95 | 22.79 | | bench | 63.73 | 77.46 | | countertop | 62.8 | 78.29 | | stove | 80.87 | 84.62 | | palm | 48.23 | 73.15 | | kitchen island | 50.0 | 73.68 | | computer | 77.33 | 88.31 | | swivel chair | 45.01 | 57.61 | | boat | 65.04 | 85.89 | | bar | 35.68 | 44.96 | | arcade machine | 65.21 | 68.8 | | hovel | 33.68 | 36.71 | | bus | 86.64 | 95.38 | | towel | 65.66 | 78.72 | | light | 52.47 | 63.44 | | truck | 49.06 | 66.38 | | tower | 25.95 | 40.53 | | chandelier | 67.37 | 77.69 | | awning | 29.83 | 39.25 | | streetlight | 33.85 | 44.16 | | booth | 44.7 | 59.52 | | television receiver | 70.32 | 85.8 | | airplane | 67.72 | 75.03 | | dirt track | 5.37 | 30.43 | | apparel | 32.1 | 45.21 | | pole | 28.05 | 41.6 | | land | 2.57 | 3.84 | | bannister | 14.95 | 19.92 | | escalator | 24.49 | 26.09 | | ottoman | 55.37 | 72.07 | | bottle | 26.58 | 34.18 | | buffet | 56.99 | 66.38 | | poster | 32.71 | 48.41 | | stage | 11.22 | 14.72 | | van | 45.99 | 61.07 | | ship | 23.26 | 25.2 | | fountain | 21.85 | 22.15 | | conveyer belt | 78.04 | 91.63 | | canopy | 39.04 | 46.74 | | washer | 68.36 | 71.8 | | plaything | 31.27 | 39.77 | | swimming pool | 66.44 | 68.36 | | stool | 45.78 | 64.96 | | barrel | 20.94 | 78.62 | | basket | 34.64 | 44.41 | | waterfall | 71.68 | 88.7 | | tent | 77.01 | 97.37 | | bag | 24.19 | 32.13 | | minibike | 74.2 | 83.7 | | cradle | 86.4 | 95.48 | | oven | 51.09 | 62.4 | | ball | 51.61 | 64.66 | | food | 54.65 | 60.4 | | step | 9.33 | 12.29 | | tank | 46.59 | 50.09 | | trade name | 27.25 | 34.0 | | microwave | 85.92 | 94.59 | | pot | 51.72 | 62.62 | | animal | 60.68 | 64.01 | | bicycle | 59.47 | 75.15 | | lake | 56.31 | 67.41 | | dishwasher | 77.29 | 87.07 | | screen | 63.1 | 81.75 | | blanket | 28.59 | 34.71 | | sculpture | 62.73 | 78.29 | | hood | 65.08 | 71.36 | | sconce | 46.23 | 55.71 | | vase | 44.58 | 62.72 | | traffic light | 41.25 | 60.7 | | tray | 18.68 | 28.07 | | ashcan | 42.02 | 55.39 | | fan | 62.73 | 73.02 | | pier | 48.12 | 53.6 | | crt screen | 5.89 | 16.47 | | plate | 58.73 | 75.74 | | monitor | 7.52 | 8.5 | | bulletin board | 52.85 | 59.98 | | shower | 4.51 | 5.83 | | radiator | 61.71 | 71.68 | | glass | 18.87 | 21.37 | | clock | 38.84 | 47.68 | | flag | 44.86 | 51.54 | +---------------------+-------+-------+ 2024/01/11 07:17:41 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5240 coco/bbox_mAP_50: 0.7050 coco/bbox_mAP_75: 0.5720 coco/bbox_mAP_s: 0.3740 coco/bbox_mAP_m: 0.5700 coco/bbox_mAP_l: 0.6770 coco/segm_mAP: 0.3540 coco/segm_mAP_50: 0.6230 coco/segm_mAP_75: 0.3540 coco/segm_mAP_s: 0.2160 coco/segm_mAP_m: 0.3990 coco/segm_mAP_l: 0.5410 Bleu_1: 0.7689 Bleu_2: 0.6077 Bleu_3: 0.4667 Bleu_4: 0.3539 METEOR: 0.2807 ROUGE_L: 0.5686 CIDEr: 1.1591 SPICE: 0.2095 aAcc: 84.4300 mIoU: 52.2200 mAcc: 63.9700 visual-grounding/miou: 0.8339 visual-grounding/acc: 0.8909 data_time: 0.0267 time: 1.9222 2024/01/11 07:28:50 - mmengine - INFO - Iter(train) [500500/640000] base_lr: 2.2544e-05 lr: 2.2544e-06 eta: 2 days, 6:22:01 time: 1.4058 data_time: 0.0181 memory: 34723 grad_norm: 3.4749 loss: 1.0907 caption_loss_cls: 1.9151 detection_loss_cls: 0.0246 detection_loss_reg: 0.3058 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0396 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3028 instance_segmentation_loss_poly: 0.8017 2024/01/11 07:40:42 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 07:40:42 - mmengine - INFO - Iter(train) [501000/640000] base_lr: 2.2389e-05 lr: 2.2389e-06 eta: 2 days, 6:14:38 time: 1.4139 data_time: 0.0184 memory: 25632 grad_norm: 3.4361 loss: 1.0729 caption_loss_cls: 1.9100 detection_loss_cls: 0.0246 detection_loss_reg: 0.3056 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0379 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3025 instance_segmentation_loss_poly: 0.8009 2024/01/11 07:53:08 - mmengine - INFO - Iter(train) [501500/640000] base_lr: 2.2234e-05 lr: 2.2234e-06 eta: 2 days, 6:21:15 time: 1.4269 data_time: 0.0188 memory: 25632 grad_norm: 3.4276 loss: 1.0789 caption_loss_cls: 1.9042 detection_loss_cls: 0.0247 detection_loss_reg: 0.3071 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0363 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3021 instance_segmentation_loss_poly: 0.7995 2024/01/11 08:04:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 08:04:29 - mmengine - INFO - Iter(train) [502000/640000] base_lr: 2.2080e-05 lr: 2.2080e-06 eta: 2 days, 5:59:31 time: 1.4185 data_time: 0.0190 memory: 25632 grad_norm: 3.4078 loss: 1.0700 caption_loss_cls: 1.9005 detection_loss_cls: 0.0245 detection_loss_reg: 0.3060 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0335 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3019 instance_segmentation_loss_poly: 0.7981 2024/01/11 08:04:29 - mmengine - INFO - Saving checkpoint at 502000 iterations 2024/01/11 08:16:30 - mmengine - INFO - Iter(train) [502500/640000] base_lr: 2.1927e-05 lr: 2.1927e-06 eta: 2 days, 5:53:23 time: 1.4175 data_time: 0.0204 memory: 25632 grad_norm: 3.3745 loss: 1.0622 caption_loss_cls: 1.8966 detection_loss_cls: 0.0244 detection_loss_reg: 0.3059 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0323 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3016 instance_segmentation_loss_poly: 0.7972 2024/01/11 08:28:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 08:28:19 - mmengine - INFO - Iter(train) [503000/640000] base_lr: 2.1773e-05 lr: 2.1773e-06 eta: 2 days, 5:42:41 time: 1.4172 data_time: 0.0206 memory: 25632 grad_norm: 3.3869 loss: 1.0584 caption_loss_cls: 1.8945 detection_loss_cls: 0.0245 detection_loss_reg: 0.3066 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0280 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3032 instance_segmentation_loss_poly: 0.8000 2024/01/11 08:39:33 - mmengine - INFO - Iter(train) [503500/640000] base_lr: 2.1621e-05 lr: 2.1621e-06 eta: 2 days, 5:21:16 time: 1.4081 data_time: 0.0208 memory: 25632 grad_norm: 3.3174 loss: 1.0634 caption_loss_cls: 1.8934 detection_loss_cls: 0.0246 detection_loss_reg: 0.3086 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0273 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3028 instance_segmentation_loss_poly: 0.8000 2024/01/11 08:51:16 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 08:51:16 - mmengine - INFO - Iter(train) [504000/640000] base_lr: 2.1469e-05 lr: 2.1469e-06 eta: 2 days, 5:09:04 time: 1.4033 data_time: 0.0211 memory: 25632 grad_norm: 3.3405 loss: 1.0800 caption_loss_cls: 1.8996 detection_loss_cls: 0.0246 detection_loss_reg: 0.3092 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 2.0277 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3044 instance_segmentation_loss_poly: 0.8024 2024/01/11 08:51:16 - mmengine - INFO - Saving checkpoint at 504000 iterations 2024/01/11 09:03:16 - mmengine - INFO - Iter(train) [504500/640000] base_lr: 2.1317e-05 lr: 2.1317e-06 eta: 2 days, 5:01:41 time: 1.4154 data_time: 0.0277 memory: 25632 grad_norm: 3.3494 loss: 1.0855 caption_loss_cls: 1.8961 detection_loss_cls: 0.0245 detection_loss_reg: 0.3093 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0212 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3057 instance_segmentation_loss_poly: 0.8052 2024/01/11 09:14:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 09:14:53 - mmengine - INFO - Iter(train) [505000/640000] base_lr: 2.1166e-05 lr: 2.1166e-06 eta: 2 days, 4:47:53 time: 1.4118 data_time: 0.0277 memory: 25632 grad_norm: 3.3817 loss: 1.0972 caption_loss_cls: 1.8973 detection_loss_cls: 0.0246 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0168 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3058 instance_segmentation_loss_poly: 0.8068 2024/01/11 09:26:32 - mmengine - INFO - Iter(train) [505500/640000] base_lr: 2.1015e-05 lr: 2.1015e-06 eta: 2 days, 4:35:00 time: 1.4000 data_time: 0.0274 memory: 25632 grad_norm: 3.3840 loss: 1.0980 caption_loss_cls: 1.8991 detection_loss_cls: 0.0247 detection_loss_reg: 0.3125 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0147 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3059 instance_segmentation_loss_poly: 0.8068 2024/01/11 09:38:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 09:38:13 - mmengine - INFO - Iter(train) [506000/640000] base_lr: 2.0865e-05 lr: 2.0865e-06 eta: 2 days, 4:22:37 time: 1.4051 data_time: 0.0275 memory: 25632 grad_norm: 3.4155 loss: 1.1085 caption_loss_cls: 1.9043 detection_loss_cls: 0.0246 detection_loss_reg: 0.3110 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0150 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3056 instance_segmentation_loss_poly: 0.8054 2024/01/11 09:38:13 - mmengine - INFO - Saving checkpoint at 506000 iterations 2024/01/11 09:50:29 - mmengine - INFO - Iter(train) [506500/640000] base_lr: 2.0715e-05 lr: 2.0715e-06 eta: 2 days, 4:17:32 time: 1.4088 data_time: 0.0271 memory: 25632 grad_norm: 3.3642 loss: 1.0973 caption_loss_cls: 1.9067 detection_loss_cls: 0.0247 detection_loss_reg: 0.3120 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0122 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8059 2024/01/11 10:02:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 10:02:02 - mmengine - INFO - Iter(train) [507000/640000] base_lr: 2.0565e-05 lr: 2.0565e-06 eta: 2 days, 4:03:22 time: 1.4049 data_time: 0.0270 memory: 25632 grad_norm: 3.3341 loss: 1.0965 caption_loss_cls: 1.9124 detection_loss_cls: 0.0246 detection_loss_reg: 0.3123 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0083 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3070 instance_segmentation_loss_poly: 0.8064 2024/01/11 10:13:16 - mmengine - INFO - Iter(train) [507500/640000] base_lr: 2.0417e-05 lr: 2.0417e-06 eta: 2 days, 3:45:32 time: 1.4047 data_time: 0.0269 memory: 25632 grad_norm: 3.3663 loss: 1.0934 caption_loss_cls: 1.9130 detection_loss_cls: 0.0246 detection_loss_reg: 0.3119 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0076 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3064 instance_segmentation_loss_poly: 0.8036 2024/01/11 10:25:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 10:25:03 - mmengine - INFO - Iter(train) [508000/640000] base_lr: 2.0268e-05 lr: 2.0268e-06 eta: 2 days, 3:34:30 time: 1.4059 data_time: 0.0269 memory: 25632 grad_norm: 3.3742 loss: 1.0840 caption_loss_cls: 1.9144 detection_loss_cls: 0.0247 detection_loss_reg: 0.3127 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 2.0046 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3070 instance_segmentation_loss_poly: 0.8044 2024/01/11 10:25:03 - mmengine - INFO - Saving checkpoint at 508000 iterations 2024/01/11 10:36:34 - mmengine - INFO - Iter(train) [508500/640000] base_lr: 2.0120e-05 lr: 2.0120e-06 eta: 2 days, 3:20:34 time: 1.3986 data_time: 0.0267 memory: 25632 grad_norm: 3.3776 loss: 1.0841 caption_loss_cls: 1.9170 detection_loss_cls: 0.0245 detection_loss_reg: 0.3122 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 2.0047 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3070 instance_segmentation_loss_poly: 0.8054 2024/01/11 10:48:23 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 10:48:23 - mmengine - INFO - Iter(train) [509000/640000] base_lr: 1.9973e-05 lr: 1.9973e-06 eta: 2 days, 3:09:45 time: 1.4016 data_time: 0.0267 memory: 25632 grad_norm: 3.3293 loss: 1.0646 caption_loss_cls: 1.9131 detection_loss_cls: 0.0243 detection_loss_reg: 0.3098 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9984 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3057 instance_segmentation_loss_poly: 0.8033 2024/01/11 10:59:49 - mmengine - INFO - Iter(train) [509500/640000] base_lr: 1.9826e-05 lr: 1.9826e-06 eta: 2 days, 2:55:12 time: 1.3982 data_time: 0.0267 memory: 25632 grad_norm: 3.3488 loss: 1.0631 caption_loss_cls: 1.9148 detection_loss_cls: 0.0244 detection_loss_reg: 0.3106 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9994 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3067 instance_segmentation_loss_poly: 0.8052 2024/01/11 11:11:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 11:11:11 - mmengine - INFO - Iter(train) [510000/640000] base_lr: 1.9680e-05 lr: 1.9680e-06 eta: 2 days, 2:40:22 time: 1.3935 data_time: 0.0266 memory: 25632 grad_norm: 3.3370 loss: 1.0650 caption_loss_cls: 1.9166 detection_loss_cls: 0.0243 detection_loss_reg: 0.3090 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9950 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3080 instance_segmentation_loss_poly: 0.8088 2024/01/11 11:11:11 - mmengine - INFO - Saving checkpoint at 510000 iterations 2024/01/11 11:23:21 - mmengine - INFO - Iter(train) [510500/640000] base_lr: 1.9534e-05 lr: 1.9534e-06 eta: 2 days, 2:32:48 time: 1.3921 data_time: 0.0267 memory: 25632 grad_norm: 3.3663 loss: 1.0724 caption_loss_cls: 1.9123 detection_loss_cls: 0.0242 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9945 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3087 instance_segmentation_loss_poly: 0.8106 2024/01/11 11:35:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 11:35:13 - mmengine - INFO - Iter(train) [511000/640000] base_lr: 1.9388e-05 lr: 1.9388e-06 eta: 2 days, 2:22:20 time: 1.3967 data_time: 0.0268 memory: 25632 grad_norm: 3.4039 loss: 1.0755 caption_loss_cls: 1.9135 detection_loss_cls: 0.0242 detection_loss_reg: 0.3091 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9930 instance_segmentation_loss_cls: 0.0244 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8183 2024/01/11 11:46:39 - mmengine - INFO - Iter(train) [511500/640000] base_lr: 1.9243e-05 lr: 1.9243e-06 eta: 2 days, 2:08:13 time: 1.3998 data_time: 0.0268 memory: 25632 grad_norm: 3.3593 loss: 1.0666 caption_loss_cls: 1.9096 detection_loss_cls: 0.0242 detection_loss_reg: 0.3075 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9910 instance_segmentation_loss_cls: 0.0244 instance_segmentation_loss_reg: 0.3130 instance_segmentation_loss_poly: 0.8207 2024/01/11 11:58:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 11:58:32 - mmengine - INFO - Iter(train) [512000/640000] base_lr: 1.9099e-05 lr: 1.9099e-06 eta: 2 days, 1:57:53 time: 1.4013 data_time: 0.0268 memory: 25632 grad_norm: 3.3288 loss: 1.0645 caption_loss_cls: 1.9077 detection_loss_cls: 0.0241 detection_loss_reg: 0.3066 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9910 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3137 instance_segmentation_loss_poly: 0.8228 2024/01/11 11:58:32 - mmengine - INFO - Saving checkpoint at 512000 iterations 2024/01/11 12:10:45 - mmengine - INFO - Iter(train) [512500/640000] base_lr: 1.8955e-05 lr: 1.8955e-06 eta: 2 days, 1:50:08 time: 1.4120 data_time: 0.0267 memory: 25632 grad_norm: 3.3165 loss: 1.0588 caption_loss_cls: 1.9066 detection_loss_cls: 0.0242 detection_loss_reg: 0.3069 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9880 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3148 instance_segmentation_loss_poly: 0.8241 2024/01/11 12:22:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 12:22:11 - mmengine - INFO - Iter(train) [513000/640000] base_lr: 1.8811e-05 lr: 1.8811e-06 eta: 2 days, 1:36:06 time: 1.4062 data_time: 0.0266 memory: 25632 grad_norm: 3.3346 loss: 1.0634 caption_loss_cls: 1.9086 detection_loss_cls: 0.0241 detection_loss_reg: 0.3052 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9875 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3147 instance_segmentation_loss_poly: 0.8241 2024/01/11 12:33:35 - mmengine - INFO - Iter(train) [513500/640000] base_lr: 1.8668e-05 lr: 1.8668e-06 eta: 2 days, 1:22:03 time: 1.4057 data_time: 0.0265 memory: 25632 grad_norm: 3.3246 loss: 1.0616 caption_loss_cls: 1.9059 detection_loss_cls: 0.0241 detection_loss_reg: 0.3047 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9872 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3141 instance_segmentation_loss_poly: 0.8238 2024/01/11 12:45:14 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 12:45:14 - mmengine - INFO - Iter(train) [514000/640000] base_lr: 1.8525e-05 lr: 1.8525e-06 eta: 2 days, 1:09:51 time: 1.4097 data_time: 0.0263 memory: 25632 grad_norm: 3.2851 loss: 1.0445 caption_loss_cls: 1.9046 detection_loss_cls: 0.0241 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9848 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3139 instance_segmentation_loss_poly: 0.8231 2024/01/11 12:45:14 - mmengine - INFO - Saving checkpoint at 514000 iterations 2024/01/11 12:57:16 - mmengine - INFO - Iter(train) [514500/640000] base_lr: 1.8383e-05 lr: 1.8383e-06 eta: 2 days, 1:00:19 time: 1.4078 data_time: 0.0258 memory: 25632 grad_norm: 3.3442 loss: 1.0429 caption_loss_cls: 1.9010 detection_loss_cls: 0.0243 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9818 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3141 instance_segmentation_loss_poly: 0.8242 2024/01/11 13:08:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 13:08:27 - mmengine - INFO - Iter(train) [515000/640000] base_lr: 1.8242e-05 lr: 1.8242e-06 eta: 2 days, 0:45:05 time: 1.3977 data_time: 0.0257 memory: 25632 grad_norm: 3.3537 loss: 1.0434 caption_loss_cls: 1.8990 detection_loss_cls: 0.0241 detection_loss_reg: 0.3044 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9742 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3155 instance_segmentation_loss_poly: 0.8275 2024/01/11 13:20:32 - mmengine - INFO - Iter(train) [515500/640000] base_lr: 1.8101e-05 lr: 1.8101e-06 eta: 2 days, 0:35:46 time: 1.4074 data_time: 0.0259 memory: 25632 grad_norm: 3.3313 loss: 1.0501 caption_loss_cls: 1.9009 detection_loss_cls: 0.0241 detection_loss_reg: 0.3040 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9742 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3157 instance_segmentation_loss_poly: 0.8264 2024/01/11 13:31:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 13:31:53 - mmengine - INFO - Iter(train) [516000/640000] base_lr: 1.7960e-05 lr: 1.7960e-06 eta: 2 days, 0:21:49 time: 1.3995 data_time: 0.0259 memory: 25632 grad_norm: 3.3985 loss: 1.0648 caption_loss_cls: 1.8967 detection_loss_cls: 0.0240 detection_loss_reg: 0.3032 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9750 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3174 instance_segmentation_loss_poly: 0.8299 2024/01/11 13:31:53 - mmengine - INFO - Saving checkpoint at 516000 iterations 2024/01/11 13:44:01 - mmengine - INFO - Iter(train) [516500/640000] base_lr: 1.7820e-05 lr: 1.7820e-06 eta: 2 days, 0:12:39 time: 1.3980 data_time: 0.0258 memory: 25632 grad_norm: 3.3666 loss: 1.0698 caption_loss_cls: 1.8962 detection_loss_cls: 0.0240 detection_loss_reg: 0.3029 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9719 instance_segmentation_loss_cls: 0.0244 instance_segmentation_loss_reg: 0.3181 instance_segmentation_loss_poly: 0.8312 2024/01/11 13:55:13 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 13:55:13 - mmengine - INFO - Iter(train) [517000/640000] base_lr: 1.7680e-05 lr: 1.7680e-06 eta: 1 day, 23:57:54 time: 1.3946 data_time: 0.0259 memory: 25632 grad_norm: 3.4318 loss: 1.0750 caption_loss_cls: 1.8968 detection_loss_cls: 0.0240 detection_loss_reg: 0.3014 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9691 instance_segmentation_loss_cls: 0.0245 instance_segmentation_loss_reg: 0.3186 instance_segmentation_loss_poly: 0.8320 2024/01/11 14:06:57 - mmengine - INFO - Iter(train) [517500/640000] base_lr: 1.7541e-05 lr: 1.7541e-06 eta: 1 day, 23:46:22 time: 1.3996 data_time: 0.0259 memory: 25632 grad_norm: 3.4512 loss: 1.0736 caption_loss_cls: 1.8980 detection_loss_cls: 0.0238 detection_loss_reg: 0.3018 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9690 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3177 instance_segmentation_loss_poly: 0.8293 2024/01/11 14:18:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 14:18:53 - mmengine - INFO - Iter(train) [518000/640000] base_lr: 1.7403e-05 lr: 1.7403e-06 eta: 1 day, 23:35:57 time: 1.4040 data_time: 0.0260 memory: 25632 grad_norm: 3.4165 loss: 1.0710 caption_loss_cls: 1.8959 detection_loss_cls: 0.0237 detection_loss_reg: 0.3019 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9712 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3170 instance_segmentation_loss_poly: 0.8284 2024/01/11 14:18:53 - mmengine - INFO - Saving checkpoint at 518000 iterations 2024/01/11 14:31:00 - mmengine - INFO - Iter(train) [518500/640000] base_lr: 1.7265e-05 lr: 1.7265e-06 eta: 1 day, 23:26:24 time: 1.4052 data_time: 0.0261 memory: 25632 grad_norm: 3.3512 loss: 1.0630 caption_loss_cls: 1.8891 detection_loss_cls: 0.0238 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9671 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3167 instance_segmentation_loss_poly: 0.8276 2024/01/11 14:42:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 14:42:11 - mmengine - INFO - Iter(train) [519000/640000] base_lr: 1.7127e-05 lr: 1.7127e-06 eta: 1 day, 23:11:51 time: 1.4051 data_time: 0.0260 memory: 25632 grad_norm: 3.3858 loss: 1.0612 caption_loss_cls: 1.8858 detection_loss_cls: 0.0238 detection_loss_reg: 0.3023 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9688 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3142 instance_segmentation_loss_poly: 0.8229 2024/01/11 14:53:25 - mmengine - INFO - Iter(train) [519500/640000] base_lr: 1.6990e-05 lr: 1.6990e-06 eta: 1 day, 22:57:44 time: 1.3925 data_time: 0.0258 memory: 25632 grad_norm: 3.4635 loss: 1.0598 caption_loss_cls: 1.8804 detection_loss_cls: 0.0240 detection_loss_reg: 0.3047 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9674 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3141 instance_segmentation_loss_poly: 0.8221 2024/01/11 15:04:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 15:04:53 - mmengine - INFO - Iter(train) [520000/640000] base_lr: 1.6853e-05 lr: 1.6853e-06 eta: 1 day, 22:44:53 time: 1.3941 data_time: 0.0259 memory: 25632 grad_norm: 3.4681 loss: 1.0589 caption_loss_cls: 1.8828 detection_loss_cls: 0.0240 detection_loss_reg: 0.3057 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9618 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3164 instance_segmentation_loss_poly: 0.8262 2024/01/11 15:04:53 - mmengine - INFO - Saving checkpoint at 520000 iterations 2024/01/11 15:16:47 - mmengine - INFO - Evaluating bbox... 2024/01/11 15:17:45 - mmengine - INFO - bbox_mAP_copypaste: 0.525 0.704 0.575 0.372 0.575 0.678 2024/01/11 15:17:45 - mmengine - INFO - Evaluating segm... 2024/01/11 15:18:55 - mmengine - INFO - segm_mAP_copypaste: 0.352 0.619 0.348 0.207 0.399 0.540 2024/01/11 15:21:03 - mmengine - INFO - Evaluating bbox... 2024/01/11 15:22:01 - mmengine - INFO - bbox_mAP_copypaste: 0.523 0.704 0.574 0.369 0.574 0.678 2024/01/11 15:28:12 - mmengine - INFO - per class results: 2024/01/11 15:28:12 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.31 | 89.5 | | building | 82.8 | 92.04 | | sky | 93.55 | 98.0 | | floor | 82.69 | 90.25 | | tree | 74.48 | 86.79 | | ceiling | 86.22 | 94.71 | | road | 85.12 | 91.14 | | bed | 90.62 | 95.67 | | windowpane | 63.54 | 79.98 | | grass | 68.65 | 85.13 | | cabinet | 63.38 | 75.0 | | sidewalk | 68.61 | 80.41 | | person | 81.95 | 91.87 | | earth | 39.53 | 51.14 | | door | 56.1 | 70.81 | | table | 64.53 | 80.41 | | mountain | 60.86 | 77.28 | | plant | 54.55 | 64.68 | | curtain | 76.31 | 88.01 | | chair | 63.31 | 77.9 | | car | 85.41 | 91.97 | | water | 60.14 | 72.98 | | painting | 73.98 | 87.93 | | sofa | 72.48 | 83.43 | | shelf | 45.71 | 66.52 | | house | 46.07 | 63.79 | | sea | 59.79 | 74.67 | | mirror | 69.54 | 78.26 | | rug | 67.93 | 80.2 | | field | 36.31 | 52.24 | | armchair | 52.51 | 71.34 | | seat | 68.07 | 82.34 | | fence | 47.4 | 65.54 | | desk | 52.98 | 70.81 | | rock | 49.11 | 69.51 | | wardrobe | 45.42 | 65.58 | | lamp | 66.83 | 79.08 | | bathtub | 78.08 | 89.36 | | railing | 42.01 | 58.05 | | cushion | 65.24 | 80.73 | | base | 22.63 | 29.95 | | box | 30.24 | 39.72 | | column | 56.58 | 65.93 | | signboard | 39.36 | 53.73 | | chest of drawers | 36.51 | 54.39 | | counter | 32.1 | 46.07 | | sand | 44.71 | 68.71 | | sink | 78.03 | 86.38 | | skyscraper | 46.6 | 58.95 | | fireplace | 76.71 | 88.7 | | refrigerator | 78.9 | 84.38 | | grandstand | 48.82 | 82.15 | | path | 21.49 | 30.87 | | stairs | 33.56 | 41.48 | | runway | 74.03 | 93.73 | | case | 48.86 | 64.87 | | pool table | 92.23 | 95.68 | | pillow | 61.98 | 71.16 | | screen door | 75.87 | 79.8 | | stairway | 34.11 | 44.24 | | river | 14.68 | 31.92 | | bridge | 69.4 | 79.71 | | bookcase | 41.25 | 66.2 | | blind | 39.25 | 45.27 | | coffee table | 66.33 | 80.91 | | toilet | 87.98 | 92.18 | | flower | 42.56 | 55.05 | | book | 48.91 | 68.11 | | hill | 16.75 | 24.24 | | bench | 62.58 | 74.32 | | countertop | 62.62 | 75.72 | | stove | 80.91 | 84.5 | | palm | 49.46 | 71.97 | | kitchen island | 47.97 | 69.36 | | computer | 77.61 | 89.38 | | swivel chair | 45.64 | 60.29 | | boat | 66.66 | 88.12 | | bar | 35.28 | 43.05 | | arcade machine | 64.06 | 66.67 | | hovel | 24.06 | 26.09 | | bus | 87.05 | 94.71 | | towel | 66.22 | 79.77 | | light | 53.92 | 64.97 | | truck | 48.7 | 67.39 | | tower | 30.15 | 48.01 | | chandelier | 66.71 | 79.85 | | awning | 31.8 | 38.87 | | streetlight | 32.95 | 45.34 | | booth | 40.88 | 57.19 | | television receiver | 74.03 | 87.6 | | airplane | 67.0 | 74.04 | | dirt track | 8.42 | 25.61 | | apparel | 32.39 | 46.73 | | pole | 26.88 | 39.05 | | land | 3.34 | 4.86 | | bannister | 14.92 | 20.32 | | escalator | 26.19 | 28.64 | | ottoman | 57.37 | 72.37 | | bottle | 27.38 | 35.0 | | buffet | 56.88 | 65.68 | | poster | 31.32 | 47.4 | | stage | 11.68 | 16.97 | | van | 46.84 | 62.04 | | ship | 24.83 | 26.2 | | fountain | 21.51 | 21.78 | | conveyer belt | 66.17 | 91.61 | | canopy | 38.96 | 46.3 | | washer | 67.82 | 72.28 | | plaything | 32.17 | 40.99 | | swimming pool | 65.83 | 71.96 | | stool | 43.27 | 61.38 | | barrel | 31.67 | 68.42 | | basket | 34.93 | 47.16 | | waterfall | 70.24 | 90.08 | | tent | 72.08 | 97.11 | | bag | 24.57 | 33.36 | | minibike | 73.13 | 84.36 | | cradle | 84.5 | 96.46 | | oven | 53.67 | 65.5 | | ball | 51.32 | 64.75 | | food | 53.14 | 58.21 | | step | 6.32 | 7.77 | | tank | 48.52 | 51.99 | | trade name | 31.23 | 41.17 | | microwave | 86.63 | 94.61 | | pot | 49.58 | 58.77 | | animal | 63.99 | 68.25 | | bicycle | 59.08 | 74.53 | | lake | 59.89 | 67.96 | | dishwasher | 72.2 | 86.77 | | screen | 62.06 | 78.09 | | blanket | 32.27 | 39.35 | | sculpture | 66.41 | 80.47 | | hood | 65.44 | 72.29 | | sconce | 47.83 | 59.69 | | vase | 45.0 | 62.16 | | traffic light | 41.58 | 62.99 | | tray | 18.55 | 25.98 | | ashcan | 47.49 | 62.03 | | fan | 64.37 | 76.67 | | pier | 45.04 | 51.41 | | crt screen | 6.6 | 20.86 | | plate | 58.34 | 79.12 | | monitor | 4.59 | 5.13 | | bulletin board | 56.91 | 67.58 | | shower | 4.49 | 6.16 | | radiator | 62.1 | 71.81 | | glass | 18.25 | 20.07 | | clock | 40.09 | 48.73 | | flag | 45.3 | 52.48 | +---------------------+-------+-------+ 2024/01/11 15:28:24 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5230 coco/bbox_mAP_50: 0.7040 coco/bbox_mAP_75: 0.5740 coco/bbox_mAP_s: 0.3690 coco/bbox_mAP_m: 0.5740 coco/bbox_mAP_l: 0.6780 coco/segm_mAP: 0.3520 coco/segm_mAP_50: 0.6190 coco/segm_mAP_75: 0.3480 coco/segm_mAP_s: 0.2070 coco/segm_mAP_m: 0.3990 coco/segm_mAP_l: 0.5400 Bleu_1: 0.7680 Bleu_2: 0.6064 Bleu_3: 0.4651 Bleu_4: 0.3533 METEOR: 0.2813 ROUGE_L: 0.5686 CIDEr: 1.1577 SPICE: 0.2091 aAcc: 84.4200 mIoU: 52.4100 mAcc: 64.3600 visual-grounding/miou: 0.8289 visual-grounding/acc: 0.8852 data_time: 0.0113 time: 1.8968 2024/01/11 15:39:59 - mmengine - INFO - Iter(train) [520500/640000] base_lr: 1.6717e-05 lr: 1.6717e-06 eta: 1 day, 22:32:50 time: 1.3865 data_time: 0.0196 memory: 34721 grad_norm: 3.4951 loss: 1.0486 caption_loss_cls: 1.8791 detection_loss_cls: 0.0240 detection_loss_reg: 0.3074 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9621 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3164 instance_segmentation_loss_poly: 0.8268 2024/01/11 15:52:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 15:52:11 - mmengine - INFO - Iter(train) [521000/640000] base_lr: 1.6582e-05 lr: 1.6582e-06 eta: 1 day, 22:23:32 time: 1.4014 data_time: 0.0198 memory: 25630 grad_norm: 3.6162 loss: 1.0458 caption_loss_cls: 1.8837 detection_loss_cls: 0.0240 detection_loss_reg: 0.3062 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9585 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3164 instance_segmentation_loss_poly: 0.8259 2024/01/11 16:03:21 - mmengine - INFO - Iter(train) [521500/640000] base_lr: 1.6446e-05 lr: 1.6446e-06 eta: 1 day, 22:09:20 time: 1.3928 data_time: 0.0198 memory: 25630 grad_norm: 3.6450 loss: 1.0509 caption_loss_cls: 1.8832 detection_loss_cls: 0.0238 detection_loss_reg: 0.3037 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 1.9504 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3181 instance_segmentation_loss_poly: 0.8307 2024/01/11 16:14:19 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 16:14:19 - mmengine - INFO - Iter(train) [522000/640000] base_lr: 1.6312e-05 lr: 1.6312e-06 eta: 1 day, 21:54:25 time: 1.3785 data_time: 0.0197 memory: 25630 grad_norm: 3.7599 loss: 1.0667 caption_loss_cls: 1.8822 detection_loss_cls: 0.0239 detection_loss_reg: 0.3034 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 1.9498 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3186 instance_segmentation_loss_poly: 0.8323 2024/01/11 16:14:19 - mmengine - INFO - Saving checkpoint at 522000 iterations 2024/01/11 16:26:03 - mmengine - INFO - Iter(train) [522500/640000] base_lr: 1.6178e-05 lr: 1.6178e-06 eta: 1 day, 21:42:59 time: 1.3728 data_time: 0.0196 memory: 25630 grad_norm: 3.8213 loss: 1.0722 caption_loss_cls: 1.8814 detection_loss_cls: 0.0239 detection_loss_reg: 0.3035 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9483 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3180 instance_segmentation_loss_poly: 0.8301 2024/01/11 16:37:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 16:37:48 - mmengine - INFO - Iter(train) [523000/640000] base_lr: 1.6044e-05 lr: 1.6044e-06 eta: 1 day, 21:31:36 time: 1.3812 data_time: 0.0198 memory: 25630 grad_norm: 3.7983 loss: 1.0730 caption_loss_cls: 1.8839 detection_loss_cls: 0.0239 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9502 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3177 instance_segmentation_loss_poly: 0.8302 2024/01/11 16:48:58 - mmengine - INFO - Iter(train) [523500/640000] base_lr: 1.5911e-05 lr: 1.5911e-06 eta: 1 day, 21:17:43 time: 1.3801 data_time: 0.0199 memory: 25630 grad_norm: 3.8277 loss: 1.0908 caption_loss_cls: 1.8797 detection_loss_cls: 0.0239 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9492 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3172 instance_segmentation_loss_poly: 0.8298 2024/01/11 17:00:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 17:00:45 - mmengine - INFO - Iter(train) [524000/640000] base_lr: 1.5779e-05 lr: 1.5779e-06 eta: 1 day, 21:06:32 time: 1.3849 data_time: 0.0199 memory: 25630 grad_norm: 3.8344 loss: 1.0847 caption_loss_cls: 1.8839 detection_loss_cls: 0.0238 detection_loss_reg: 0.3017 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9479 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3175 instance_segmentation_loss_poly: 0.8292 2024/01/11 17:00:45 - mmengine - INFO - Saving checkpoint at 524000 iterations 2024/01/11 17:13:14 - mmengine - INFO - Iter(train) [524500/640000] base_lr: 1.5646e-05 lr: 1.5646e-06 eta: 1 day, 20:58:10 time: 1.3979 data_time: 0.0263 memory: 25630 grad_norm: 3.8125 loss: 1.0865 caption_loss_cls: 1.8838 detection_loss_cls: 0.0239 detection_loss_reg: 0.3020 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9487 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3168 instance_segmentation_loss_poly: 0.8285 2024/01/11 17:24:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 17:24:47 - mmengine - INFO - Iter(train) [525000/640000] base_lr: 1.5515e-05 lr: 1.5515e-06 eta: 1 day, 20:45:53 time: 1.3880 data_time: 0.0262 memory: 25630 grad_norm: 3.6913 loss: 1.0908 caption_loss_cls: 1.8781 detection_loss_cls: 0.0239 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 1.9470 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3161 instance_segmentation_loss_poly: 0.8271 2024/01/11 17:36:28 - mmengine - INFO - Iter(train) [525500/640000] base_lr: 1.5384e-05 lr: 1.5384e-06 eta: 1 day, 20:34:14 time: 1.3960 data_time: 0.0262 memory: 25630 grad_norm: 3.6634 loss: 1.0865 caption_loss_cls: 1.8752 detection_loss_cls: 0.0236 detection_loss_reg: 0.2996 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9439 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3166 instance_segmentation_loss_poly: 0.8285 2024/01/11 17:48:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 17:48:47 - mmengine - INFO - Iter(train) [526000/640000] base_lr: 1.5253e-05 lr: 1.5253e-06 eta: 1 day, 20:24:55 time: 1.4159 data_time: 0.0266 memory: 25630 grad_norm: 3.5988 loss: 1.0845 caption_loss_cls: 1.8815 detection_loss_cls: 0.0236 detection_loss_reg: 0.2999 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9439 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3163 instance_segmentation_loss_poly: 0.8286 2024/01/11 17:48:47 - mmengine - INFO - Saving checkpoint at 526000 iterations 2024/01/11 18:00:26 - mmengine - INFO - Iter(train) [526500/640000] base_lr: 1.5123e-05 lr: 1.5123e-06 eta: 1 day, 20:13:04 time: 1.4146 data_time: 0.0266 memory: 25630 grad_norm: 3.5944 loss: 1.0943 caption_loss_cls: 1.8772 detection_loss_cls: 0.0239 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0064 grounding_loss_reg: 1.9438 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3167 instance_segmentation_loss_poly: 0.8298 2024/01/11 18:12:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 18:12:12 - mmengine - INFO - Iter(train) [527000/640000] base_lr: 1.4994e-05 lr: 1.4994e-06 eta: 1 day, 20:01:39 time: 1.4150 data_time: 0.0266 memory: 25630 grad_norm: 3.5716 loss: 1.0954 caption_loss_cls: 1.8797 detection_loss_cls: 0.0237 detection_loss_reg: 0.3014 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9445 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3181 instance_segmentation_loss_poly: 0.8323 2024/01/11 18:23:15 - mmengine - INFO - Iter(train) [527500/640000] base_lr: 1.4865e-05 lr: 1.4865e-06 eta: 1 day, 19:47:39 time: 1.4133 data_time: 0.0265 memory: 25630 grad_norm: 3.5522 loss: 1.0834 caption_loss_cls: 1.8748 detection_loss_cls: 0.0236 detection_loss_reg: 0.3012 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9466 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3176 instance_segmentation_loss_poly: 0.8323 2024/01/11 18:35:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 18:35:27 - mmengine - INFO - Iter(train) [528000/640000] base_lr: 1.4736e-05 lr: 1.4736e-06 eta: 1 day, 19:37:48 time: 1.4196 data_time: 0.0265 memory: 25630 grad_norm: 3.5367 loss: 1.0705 caption_loss_cls: 1.8731 detection_loss_cls: 0.0234 detection_loss_reg: 0.3003 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9466 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3185 instance_segmentation_loss_poly: 0.8356 2024/01/11 18:35:27 - mmengine - INFO - Saving checkpoint at 528000 iterations 2024/01/11 18:47:24 - mmengine - INFO - Iter(train) [528500/640000] base_lr: 1.4608e-05 lr: 1.4608e-06 eta: 1 day, 19:26:58 time: 1.4115 data_time: 0.0264 memory: 25630 grad_norm: 3.5535 loss: 1.0742 caption_loss_cls: 1.8743 detection_loss_cls: 0.0234 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9462 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3178 instance_segmentation_loss_poly: 0.8334 2024/01/11 18:58:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 18:58:56 - mmengine - INFO - Iter(train) [529000/640000] base_lr: 1.4481e-05 lr: 1.4481e-06 eta: 1 day, 19:14:44 time: 1.4115 data_time: 0.0265 memory: 25630 grad_norm: 3.5186 loss: 1.0803 caption_loss_cls: 1.8720 detection_loss_cls: 0.0234 detection_loss_reg: 0.3009 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9480 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3177 instance_segmentation_loss_poly: 0.8320 2024/01/11 19:10:43 - mmengine - INFO - Iter(train) [529500/640000] base_lr: 1.4354e-05 lr: 1.4354e-06 eta: 1 day, 19:03:18 time: 1.4126 data_time: 0.0264 memory: 25630 grad_norm: 3.4794 loss: 1.0763 caption_loss_cls: 1.8668 detection_loss_cls: 0.0235 detection_loss_reg: 0.3030 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9502 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3176 instance_segmentation_loss_poly: 0.8322 2024/01/11 19:22:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 19:22:04 - mmengine - INFO - Iter(train) [530000/640000] base_lr: 1.4227e-05 lr: 1.4227e-06 eta: 1 day, 18:50:31 time: 1.3984 data_time: 0.0261 memory: 25630 grad_norm: 3.4846 loss: 1.0691 caption_loss_cls: 1.8673 detection_loss_cls: 0.0234 detection_loss_reg: 0.3030 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9510 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3167 instance_segmentation_loss_poly: 0.8309 2024/01/11 19:22:04 - mmengine - INFO - Saving checkpoint at 530000 iterations 2024/01/11 19:33:54 - mmengine - INFO - Iter(train) [530500/640000] base_lr: 1.4101e-05 lr: 1.4101e-06 eta: 1 day, 18:39:16 time: 1.4011 data_time: 0.0261 memory: 25630 grad_norm: 3.4827 loss: 1.0694 caption_loss_cls: 1.8695 detection_loss_cls: 0.0235 detection_loss_reg: 0.3038 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9500 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3166 instance_segmentation_loss_poly: 0.8305 2024/01/11 19:45:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 19:45:24 - mmengine - INFO - Iter(train) [531000/640000] base_lr: 1.3976e-05 lr: 1.3976e-06 eta: 1 day, 18:26:59 time: 1.3971 data_time: 0.0260 memory: 25630 grad_norm: 3.4633 loss: 1.0600 caption_loss_cls: 1.8746 detection_loss_cls: 0.0236 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9514 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3140 instance_segmentation_loss_poly: 0.8254 2024/01/11 19:57:03 - mmengine - INFO - Iter(train) [531500/640000] base_lr: 1.3851e-05 lr: 1.3851e-06 eta: 1 day, 18:15:09 time: 1.4060 data_time: 0.0261 memory: 25630 grad_norm: 3.4135 loss: 1.0542 caption_loss_cls: 1.8803 detection_loss_cls: 0.0236 detection_loss_reg: 0.3041 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9520 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3123 instance_segmentation_loss_poly: 0.8217 2024/01/11 20:08:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 20:08:40 - mmengine - INFO - Iter(train) [532000/640000] base_lr: 1.3727e-05 lr: 1.3727e-06 eta: 1 day, 18:03:15 time: 1.3971 data_time: 0.0260 memory: 25630 grad_norm: 3.4120 loss: 1.0714 caption_loss_cls: 1.8838 detection_loss_cls: 0.0236 detection_loss_reg: 0.3052 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9521 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8204 2024/01/11 20:08:40 - mmengine - INFO - Saving checkpoint at 532000 iterations 2024/01/11 20:20:52 - mmengine - INFO - Iter(train) [532500/640000] base_lr: 1.3603e-05 lr: 1.3603e-06 eta: 1 day, 17:53:05 time: 1.4011 data_time: 0.0261 memory: 25630 grad_norm: 3.4242 loss: 1.0747 caption_loss_cls: 1.8841 detection_loss_cls: 0.0235 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9514 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3123 instance_segmentation_loss_poly: 0.8208 2024/01/11 20:32:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 20:32:56 - mmengine - INFO - Iter(train) [533000/640000] base_lr: 1.3480e-05 lr: 1.3480e-06 eta: 1 day, 17:42:26 time: 1.4088 data_time: 0.0262 memory: 25630 grad_norm: 3.3789 loss: 1.0645 caption_loss_cls: 1.8830 detection_loss_cls: 0.0235 detection_loss_reg: 0.3057 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9529 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3128 instance_segmentation_loss_poly: 0.8225 2024/01/11 20:44:24 - mmengine - INFO - Iter(train) [533500/640000] base_lr: 1.3357e-05 lr: 1.3357e-06 eta: 1 day, 17:30:06 time: 1.4044 data_time: 0.0261 memory: 25630 grad_norm: 3.4356 loss: 1.0660 caption_loss_cls: 1.8847 detection_loss_cls: 0.0235 detection_loss_reg: 0.3052 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9492 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3125 instance_segmentation_loss_poly: 0.8223 2024/01/11 20:55:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 20:55:36 - mmengine - INFO - Iter(train) [534000/640000] base_lr: 1.3235e-05 lr: 1.3235e-06 eta: 1 day, 17:17:02 time: 1.4021 data_time: 0.0261 memory: 25630 grad_norm: 3.4360 loss: 1.0683 caption_loss_cls: 1.8860 detection_loss_cls: 0.0235 detection_loss_reg: 0.3048 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9515 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3138 instance_segmentation_loss_poly: 0.8255 2024/01/11 20:55:36 - mmengine - INFO - Saving checkpoint at 534000 iterations 2024/01/11 21:07:56 - mmengine - INFO - Iter(train) [534500/640000] base_lr: 1.3113e-05 lr: 1.3113e-06 eta: 1 day, 17:07:05 time: 1.4095 data_time: 0.0262 memory: 25630 grad_norm: 3.4072 loss: 1.0649 caption_loss_cls: 1.8849 detection_loss_cls: 0.0234 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9490 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3163 instance_segmentation_loss_poly: 0.8295 2024/01/11 21:19:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 21:19:07 - mmengine - INFO - Iter(train) [535000/640000] base_lr: 1.2992e-05 lr: 1.2992e-06 eta: 1 day, 16:53:59 time: 1.4046 data_time: 0.0262 memory: 25630 grad_norm: 3.4549 loss: 1.0730 caption_loss_cls: 1.8779 detection_loss_cls: 0.0234 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9505 instance_segmentation_loss_cls: 0.0243 instance_segmentation_loss_reg: 0.3161 instance_segmentation_loss_poly: 0.8295 2024/01/11 21:30:57 - mmengine - INFO - Iter(train) [535500/640000] base_lr: 1.2871e-05 lr: 1.2871e-06 eta: 1 day, 16:42:40 time: 1.4075 data_time: 0.0263 memory: 25630 grad_norm: 3.4088 loss: 1.0634 caption_loss_cls: 1.8754 detection_loss_cls: 0.0235 detection_loss_reg: 0.3056 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9504 instance_segmentation_loss_cls: 0.0242 instance_segmentation_loss_reg: 0.3145 instance_segmentation_loss_poly: 0.8253 2024/01/11 21:42:34 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 21:42:34 - mmengine - INFO - Iter(train) [536000/640000] base_lr: 1.2751e-05 lr: 1.2751e-06 eta: 1 day, 16:30:47 time: 1.4075 data_time: 0.0262 memory: 25630 grad_norm: 3.4021 loss: 1.0518 caption_loss_cls: 1.8767 detection_loss_cls: 0.0235 detection_loss_reg: 0.3047 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9515 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3141 instance_segmentation_loss_poly: 0.8246 2024/01/11 21:42:34 - mmengine - INFO - Saving checkpoint at 536000 iterations 2024/01/11 21:54:25 - mmengine - INFO - Iter(train) [536500/640000] base_lr: 1.2631e-05 lr: 1.2631e-06 eta: 1 day, 16:19:29 time: 1.4021 data_time: 0.0262 memory: 25630 grad_norm: 3.3917 loss: 1.0467 caption_loss_cls: 1.8740 detection_loss_cls: 0.0234 detection_loss_reg: 0.3035 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.9532 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8208 2024/01/11 22:05:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 22:05:49 - mmengine - INFO - Iter(train) [537000/640000] base_lr: 1.2512e-05 lr: 1.2512e-06 eta: 1 day, 16:07:04 time: 1.3923 data_time: 0.0260 memory: 25630 grad_norm: 3.4503 loss: 1.0510 caption_loss_cls: 1.8735 detection_loss_cls: 0.0234 detection_loss_reg: 0.3043 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9534 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3113 instance_segmentation_loss_poly: 0.8195 2024/01/11 22:17:20 - mmengine - INFO - Iter(train) [537500/640000] base_lr: 1.2393e-05 lr: 1.2393e-06 eta: 1 day, 15:54:56 time: 1.3928 data_time: 0.0262 memory: 25630 grad_norm: 3.4279 loss: 1.0528 caption_loss_cls: 1.8720 detection_loss_cls: 0.0233 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9537 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8200 2024/01/11 22:28:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 22:28:50 - mmengine - INFO - Iter(train) [538000/640000] base_lr: 1.2275e-05 lr: 1.2275e-06 eta: 1 day, 15:42:47 time: 1.3973 data_time: 0.0261 memory: 25630 grad_norm: 3.4562 loss: 1.0560 caption_loss_cls: 1.8793 detection_loss_cls: 0.0232 detection_loss_reg: 0.3028 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9524 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3110 instance_segmentation_loss_poly: 0.8181 2024/01/11 22:28:50 - mmengine - INFO - Saving checkpoint at 538000 iterations 2024/01/11 22:41:00 - mmengine - INFO - Iter(train) [538500/640000] base_lr: 1.2158e-05 lr: 1.2158e-06 eta: 1 day, 15:32:15 time: 1.3949 data_time: 0.0262 memory: 25630 grad_norm: 3.4704 loss: 1.0502 caption_loss_cls: 1.8799 detection_loss_cls: 0.0231 detection_loss_reg: 0.3018 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9507 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3110 instance_segmentation_loss_poly: 0.8178 2024/01/11 22:52:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 22:52:52 - mmengine - INFO - Iter(train) [539000/640000] base_lr: 1.2041e-05 lr: 1.2041e-06 eta: 1 day, 15:20:59 time: 1.4054 data_time: 0.0263 memory: 25630 grad_norm: 3.4820 loss: 1.0531 caption_loss_cls: 1.8839 detection_loss_cls: 0.0232 detection_loss_reg: 0.3034 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9453 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3130 instance_segmentation_loss_poly: 0.8219 2024/01/11 23:04:35 - mmengine - INFO - Iter(train) [539500/640000] base_lr: 1.1924e-05 lr: 1.1924e-06 eta: 1 day, 15:09:20 time: 1.4035 data_time: 0.0264 memory: 25630 grad_norm: 3.5560 loss: 1.0663 caption_loss_cls: 1.8840 detection_loss_cls: 0.0231 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9511 instance_segmentation_loss_cls: 0.0241 instance_segmentation_loss_reg: 0.3134 instance_segmentation_loss_poly: 0.8237 2024/01/11 23:16:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/11 23:16:22 - mmengine - INFO - Iter(train) [540000/640000] base_lr: 1.1808e-05 lr: 1.1808e-06 eta: 1 day, 14:57:51 time: 1.4061 data_time: 0.0264 memory: 25630 grad_norm: 3.5414 loss: 1.0699 caption_loss_cls: 1.8897 detection_loss_cls: 0.0230 detection_loss_reg: 0.3022 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9567 instance_segmentation_loss_cls: 0.0240 instance_segmentation_loss_reg: 0.3127 instance_segmentation_loss_poly: 0.8238 2024/01/11 23:16:22 - mmengine - INFO - Saving checkpoint at 540000 iterations 2024/01/11 23:28:05 - mmengine - INFO - Evaluating bbox... 2024/01/11 23:29:05 - mmengine - INFO - bbox_mAP_copypaste: 0.529 0.710 0.580 0.371 0.575 0.682 2024/01/11 23:29:05 - mmengine - INFO - Evaluating segm... 2024/01/11 23:30:16 - mmengine - INFO - segm_mAP_copypaste: 0.356 0.625 0.354 0.204 0.402 0.543 2024/01/11 23:32:24 - mmengine - INFO - Evaluating bbox... 2024/01/11 23:33:23 - mmengine - INFO - bbox_mAP_copypaste: 0.524 0.707 0.573 0.355 0.571 0.682 2024/01/11 23:39:31 - mmengine - INFO - per class results: 2024/01/11 23:39:31 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.46 | 89.9 | | building | 83.04 | 92.21 | | sky | 93.67 | 97.92 | | floor | 82.95 | 90.78 | | tree | 74.61 | 87.32 | | ceiling | 86.51 | 94.39 | | road | 84.96 | 90.23 | | bed | 90.76 | 96.13 | | windowpane | 63.78 | 79.67 | | grass | 66.34 | 80.78 | | cabinet | 63.92 | 73.77 | | sidewalk | 69.64 | 82.14 | | person | 82.25 | 92.3 | | earth | 39.12 | 53.45 | | door | 55.63 | 70.56 | | table | 65.01 | 80.24 | | mountain | 61.05 | 78.11 | | plant | 54.19 | 64.74 | | curtain | 76.64 | 87.61 | | chair | 63.38 | 77.46 | | car | 85.33 | 92.27 | | water | 60.88 | 73.7 | | painting | 74.69 | 88.07 | | sofa | 73.29 | 82.89 | | shelf | 47.99 | 69.71 | | house | 44.79 | 60.46 | | sea | 61.31 | 74.15 | | mirror | 68.59 | 78.04 | | rug | 65.62 | 75.68 | | field | 37.94 | 59.13 | | armchair | 53.88 | 72.51 | | seat | 67.85 | 82.57 | | fence | 48.36 | 65.61 | | desk | 52.72 | 72.01 | | rock | 47.5 | 66.26 | | wardrobe | 44.8 | 64.62 | | lamp | 67.53 | 79.96 | | bathtub | 79.79 | 89.29 | | railing | 41.8 | 57.77 | | cushion | 64.77 | 80.35 | | base | 19.95 | 27.81 | | box | 29.96 | 39.21 | | column | 57.01 | 66.37 | | signboard | 39.6 | 54.7 | | chest of drawers | 40.49 | 62.22 | | counter | 29.21 | 40.96 | | sand | 44.72 | 68.9 | | sink | 75.62 | 83.45 | | skyscraper | 47.01 | 59.32 | | fireplace | 76.05 | 88.43 | | refrigerator | 81.44 | 85.97 | | grandstand | 48.8 | 82.6 | | path | 21.42 | 30.16 | | stairs | 32.96 | 40.19 | | runway | 74.12 | 94.77 | | case | 51.64 | 65.25 | | pool table | 92.12 | 95.67 | | pillow | 60.59 | 69.06 | | screen door | 70.3 | 74.04 | | stairway | 32.54 | 43.38 | | river | 15.09 | 35.43 | | bridge | 69.17 | 79.14 | | bookcase | 41.75 | 57.46 | | blind | 38.91 | 44.31 | | coffee table | 64.48 | 82.79 | | toilet | 87.69 | 92.39 | | flower | 44.03 | 55.28 | | book | 49.27 | 68.91 | | hill | 16.65 | 25.22 | | bench | 63.32 | 74.2 | | countertop | 62.83 | 80.74 | | stove | 81.24 | 84.82 | | palm | 48.39 | 71.63 | | kitchen island | 51.2 | 74.13 | | computer | 77.91 | 89.04 | | swivel chair | 46.9 | 62.43 | | boat | 67.23 | 87.69 | | bar | 38.26 | 47.66 | | arcade machine | 66.29 | 69.85 | | hovel | 15.5 | 16.51 | | bus | 86.81 | 94.73 | | towel | 67.07 | 80.08 | | light | 53.23 | 63.0 | | truck | 49.78 | 66.47 | | tower | 31.07 | 50.95 | | chandelier | 67.6 | 78.29 | | awning | 31.98 | 39.75 | | streetlight | 35.38 | 48.01 | | booth | 41.72 | 59.63 | | television receiver | 73.77 | 86.28 | | airplane | 67.37 | 76.39 | | dirt track | 8.09 | 22.55 | | apparel | 32.5 | 48.55 | | pole | 29.96 | 45.68 | | land | 3.48 | 4.86 | | bannister | 17.3 | 24.04 | | escalator | 27.38 | 29.62 | | ottoman | 59.53 | 72.27 | | bottle | 28.04 | 36.12 | | buffet | 60.03 | 65.46 | | poster | 33.06 | 49.0 | | stage | 11.98 | 17.65 | | van | 46.34 | 60.3 | | ship | 9.16 | 9.69 | | fountain | 19.77 | 20.08 | | conveyer belt | 65.54 | 91.86 | | canopy | 37.2 | 43.39 | | washer | 68.65 | 71.9 | | plaything | 30.37 | 38.09 | | swimming pool | 68.34 | 69.97 | | stool | 44.38 | 61.68 | | barrel | 21.67 | 74.95 | | basket | 35.74 | 45.85 | | waterfall | 65.01 | 87.92 | | tent | 75.67 | 97.7 | | bag | 25.88 | 35.06 | | minibike | 73.33 | 85.89 | | cradle | 84.86 | 96.81 | | oven | 51.57 | 63.47 | | ball | 50.05 | 63.79 | | food | 53.25 | 58.13 | | step | 7.64 | 9.59 | | tank | 51.25 | 55.16 | | trade name | 27.62 | 33.27 | | microwave | 86.83 | 94.07 | | pot | 52.6 | 61.44 | | animal | 63.45 | 67.55 | | bicycle | 58.78 | 74.33 | | lake | 59.51 | 65.11 | | dishwasher | 78.36 | 87.37 | | screen | 60.42 | 80.53 | | blanket | 31.06 | 37.7 | | sculpture | 68.43 | 80.68 | | hood | 62.85 | 72.51 | | sconce | 48.04 | 60.79 | | vase | 45.41 | 63.48 | | traffic light | 40.44 | 55.03 | | tray | 21.15 | 32.1 | | ashcan | 42.74 | 59.2 | | fan | 64.58 | 76.04 | | pier | 45.25 | 52.45 | | crt screen | 4.68 | 13.53 | | plate | 59.08 | 76.89 | | monitor | 5.3 | 6.07 | | bulletin board | 57.37 | 69.51 | | shower | 4.43 | 6.92 | | radiator | 61.87 | 72.04 | | glass | 19.73 | 21.67 | | clock | 39.03 | 46.84 | | flag | 45.14 | 52.63 | +---------------------+-------+-------+ 2024/01/11 23:39:43 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5240 coco/bbox_mAP_50: 0.7070 coco/bbox_mAP_75: 0.5730 coco/bbox_mAP_s: 0.3550 coco/bbox_mAP_m: 0.5710 coco/bbox_mAP_l: 0.6820 coco/segm_mAP: 0.3560 coco/segm_mAP_50: 0.6250 coco/segm_mAP_75: 0.3540 coco/segm_mAP_s: 0.2040 coco/segm_mAP_m: 0.4020 coco/segm_mAP_l: 0.5430 Bleu_1: 0.7732 Bleu_2: 0.6124 Bleu_3: 0.4704 Bleu_4: 0.3577 METEOR: 0.2823 ROUGE_L: 0.5719 CIDEr: 1.1743 SPICE: 0.2109 aAcc: 84.4500 mIoU: 52.3700 mAcc: 64.2500 visual-grounding/miou: 0.8333 visual-grounding/acc: 0.8900 data_time: 0.0118 time: 1.9005 2024/01/11 23:51:38 - mmengine - INFO - Iter(train) [540500/640000] base_lr: 1.1693e-05 lr: 1.1693e-06 eta: 1 day, 14:46:42 time: 1.4074 data_time: 0.0200 memory: 34721 grad_norm: 3.5044 loss: 1.0642 caption_loss_cls: 1.8882 detection_loss_cls: 0.0230 detection_loss_reg: 0.3017 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9529 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3116 instance_segmentation_loss_poly: 0.8219 2024/01/12 00:03:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 00:03:11 - mmengine - INFO - Iter(train) [541000/640000] base_lr: 1.1578e-05 lr: 1.1578e-06 eta: 1 day, 14:34:41 time: 1.4096 data_time: 0.0201 memory: 25630 grad_norm: 3.5146 loss: 1.0626 caption_loss_cls: 1.8867 detection_loss_cls: 0.0230 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9542 instance_segmentation_loss_cls: 0.0239 instance_segmentation_loss_reg: 0.3111 instance_segmentation_loss_poly: 0.8206 2024/01/12 00:15:08 - mmengine - INFO - Iter(train) [541500/640000] base_lr: 1.1463e-05 lr: 1.1463e-06 eta: 1 day, 14:23:32 time: 1.4162 data_time: 0.0201 memory: 25630 grad_norm: 3.5128 loss: 1.0613 caption_loss_cls: 1.8863 detection_loss_cls: 0.0231 detection_loss_reg: 0.3035 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9596 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3089 instance_segmentation_loss_poly: 0.8161 2024/01/12 00:26:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 00:26:40 - mmengine - INFO - Iter(train) [542000/640000] base_lr: 1.1350e-05 lr: 1.1350e-06 eta: 1 day, 14:11:29 time: 1.4168 data_time: 0.0201 memory: 25630 grad_norm: 3.4864 loss: 1.0607 caption_loss_cls: 1.8936 detection_loss_cls: 0.0232 detection_loss_reg: 0.3043 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9583 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3073 instance_segmentation_loss_poly: 0.8130 2024/01/12 00:26:40 - mmengine - INFO - Saving checkpoint at 542000 iterations 2024/01/12 00:39:00 - mmengine - INFO - Iter(train) [542500/640000] base_lr: 1.1236e-05 lr: 1.1236e-06 eta: 1 day, 14:01:08 time: 1.4193 data_time: 0.0200 memory: 25630 grad_norm: 3.4412 loss: 1.0553 caption_loss_cls: 1.8948 detection_loss_cls: 0.0231 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9599 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3080 instance_segmentation_loss_poly: 0.8155 2024/01/12 00:50:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 00:50:40 - mmengine - INFO - Iter(train) [543000/640000] base_lr: 1.1123e-05 lr: 1.1123e-06 eta: 1 day, 13:49:21 time: 1.4161 data_time: 0.0199 memory: 25630 grad_norm: 3.4046 loss: 1.0496 caption_loss_cls: 1.8938 detection_loss_cls: 0.0229 detection_loss_reg: 0.3021 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9575 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3096 instance_segmentation_loss_poly: 0.8182 2024/01/12 01:02:15 - mmengine - INFO - Iter(train) [543500/640000] base_lr: 1.1011e-05 lr: 1.1011e-06 eta: 1 day, 13:37:25 time: 1.4143 data_time: 0.0199 memory: 25630 grad_norm: 3.3896 loss: 1.0535 caption_loss_cls: 1.8945 detection_loss_cls: 0.0230 detection_loss_reg: 0.3025 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9558 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3071 instance_segmentation_loss_poly: 0.8128 2024/01/12 01:14:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 01:14:27 - mmengine - INFO - Iter(train) [544000/640000] base_lr: 1.0900e-05 lr: 1.0900e-06 eta: 1 day, 13:26:41 time: 1.4203 data_time: 0.0201 memory: 25630 grad_norm: 3.4021 loss: 1.0620 caption_loss_cls: 1.8920 detection_loss_cls: 0.0230 detection_loss_reg: 0.3023 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9543 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3082 instance_segmentation_loss_poly: 0.8155 2024/01/12 01:14:27 - mmengine - INFO - Saving checkpoint at 544000 iterations 2024/01/12 01:26:34 - mmengine - INFO - Iter(train) [544500/640000] base_lr: 1.0788e-05 lr: 1.0788e-06 eta: 1 day, 13:15:48 time: 1.4231 data_time: 0.0264 memory: 25630 grad_norm: 3.4494 loss: 1.0681 caption_loss_cls: 1.8888 detection_loss_cls: 0.0230 detection_loss_reg: 0.3022 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9541 instance_segmentation_loss_cls: 0.0238 instance_segmentation_loss_reg: 0.3089 instance_segmentation_loss_poly: 0.8168 2024/01/12 01:38:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 01:38:36 - mmengine - INFO - Iter(train) [545000/640000] base_lr: 1.0678e-05 lr: 1.0678e-06 eta: 1 day, 13:04:43 time: 1.4304 data_time: 0.0265 memory: 25630 grad_norm: 3.4166 loss: 1.0671 caption_loss_cls: 1.8909 detection_loss_cls: 0.0228 detection_loss_reg: 0.3008 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9556 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3090 instance_segmentation_loss_poly: 0.8162 2024/01/12 01:50:34 - mmengine - INFO - Iter(train) [545500/640000] base_lr: 1.0568e-05 lr: 1.0568e-06 eta: 1 day, 12:53:28 time: 1.4304 data_time: 0.0264 memory: 25630 grad_norm: 3.3609 loss: 1.0581 caption_loss_cls: 1.8920 detection_loss_cls: 0.0230 detection_loss_reg: 0.3013 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9558 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3081 instance_segmentation_loss_poly: 0.8132 2024/01/12 02:02:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 02:02:32 - mmengine - INFO - Iter(train) [546000/640000] base_lr: 1.0458e-05 lr: 1.0458e-06 eta: 1 day, 12:42:13 time: 1.4370 data_time: 0.0265 memory: 25630 grad_norm: 3.3510 loss: 1.0479 caption_loss_cls: 1.8867 detection_loss_cls: 0.0229 detection_loss_reg: 0.3002 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9526 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3078 instance_segmentation_loss_poly: 0.8126 2024/01/12 02:02:32 - mmengine - INFO - Saving checkpoint at 546000 iterations 2024/01/12 02:14:46 - mmengine - INFO - Iter(train) [546500/640000] base_lr: 1.0349e-05 lr: 1.0349e-06 eta: 1 day, 12:31:27 time: 1.4354 data_time: 0.0266 memory: 25630 grad_norm: 3.3915 loss: 1.0560 caption_loss_cls: 1.8846 detection_loss_cls: 0.0227 detection_loss_reg: 0.2965 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9562 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3080 instance_segmentation_loss_poly: 0.8134 2024/01/12 02:25:58 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 02:25:58 - mmengine - INFO - Iter(train) [547000/640000] base_lr: 1.0241e-05 lr: 1.0241e-06 eta: 1 day, 12:18:46 time: 1.4284 data_time: 0.0265 memory: 25630 grad_norm: 3.4313 loss: 1.0647 caption_loss_cls: 1.8877 detection_loss_cls: 0.0228 detection_loss_reg: 0.2975 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9550 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3070 instance_segmentation_loss_poly: 0.8124 2024/01/12 02:37:31 - mmengine - INFO - Iter(train) [547500/640000] base_lr: 1.0133e-05 lr: 1.0133e-06 eta: 1 day, 12:06:46 time: 1.4280 data_time: 0.0265 memory: 25630 grad_norm: 3.4312 loss: 1.0678 caption_loss_cls: 1.8990 detection_loss_cls: 0.0230 detection_loss_reg: 0.2985 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9515 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3086 instance_segmentation_loss_poly: 0.8150 2024/01/12 02:49:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 02:49:21 - mmengine - INFO - Iter(train) [548000/640000] base_lr: 1.0025e-05 lr: 1.0025e-06 eta: 1 day, 11:55:15 time: 1.4227 data_time: 0.0263 memory: 25630 grad_norm: 3.4458 loss: 1.0596 caption_loss_cls: 1.8969 detection_loss_cls: 0.0231 detection_loss_reg: 0.3006 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9495 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8147 2024/01/12 02:49:21 - mmengine - INFO - Saving checkpoint at 548000 iterations 2024/01/12 03:01:15 - mmengine - INFO - Iter(train) [548500/640000] base_lr: 9.9185e-06 lr: 9.9185e-07 eta: 1 day, 11:43:50 time: 1.4191 data_time: 0.0261 memory: 25630 grad_norm: 3.4514 loss: 1.0567 caption_loss_cls: 1.8937 detection_loss_cls: 0.0232 detection_loss_reg: 0.3006 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9503 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3093 instance_segmentation_loss_poly: 0.8174 2024/01/12 03:13:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 03:13:07 - mmengine - INFO - Iter(train) [549000/640000] base_lr: 9.8122e-06 lr: 9.8122e-07 eta: 1 day, 11:32:22 time: 1.4167 data_time: 0.0260 memory: 25630 grad_norm: 3.4914 loss: 1.0538 caption_loss_cls: 1.8925 detection_loss_cls: 0.0231 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9480 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3078 instance_segmentation_loss_poly: 0.8144 2024/01/12 03:24:48 - mmengine - INFO - Iter(train) [549500/640000] base_lr: 9.7065e-06 lr: 9.7065e-07 eta: 1 day, 11:20:34 time: 1.4125 data_time: 0.0261 memory: 25630 grad_norm: 3.5528 loss: 1.0586 caption_loss_cls: 1.8940 detection_loss_cls: 0.0231 detection_loss_reg: 0.2999 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9457 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3077 instance_segmentation_loss_poly: 0.8148 2024/01/12 03:36:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 03:36:39 - mmengine - INFO - Iter(train) [550000/640000] base_lr: 9.6013e-06 lr: 9.6013e-07 eta: 1 day, 11:09:05 time: 1.4108 data_time: 0.0259 memory: 25630 grad_norm: 3.5374 loss: 1.0516 caption_loss_cls: 1.8932 detection_loss_cls: 0.0232 detection_loss_reg: 0.3008 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9446 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8120 2024/01/12 03:36:39 - mmengine - INFO - Saving checkpoint at 550000 iterations 2024/01/12 03:48:59 - mmengine - INFO - Iter(train) [550500/640000] base_lr: 9.4966e-06 lr: 9.4966e-07 eta: 1 day, 10:58:22 time: 1.4124 data_time: 0.0262 memory: 25630 grad_norm: 3.5212 loss: 1.0510 caption_loss_cls: 1.8887 detection_loss_cls: 0.0229 detection_loss_reg: 0.2983 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9473 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3090 instance_segmentation_loss_poly: 0.8191 2024/01/12 04:00:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 04:00:31 - mmengine - INFO - Iter(train) [551000/640000] base_lr: 9.3925e-06 lr: 9.3925e-07 eta: 1 day, 10:46:19 time: 1.4173 data_time: 0.0263 memory: 25630 grad_norm: 3.5243 loss: 1.0413 caption_loss_cls: 1.8856 detection_loss_cls: 0.0229 detection_loss_reg: 0.2979 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9452 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3107 instance_segmentation_loss_poly: 0.8219 2024/01/12 04:12:04 - mmengine - INFO - Iter(train) [551500/640000] base_lr: 9.2889e-06 lr: 9.2889e-07 eta: 1 day, 10:34:19 time: 1.4172 data_time: 0.0263 memory: 25630 grad_norm: 3.5351 loss: 1.0341 caption_loss_cls: 1.8815 detection_loss_cls: 0.0229 detection_loss_reg: 0.3001 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9442 instance_segmentation_loss_cls: 0.0237 instance_segmentation_loss_reg: 0.3115 instance_segmentation_loss_poly: 0.8237 2024/01/12 04:23:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 04:23:57 - mmengine - INFO - Iter(train) [552000/640000] base_lr: 9.1859e-06 lr: 9.1859e-07 eta: 1 day, 10:22:50 time: 1.4179 data_time: 0.0263 memory: 25630 grad_norm: 3.5475 loss: 1.0296 caption_loss_cls: 1.8731 detection_loss_cls: 0.0228 detection_loss_reg: 0.2987 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9403 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3114 instance_segmentation_loss_poly: 0.8244 2024/01/12 04:23:57 - mmengine - INFO - Saving checkpoint at 552000 iterations 2024/01/12 04:35:49 - mmengine - INFO - Iter(train) [552500/640000] base_lr: 9.0834e-06 lr: 9.0834e-07 eta: 1 day, 10:11:19 time: 1.4175 data_time: 0.0265 memory: 25630 grad_norm: 3.5826 loss: 1.0423 caption_loss_cls: 1.8678 detection_loss_cls: 0.0229 detection_loss_reg: 0.3004 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9419 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3119 instance_segmentation_loss_poly: 0.8254 2024/01/12 04:47:32 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 04:47:32 - mmengine - INFO - Iter(train) [553000/640000] base_lr: 8.9815e-06 lr: 8.9815e-07 eta: 1 day, 9:59:34 time: 1.4152 data_time: 0.0264 memory: 25630 grad_norm: 3.5492 loss: 1.0360 caption_loss_cls: 1.8670 detection_loss_cls: 0.0229 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9392 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3106 instance_segmentation_loss_poly: 0.8225 2024/01/12 04:59:08 - mmengine - INFO - Iter(train) [553500/640000] base_lr: 8.8801e-06 lr: 8.8801e-07 eta: 1 day, 9:47:40 time: 1.4141 data_time: 0.0264 memory: 25630 grad_norm: 3.5295 loss: 1.0417 caption_loss_cls: 1.8730 detection_loss_cls: 0.0229 detection_loss_reg: 0.3014 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9391 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3111 instance_segmentation_loss_poly: 0.8237 2024/01/12 05:10:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240111_051648 2024/01/12 05:10:46 - mmengine - INFO - Iter(train) [554000/640000] base_lr: 8.7792e-06 lr: 8.7792e-07 eta: 1 day, 9:35:48 time: 1.4108 data_time: 0.0266 memory: 25630 grad_norm: 3.5525 loss: 1.0511 caption_loss_cls: 1.8699 detection_loss_cls: 0.0229 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9341 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3104 instance_segmentation_loss_poly: 0.8215 2024/01/12 05:10:46 - mmengine - INFO - Saving checkpoint at 554000 iterations 2024/01/12 05:32:26 - mmengine - INFO - Iter(train) [554500/640000] base_lr: 8.6790e-06 lr: 8.6790e-07 eta: 1 day, 8:32:35 time: 1.3971 data_time: 0.0194 memory: 25641 grad_norm: 3.5491 loss: 1.0471 caption_loss_cls: 1.8674 detection_loss_cls: 0.0229 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9327 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3095 instance_segmentation_loss_poly: 0.8213 2024/01/12 05:43:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 05:43:45 - mmengine - INFO - Iter(train) [555000/640000] base_lr: 8.5792e-06 lr: 8.5792e-07 eta: 1 day, 8:12:15 time: 1.3940 data_time: 0.0191 memory: 25641 grad_norm: 3.5572 loss: 1.0407 caption_loss_cls: 1.8673 detection_loss_cls: 0.0229 detection_loss_reg: 0.3008 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9314 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3076 instance_segmentation_loss_poly: 0.8160 2024/01/12 05:55:20 - mmengine - INFO - Iter(train) [555500/640000] base_lr: 8.4800e-06 lr: 8.4800e-07 eta: 1 day, 8:12:16 time: 1.3943 data_time: 0.0189 memory: 25641 grad_norm: 3.5430 loss: 1.0383 caption_loss_cls: 1.8692 detection_loss_cls: 0.0229 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9287 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3084 instance_segmentation_loss_poly: 0.8186 2024/01/12 06:07:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 06:07:17 - mmengine - INFO - Iter(train) [556000/640000] base_lr: 8.3814e-06 lr: 8.3814e-07 eta: 1 day, 8:22:44 time: 1.3955 data_time: 0.0187 memory: 25641 grad_norm: 3.5301 loss: 1.0477 caption_loss_cls: 1.8714 detection_loss_cls: 0.0227 detection_loss_reg: 0.3000 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9264 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8215 2024/01/12 06:07:17 - mmengine - INFO - Saving checkpoint at 556000 iterations 2024/01/12 06:19:21 - mmengine - INFO - Iter(train) [556500/640000] base_lr: 8.2833e-06 lr: 8.2833e-07 eta: 1 day, 8:27:38 time: 1.3987 data_time: 0.0181 memory: 25641 grad_norm: 3.5867 loss: 1.0443 caption_loss_cls: 1.8726 detection_loss_cls: 0.0228 detection_loss_reg: 0.3009 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9229 instance_segmentation_loss_cls: 0.0236 instance_segmentation_loss_reg: 0.3111 instance_segmentation_loss_poly: 0.8232 2024/01/12 06:31:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 06:31:07 - mmengine - INFO - Iter(train) [557000/640000] base_lr: 8.1858e-06 lr: 8.1858e-07 eta: 1 day, 8:18:23 time: 1.3993 data_time: 0.0181 memory: 25641 grad_norm: 3.5808 loss: 1.0496 caption_loss_cls: 1.8709 detection_loss_cls: 0.0228 detection_loss_reg: 0.3013 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9200 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3107 instance_segmentation_loss_poly: 0.8212 2024/01/12 06:44:30 - mmengine - INFO - Iter(train) [557500/640000] base_lr: 8.0888e-06 lr: 8.0888e-07 eta: 1 day, 8:46:37 time: 1.4259 data_time: 0.0180 memory: 25641 grad_norm: 3.5907 loss: 1.0544 caption_loss_cls: 1.8796 detection_loss_cls: 0.0230 detection_loss_reg: 0.3019 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9198 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3103 instance_segmentation_loss_poly: 0.8213 2024/01/12 06:57:35 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 06:57:35 - mmengine - INFO - Iter(train) [558000/640000] base_lr: 7.9924e-06 lr: 7.9924e-07 eta: 1 day, 8:58:26 time: 1.4476 data_time: 0.0177 memory: 25641 grad_norm: 3.5545 loss: 1.0376 caption_loss_cls: 1.8759 detection_loss_cls: 0.0231 detection_loss_reg: 0.3018 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9190 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3094 instance_segmentation_loss_poly: 0.8198 2024/01/12 06:57:35 - mmengine - INFO - Saving checkpoint at 558000 iterations 2024/01/12 07:09:27 - mmengine - INFO - Iter(train) [558500/640000] base_lr: 7.8965e-06 lr: 7.8965e-07 eta: 1 day, 8:42:40 time: 1.4542 data_time: 0.0239 memory: 25641 grad_norm: 3.6103 loss: 1.0497 caption_loss_cls: 1.8720 detection_loss_cls: 0.0231 detection_loss_reg: 0.3023 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9214 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3102 instance_segmentation_loss_poly: 0.8219 2024/01/12 07:21:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 07:21:33 - mmengine - INFO - Iter(train) [559000/640000] base_lr: 7.8012e-06 lr: 7.8012e-07 eta: 1 day, 8:31:21 time: 1.4658 data_time: 0.0240 memory: 25641 grad_norm: 3.5428 loss: 1.0494 caption_loss_cls: 1.8715 detection_loss_cls: 0.0233 detection_loss_reg: 0.3044 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9222 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3105 instance_segmentation_loss_poly: 0.8222 2024/01/12 07:34:20 - mmengine - INFO - Iter(train) [559500/640000] base_lr: 7.7064e-06 lr: 7.7064e-07 eta: 1 day, 8:29:59 time: 1.4839 data_time: 0.0243 memory: 25641 grad_norm: 3.5551 loss: 1.0442 caption_loss_cls: 1.8682 detection_loss_cls: 0.0230 detection_loss_reg: 0.3012 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9158 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3095 instance_segmentation_loss_poly: 0.8191 2024/01/12 07:45:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 07:45:54 - mmengine - INFO - Iter(train) [560000/640000] base_lr: 7.6122e-06 lr: 7.6122e-07 eta: 1 day, 8:10:29 time: 1.4780 data_time: 0.0242 memory: 25641 grad_norm: 3.5636 loss: 1.0287 caption_loss_cls: 1.8630 detection_loss_cls: 0.0229 detection_loss_reg: 0.3002 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9089 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8171 2024/01/12 07:45:54 - mmengine - INFO - Saving checkpoint at 560000 iterations 2024/01/12 07:57:21 - mmengine - INFO - Evaluating bbox... 2024/01/12 07:58:19 - mmengine - INFO - bbox_mAP_copypaste: 0.527 0.708 0.576 0.369 0.573 0.682 2024/01/12 07:58:19 - mmengine - INFO - Evaluating segm... 2024/01/12 07:59:33 - mmengine - INFO - segm_mAP_copypaste: 0.356 0.626 0.352 0.205 0.402 0.546 2024/01/12 08:01:42 - mmengine - INFO - Evaluating bbox... 2024/01/12 08:02:41 - mmengine - INFO - bbox_mAP_copypaste: 0.525 0.707 0.574 0.368 0.571 0.681 2024/01/12 08:08:40 - mmengine - INFO - per class results: 2024/01/12 08:08:40 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.51 | 89.85 | | building | 82.89 | 92.06 | | sky | 93.67 | 97.76 | | floor | 82.71 | 90.82 | | tree | 74.89 | 87.48 | | ceiling | 86.25 | 94.44 | | road | 84.54 | 89.72 | | bed | 90.72 | 96.05 | | windowpane | 63.68 | 79.54 | | grass | 66.84 | 82.63 | | cabinet | 63.81 | 73.95 | | sidewalk | 69.11 | 81.63 | | person | 81.84 | 92.13 | | earth | 38.31 | 49.27 | | door | 55.66 | 71.18 | | table | 64.37 | 80.46 | | mountain | 60.22 | 78.21 | | plant | 55.05 | 66.21 | | curtain | 76.49 | 87.57 | | chair | 62.95 | 76.65 | | car | 85.26 | 92.38 | | water | 60.25 | 73.02 | | painting | 74.56 | 87.53 | | sofa | 72.86 | 84.34 | | shelf | 46.64 | 67.51 | | house | 45.91 | 60.15 | | sea | 61.4 | 76.88 | | mirror | 69.3 | 78.13 | | rug | 65.52 | 76.08 | | field | 37.16 | 59.08 | | armchair | 52.66 | 71.73 | | seat | 67.89 | 82.01 | | fence | 48.05 | 66.36 | | desk | 53.66 | 69.07 | | rock | 47.3 | 68.11 | | wardrobe | 46.52 | 66.77 | | lamp | 67.75 | 80.54 | | bathtub | 80.38 | 89.87 | | railing | 41.68 | 57.88 | | cushion | 64.3 | 78.18 | | base | 20.8 | 28.49 | | box | 30.83 | 40.21 | | column | 56.95 | 67.33 | | signboard | 39.05 | 52.59 | | chest of drawers | 39.79 | 61.64 | | counter | 29.26 | 41.64 | | sand | 45.96 | 67.95 | | sink | 76.5 | 85.25 | | skyscraper | 47.13 | 58.54 | | fireplace | 76.26 | 88.62 | | refrigerator | 81.5 | 87.09 | | grandstand | 47.89 | 83.77 | | path | 24.08 | 35.4 | | stairs | 33.52 | 40.25 | | runway | 73.97 | 94.87 | | case | 52.8 | 68.44 | | pool table | 92.21 | 96.15 | | pillow | 61.67 | 71.7 | | screen door | 68.1 | 71.56 | | stairway | 33.29 | 43.92 | | river | 14.52 | 32.06 | | bridge | 68.6 | 79.24 | | bookcase | 41.0 | 61.03 | | blind | 38.8 | 43.98 | | coffee table | 64.43 | 82.05 | | toilet | 87.37 | 92.19 | | flower | 43.28 | 57.78 | | book | 48.87 | 71.09 | | hill | 16.58 | 24.39 | | bench | 61.67 | 71.69 | | countertop | 62.65 | 79.08 | | stove | 81.33 | 85.55 | | palm | 48.89 | 73.69 | | kitchen island | 49.78 | 73.92 | | computer | 78.22 | 89.63 | | swivel chair | 45.53 | 59.26 | | boat | 66.46 | 88.37 | | bar | 38.19 | 48.64 | | arcade machine | 69.71 | 73.36 | | hovel | 33.24 | 36.49 | | bus | 86.18 | 94.73 | | towel | 67.36 | 79.56 | | light | 53.05 | 62.73 | | truck | 50.16 | 66.21 | | tower | 35.39 | 58.51 | | chandelier | 67.61 | 78.14 | | awning | 32.56 | 41.2 | | streetlight | 35.16 | 47.66 | | booth | 41.13 | 54.13 | | television receiver | 74.88 | 89.8 | | airplane | 70.66 | 81.17 | | dirt track | 7.46 | 22.89 | | apparel | 33.57 | 47.93 | | pole | 29.66 | 42.65 | | land | 3.07 | 4.05 | | bannister | 16.76 | 23.58 | | escalator | 23.32 | 24.38 | | ottoman | 53.62 | 73.14 | | bottle | 28.35 | 36.63 | | buffet | 59.07 | 65.89 | | poster | 34.75 | 46.66 | | stage | 10.85 | 15.73 | | van | 47.77 | 62.52 | | ship | 16.3 | 17.15 | | fountain | 21.89 | 22.21 | | conveyer belt | 69.86 | 91.56 | | canopy | 37.2 | 46.3 | | washer | 68.53 | 72.21 | | plaything | 31.8 | 40.95 | | swimming pool | 67.19 | 69.83 | | stool | 44.26 | 62.99 | | barrel | 22.94 | 79.24 | | basket | 35.09 | 45.53 | | waterfall | 65.96 | 88.19 | | tent | 73.0 | 97.38 | | bag | 25.17 | 34.19 | | minibike | 73.66 | 85.31 | | cradle | 85.8 | 96.42 | | oven | 53.27 | 65.23 | | ball | 48.16 | 60.8 | | food | 54.26 | 59.44 | | step | 7.55 | 9.28 | | tank | 52.35 | 56.27 | | trade name | 26.6 | 32.24 | | microwave | 87.28 | 93.92 | | pot | 49.81 | 59.66 | | animal | 64.14 | 68.86 | | bicycle | 58.89 | 74.69 | | lake | 57.79 | 65.12 | | dishwasher | 72.84 | 87.9 | | screen | 64.85 | 87.94 | | blanket | 30.83 | 37.8 | | sculpture | 70.77 | 81.33 | | hood | 65.28 | 72.84 | | sconce | 46.64 | 57.94 | | vase | 44.25 | 63.14 | | traffic light | 40.88 | 59.32 | | tray | 20.92 | 30.55 | | ashcan | 44.87 | 56.81 | | fan | 64.1 | 75.84 | | pier | 40.24 | 44.41 | | crt screen | 3.06 | 7.86 | | plate | 58.5 | 77.06 | | monitor | 12.93 | 14.66 | | bulletin board | 55.7 | 65.38 | | shower | 2.88 | 4.99 | | radiator | 61.86 | 72.69 | | glass | 19.72 | 22.1 | | clock | 38.5 | 45.77 | | flag | 44.21 | 51.23 | +---------------------+-------+-------+ 2024/01/12 08:08:53 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5250 coco/bbox_mAP_50: 0.7070 coco/bbox_mAP_75: 0.5740 coco/bbox_mAP_s: 0.3680 coco/bbox_mAP_m: 0.5710 coco/bbox_mAP_l: 0.6810 coco/segm_mAP: 0.3560 coco/segm_mAP_50: 0.6260 coco/segm_mAP_75: 0.3520 coco/segm_mAP_s: 0.2050 coco/segm_mAP_m: 0.4020 coco/segm_mAP_l: 0.5460 Bleu_1: 0.7731 Bleu_2: 0.6114 Bleu_3: 0.4711 Bleu_4: 0.3586 METEOR: 0.2824 ROUGE_L: 0.5714 CIDEr: 1.1774 SPICE: 0.2090 aAcc: 84.4400 mIoU: 52.5200 mAcc: 64.5600 visual-grounding/miou: 0.8366 visual-grounding/acc: 0.8923 data_time: 0.0278 time: 1.9142 2024/01/12 08:20:23 - mmengine - INFO - Iter(train) [560500/640000] base_lr: 7.5186e-06 lr: 7.5186e-07 eta: 1 day, 7:51:48 time: 1.4700 data_time: 0.0185 memory: 34733 grad_norm: 3.4860 loss: 1.0227 caption_loss_cls: 1.8629 detection_loss_cls: 0.0230 detection_loss_reg: 0.3010 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9107 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3097 instance_segmentation_loss_poly: 0.8178 2024/01/12 08:32:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 08:32:06 - mmengine - INFO - Iter(train) [561000/640000] base_lr: 7.4255e-06 lr: 7.4255e-07 eta: 1 day, 7:36:16 time: 1.4694 data_time: 0.0188 memory: 25641 grad_norm: 3.5082 loss: 1.0312 caption_loss_cls: 1.8606 detection_loss_cls: 0.0229 detection_loss_reg: 0.3015 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9101 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3114 instance_segmentation_loss_poly: 0.8226 2024/01/12 08:43:20 - mmengine - INFO - Iter(train) [561500/640000] base_lr: 7.3330e-06 lr: 7.3330e-07 eta: 1 day, 7:15:59 time: 1.4370 data_time: 0.0189 memory: 25641 grad_norm: 3.5759 loss: 1.0314 caption_loss_cls: 1.8631 detection_loss_cls: 0.0229 detection_loss_reg: 0.3004 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9095 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3116 instance_segmentation_loss_poly: 0.8227 2024/01/12 08:55:07 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 08:55:07 - mmengine - INFO - Iter(train) [562000/640000] base_lr: 7.2410e-06 lr: 7.2410e-07 eta: 1 day, 7:02:29 time: 1.4177 data_time: 0.0191 memory: 25641 grad_norm: 3.6020 loss: 1.0498 caption_loss_cls: 1.8630 detection_loss_cls: 0.0229 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.9110 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3115 instance_segmentation_loss_poly: 0.8231 2024/01/12 08:55:07 - mmengine - INFO - Saving checkpoint at 562000 iterations 2024/01/12 09:07:09 - mmengine - INFO - Iter(train) [562500/640000] base_lr: 7.1496e-06 lr: 7.1496e-07 eta: 1 day, 6:51:10 time: 1.4200 data_time: 0.0208 memory: 25641 grad_norm: 3.6061 loss: 1.0519 caption_loss_cls: 1.8600 detection_loss_cls: 0.0230 detection_loss_reg: 0.3024 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9117 instance_segmentation_loss_cls: 0.0235 instance_segmentation_loss_reg: 0.3118 instance_segmentation_loss_poly: 0.8242 2024/01/12 09:18:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 09:18:47 - mmengine - INFO - Iter(train) [563000/640000] base_lr: 7.0587e-06 lr: 7.0587e-07 eta: 1 day, 6:36:31 time: 1.4131 data_time: 0.0209 memory: 25641 grad_norm: 3.6325 loss: 1.0582 caption_loss_cls: 1.8594 detection_loss_cls: 0.0230 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9086 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3100 instance_segmentation_loss_poly: 0.8208 2024/01/12 09:30:25 - mmengine - INFO - Iter(train) [563500/640000] base_lr: 6.9684e-06 lr: 6.9684e-07 eta: 1 day, 6:22:12 time: 1.3959 data_time: 0.0209 memory: 25641 grad_norm: 3.6064 loss: 1.0538 caption_loss_cls: 1.8555 detection_loss_cls: 0.0229 detection_loss_reg: 0.3033 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9084 instance_segmentation_loss_cls: 0.0234 instance_segmentation_loss_reg: 0.3108 instance_segmentation_loss_poly: 0.8216 2024/01/12 09:42:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 09:42:03 - mmengine - INFO - Iter(train) [564000/640000] base_lr: 6.8787e-06 lr: 6.8787e-07 eta: 1 day, 6:08:10 time: 1.3969 data_time: 0.0213 memory: 25641 grad_norm: 3.6357 loss: 1.0683 caption_loss_cls: 1.8510 detection_loss_cls: 0.0229 detection_loss_reg: 0.3039 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9073 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3097 instance_segmentation_loss_poly: 0.8186 2024/01/12 09:42:03 - mmengine - INFO - Saving checkpoint at 564000 iterations 2024/01/12 09:54:17 - mmengine - INFO - Iter(train) [564500/640000] base_lr: 6.7895e-06 lr: 6.7895e-07 eta: 1 day, 5:58:33 time: 1.4073 data_time: 0.0279 memory: 25641 grad_norm: 3.5859 loss: 1.0544 caption_loss_cls: 1.8483 detection_loss_cls: 0.0229 detection_loss_reg: 0.3042 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9076 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8157 2024/01/12 10:05:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 10:05:47 - mmengine - INFO - Iter(train) [565000/640000] base_lr: 6.7009e-06 lr: 6.7009e-07 eta: 1 day, 5:43:51 time: 1.4041 data_time: 0.0278 memory: 25641 grad_norm: 3.6094 loss: 1.0498 caption_loss_cls: 1.8487 detection_loss_cls: 0.0228 detection_loss_reg: 0.3040 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9041 instance_segmentation_loss_cls: 0.0229 instance_segmentation_loss_reg: 0.3081 instance_segmentation_loss_poly: 0.8155 2024/01/12 10:16:56 - mmengine - INFO - Iter(train) [565500/640000] base_lr: 6.6128e-06 lr: 6.6128e-07 eta: 1 day, 5:27:05 time: 1.4031 data_time: 0.0278 memory: 25641 grad_norm: 3.5916 loss: 1.0565 caption_loss_cls: 1.8506 detection_loss_cls: 0.0227 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9034 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3094 instance_segmentation_loss_poly: 0.8194 2024/01/12 10:28:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 10:28:45 - mmengine - INFO - Iter(train) [566000/640000] base_lr: 6.5254e-06 lr: 6.5254e-07 eta: 1 day, 5:14:53 time: 1.4033 data_time: 0.0278 memory: 25641 grad_norm: 3.5987 loss: 1.0562 caption_loss_cls: 1.8491 detection_loss_cls: 0.0227 detection_loss_reg: 0.3047 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.9052 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3096 instance_segmentation_loss_poly: 0.8195 2024/01/12 10:28:45 - mmengine - INFO - Saving checkpoint at 566000 iterations 2024/01/12 10:40:18 - mmengine - INFO - Iter(train) [566500/640000] base_lr: 6.4384e-06 lr: 6.4384e-07 eta: 1 day, 5:01:14 time: 1.3964 data_time: 0.0271 memory: 25641 grad_norm: 3.5901 loss: 1.0469 caption_loss_cls: 1.8489 detection_loss_cls: 0.0227 detection_loss_reg: 0.3059 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8989 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3088 instance_segmentation_loss_poly: 0.8174 2024/01/12 10:52:05 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 10:52:05 - mmengine - INFO - Iter(train) [567000/640000] base_lr: 6.3521e-06 lr: 6.3521e-07 eta: 1 day, 4:48:55 time: 1.3985 data_time: 0.0271 memory: 25641 grad_norm: 3.5602 loss: 1.0327 caption_loss_cls: 1.8471 detection_loss_cls: 0.0229 detection_loss_reg: 0.3073 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8977 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3087 instance_segmentation_loss_poly: 0.8177 2024/01/12 11:03:27 - mmengine - INFO - Iter(train) [567500/640000] base_lr: 6.2663e-06 lr: 6.2663e-07 eta: 1 day, 4:34:28 time: 1.3944 data_time: 0.0272 memory: 25641 grad_norm: 3.5809 loss: 1.0386 caption_loss_cls: 1.8317 detection_loss_cls: 0.0228 detection_loss_reg: 0.3061 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8984 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3102 instance_segmentation_loss_poly: 0.8219 2024/01/12 11:14:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 11:14:52 - mmengine - INFO - Iter(train) [568000/640000] base_lr: 6.1810e-06 lr: 6.1810e-07 eta: 1 day, 4:20:33 time: 1.3912 data_time: 0.0271 memory: 25641 grad_norm: 3.5768 loss: 1.0354 caption_loss_cls: 1.8336 detection_loss_cls: 0.0228 detection_loss_reg: 0.3053 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8953 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3082 instance_segmentation_loss_poly: 0.8175 2024/01/12 11:14:52 - mmengine - INFO - Saving checkpoint at 568000 iterations 2024/01/12 11:27:05 - mmengine - INFO - Iter(train) [568500/640000] base_lr: 6.0964e-06 lr: 6.0964e-07 eta: 1 day, 4:10:40 time: 1.3910 data_time: 0.0270 memory: 25641 grad_norm: 3.5785 loss: 1.0513 caption_loss_cls: 1.8358 detection_loss_cls: 0.0227 detection_loss_reg: 0.3035 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8961 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3088 instance_segmentation_loss_poly: 0.8183 2024/01/12 11:38:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 11:38:49 - mmengine - INFO - Iter(train) [569000/640000] base_lr: 6.0123e-06 lr: 6.0123e-07 eta: 1 day, 3:58:24 time: 1.3944 data_time: 0.0270 memory: 25641 grad_norm: 3.5629 loss: 1.0513 caption_loss_cls: 1.8343 detection_loss_cls: 0.0228 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8940 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3084 instance_segmentation_loss_poly: 0.8173 2024/01/12 11:50:10 - mmengine - INFO - Iter(train) [569500/640000] base_lr: 5.9287e-06 lr: 5.9287e-07 eta: 1 day, 3:44:24 time: 1.3974 data_time: 0.0270 memory: 25641 grad_norm: 3.5213 loss: 1.0368 caption_loss_cls: 1.8364 detection_loss_cls: 0.0228 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8917 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8199 2024/01/12 12:02:02 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 12:02:02 - mmengine - INFO - Iter(train) [570000/640000] base_lr: 5.8458e-06 lr: 5.8458e-07 eta: 1 day, 3:32:47 time: 1.3981 data_time: 0.0270 memory: 25641 grad_norm: 3.5010 loss: 1.0283 caption_loss_cls: 1.8383 detection_loss_cls: 0.0226 detection_loss_reg: 0.3036 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8868 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3110 instance_segmentation_loss_poly: 0.8215 2024/01/12 12:02:02 - mmengine - INFO - Saving checkpoint at 570000 iterations 2024/01/12 12:14:17 - mmengine - INFO - Iter(train) [570500/640000] base_lr: 5.7634e-06 lr: 5.7634e-07 eta: 1 day, 3:22:49 time: 1.4084 data_time: 0.0271 memory: 25641 grad_norm: 3.4925 loss: 1.0247 caption_loss_cls: 1.8380 detection_loss_cls: 0.0226 detection_loss_reg: 0.3047 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8819 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3100 instance_segmentation_loss_poly: 0.8191 2024/01/12 12:25:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 12:25:39 - mmengine - INFO - Iter(train) [571000/640000] base_lr: 5.6815e-06 lr: 5.6815e-07 eta: 1 day, 3:09:11 time: 1.4026 data_time: 0.0271 memory: 25641 grad_norm: 3.5065 loss: 1.0330 caption_loss_cls: 1.8380 detection_loss_cls: 0.0228 detection_loss_reg: 0.3063 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8831 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3102 instance_segmentation_loss_poly: 0.8195 2024/01/12 12:37:06 - mmengine - INFO - Iter(train) [571500/640000] base_lr: 5.6003e-06 lr: 5.6003e-07 eta: 1 day, 2:55:56 time: 1.4037 data_time: 0.0270 memory: 25641 grad_norm: 3.5089 loss: 1.0396 caption_loss_cls: 1.8399 detection_loss_cls: 0.0227 detection_loss_reg: 0.3056 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8826 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8183 2024/01/12 12:48:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 12:48:44 - mmengine - INFO - Iter(train) [572000/640000] base_lr: 5.5196e-06 lr: 5.5196e-07 eta: 1 day, 2:43:27 time: 1.4067 data_time: 0.0270 memory: 25641 grad_norm: 3.4568 loss: 1.0331 caption_loss_cls: 1.8405 detection_loss_cls: 0.0229 detection_loss_reg: 0.3068 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8835 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3087 instance_segmentation_loss_poly: 0.8164 2024/01/12 12:48:44 - mmengine - INFO - Saving checkpoint at 572000 iterations 2024/01/12 13:00:46 - mmengine - INFO - Iter(train) [572500/640000] base_lr: 5.4394e-06 lr: 5.4394e-07 eta: 1 day, 2:32:35 time: 1.4043 data_time: 0.0269 memory: 25641 grad_norm: 3.5140 loss: 1.0312 caption_loss_cls: 1.8433 detection_loss_cls: 0.0228 detection_loss_reg: 0.3061 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8794 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3103 instance_segmentation_loss_poly: 0.8193 2024/01/12 13:11:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 13:11:59 - mmengine - INFO - Iter(train) [573000/640000] base_lr: 5.3599e-06 lr: 5.3599e-07 eta: 1 day, 2:18:40 time: 1.3964 data_time: 0.0269 memory: 25641 grad_norm: 3.5299 loss: 1.0377 caption_loss_cls: 1.8437 detection_loss_cls: 0.0228 detection_loss_reg: 0.3058 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8782 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3106 instance_segmentation_loss_poly: 0.8202 2024/01/12 13:24:02 - mmengine - INFO - Iter(train) [573500/640000] base_lr: 5.2809e-06 lr: 5.2809e-07 eta: 1 day, 2:07:46 time: 1.4068 data_time: 0.0270 memory: 25641 grad_norm: 3.5435 loss: 1.0298 caption_loss_cls: 1.8387 detection_loss_cls: 0.0229 detection_loss_reg: 0.3060 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8786 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3104 instance_segmentation_loss_poly: 0.8197 2024/01/12 13:35:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 13:35:25 - mmengine - INFO - Iter(train) [574000/640000] base_lr: 5.2025e-06 lr: 5.2025e-07 eta: 1 day, 1:54:36 time: 1.3997 data_time: 0.0269 memory: 25641 grad_norm: 3.6213 loss: 1.0423 caption_loss_cls: 1.8434 detection_loss_cls: 0.0229 detection_loss_reg: 0.3063 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8783 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3098 instance_segmentation_loss_poly: 0.8183 2024/01/12 13:35:25 - mmengine - INFO - Saving checkpoint at 574000 iterations 2024/01/12 13:47:38 - mmengine - INFO - Iter(train) [574500/640000] base_lr: 5.1246e-06 lr: 5.1246e-07 eta: 1 day, 1:44:13 time: 1.3992 data_time: 0.0268 memory: 25641 grad_norm: 3.5989 loss: 1.0555 caption_loss_cls: 1.8493 detection_loss_cls: 0.0230 detection_loss_reg: 0.3070 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8809 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3106 instance_segmentation_loss_poly: 0.8197 2024/01/12 13:58:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 13:58:53 - mmengine - INFO - Iter(train) [575000/640000] base_lr: 5.0473e-06 lr: 5.0473e-07 eta: 1 day, 1:30:44 time: 1.3972 data_time: 0.0268 memory: 25641 grad_norm: 3.6411 loss: 1.0548 caption_loss_cls: 1.8463 detection_loss_cls: 0.0229 detection_loss_reg: 0.3064 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8837 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3094 instance_segmentation_loss_poly: 0.8167 2024/01/12 14:10:38 - mmengine - INFO - Iter(train) [575500/640000] base_lr: 4.9706e-06 lr: 4.9706e-07 eta: 1 day, 1:18:51 time: 1.4018 data_time: 0.0268 memory: 25641 grad_norm: 3.6443 loss: 1.0536 caption_loss_cls: 1.8512 detection_loss_cls: 0.0231 detection_loss_reg: 0.3083 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8829 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3103 instance_segmentation_loss_poly: 0.8184 2024/01/12 14:22:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 14:22:36 - mmengine - INFO - Iter(train) [576000/640000] base_lr: 4.8945e-06 lr: 4.8945e-07 eta: 1 day, 1:07:38 time: 1.4070 data_time: 0.0269 memory: 25641 grad_norm: 3.6249 loss: 1.0537 caption_loss_cls: 1.8500 detection_loss_cls: 0.0233 detection_loss_reg: 0.3098 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8850 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3114 instance_segmentation_loss_poly: 0.8198 2024/01/12 14:22:36 - mmengine - INFO - Saving checkpoint at 576000 iterations 2024/01/12 14:34:48 - mmengine - INFO - Iter(train) [576500/640000] base_lr: 4.8189e-06 lr: 4.8189e-07 eta: 1 day, 0:57:01 time: 1.4093 data_time: 0.0268 memory: 25641 grad_norm: 3.5818 loss: 1.0415 caption_loss_cls: 1.8529 detection_loss_cls: 0.0232 detection_loss_reg: 0.3091 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8828 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3102 instance_segmentation_loss_poly: 0.8170 2024/01/12 14:45:59 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 14:45:59 - mmengine - INFO - Iter(train) [577000/640000] base_lr: 4.7440e-06 lr: 4.7440e-07 eta: 1 day, 0:43:32 time: 1.4088 data_time: 0.0267 memory: 25641 grad_norm: 3.5754 loss: 1.0422 caption_loss_cls: 1.8455 detection_loss_cls: 0.0232 detection_loss_reg: 0.3102 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8862 instance_segmentation_loss_cls: 0.0229 instance_segmentation_loss_reg: 0.3085 instance_segmentation_loss_poly: 0.8139 2024/01/12 14:57:17 - mmengine - INFO - Iter(train) [577500/640000] base_lr: 4.6695e-06 lr: 4.6695e-07 eta: 1 day, 0:30:30 time: 1.3978 data_time: 0.0267 memory: 25641 grad_norm: 3.5764 loss: 1.0595 caption_loss_cls: 1.8477 detection_loss_cls: 0.0230 detection_loss_reg: 0.3102 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8881 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3091 instance_segmentation_loss_poly: 0.8169 2024/01/12 15:08:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 15:08:47 - mmengine - INFO - Iter(train) [578000/640000] base_lr: 4.5957e-06 lr: 4.5957e-07 eta: 1 day, 0:18:00 time: 1.3995 data_time: 0.0267 memory: 25641 grad_norm: 3.5525 loss: 1.0528 caption_loss_cls: 1.8504 detection_loss_cls: 0.0231 detection_loss_reg: 0.3103 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8831 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3073 instance_segmentation_loss_poly: 0.8128 2024/01/12 15:08:47 - mmengine - INFO - Saving checkpoint at 578000 iterations 2024/01/12 15:20:53 - mmengine - INFO - Iter(train) [578500/640000] base_lr: 4.5224e-06 lr: 4.5224e-07 eta: 1 day, 0:07:05 time: 1.3976 data_time: 0.0267 memory: 25641 grad_norm: 3.5440 loss: 1.0388 caption_loss_cls: 1.8518 detection_loss_cls: 0.0232 detection_loss_reg: 0.3109 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8828 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3084 instance_segmentation_loss_poly: 0.8138 2024/01/12 15:33:04 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 15:33:04 - mmengine - INFO - Iter(train) [579000/640000] base_lr: 4.4498e-06 lr: 4.4498e-07 eta: 23:56:20 time: 1.4118 data_time: 0.0268 memory: 25641 grad_norm: 3.4977 loss: 1.0339 caption_loss_cls: 1.8544 detection_loss_cls: 0.0232 detection_loss_reg: 0.3089 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8838 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3062 instance_segmentation_loss_poly: 0.8102 2024/01/12 15:44:15 - mmengine - INFO - Iter(train) [579500/640000] base_lr: 4.3776e-06 lr: 4.3776e-07 eta: 23:43:08 time: 1.4033 data_time: 0.0268 memory: 25641 grad_norm: 3.5525 loss: 1.0403 caption_loss_cls: 1.8591 detection_loss_cls: 0.0233 detection_loss_reg: 0.3096 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8807 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3077 instance_segmentation_loss_poly: 0.8131 2024/01/12 15:55:06 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 15:55:06 - mmengine - INFO - Iter(train) [580000/640000] base_lr: 4.3061e-06 lr: 4.3061e-07 eta: 23:29:16 time: 1.3865 data_time: 0.0266 memory: 25641 grad_norm: 3.6669 loss: 1.0465 caption_loss_cls: 1.8588 detection_loss_cls: 0.0233 detection_loss_reg: 0.3095 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8815 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3068 instance_segmentation_loss_poly: 0.8102 2024/01/12 15:55:06 - mmengine - INFO - Saving checkpoint at 580000 iterations 2024/01/12 16:06:47 - mmengine - INFO - Evaluating bbox... 2024/01/12 16:07:44 - mmengine - INFO - bbox_mAP_copypaste: 0.528 0.710 0.577 0.368 0.574 0.682 2024/01/12 16:07:44 - mmengine - INFO - Evaluating segm... 2024/01/12 16:08:57 - mmengine - INFO - segm_mAP_copypaste: 0.357 0.624 0.356 0.204 0.403 0.544 2024/01/12 16:11:05 - mmengine - INFO - Evaluating bbox... 2024/01/12 16:12:02 - mmengine - INFO - bbox_mAP_copypaste: 0.527 0.708 0.575 0.363 0.573 0.683 2024/01/12 16:18:13 - mmengine - INFO - per class results: 2024/01/12 16:18:13 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.55 | 89.97 | | building | 82.88 | 91.99 | | sky | 93.61 | 97.82 | | floor | 82.89 | 90.86 | | tree | 74.49 | 88.04 | | ceiling | 86.55 | 94.67 | | road | 84.63 | 90.09 | | bed | 90.89 | 96.15 | | windowpane | 64.05 | 79.99 | | grass | 66.74 | 81.5 | | cabinet | 63.89 | 74.08 | | sidewalk | 69.51 | 81.86 | | person | 82.0 | 91.85 | | earth | 38.48 | 51.77 | | door | 56.24 | 71.42 | | table | 64.98 | 80.38 | | mountain | 60.96 | 77.91 | | plant | 54.25 | 64.49 | | curtain | 76.65 | 87.4 | | chair | 62.55 | 75.92 | | car | 85.44 | 92.41 | | water | 60.64 | 72.75 | | painting | 74.3 | 87.3 | | sofa | 72.73 | 83.6 | | shelf | 47.37 | 67.21 | | house | 45.6 | 61.48 | | sea | 60.65 | 75.46 | | mirror | 69.66 | 78.53 | | rug | 66.16 | 77.06 | | field | 37.82 | 57.21 | | armchair | 52.72 | 72.44 | | seat | 67.96 | 82.45 | | fence | 48.04 | 65.72 | | desk | 53.96 | 69.11 | | rock | 49.53 | 71.5 | | wardrobe | 45.39 | 66.23 | | lamp | 67.56 | 80.12 | | bathtub | 80.36 | 89.85 | | railing | 41.84 | 58.22 | | cushion | 64.29 | 77.94 | | base | 21.53 | 29.34 | | box | 30.83 | 40.32 | | column | 57.0 | 66.44 | | signboard | 39.04 | 52.75 | | chest of drawers | 39.8 | 59.85 | | counter | 30.42 | 45.72 | | sand | 45.18 | 68.48 | | sink | 76.81 | 85.0 | | skyscraper | 47.12 | 58.17 | | fireplace | 76.15 | 87.95 | | refrigerator | 81.89 | 86.88 | | grandstand | 48.17 | 82.4 | | path | 20.91 | 29.53 | | stairs | 34.72 | 42.06 | | runway | 74.49 | 95.34 | | case | 48.85 | 62.74 | | pool table | 92.24 | 95.72 | | pillow | 61.55 | 71.77 | | screen door | 69.98 | 73.6 | | stairway | 33.9 | 43.42 | | river | 14.46 | 33.72 | | bridge | 68.77 | 80.03 | | bookcase | 41.87 | 62.07 | | blind | 38.53 | 42.99 | | coffee table | 65.24 | 81.91 | | toilet | 87.56 | 92.43 | | flower | 43.54 | 57.18 | | book | 50.2 | 68.55 | | hill | 16.31 | 23.39 | | bench | 61.39 | 71.72 | | countertop | 63.34 | 79.66 | | stove | 81.48 | 85.19 | | palm | 48.52 | 71.69 | | kitchen island | 48.53 | 72.44 | | computer | 78.18 | 88.65 | | swivel chair | 45.03 | 58.55 | | boat | 65.9 | 86.83 | | bar | 38.89 | 48.82 | | arcade machine | 70.18 | 73.69 | | hovel | 21.66 | 23.66 | | bus | 86.25 | 94.81 | | towel | 67.54 | 79.89 | | light | 53.98 | 64.84 | | truck | 50.32 | 65.94 | | tower | 34.64 | 56.95 | | chandelier | 68.23 | 79.66 | | awning | 32.24 | 38.67 | | streetlight | 34.44 | 46.78 | | booth | 45.49 | 57.5 | | television receiver | 73.94 | 85.23 | | airplane | 68.63 | 76.88 | | dirt track | 8.24 | 27.5 | | apparel | 32.24 | 46.98 | | pole | 29.36 | 44.41 | | land | 2.42 | 3.3 | | bannister | 15.4 | 21.3 | | escalator | 26.12 | 28.29 | | ottoman | 54.03 | 73.17 | | bottle | 27.51 | 35.87 | | buffet | 58.48 | 65.79 | | poster | 31.74 | 44.14 | | stage | 12.38 | 18.04 | | van | 47.61 | 62.83 | | ship | 8.88 | 9.36 | | fountain | 21.67 | 21.97 | | conveyer belt | 68.56 | 91.53 | | canopy | 38.22 | 45.01 | | washer | 68.23 | 72.06 | | plaything | 31.64 | 39.38 | | swimming pool | 67.46 | 69.22 | | stool | 44.54 | 61.57 | | barrel | 22.29 | 77.93 | | basket | 35.48 | 46.63 | | waterfall | 66.49 | 88.06 | | tent | 71.42 | 97.36 | | bag | 25.01 | 33.57 | | minibike | 73.44 | 84.33 | | cradle | 84.9 | 96.38 | | oven | 51.37 | 62.27 | | ball | 49.29 | 62.25 | | food | 54.02 | 58.51 | | step | 11.85 | 14.91 | | tank | 50.11 | 53.86 | | trade name | 28.45 | 35.31 | | microwave | 86.98 | 93.97 | | pot | 50.38 | 59.15 | | animal | 64.0 | 67.9 | | bicycle | 58.81 | 74.14 | | lake | 58.29 | 64.5 | | dishwasher | 74.97 | 86.31 | | screen | 64.69 | 85.56 | | blanket | 30.92 | 37.57 | | sculpture | 68.28 | 81.76 | | hood | 65.68 | 72.86 | | sconce | 47.91 | 59.1 | | vase | 45.59 | 62.21 | | traffic light | 40.88 | 56.92 | | tray | 20.35 | 29.76 | | ashcan | 45.27 | 56.03 | | fan | 64.25 | 76.2 | | pier | 39.47 | 43.98 | | crt screen | 5.28 | 14.39 | | plate | 58.48 | 77.15 | | monitor | 7.73 | 8.73 | | bulletin board | 57.51 | 67.37 | | shower | 3.56 | 5.68 | | radiator | 61.18 | 72.15 | | glass | 19.12 | 21.1 | | clock | 40.68 | 48.89 | | flag | 44.11 | 51.08 | +---------------------+-------+-------+ 2024/01/12 16:18:25 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5270 coco/bbox_mAP_50: 0.7080 coco/bbox_mAP_75: 0.5750 coco/bbox_mAP_s: 0.3630 coco/bbox_mAP_m: 0.5730 coco/bbox_mAP_l: 0.6830 coco/segm_mAP: 0.3570 coco/segm_mAP_50: 0.6240 coco/segm_mAP_75: 0.3560 coco/segm_mAP_s: 0.2040 coco/segm_mAP_m: 0.4030 coco/segm_mAP_l: 0.5440 Bleu_1: 0.7744 Bleu_2: 0.6158 Bleu_3: 0.4756 Bleu_4: 0.3634 METEOR: 0.2838 ROUGE_L: 0.5750 CIDEr: 1.1833 SPICE: 0.2102 aAcc: 84.4700 mIoU: 52.4500 mAcc: 64.2700 visual-grounding/miou: 0.8371 visual-grounding/acc: 0.8915 data_time: 0.0116 time: 1.9035 2024/01/12 16:29:47 - mmengine - INFO - Iter(train) [580500/640000] base_lr: 4.2351e-06 lr: 4.2351e-07 eta: 23:16:43 time: 1.3745 data_time: 0.0201 memory: 34732 grad_norm: 3.7168 loss: 1.0508 caption_loss_cls: 1.8582 detection_loss_cls: 0.0233 detection_loss_reg: 0.3090 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8806 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3070 instance_segmentation_loss_poly: 0.8101 2024/01/12 16:41:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 16:41:31 - mmengine - INFO - Iter(train) [581000/640000] base_lr: 4.1648e-06 lr: 4.1648e-07 eta: 23:04:58 time: 1.3829 data_time: 0.0203 memory: 25638 grad_norm: 3.7094 loss: 1.0425 caption_loss_cls: 1.8558 detection_loss_cls: 0.0233 detection_loss_reg: 0.3083 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8754 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3056 instance_segmentation_loss_poly: 0.8091 2024/01/12 16:52:44 - mmengine - INFO - Iter(train) [581500/640000] base_lr: 4.0950e-06 lr: 4.0950e-07 eta: 22:52:06 time: 1.3815 data_time: 0.0203 memory: 25638 grad_norm: 3.7468 loss: 1.0458 caption_loss_cls: 1.8493 detection_loss_cls: 0.0236 detection_loss_reg: 0.3097 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8737 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3061 instance_segmentation_loss_poly: 0.8095 2024/01/12 17:04:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 17:04:33 - mmengine - INFO - Iter(train) [582000/640000] base_lr: 4.0257e-06 lr: 4.0257e-07 eta: 22:40:33 time: 1.3864 data_time: 0.0204 memory: 25638 grad_norm: 3.7013 loss: 1.0368 caption_loss_cls: 1.8467 detection_loss_cls: 0.0235 detection_loss_reg: 0.3087 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8714 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3056 instance_segmentation_loss_poly: 0.8088 2024/01/12 17:04:33 - mmengine - INFO - Saving checkpoint at 582000 iterations 2024/01/12 17:17:02 - mmengine - INFO - Iter(train) [582500/640000] base_lr: 3.9571e-06 lr: 3.9571e-07 eta: 22:30:20 time: 1.3923 data_time: 0.0205 memory: 25638 grad_norm: 3.6952 loss: 1.0342 caption_loss_cls: 1.8415 detection_loss_cls: 0.0235 detection_loss_reg: 0.3074 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8685 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8079 2024/01/12 17:28:36 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 17:28:36 - mmengine - INFO - Iter(train) [583000/640000] base_lr: 3.8890e-06 lr: 3.8890e-07 eta: 22:18:13 time: 1.3828 data_time: 0.0206 memory: 25638 grad_norm: 3.7365 loss: 1.0434 caption_loss_cls: 1.8438 detection_loss_cls: 0.0235 detection_loss_reg: 0.3077 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8696 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3058 instance_segmentation_loss_poly: 0.8074 2024/01/12 17:40:18 - mmengine - INFO - Iter(train) [583500/640000] base_lr: 3.8215e-06 lr: 3.8215e-07 eta: 22:06:23 time: 1.3906 data_time: 0.0206 memory: 25638 grad_norm: 3.6821 loss: 1.0327 caption_loss_cls: 1.8464 detection_loss_cls: 0.0237 detection_loss_reg: 0.3085 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8675 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3078 instance_segmentation_loss_poly: 0.8112 2024/01/12 17:52:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 17:52:46 - mmengine - INFO - Iter(train) [584000/640000] base_lr: 3.7546e-06 lr: 3.7546e-07 eta: 21:55:59 time: 1.4146 data_time: 0.0210 memory: 25638 grad_norm: 3.5861 loss: 1.0279 caption_loss_cls: 1.8469 detection_loss_cls: 0.0236 detection_loss_reg: 0.3067 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8708 instance_segmentation_loss_cls: 0.0233 instance_segmentation_loss_reg: 0.3077 instance_segmentation_loss_poly: 0.8116 2024/01/12 17:52:46 - mmengine - INFO - Saving checkpoint at 584000 iterations 2024/01/12 18:04:30 - mmengine - INFO - Iter(train) [584500/640000] base_lr: 3.6883e-06 lr: 3.6883e-07 eta: 21:44:11 time: 1.4198 data_time: 0.0275 memory: 25638 grad_norm: 3.5918 loss: 1.0363 caption_loss_cls: 1.8454 detection_loss_cls: 0.0237 detection_loss_reg: 0.3085 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8703 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3060 instance_segmentation_loss_poly: 0.8097 2024/01/12 18:16:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 18:16:22 - mmengine - INFO - Iter(train) [585000/640000] base_lr: 3.6225e-06 lr: 3.6225e-07 eta: 21:32:37 time: 1.4216 data_time: 0.0276 memory: 25638 grad_norm: 3.5794 loss: 1.0387 caption_loss_cls: 1.8434 detection_loss_cls: 0.0236 detection_loss_reg: 0.3075 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8707 instance_segmentation_loss_cls: 0.0232 instance_segmentation_loss_reg: 0.3067 instance_segmentation_loss_poly: 0.8115 2024/01/12 18:27:25 - mmengine - INFO - Iter(train) [585500/640000] base_lr: 3.5574e-06 lr: 3.5574e-07 eta: 21:19:40 time: 1.4193 data_time: 0.0274 memory: 25638 grad_norm: 3.5627 loss: 1.0290 caption_loss_cls: 1.8404 detection_loss_cls: 0.0236 detection_loss_reg: 0.3064 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8688 instance_segmentation_loss_cls: 0.0231 instance_segmentation_loss_reg: 0.3057 instance_segmentation_loss_poly: 0.8095 2024/01/12 18:39:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 18:39:45 - mmengine - INFO - Iter(train) [586000/640000] base_lr: 3.4928e-06 lr: 3.4928e-07 eta: 21:08:53 time: 1.4268 data_time: 0.0283 memory: 25638 grad_norm: 3.5506 loss: 1.0284 caption_loss_cls: 1.8408 detection_loss_cls: 0.0236 detection_loss_reg: 0.3050 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8654 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3048 instance_segmentation_loss_poly: 0.8073 2024/01/12 18:39:45 - mmengine - INFO - Saving checkpoint at 586000 iterations 2024/01/12 18:51:42 - mmengine - INFO - Iter(train) [586500/640000] base_lr: 3.4288e-06 lr: 3.4288e-07 eta: 20:57:29 time: 1.4189 data_time: 0.0281 memory: 25638 grad_norm: 3.5816 loss: 1.0323 caption_loss_cls: 1.8456 detection_loss_cls: 0.0236 detection_loss_reg: 0.3052 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8679 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3040 instance_segmentation_loss_poly: 0.8056 2024/01/12 19:03:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 19:03:18 - mmengine - INFO - Iter(train) [587000/640000] base_lr: 3.3653e-06 lr: 3.3653e-07 eta: 20:45:27 time: 1.4193 data_time: 0.0281 memory: 25638 grad_norm: 3.5682 loss: 1.0347 caption_loss_cls: 1.8460 detection_loss_cls: 0.0235 detection_loss_reg: 0.3043 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8685 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3035 instance_segmentation_loss_poly: 0.8034 2024/01/12 19:15:08 - mmengine - INFO - Iter(train) [587500/640000] base_lr: 3.3025e-06 lr: 3.3025e-07 eta: 20:33:49 time: 1.4214 data_time: 0.0280 memory: 25638 grad_norm: 3.5607 loss: 1.0343 caption_loss_cls: 1.8478 detection_loss_cls: 0.0234 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8716 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3036 instance_segmentation_loss_poly: 0.8027 2024/01/12 19:26:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 19:26:33 - mmengine - INFO - Iter(train) [588000/640000] base_lr: 3.2402e-06 lr: 3.2402e-07 eta: 20:21:33 time: 1.4057 data_time: 0.0277 memory: 25638 grad_norm: 3.5857 loss: 1.0298 caption_loss_cls: 1.8445 detection_loss_cls: 0.0233 detection_loss_reg: 0.3048 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8707 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3031 instance_segmentation_loss_poly: 0.8025 2024/01/12 19:26:33 - mmengine - INFO - Saving checkpoint at 588000 iterations 2024/01/12 19:38:25 - mmengine - INFO - Iter(train) [588500/640000] base_lr: 3.1785e-06 lr: 3.1785e-07 eta: 20:09:59 time: 1.4078 data_time: 0.0278 memory: 25638 grad_norm: 3.5843 loss: 1.0311 caption_loss_cls: 1.8411 detection_loss_cls: 0.0234 detection_loss_reg: 0.3055 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8704 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3035 instance_segmentation_loss_poly: 0.8037 2024/01/12 19:49:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 19:49:57 - mmengine - INFO - Iter(train) [589000/640000] base_lr: 3.1174e-06 lr: 3.1174e-07 eta: 19:57:54 time: 1.4028 data_time: 0.0276 memory: 25638 grad_norm: 3.5762 loss: 1.0233 caption_loss_cls: 1.8401 detection_loss_cls: 0.0233 detection_loss_reg: 0.3044 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8667 instance_segmentation_loss_cls: 0.0229 instance_segmentation_loss_reg: 0.3026 instance_segmentation_loss_poly: 0.8014 2024/01/12 20:01:31 - mmengine - INFO - Iter(train) [589500/640000] base_lr: 3.0569e-06 lr: 3.0569e-07 eta: 19:45:53 time: 1.4103 data_time: 0.0278 memory: 25638 grad_norm: 3.5462 loss: 1.0261 caption_loss_cls: 1.8381 detection_loss_cls: 0.0233 detection_loss_reg: 0.3048 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8714 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3041 instance_segmentation_loss_poly: 0.8049 2024/01/12 20:13:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 20:13:10 - mmengine - INFO - Iter(train) [590000/640000] base_lr: 2.9970e-06 lr: 2.9970e-07 eta: 19:34:00 time: 1.4002 data_time: 0.0269 memory: 25638 grad_norm: 3.6497 loss: 1.0317 caption_loss_cls: 1.8374 detection_loss_cls: 0.0232 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8703 instance_segmentation_loss_cls: 0.0230 instance_segmentation_loss_reg: 0.3044 instance_segmentation_loss_poly: 0.8058 2024/01/12 20:13:10 - mmengine - INFO - Saving checkpoint at 590000 iterations 2024/01/12 20:25:23 - mmengine - INFO - Iter(train) [590500/640000] base_lr: 2.9376e-06 lr: 2.9376e-07 eta: 19:22:54 time: 1.4041 data_time: 0.0272 memory: 25638 grad_norm: 3.6546 loss: 1.0307 caption_loss_cls: 1.8358 detection_loss_cls: 0.0232 detection_loss_reg: 0.3038 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8702 instance_segmentation_loss_cls: 0.0228 instance_segmentation_loss_reg: 0.3029 instance_segmentation_loss_poly: 0.8034 2024/01/12 20:37:26 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 20:37:26 - mmengine - INFO - Iter(train) [591000/640000] base_lr: 2.8789e-06 lr: 2.8789e-07 eta: 19:11:32 time: 1.4109 data_time: 0.0273 memory: 25638 grad_norm: 3.6176 loss: 1.0218 caption_loss_cls: 1.8368 detection_loss_cls: 0.0235 detection_loss_reg: 0.3049 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8664 instance_segmentation_loss_cls: 0.0228 instance_segmentation_loss_reg: 0.3026 instance_segmentation_loss_poly: 0.8027 2024/01/12 20:48:59 - mmengine - INFO - Iter(train) [591500/640000] base_lr: 2.8207e-06 lr: 2.8207e-07 eta: 18:59:32 time: 1.4068 data_time: 0.0273 memory: 25638 grad_norm: 3.6221 loss: 1.0264 caption_loss_cls: 1.8415 detection_loss_cls: 0.0235 detection_loss_reg: 0.3046 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8645 instance_segmentation_loss_cls: 0.0228 instance_segmentation_loss_reg: 0.3027 instance_segmentation_loss_poly: 0.8029 2024/01/12 21:00:18 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 21:00:18 - mmengine - INFO - Iter(train) [592000/640000] base_lr: 2.7631e-06 lr: 2.7631e-07 eta: 18:47:13 time: 1.4051 data_time: 0.0272 memory: 25638 grad_norm: 3.6372 loss: 1.0315 caption_loss_cls: 1.8461 detection_loss_cls: 0.0234 detection_loss_reg: 0.3026 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8671 instance_segmentation_loss_cls: 0.0228 instance_segmentation_loss_reg: 0.3013 instance_segmentation_loss_poly: 0.7998 2024/01/12 21:00:18 - mmengine - INFO - Saving checkpoint at 592000 iterations 2024/01/12 21:12:43 - mmengine - INFO - Iter(train) [592500/640000] base_lr: 2.7061e-06 lr: 2.7061e-07 eta: 18:36:17 time: 1.4132 data_time: 0.0273 memory: 25638 grad_norm: 3.6052 loss: 1.0236 caption_loss_cls: 1.8378 detection_loss_cls: 0.0234 detection_loss_reg: 0.3031 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8677 instance_segmentation_loss_cls: 0.0228 instance_segmentation_loss_reg: 0.3013 instance_segmentation_loss_poly: 0.8002 2024/01/12 21:23:52 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 21:23:52 - mmengine - INFO - Iter(train) [593000/640000] base_lr: 2.6497e-06 lr: 2.6497e-07 eta: 18:23:49 time: 1.4077 data_time: 0.0272 memory: 25638 grad_norm: 3.6650 loss: 1.0325 caption_loss_cls: 1.8397 detection_loss_cls: 0.0233 detection_loss_reg: 0.3018 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8720 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2986 instance_segmentation_loss_poly: 0.7942 2024/01/12 21:35:48 - mmengine - INFO - Iter(train) [593500/640000] base_lr: 2.5939e-06 lr: 2.5939e-07 eta: 18:12:17 time: 1.4132 data_time: 0.0271 memory: 25638 grad_norm: 3.6239 loss: 1.0156 caption_loss_cls: 1.8399 detection_loss_cls: 0.0231 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8690 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2973 instance_segmentation_loss_poly: 0.7919 2024/01/12 21:47:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 21:47:33 - mmengine - INFO - Iter(train) [594000/640000] base_lr: 2.5386e-06 lr: 2.5386e-07 eta: 18:00:32 time: 1.4146 data_time: 0.0272 memory: 25638 grad_norm: 3.5676 loss: 1.0189 caption_loss_cls: 1.8348 detection_loss_cls: 0.0232 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8689 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2987 instance_segmentation_loss_poly: 0.7931 2024/01/12 21:47:33 - mmengine - INFO - Saving checkpoint at 594000 iterations 2024/01/12 21:59:30 - mmengine - INFO - Iter(train) [594500/640000] base_lr: 2.4840e-06 lr: 2.4840e-07 eta: 17:49:00 time: 1.4105 data_time: 0.0272 memory: 25638 grad_norm: 3.5651 loss: 1.0257 caption_loss_cls: 1.8413 detection_loss_cls: 0.0233 detection_loss_reg: 0.3014 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8648 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.2990 instance_segmentation_loss_poly: 0.7945 2024/01/12 22:10:53 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 22:10:53 - mmengine - INFO - Iter(train) [595000/640000] base_lr: 2.4299e-06 lr: 2.4299e-07 eta: 17:36:51 time: 1.4008 data_time: 0.0270 memory: 25638 grad_norm: 3.6148 loss: 1.0255 caption_loss_cls: 1.8384 detection_loss_cls: 0.0233 detection_loss_reg: 0.3007 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8598 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2969 instance_segmentation_loss_poly: 0.7902 2024/01/12 22:22:30 - mmengine - INFO - Iter(train) [595500/640000] base_lr: 2.3764e-06 lr: 2.3764e-07 eta: 17:24:57 time: 1.4015 data_time: 0.0269 memory: 25638 grad_norm: 3.6139 loss: 1.0136 caption_loss_cls: 1.8341 detection_loss_cls: 0.0233 detection_loss_reg: 0.3000 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8578 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2955 instance_segmentation_loss_poly: 0.7885 2024/01/12 22:33:51 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 22:33:51 - mmengine - INFO - Iter(train) [596000/640000] base_lr: 2.3235e-06 lr: 2.3235e-07 eta: 17:12:48 time: 1.4022 data_time: 0.0270 memory: 25638 grad_norm: 3.5989 loss: 1.0122 caption_loss_cls: 1.8306 detection_loss_cls: 0.0233 detection_loss_reg: 0.2992 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8579 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2947 instance_segmentation_loss_poly: 0.7869 2024/01/12 22:33:51 - mmengine - INFO - Saving checkpoint at 596000 iterations 2024/01/12 22:45:57 - mmengine - INFO - Iter(train) [596500/640000] base_lr: 2.2712e-06 lr: 2.2712e-07 eta: 17:01:26 time: 1.3975 data_time: 0.0271 memory: 25638 grad_norm: 3.5908 loss: 1.0182 caption_loss_cls: 1.8310 detection_loss_cls: 0.0232 detection_loss_reg: 0.2989 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8591 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2957 instance_segmentation_loss_poly: 0.7904 2024/01/12 22:57:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 22:57:54 - mmengine - INFO - Iter(train) [597000/640000] base_lr: 2.2195e-06 lr: 2.2195e-07 eta: 16:49:53 time: 1.4092 data_time: 0.0273 memory: 25638 grad_norm: 3.5718 loss: 1.0182 caption_loss_cls: 1.8292 detection_loss_cls: 0.0233 detection_loss_reg: 0.2995 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8534 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2953 instance_segmentation_loss_poly: 0.7896 2024/01/12 23:09:40 - mmengine - INFO - Iter(train) [597500/640000] base_lr: 2.1684e-06 lr: 2.1684e-07 eta: 16:38:09 time: 1.4068 data_time: 0.0273 memory: 25638 grad_norm: 3.5991 loss: 1.0204 caption_loss_cls: 1.8319 detection_loss_cls: 0.0232 detection_loss_reg: 0.2985 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8524 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2940 instance_segmentation_loss_poly: 0.7861 2024/01/12 23:21:27 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 23:21:27 - mmengine - INFO - Iter(train) [598000/640000] base_lr: 2.1178e-06 lr: 2.1178e-07 eta: 16:26:26 time: 1.4074 data_time: 0.0273 memory: 25638 grad_norm: 3.6281 loss: 1.0157 caption_loss_cls: 1.8296 detection_loss_cls: 0.0233 detection_loss_reg: 0.2986 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8526 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2947 instance_segmentation_loss_poly: 0.7879 2024/01/12 23:21:27 - mmengine - INFO - Saving checkpoint at 598000 iterations 2024/01/12 23:33:45 - mmengine - INFO - Iter(train) [598500/640000] base_lr: 2.0679e-06 lr: 2.0679e-07 eta: 16:15:13 time: 1.4128 data_time: 0.0271 memory: 25638 grad_norm: 3.5768 loss: 1.0100 caption_loss_cls: 1.8306 detection_loss_cls: 0.0232 detection_loss_reg: 0.2974 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8524 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2953 instance_segmentation_loss_poly: 0.7901 2024/01/12 23:45:21 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/12 23:45:21 - mmengine - INFO - Iter(train) [599000/640000] base_lr: 2.0185e-06 lr: 2.0185e-07 eta: 16:03:19 time: 1.4158 data_time: 0.0272 memory: 25638 grad_norm: 3.5872 loss: 1.0224 caption_loss_cls: 1.8287 detection_loss_cls: 0.0232 detection_loss_reg: 0.2966 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8549 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2966 instance_segmentation_loss_poly: 0.7923 2024/01/12 23:57:15 - mmengine - INFO - Iter(train) [599500/640000] base_lr: 1.9697e-06 lr: 1.9697e-07 eta: 15:51:42 time: 1.4202 data_time: 0.0274 memory: 25638 grad_norm: 3.5927 loss: 1.0217 caption_loss_cls: 1.8275 detection_loss_cls: 0.0232 detection_loss_reg: 0.2971 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8515 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2960 instance_segmentation_loss_poly: 0.7910 2024/01/13 00:08:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 00:08:44 - mmengine - INFO - Iter(train) [600000/640000] base_lr: 1.9216e-06 lr: 1.9216e-07 eta: 15:39:42 time: 1.4220 data_time: 0.0274 memory: 25638 grad_norm: 3.6199 loss: 1.0153 caption_loss_cls: 1.8238 detection_loss_cls: 0.0230 detection_loss_reg: 0.2957 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8514 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2951 instance_segmentation_loss_poly: 0.7897 2024/01/13 00:08:44 - mmengine - INFO - Saving checkpoint at 600000 iterations 2024/01/13 00:20:32 - mmengine - INFO - Evaluating bbox... 2024/01/13 00:21:29 - mmengine - INFO - bbox_mAP_copypaste: 0.529 0.709 0.579 0.370 0.576 0.684 2024/01/13 00:21:29 - mmengine - INFO - Evaluating segm... 2024/01/13 00:22:42 - mmengine - INFO - segm_mAP_copypaste: 0.359 0.626 0.356 0.207 0.404 0.547 2024/01/13 00:24:50 - mmengine - INFO - Evaluating bbox... 2024/01/13 00:25:48 - mmengine - INFO - bbox_mAP_copypaste: 0.527 0.708 0.576 0.366 0.573 0.683 2024/01/13 00:32:11 - mmengine - INFO - per class results: 2024/01/13 00:32:11 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.44 | 90.06 | | building | 82.96 | 91.88 | | sky | 93.56 | 97.91 | | floor | 82.64 | 90.97 | | tree | 74.61 | 87.57 | | ceiling | 86.43 | 94.69 | | road | 84.72 | 89.95 | | bed | 90.89 | 95.96 | | windowpane | 63.79 | 80.0 | | grass | 66.82 | 81.79 | | cabinet | 63.71 | 74.03 | | sidewalk | 69.59 | 82.46 | | person | 82.06 | 91.92 | | earth | 38.55 | 51.69 | | door | 55.74 | 71.29 | | table | 64.81 | 79.95 | | mountain | 60.87 | 77.27 | | plant | 54.11 | 64.07 | | curtain | 76.71 | 87.51 | | chair | 62.8 | 75.95 | | car | 85.43 | 92.24 | | water | 60.42 | 73.05 | | painting | 74.47 | 87.57 | | sofa | 72.95 | 83.48 | | shelf | 47.15 | 67.64 | | house | 46.39 | 62.08 | | sea | 61.61 | 76.82 | | mirror | 69.55 | 78.08 | | rug | 64.54 | 74.37 | | field | 38.94 | 59.43 | | armchair | 53.23 | 72.61 | | seat | 67.42 | 82.37 | | fence | 47.62 | 64.89 | | desk | 53.37 | 70.22 | | rock | 48.85 | 69.04 | | wardrobe | 45.94 | 66.13 | | lamp | 67.7 | 80.09 | | bathtub | 80.07 | 89.31 | | railing | 41.76 | 58.51 | | cushion | 65.27 | 80.03 | | base | 21.38 | 29.66 | | box | 30.56 | 39.44 | | column | 57.05 | 66.69 | | signboard | 39.21 | 53.01 | | chest of drawers | 38.42 | 58.39 | | counter | 29.83 | 41.97 | | sand | 46.49 | 68.25 | | sink | 76.6 | 84.9 | | skyscraper | 47.08 | 58.72 | | fireplace | 76.14 | 87.82 | | refrigerator | 81.64 | 86.31 | | grandstand | 48.24 | 82.36 | | path | 22.21 | 31.93 | | stairs | 33.44 | 40.45 | | runway | 74.37 | 95.37 | | case | 51.62 | 66.68 | | pool table | 92.18 | 95.7 | | pillow | 61.25 | 70.5 | | screen door | 69.45 | 72.6 | | stairway | 35.17 | 45.38 | | river | 14.9 | 34.04 | | bridge | 67.94 | 78.03 | | bookcase | 42.28 | 61.99 | | blind | 38.64 | 43.17 | | coffee table | 65.27 | 81.95 | | toilet | 87.48 | 92.19 | | flower | 43.62 | 56.93 | | book | 50.17 | 69.47 | | hill | 15.63 | 23.19 | | bench | 61.52 | 70.88 | | countertop | 63.59 | 80.17 | | stove | 81.2 | 85.06 | | palm | 49.4 | 73.17 | | kitchen island | 47.77 | 70.11 | | computer | 78.14 | 88.98 | | swivel chair | 45.08 | 59.04 | | boat | 66.25 | 87.3 | | bar | 37.89 | 48.01 | | arcade machine | 68.38 | 71.95 | | hovel | 26.74 | 29.2 | | bus | 86.19 | 94.74 | | towel | 67.16 | 79.15 | | light | 53.53 | 63.95 | | truck | 50.1 | 66.27 | | tower | 36.23 | 60.21 | | chandelier | 68.02 | 78.69 | | awning | 33.07 | 40.91 | | streetlight | 34.76 | 46.58 | | booth | 42.1 | 55.34 | | television receiver | 75.57 | 87.98 | | airplane | 68.24 | 75.37 | | dirt track | 9.2 | 27.46 | | apparel | 32.9 | 47.77 | | pole | 29.57 | 44.71 | | land | 2.45 | 3.32 | | bannister | 15.91 | 21.76 | | escalator | 25.82 | 28.01 | | ottoman | 55.07 | 72.68 | | bottle | 27.83 | 35.88 | | buffet | 56.48 | 63.92 | | poster | 32.53 | 45.47 | | stage | 12.26 | 17.63 | | van | 48.2 | 63.39 | | ship | 8.92 | 9.42 | | fountain | 21.07 | 21.33 | | conveyer belt | 68.5 | 91.6 | | canopy | 36.45 | 43.08 | | washer | 68.37 | 71.73 | | plaything | 33.96 | 43.29 | | swimming pool | 68.19 | 69.91 | | stool | 44.42 | 61.15 | | barrel | 22.89 | 70.52 | | basket | 35.09 | 46.02 | | waterfall | 65.67 | 86.83 | | tent | 72.07 | 97.46 | | bag | 25.03 | 33.76 | | minibike | 73.64 | 85.2 | | cradle | 84.39 | 96.39 | | oven | 52.12 | 63.61 | | ball | 48.73 | 61.59 | | food | 53.57 | 58.05 | | step | 10.44 | 12.92 | | tank | 50.06 | 53.68 | | trade name | 28.89 | 37.56 | | microwave | 87.06 | 93.9 | | pot | 49.58 | 58.32 | | animal | 63.33 | 67.17 | | bicycle | 58.9 | 74.25 | | lake | 58.41 | 64.76 | | dishwasher | 72.11 | 87.52 | | screen | 60.85 | 77.89 | | blanket | 30.8 | 37.34 | | sculpture | 69.0 | 81.61 | | hood | 64.54 | 73.01 | | sconce | 46.53 | 57.86 | | vase | 44.88 | 62.58 | | traffic light | 41.1 | 59.55 | | tray | 20.74 | 29.89 | | ashcan | 45.07 | 57.12 | | fan | 63.63 | 74.58 | | pier | 44.19 | 49.03 | | crt screen | 7.18 | 21.74 | | plate | 58.93 | 77.34 | | monitor | 9.27 | 10.46 | | bulletin board | 58.2 | 68.23 | | shower | 3.25 | 4.52 | | radiator | 61.45 | 71.61 | | glass | 19.14 | 21.22 | | clock | 40.39 | 48.75 | | flag | 44.43 | 51.91 | +---------------------+-------+-------+ 2024/01/13 00:32:24 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5270 coco/bbox_mAP_50: 0.7080 coco/bbox_mAP_75: 0.5760 coco/bbox_mAP_s: 0.3660 coco/bbox_mAP_m: 0.5730 coco/bbox_mAP_l: 0.6830 coco/segm_mAP: 0.3590 coco/segm_mAP_50: 0.6260 coco/segm_mAP_75: 0.3560 coco/segm_mAP_s: 0.2070 coco/segm_mAP_m: 0.4040 coco/segm_mAP_l: 0.5470 Bleu_1: 0.7754 Bleu_2: 0.6149 Bleu_3: 0.4745 Bleu_4: 0.3624 METEOR: 0.2845 ROUGE_L: 0.5756 CIDEr: 1.1872 SPICE: 0.2115 aAcc: 84.4500 mIoU: 52.4700 mAcc: 64.3000 visual-grounding/miou: 0.8372 visual-grounding/acc: 0.8913 data_time: 0.0114 time: 1.9115 2024/01/13 00:44:17 - mmengine - INFO - Iter(train) [600500/640000] base_lr: 1.8740e-06 lr: 1.8740e-07 eta: 15:28:06 time: 1.4195 data_time: 0.0206 memory: 34732 grad_norm: 3.6132 loss: 0.9996 caption_loss_cls: 1.8197 detection_loss_cls: 0.0231 detection_loss_reg: 0.2967 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8515 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2966 instance_segmentation_loss_poly: 0.7925 2024/01/13 00:55:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 00:55:56 - mmengine - INFO - Iter(train) [601000/640000] base_lr: 1.8270e-06 lr: 1.8270e-07 eta: 15:16:15 time: 1.4150 data_time: 0.0205 memory: 25638 grad_norm: 3.5675 loss: 0.9902 caption_loss_cls: 1.8212 detection_loss_cls: 0.0230 detection_loss_reg: 0.2953 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8520 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2964 instance_segmentation_loss_poly: 0.7913 2024/01/13 01:07:31 - mmengine - INFO - Iter(train) [601500/640000] base_lr: 1.7806e-06 lr: 1.7806e-07 eta: 15:04:23 time: 1.4123 data_time: 0.0206 memory: 25638 grad_norm: 3.5956 loss: 0.9993 caption_loss_cls: 1.8204 detection_loss_cls: 0.0229 detection_loss_reg: 0.2943 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8520 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2944 instance_segmentation_loss_poly: 0.7857 2024/01/13 01:19:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 01:19:44 - mmengine - INFO - Iter(train) [602000/640000] base_lr: 1.7348e-06 lr: 1.7348e-07 eta: 14:52:59 time: 1.4186 data_time: 0.0207 memory: 25638 grad_norm: 3.5508 loss: 1.0034 caption_loss_cls: 1.8207 detection_loss_cls: 0.0229 detection_loss_reg: 0.2950 semantic_segmentation_loss_cls: 0.0061 grounding_loss_reg: 1.8542 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2950 instance_segmentation_loss_poly: 0.7859 2024/01/13 01:19:44 - mmengine - INFO - Saving checkpoint at 602000 iterations 2024/01/13 01:31:53 - mmengine - INFO - Iter(train) [602500/640000] base_lr: 1.6895e-06 lr: 1.6895e-07 eta: 14:41:32 time: 1.4163 data_time: 0.0208 memory: 25638 grad_norm: 3.6031 loss: 1.0091 caption_loss_cls: 1.8238 detection_loss_cls: 0.0229 detection_loss_reg: 0.2957 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8537 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2947 instance_segmentation_loss_poly: 0.7854 2024/01/13 01:43:57 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 01:43:57 - mmengine - INFO - Iter(train) [603000/640000] base_lr: 1.6449e-06 lr: 1.6449e-07 eta: 14:30:01 time: 1.4234 data_time: 0.0208 memory: 25638 grad_norm: 3.5534 loss: 0.9946 caption_loss_cls: 1.8238 detection_loss_cls: 0.0228 detection_loss_reg: 0.2943 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8512 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2941 instance_segmentation_loss_poly: 0.7837 2024/01/13 01:55:49 - mmengine - INFO - Iter(train) [603500/640000] base_lr: 1.6009e-06 lr: 1.6009e-07 eta: 14:18:20 time: 1.4230 data_time: 0.0206 memory: 25638 grad_norm: 3.5068 loss: 0.9875 caption_loss_cls: 1.8241 detection_loss_cls: 0.0227 detection_loss_reg: 0.2940 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8522 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2921 instance_segmentation_loss_poly: 0.7805 2024/01/13 02:07:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 02:07:49 - mmengine - INFO - Iter(train) [604000/640000] base_lr: 1.5574e-06 lr: 1.5574e-07 eta: 14:06:45 time: 1.4309 data_time: 0.0208 memory: 25638 grad_norm: 3.4543 loss: 0.9939 caption_loss_cls: 1.8289 detection_loss_cls: 0.0226 detection_loss_reg: 0.2944 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8499 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2922 instance_segmentation_loss_poly: 0.7797 2024/01/13 02:07:49 - mmengine - INFO - Saving checkpoint at 604000 iterations 2024/01/13 02:20:08 - mmengine - INFO - Iter(train) [604500/640000] base_lr: 1.5146e-06 lr: 1.5146e-07 eta: 13:55:22 time: 1.4365 data_time: 0.0273 memory: 25638 grad_norm: 3.4745 loss: 1.0005 caption_loss_cls: 1.8270 detection_loss_cls: 0.0225 detection_loss_reg: 0.2936 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8480 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2920 instance_segmentation_loss_poly: 0.7787 2024/01/13 02:31:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 02:31:22 - mmengine - INFO - Iter(train) [605000/640000] base_lr: 1.4723e-06 lr: 1.4723e-07 eta: 13:43:14 time: 1.4305 data_time: 0.0273 memory: 25638 grad_norm: 3.5148 loss: 1.0156 caption_loss_cls: 1.8270 detection_loss_cls: 0.0225 detection_loss_reg: 0.2932 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8486 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2913 instance_segmentation_loss_poly: 0.7765 2024/01/13 02:43:01 - mmengine - INFO - Iter(train) [605500/640000] base_lr: 1.4307e-06 lr: 1.4307e-07 eta: 13:31:24 time: 1.4315 data_time: 0.0272 memory: 25638 grad_norm: 3.4966 loss: 1.0110 caption_loss_cls: 1.8261 detection_loss_cls: 0.0224 detection_loss_reg: 0.2931 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8503 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2912 instance_segmentation_loss_poly: 0.7757 2024/01/13 02:54:46 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 02:54:46 - mmengine - INFO - Iter(train) [606000/640000] base_lr: 1.3896e-06 lr: 1.3896e-07 eta: 13:19:38 time: 1.4246 data_time: 0.0270 memory: 25638 grad_norm: 3.5157 loss: 1.0148 caption_loss_cls: 1.8282 detection_loss_cls: 0.0226 detection_loss_reg: 0.2944 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8529 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2911 instance_segmentation_loss_poly: 0.7747 2024/01/13 02:54:46 - mmengine - INFO - Saving checkpoint at 606000 iterations 2024/01/13 03:06:34 - mmengine - INFO - Iter(train) [606500/640000] base_lr: 1.3491e-06 lr: 1.3491e-07 eta: 13:07:53 time: 1.4192 data_time: 0.0271 memory: 25638 grad_norm: 3.5308 loss: 1.0066 caption_loss_cls: 1.8248 detection_loss_cls: 0.0226 detection_loss_reg: 0.2937 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8544 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2926 instance_segmentation_loss_poly: 0.7780 2024/01/13 03:18:29 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 03:18:29 - mmengine - INFO - Iter(train) [607000/640000] base_lr: 1.3092e-06 lr: 1.3092e-07 eta: 12:56:13 time: 1.4169 data_time: 0.0270 memory: 25638 grad_norm: 3.5405 loss: 1.0124 caption_loss_cls: 1.8273 detection_loss_cls: 0.0225 detection_loss_reg: 0.2929 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8553 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2913 instance_segmentation_loss_poly: 0.7752 2024/01/13 03:30:05 - mmengine - INFO - Iter(train) [607500/640000] base_lr: 1.2699e-06 lr: 1.2699e-07 eta: 12:44:21 time: 1.4128 data_time: 0.0272 memory: 25638 grad_norm: 3.5711 loss: 1.0234 caption_loss_cls: 1.8278 detection_loss_cls: 0.0225 detection_loss_reg: 0.2916 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8565 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2911 instance_segmentation_loss_poly: 0.7745 2024/01/13 03:41:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 03:41:54 - mmengine - INFO - Iter(train) [608000/640000] base_lr: 1.2312e-06 lr: 1.2312e-07 eta: 12:32:37 time: 1.4100 data_time: 0.0271 memory: 25638 grad_norm: 3.5541 loss: 1.0197 caption_loss_cls: 1.8295 detection_loss_cls: 0.0226 detection_loss_reg: 0.2915 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8583 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2917 instance_segmentation_loss_poly: 0.7752 2024/01/13 03:41:54 - mmengine - INFO - Saving checkpoint at 608000 iterations 2024/01/13 03:54:14 - mmengine - INFO - Iter(train) [608500/640000] base_lr: 1.1931e-06 lr: 1.1931e-07 eta: 12:21:11 time: 1.4106 data_time: 0.0271 memory: 25638 grad_norm: 3.5209 loss: 1.0155 caption_loss_cls: 1.8289 detection_loss_cls: 0.0225 detection_loss_reg: 0.2898 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8562 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2915 instance_segmentation_loss_poly: 0.7753 2024/01/13 04:05:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 04:05:44 - mmengine - INFO - Iter(train) [609000/640000] base_lr: 1.1556e-06 lr: 1.1556e-07 eta: 12:09:16 time: 1.4145 data_time: 0.0272 memory: 25638 grad_norm: 3.5295 loss: 1.0188 caption_loss_cls: 1.8334 detection_loss_cls: 0.0228 detection_loss_reg: 0.2926 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8579 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2921 instance_segmentation_loss_poly: 0.7782 2024/01/13 04:17:19 - mmengine - INFO - Iter(train) [609500/640000] base_lr: 1.1187e-06 lr: 1.1187e-07 eta: 11:57:24 time: 1.4133 data_time: 0.0274 memory: 25638 grad_norm: 3.5408 loss: 1.0278 caption_loss_cls: 1.8301 detection_loss_cls: 0.0228 detection_loss_reg: 0.2921 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8547 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2930 instance_segmentation_loss_poly: 0.7800 2024/01/13 04:29:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 04:29:12 - mmengine - INFO - Iter(train) [610000/640000] base_lr: 1.0824e-06 lr: 1.0824e-07 eta: 11:45:42 time: 1.4152 data_time: 0.0275 memory: 25638 grad_norm: 3.5488 loss: 1.0226 caption_loss_cls: 1.8308 detection_loss_cls: 0.0227 detection_loss_reg: 0.2910 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8569 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2933 instance_segmentation_loss_poly: 0.7797 2024/01/13 04:29:12 - mmengine - INFO - Saving checkpoint at 610000 iterations 2024/01/13 04:41:06 - mmengine - INFO - Iter(train) [610500/640000] base_lr: 1.0467e-06 lr: 1.0467e-07 eta: 11:34:00 time: 1.4170 data_time: 0.0274 memory: 25638 grad_norm: 3.5713 loss: 1.0376 caption_loss_cls: 1.8360 detection_loss_cls: 0.0226 detection_loss_reg: 0.2900 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8608 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2928 instance_segmentation_loss_poly: 0.7773 2024/01/13 04:52:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 04:52:49 - mmengine - INFO - Iter(train) [611000/640000] base_lr: 1.0116e-06 lr: 1.0116e-07 eta: 11:22:13 time: 1.4139 data_time: 0.0274 memory: 25638 grad_norm: 3.5965 loss: 1.0255 caption_loss_cls: 1.8322 detection_loss_cls: 0.0224 detection_loss_reg: 0.2894 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8593 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2927 instance_segmentation_loss_poly: 0.7771 2024/01/13 05:04:27 - mmengine - INFO - Iter(train) [611500/640000] base_lr: 9.7706e-07 lr: 9.7706e-08 eta: 11:10:23 time: 1.4143 data_time: 0.0273 memory: 25638 grad_norm: 3.6008 loss: 1.0208 caption_loss_cls: 1.8298 detection_loss_cls: 0.0224 detection_loss_reg: 0.2890 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8588 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2910 instance_segmentation_loss_poly: 0.7740 2024/01/13 05:16:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240112_051728 2024/01/13 05:16:10 - mmengine - INFO - Iter(train) [612000/640000] base_lr: 9.4313e-07 lr: 9.4313e-08 eta: 10:58:36 time: 1.4129 data_time: 0.0273 memory: 25638 grad_norm: 3.6666 loss: 1.0223 caption_loss_cls: 1.8295 detection_loss_cls: 0.0225 detection_loss_reg: 0.2904 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8552 instance_segmentation_loss_cls: 0.0220 instance_segmentation_loss_reg: 0.2902 instance_segmentation_loss_poly: 0.7727 2024/01/13 05:16:10 - mmengine - INFO - Saving checkpoint at 612000 iterations 2024/01/13 05:32:48 - mmengine - INFO - Iter(train) [612500/640000] base_lr: 9.0980e-07 lr: 9.0980e-08 eta: 10:33:32 time: 1.4007 data_time: 0.0199 memory: 25573 grad_norm: 3.6571 loss: 1.0269 caption_loss_cls: 1.8309 detection_loss_cls: 0.0224 detection_loss_reg: 0.2892 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8545 instance_segmentation_loss_cls: 0.0221 instance_segmentation_loss_reg: 0.2897 instance_segmentation_loss_poly: 0.7713 2024/01/13 05:44:12 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 05:44:12 - mmengine - INFO - Iter(train) [613000/640000] base_lr: 8.7707e-07 lr: 8.7707e-08 eta: 10:18:29 time: 1.3991 data_time: 0.0197 memory: 25573 grad_norm: 3.6635 loss: 1.0282 caption_loss_cls: 1.8330 detection_loss_cls: 0.0224 detection_loss_reg: 0.2898 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8552 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2925 instance_segmentation_loss_poly: 0.7770 2024/01/13 05:55:41 - mmengine - INFO - Iter(train) [613500/640000] base_lr: 8.4493e-07 lr: 8.4493e-08 eta: 10:07:21 time: 1.3976 data_time: 0.0194 memory: 25573 grad_norm: 3.6727 loss: 1.0275 caption_loss_cls: 1.8391 detection_loss_cls: 0.0224 detection_loss_reg: 0.2902 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8541 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2931 instance_segmentation_loss_poly: 0.7776 2024/01/13 06:07:39 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 06:07:39 - mmengine - INFO - Iter(train) [614000/640000] base_lr: 8.1339e-07 lr: 8.1339e-08 eta: 10:02:34 time: 1.3993 data_time: 0.0191 memory: 25573 grad_norm: 3.6741 loss: 1.0375 caption_loss_cls: 1.8379 detection_loss_cls: 0.0225 detection_loss_reg: 0.2907 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8562 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2935 instance_segmentation_loss_poly: 0.7792 2024/01/13 06:07:39 - mmengine - INFO - Saving checkpoint at 614000 iterations 2024/01/13 06:19:42 - mmengine - INFO - Iter(train) [614500/640000] base_lr: 7.8245e-07 lr: 7.8245e-08 eta: 9:55:32 time: 1.4012 data_time: 0.0183 memory: 25573 grad_norm: 3.6583 loss: 1.0333 caption_loss_cls: 1.8362 detection_loss_cls: 0.0224 detection_loss_reg: 0.2905 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8586 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2939 instance_segmentation_loss_poly: 0.7787 2024/01/13 06:31:28 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 06:31:28 - mmengine - INFO - Iter(train) [615000/640000] base_lr: 7.5211e-07 lr: 7.5211e-08 eta: 9:44:33 time: 1.4021 data_time: 0.0181 memory: 25573 grad_norm: 3.6344 loss: 1.0405 caption_loss_cls: 1.8338 detection_loss_cls: 0.0224 detection_loss_reg: 0.2906 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8611 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2943 instance_segmentation_loss_poly: 0.7796 2024/01/13 06:43:09 - mmengine - INFO - Iter(train) [615500/640000] base_lr: 7.2236e-07 lr: 7.2236e-08 eta: 9:32:39 time: 1.4028 data_time: 0.0181 memory: 25573 grad_norm: 3.6462 loss: 1.0531 caption_loss_cls: 1.8361 detection_loss_cls: 0.0222 detection_loss_reg: 0.2905 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8612 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.2964 instance_segmentation_loss_poly: 0.7847 2024/01/13 06:55:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 06:55:10 - mmengine - INFO - Iter(train) [616000/640000] base_lr: 6.9321e-07 lr: 6.9321e-08 eta: 9:22:53 time: 1.4072 data_time: 0.0179 memory: 25573 grad_norm: 3.5851 loss: 1.0402 caption_loss_cls: 1.8349 detection_loss_cls: 0.0221 detection_loss_reg: 0.2911 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8609 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.2958 instance_segmentation_loss_poly: 0.7820 2024/01/13 06:55:10 - mmengine - INFO - Saving checkpoint at 616000 iterations 2024/01/13 07:07:03 - mmengine - INFO - Iter(train) [616500/640000] base_lr: 6.6466e-07 lr: 6.6466e-08 eta: 9:11:59 time: 1.4127 data_time: 0.0240 memory: 25573 grad_norm: 3.6533 loss: 1.0474 caption_loss_cls: 1.8361 detection_loss_cls: 0.0220 detection_loss_reg: 0.2899 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8648 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2952 instance_segmentation_loss_poly: 0.7804 2024/01/13 07:19:11 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 07:19:11 - mmengine - INFO - Iter(train) [617000/640000] base_lr: 6.3671e-07 lr: 6.3671e-08 eta: 9:01:59 time: 1.4238 data_time: 0.0241 memory: 25573 grad_norm: 3.6093 loss: 1.0309 caption_loss_cls: 1.8363 detection_loss_cls: 0.0219 detection_loss_reg: 0.2887 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8633 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2957 instance_segmentation_loss_poly: 0.7819 2024/01/13 07:31:57 - mmengine - INFO - Iter(train) [617500/640000] base_lr: 6.0936e-07 lr: 6.0936e-08 eta: 8:54:11 time: 1.4430 data_time: 0.0243 memory: 25573 grad_norm: 3.5739 loss: 1.0229 caption_loss_cls: 1.8377 detection_loss_cls: 0.0216 detection_loss_reg: 0.2863 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8653 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2944 instance_segmentation_loss_poly: 0.7784 2024/01/13 07:43:33 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 07:43:33 - mmengine - INFO - Iter(train) [618000/640000] base_lr: 5.8260e-07 lr: 5.8260e-08 eta: 8:41:15 time: 1.4371 data_time: 0.0242 memory: 25573 grad_norm: 3.5680 loss: 1.0172 caption_loss_cls: 1.8374 detection_loss_cls: 0.0216 detection_loss_reg: 0.2869 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8644 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2954 instance_segmentation_loss_poly: 0.7804 2024/01/13 07:43:33 - mmengine - INFO - Saving checkpoint at 618000 iterations 2024/01/13 07:55:43 - mmengine - INFO - Iter(train) [618500/640000] base_lr: 5.5645e-07 lr: 5.5645e-08 eta: 8:30:28 time: 1.4392 data_time: 0.0238 memory: 25573 grad_norm: 3.5577 loss: 0.9986 caption_loss_cls: 1.8321 detection_loss_cls: 0.0217 detection_loss_reg: 0.2878 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8594 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2940 instance_segmentation_loss_poly: 0.7767 2024/01/13 08:07:49 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 08:07:49 - mmengine - INFO - Iter(train) [619000/640000] base_lr: 5.3089e-07 lr: 5.3089e-08 eta: 8:19:13 time: 1.4439 data_time: 0.0239 memory: 25573 grad_norm: 3.6275 loss: 1.0036 caption_loss_cls: 1.8337 detection_loss_cls: 0.0217 detection_loss_reg: 0.2881 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8560 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2935 instance_segmentation_loss_poly: 0.7761 2024/01/13 08:19:22 - mmengine - INFO - Iter(train) [619500/640000] base_lr: 5.0593e-07 lr: 5.0593e-08 eta: 8:06:25 time: 1.4423 data_time: 0.0238 memory: 25573 grad_norm: 3.6375 loss: 1.0087 caption_loss_cls: 1.8325 detection_loss_cls: 0.0217 detection_loss_reg: 0.2878 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8611 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2946 instance_segmentation_loss_poly: 0.7792 2024/01/13 08:31:24 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 08:31:24 - mmengine - INFO - Iter(train) [620000/640000] base_lr: 4.8158e-07 lr: 4.8158e-08 eta: 7:54:56 time: 1.4424 data_time: 0.0238 memory: 25573 grad_norm: 3.6550 loss: 1.0205 caption_loss_cls: 1.8378 detection_loss_cls: 0.0217 detection_loss_reg: 0.2878 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8624 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2950 instance_segmentation_loss_poly: 0.7804 2024/01/13 08:31:24 - mmengine - INFO - Saving checkpoint at 620000 iterations 2024/01/13 08:42:48 - mmengine - INFO - Evaluating bbox... 2024/01/13 08:43:45 - mmengine - INFO - bbox_mAP_copypaste: 0.529 0.710 0.578 0.369 0.574 0.685 2024/01/13 08:43:45 - mmengine - INFO - Evaluating segm... 2024/01/13 08:44:59 - mmengine - INFO - segm_mAP_copypaste: 0.358 0.625 0.357 0.206 0.405 0.546 2024/01/13 08:47:08 - mmengine - INFO - Evaluating bbox... 2024/01/13 08:48:06 - mmengine - INFO - bbox_mAP_copypaste: 0.528 0.709 0.578 0.367 0.573 0.685 2024/01/13 08:54:23 - mmengine - INFO - per class results: 2024/01/13 08:54:23 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.53 | 89.94 | | building | 82.95 | 91.96 | | sky | 93.59 | 97.88 | | floor | 82.6 | 90.94 | | tree | 74.47 | 87.6 | | ceiling | 86.45 | 94.66 | | road | 84.76 | 90.07 | | bed | 90.87 | 96.05 | | windowpane | 63.9 | 79.96 | | grass | 66.87 | 81.5 | | cabinet | 63.86 | 74.32 | | sidewalk | 69.67 | 82.1 | | person | 82.17 | 92.07 | | earth | 38.18 | 50.72 | | door | 56.02 | 71.47 | | table | 64.8 | 80.13 | | mountain | 61.06 | 77.58 | | plant | 54.11 | 64.52 | | curtain | 76.6 | 87.86 | | chair | 62.96 | 76.16 | | car | 85.5 | 92.42 | | water | 60.39 | 72.95 | | painting | 74.14 | 87.58 | | sofa | 72.97 | 84.1 | | shelf | 46.97 | 67.54 | | house | 46.3 | 62.78 | | sea | 61.73 | 76.79 | | mirror | 69.5 | 78.41 | | rug | 64.51 | 74.52 | | field | 37.83 | 58.28 | | armchair | 53.14 | 71.7 | | seat | 67.31 | 82.38 | | fence | 47.82 | 64.79 | | desk | 53.48 | 70.13 | | rock | 50.42 | 72.23 | | wardrobe | 46.04 | 66.54 | | lamp | 67.71 | 80.17 | | bathtub | 80.53 | 89.74 | | railing | 41.63 | 58.36 | | cushion | 65.1 | 79.79 | | base | 22.12 | 30.76 | | box | 30.62 | 39.87 | | column | 57.14 | 66.38 | | signboard | 39.1 | 52.58 | | chest of drawers | 37.65 | 56.73 | | counter | 30.04 | 43.5 | | sand | 45.95 | 68.23 | | sink | 77.04 | 85.47 | | skyscraper | 46.99 | 58.87 | | fireplace | 75.63 | 88.29 | | refrigerator | 81.45 | 86.4 | | grandstand | 48.31 | 81.52 | | path | 21.95 | 31.74 | | stairs | 33.77 | 40.94 | | runway | 74.2 | 95.12 | | case | 51.01 | 65.39 | | pool table | 92.22 | 95.68 | | pillow | 61.29 | 70.7 | | screen door | 70.55 | 73.8 | | stairway | 34.52 | 44.09 | | river | 14.68 | 33.64 | | bridge | 68.74 | 79.57 | | bookcase | 42.51 | 63.93 | | blind | 38.77 | 43.74 | | coffee table | 65.18 | 82.03 | | toilet | 87.57 | 92.4 | | flower | 43.93 | 57.3 | | book | 50.35 | 69.02 | | hill | 16.22 | 23.47 | | bench | 61.56 | 71.48 | | countertop | 63.85 | 79.18 | | stove | 81.27 | 85.18 | | palm | 48.93 | 72.13 | | kitchen island | 47.23 | 69.47 | | computer | 78.31 | 89.03 | | swivel chair | 45.14 | 59.2 | | boat | 66.34 | 87.91 | | bar | 38.85 | 48.69 | | arcade machine | 69.13 | 72.73 | | hovel | 23.96 | 26.2 | | bus | 86.35 | 94.92 | | towel | 67.23 | 79.17 | | light | 53.11 | 63.68 | | truck | 50.08 | 66.32 | | tower | 35.35 | 58.57 | | chandelier | 68.13 | 78.61 | | awning | 32.66 | 40.59 | | streetlight | 34.69 | 46.93 | | booth | 42.04 | 56.27 | | television receiver | 75.34 | 87.95 | | airplane | 67.71 | 75.7 | | dirt track | 9.22 | 27.95 | | apparel | 32.63 | 47.63 | | pole | 29.47 | 44.13 | | land | 3.13 | 4.2 | | bannister | 15.48 | 21.15 | | escalator | 25.15 | 27.07 | | ottoman | 55.97 | 72.54 | | bottle | 27.98 | 36.0 | | buffet | 56.94 | 64.26 | | poster | 32.24 | 45.24 | | stage | 12.3 | 18.15 | | van | 48.82 | 64.38 | | ship | 9.3 | 9.8 | | fountain | 21.2 | 21.46 | | conveyer belt | 67.99 | 91.6 | | canopy | 36.33 | 43.48 | | washer | 68.34 | 72.03 | | plaything | 33.51 | 42.4 | | swimming pool | 67.7 | 69.33 | | stool | 44.79 | 61.26 | | barrel | 22.14 | 70.55 | | basket | 35.33 | 46.53 | | waterfall | 65.31 | 88.07 | | tent | 72.08 | 97.47 | | bag | 25.17 | 33.72 | | minibike | 73.56 | 85.0 | | cradle | 84.64 | 96.49 | | oven | 51.87 | 63.1 | | ball | 49.35 | 62.24 | | food | 53.23 | 57.49 | | step | 9.52 | 12.02 | | tank | 49.34 | 52.86 | | trade name | 28.47 | 35.82 | | microwave | 86.72 | 94.02 | | pot | 49.69 | 58.14 | | animal | 64.35 | 68.51 | | bicycle | 59.18 | 74.72 | | lake | 58.03 | 64.58 | | dishwasher | 72.28 | 87.78 | | screen | 60.55 | 78.49 | | blanket | 31.04 | 37.79 | | sculpture | 69.8 | 81.08 | | hood | 64.17 | 72.7 | | sconce | 46.38 | 58.25 | | vase | 44.73 | 63.0 | | traffic light | 41.28 | 59.51 | | tray | 20.36 | 29.53 | | ashcan | 45.1 | 56.76 | | fan | 63.78 | 74.7 | | pier | 40.89 | 45.21 | | crt screen | 6.61 | 19.73 | | plate | 59.06 | 77.64 | | monitor | 5.48 | 6.11 | | bulletin board | 57.96 | 67.15 | | shower | 3.36 | 5.16 | | radiator | 61.43 | 71.65 | | glass | 19.26 | 21.32 | | clock | 40.8 | 49.13 | | flag | 44.3 | 51.55 | +---------------------+-------+-------+ 2024/01/13 08:54:36 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5280 coco/bbox_mAP_50: 0.7090 coco/bbox_mAP_75: 0.5780 coco/bbox_mAP_s: 0.3670 coco/bbox_mAP_m: 0.5730 coco/bbox_mAP_l: 0.6850 coco/segm_mAP: 0.3580 coco/segm_mAP_50: 0.6250 coco/segm_mAP_75: 0.3570 coco/segm_mAP_s: 0.2060 coco/segm_mAP_m: 0.4050 coco/segm_mAP_l: 0.5460 Bleu_1: 0.7763 Bleu_2: 0.6156 Bleu_3: 0.4751 Bleu_4: 0.3624 METEOR: 0.2838 ROUGE_L: 0.5744 CIDEr: 1.1849 SPICE: 0.2112 aAcc: 84.4500 mIoU: 52.4000 mAcc: 64.2600 visual-grounding/miou: 0.8374 visual-grounding/acc: 0.8921 data_time: 0.0267 time: 1.9124 2024/01/13 09:06:08 - mmengine - INFO - Iter(train) [620500/640000] base_lr: 4.5782e-07 lr: 4.5782e-08 eta: 7:42:21 time: 1.4377 data_time: 0.0186 memory: 34667 grad_norm: 3.6987 loss: 1.0218 caption_loss_cls: 1.8403 detection_loss_cls: 0.0218 detection_loss_reg: 0.2881 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8651 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2941 instance_segmentation_loss_poly: 0.7784 2024/01/13 09:17:45 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 09:17:45 - mmengine - INFO - Iter(train) [621000/640000] base_lr: 4.3466e-07 lr: 4.3466e-08 eta: 7:29:57 time: 1.4297 data_time: 0.0187 memory: 25573 grad_norm: 3.7112 loss: 1.0186 caption_loss_cls: 1.8393 detection_loss_cls: 0.0218 detection_loss_reg: 0.2881 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8667 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2934 instance_segmentation_loss_poly: 0.7773 2024/01/13 09:29:22 - mmengine - INFO - Iter(train) [621500/640000] base_lr: 4.1210e-07 lr: 4.1210e-08 eta: 7:17:39 time: 1.4125 data_time: 0.0188 memory: 25573 grad_norm: 3.7587 loss: 1.0165 caption_loss_cls: 1.8392 detection_loss_cls: 0.0218 detection_loss_reg: 0.2898 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8664 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2939 instance_segmentation_loss_poly: 0.7782 2024/01/13 09:41:01 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 09:41:01 - mmengine - INFO - Iter(train) [622000/640000] base_lr: 3.9014e-07 lr: 3.9014e-08 eta: 7:05:30 time: 1.4135 data_time: 0.0191 memory: 25573 grad_norm: 3.7851 loss: 1.0244 caption_loss_cls: 1.8410 detection_loss_cls: 0.0219 detection_loss_reg: 0.2900 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8651 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2947 instance_segmentation_loss_poly: 0.7809 2024/01/13 09:41:01 - mmengine - INFO - Saving checkpoint at 622000 iterations 2024/01/13 09:53:16 - mmengine - INFO - Iter(train) [622500/640000] base_lr: 3.6878e-07 lr: 3.6878e-08 eta: 6:54:24 time: 1.4147 data_time: 0.0208 memory: 25573 grad_norm: 3.7340 loss: 1.0288 caption_loss_cls: 1.8409 detection_loss_cls: 0.0220 detection_loss_reg: 0.2918 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8652 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2945 instance_segmentation_loss_poly: 0.7810 2024/01/13 10:04:47 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 10:04:47 - mmengine - INFO - Iter(train) [623000/640000] base_lr: 3.4802e-07 lr: 3.4802e-08 eta: 6:42:01 time: 1.4059 data_time: 0.0210 memory: 25573 grad_norm: 3.6557 loss: 1.0286 caption_loss_cls: 1.8423 detection_loss_cls: 0.0222 detection_loss_reg: 0.2924 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8659 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2936 instance_segmentation_loss_poly: 0.7784 2024/01/13 10:15:58 - mmengine - INFO - Iter(train) [623500/640000] base_lr: 3.2786e-07 lr: 3.2786e-08 eta: 6:29:15 time: 1.4002 data_time: 0.0211 memory: 25573 grad_norm: 3.6683 loss: 1.0235 caption_loss_cls: 1.8380 detection_loss_cls: 0.0223 detection_loss_reg: 0.2946 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8675 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2940 instance_segmentation_loss_poly: 0.7803 2024/01/13 10:27:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 10:27:44 - mmengine - INFO - Iter(train) [624000/640000] base_lr: 3.0831e-07 lr: 3.0831e-08 eta: 6:17:25 time: 1.3964 data_time: 0.0214 memory: 25573 grad_norm: 3.6828 loss: 1.0334 caption_loss_cls: 1.8407 detection_loss_cls: 0.0224 detection_loss_reg: 0.2958 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8675 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2946 instance_segmentation_loss_poly: 0.7824 2024/01/13 10:27:44 - mmengine - INFO - Saving checkpoint at 624000 iterations 2024/01/13 10:39:19 - mmengine - INFO - Iter(train) [624500/640000] base_lr: 2.8935e-07 lr: 2.8935e-08 eta: 6:05:21 time: 1.3964 data_time: 0.0276 memory: 25573 grad_norm: 3.6575 loss: 1.0339 caption_loss_cls: 1.8375 detection_loss_cls: 0.0224 detection_loss_reg: 0.2956 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8710 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2958 instance_segmentation_loss_poly: 0.7849 2024/01/13 10:51:03 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 10:51:03 - mmengine - INFO - Iter(train) [625000/640000] base_lr: 2.7099e-07 lr: 2.7099e-08 eta: 5:53:29 time: 1.3984 data_time: 0.0277 memory: 25573 grad_norm: 3.6145 loss: 1.0326 caption_loss_cls: 1.8355 detection_loss_cls: 0.0225 detection_loss_reg: 0.2964 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8659 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2983 instance_segmentation_loss_poly: 0.7897 2024/01/13 11:02:22 - mmengine - INFO - Iter(train) [625500/640000] base_lr: 2.5323e-07 lr: 2.5323e-08 eta: 5:41:11 time: 1.3939 data_time: 0.0277 memory: 25573 grad_norm: 3.5976 loss: 1.0396 caption_loss_cls: 1.8361 detection_loss_cls: 0.0225 detection_loss_reg: 0.2953 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8696 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2990 instance_segmentation_loss_poly: 0.7914 2024/01/13 11:13:44 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 11:13:44 - mmengine - INFO - Iter(train) [626000/640000] base_lr: 2.3608e-07 lr: 2.3608e-08 eta: 5:29:02 time: 1.3897 data_time: 0.0276 memory: 25573 grad_norm: 3.5922 loss: 1.0366 caption_loss_cls: 1.8328 detection_loss_cls: 0.0224 detection_loss_reg: 0.2953 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8659 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2981 instance_segmentation_loss_poly: 0.7895 2024/01/13 11:13:44 - mmengine - INFO - Saving checkpoint at 626000 iterations 2024/01/13 11:25:55 - mmengine - INFO - Iter(train) [626500/640000] base_lr: 2.1952e-07 lr: 2.1952e-08 eta: 5:17:40 time: 1.3884 data_time: 0.0271 memory: 25573 grad_norm: 3.6816 loss: 1.0413 caption_loss_cls: 1.8330 detection_loss_cls: 0.0224 detection_loss_reg: 0.2959 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8661 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2981 instance_segmentation_loss_poly: 0.7892 2024/01/13 11:37:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 11:37:50 - mmengine - INFO - Iter(train) [627000/640000] base_lr: 2.0357e-07 lr: 2.0357e-08 eta: 5:06:01 time: 1.3946 data_time: 0.0273 memory: 25573 grad_norm: 3.7137 loss: 1.0437 caption_loss_cls: 1.8398 detection_loss_cls: 0.0222 detection_loss_reg: 0.2945 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8612 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2992 instance_segmentation_loss_poly: 0.7916 2024/01/13 11:49:21 - mmengine - INFO - Iter(train) [627500/640000] base_lr: 1.8822e-07 lr: 1.8822e-08 eta: 4:54:02 time: 1.3996 data_time: 0.0274 memory: 25573 grad_norm: 3.6823 loss: 1.0403 caption_loss_cls: 1.8380 detection_loss_cls: 0.0225 detection_loss_reg: 0.2976 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8607 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3001 instance_segmentation_loss_poly: 0.7930 2024/01/13 12:01:17 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 12:01:17 - mmengine - INFO - Iter(train) [628000/640000] base_lr: 1.7347e-07 lr: 1.7347e-08 eta: 4:42:24 time: 1.4021 data_time: 0.0273 memory: 25573 grad_norm: 3.6689 loss: 1.0245 caption_loss_cls: 1.8349 detection_loss_cls: 0.0227 detection_loss_reg: 0.3003 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8573 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3000 instance_segmentation_loss_poly: 0.7934 2024/01/13 12:01:17 - mmengine - INFO - Saving checkpoint at 628000 iterations 2024/01/13 12:13:41 - mmengine - INFO - Iter(train) [628500/640000] base_lr: 1.5932e-07 lr: 1.5932e-08 eta: 4:31:04 time: 1.4143 data_time: 0.0275 memory: 25573 grad_norm: 3.6431 loss: 1.0163 caption_loss_cls: 1.8305 detection_loss_cls: 0.0226 detection_loss_reg: 0.3006 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8561 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2995 instance_segmentation_loss_poly: 0.7913 2024/01/13 12:25:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 12:25:10 - mmengine - INFO - Iter(train) [629000/640000] base_lr: 1.4577e-07 lr: 1.4577e-08 eta: 4:19:05 time: 1.4107 data_time: 0.0273 memory: 25573 grad_norm: 3.6790 loss: 1.0163 caption_loss_cls: 1.8285 detection_loss_cls: 0.0225 detection_loss_reg: 0.2994 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8541 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2987 instance_segmentation_loss_poly: 0.7889 2024/01/13 12:36:40 - mmengine - INFO - Iter(train) [629500/640000] base_lr: 1.3282e-07 lr: 1.3282e-08 eta: 4:07:08 time: 1.4135 data_time: 0.0273 memory: 25573 grad_norm: 3.6848 loss: 1.0208 caption_loss_cls: 1.8342 detection_loss_cls: 0.0226 detection_loss_reg: 0.2997 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8558 instance_segmentation_loss_cls: 0.0223 instance_segmentation_loss_reg: 0.2984 instance_segmentation_loss_poly: 0.7870 2024/01/13 12:48:25 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 12:48:25 - mmengine - INFO - Iter(train) [630000/640000] base_lr: 1.2048e-07 lr: 1.2048e-08 eta: 3:55:20 time: 1.4189 data_time: 0.0274 memory: 25573 grad_norm: 3.6391 loss: 1.0128 caption_loss_cls: 1.8307 detection_loss_cls: 0.0225 detection_loss_reg: 0.2995 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8552 instance_segmentation_loss_cls: 0.0222 instance_segmentation_loss_reg: 0.2985 instance_segmentation_loss_poly: 0.7875 2024/01/13 12:48:25 - mmengine - INFO - Saving checkpoint at 630000 iterations 2024/01/13 13:00:37 - mmengine - INFO - Iter(train) [630500/640000] base_lr: 1.0874e-07 lr: 1.0874e-08 eta: 3:43:48 time: 1.4195 data_time: 0.0275 memory: 25573 grad_norm: 3.5963 loss: 1.0158 caption_loss_cls: 1.8251 detection_loss_cls: 0.0227 detection_loss_reg: 0.3004 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8534 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2999 instance_segmentation_loss_poly: 0.7915 2024/01/13 13:11:56 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 13:11:56 - mmengine - INFO - Iter(train) [631000/640000] base_lr: 9.7593e-08 lr: 9.7593e-09 eta: 3:31:48 time: 1.4104 data_time: 0.0274 memory: 25573 grad_norm: 3.6082 loss: 1.0193 caption_loss_cls: 1.8271 detection_loss_cls: 0.0227 detection_loss_reg: 0.3012 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8583 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.3003 instance_segmentation_loss_poly: 0.7929 2024/01/13 13:24:08 - mmengine - INFO - Iter(train) [631500/640000] base_lr: 8.7054e-08 lr: 8.7054e-09 eta: 3:20:13 time: 1.4207 data_time: 0.0275 memory: 25573 grad_norm: 3.5796 loss: 1.0093 caption_loss_cls: 1.8216 detection_loss_cls: 0.0228 detection_loss_reg: 0.3012 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8569 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3015 instance_segmentation_loss_poly: 0.7947 2024/01/13 13:35:48 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 13:35:48 - mmengine - INFO - Iter(train) [632000/640000] base_lr: 7.7116e-08 lr: 7.7116e-09 eta: 3:08:23 time: 1.4165 data_time: 0.0273 memory: 25573 grad_norm: 3.6146 loss: 1.0226 caption_loss_cls: 1.8206 detection_loss_cls: 0.0227 detection_loss_reg: 0.3000 semantic_segmentation_loss_cls: 0.0063 grounding_loss_reg: 1.8611 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3020 instance_segmentation_loss_poly: 0.7960 2024/01/13 13:35:48 - mmengine - INFO - Saving checkpoint at 632000 iterations 2024/01/13 13:47:58 - mmengine - INFO - Iter(train) [632500/640000] base_lr: 6.7780e-08 lr: 6.7780e-09 eta: 2:56:45 time: 1.4131 data_time: 0.0273 memory: 25573 grad_norm: 3.5834 loss: 1.0280 caption_loss_cls: 1.8173 detection_loss_cls: 0.0227 detection_loss_reg: 0.3016 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8632 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3022 instance_segmentation_loss_poly: 0.7969 2024/01/13 13:59:22 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 13:59:22 - mmengine - INFO - Iter(train) [633000/640000] base_lr: 5.9046e-08 lr: 5.9046e-09 eta: 2:44:50 time: 1.4118 data_time: 0.0273 memory: 25573 grad_norm: 3.6241 loss: 1.0397 caption_loss_cls: 1.8160 detection_loss_cls: 0.0228 detection_loss_reg: 0.3019 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8610 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3020 instance_segmentation_loss_poly: 0.7967 2024/01/13 14:11:14 - mmengine - INFO - Iter(train) [633500/640000] base_lr: 5.0914e-08 lr: 5.0914e-09 eta: 2:33:05 time: 1.4174 data_time: 0.0273 memory: 25573 grad_norm: 3.6264 loss: 1.0297 caption_loss_cls: 1.8169 detection_loss_cls: 0.0228 detection_loss_reg: 0.3026 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8589 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2996 instance_segmentation_loss_poly: 0.7907 2024/01/13 14:23:10 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 14:23:10 - mmengine - INFO - Iter(train) [634000/640000] base_lr: 4.3384e-08 lr: 4.3384e-09 eta: 2:21:21 time: 1.4203 data_time: 0.0273 memory: 25573 grad_norm: 3.5935 loss: 1.0292 caption_loss_cls: 1.8201 detection_loss_cls: 0.0230 detection_loss_reg: 0.3045 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8585 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3008 instance_segmentation_loss_poly: 0.7935 2024/01/13 14:23:10 - mmengine - INFO - Saving checkpoint at 634000 iterations 2024/01/13 14:35:26 - mmengine - INFO - Iter(train) [634500/640000] base_lr: 3.6456e-08 lr: 3.6456e-09 eta: 2:09:41 time: 1.4210 data_time: 0.0273 memory: 25573 grad_norm: 3.5684 loss: 1.0262 caption_loss_cls: 1.8198 detection_loss_cls: 0.0231 detection_loss_reg: 0.3058 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8541 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.3010 instance_segmentation_loss_poly: 0.7941 2024/01/13 14:46:40 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 14:46:40 - mmengine - INFO - Iter(train) [635000/640000] base_lr: 3.0130e-08 lr: 3.0130e-09 eta: 1:57:46 time: 1.4198 data_time: 0.0273 memory: 25573 grad_norm: 3.5708 loss: 1.0249 caption_loss_cls: 1.8199 detection_loss_cls: 0.0230 detection_loss_reg: 0.3052 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8563 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.3009 instance_segmentation_loss_poly: 0.7951 2024/01/13 14:58:05 - mmengine - INFO - Iter(train) [635500/640000] base_lr: 2.4407e-08 lr: 2.4407e-09 eta: 1:45:55 time: 1.4080 data_time: 0.0271 memory: 25573 grad_norm: 3.6472 loss: 1.0433 caption_loss_cls: 1.8262 detection_loss_cls: 0.0231 detection_loss_reg: 0.3055 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8564 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3020 instance_segmentation_loss_poly: 0.7966 2024/01/13 15:09:31 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 15:09:31 - mmengine - INFO - Iter(train) [636000/640000] base_lr: 1.9286e-08 lr: 1.9286e-09 eta: 1:34:06 time: 1.4047 data_time: 0.0272 memory: 25573 grad_norm: 3.6456 loss: 1.0437 caption_loss_cls: 1.8245 detection_loss_cls: 0.0232 detection_loss_reg: 0.3067 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8549 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3007 instance_segmentation_loss_poly: 0.7945 2024/01/13 15:09:31 - mmengine - INFO - Saving checkpoint at 636000 iterations 2024/01/13 15:21:39 - mmengine - INFO - Iter(train) [636500/640000] base_lr: 1.4767e-08 lr: 1.4767e-09 eta: 1:22:23 time: 1.4041 data_time: 0.0272 memory: 25573 grad_norm: 3.6497 loss: 1.0340 caption_loss_cls: 1.8198 detection_loss_cls: 0.0233 detection_loss_reg: 0.3077 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8527 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.3002 instance_segmentation_loss_poly: 0.7938 2024/01/13 15:33:50 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 15:33:50 - mmengine - INFO - Iter(train) [637000/640000] base_lr: 1.0850e-08 lr: 1.0850e-09 eta: 1:10:40 time: 1.4157 data_time: 0.0273 memory: 25573 grad_norm: 3.5830 loss: 1.0268 caption_loss_cls: 1.8213 detection_loss_cls: 0.0233 detection_loss_reg: 0.3082 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8573 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2996 instance_segmentation_loss_poly: 0.7934 2024/01/13 15:45:00 - mmengine - INFO - Iter(train) [637500/640000] base_lr: 7.5358e-09 lr: 7.5358e-10 eta: 0:58:49 time: 1.4051 data_time: 0.0274 memory: 25573 grad_norm: 3.5973 loss: 1.0397 caption_loss_cls: 1.8195 detection_loss_cls: 0.0234 detection_loss_reg: 0.3088 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8568 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3005 instance_segmentation_loss_poly: 0.7950 2024/01/13 15:55:54 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 15:55:54 - mmengine - INFO - Iter(train) [638000/640000] base_lr: 4.8239e-09 lr: 4.8239e-10 eta: 0:46:59 time: 1.3898 data_time: 0.0272 memory: 25573 grad_norm: 3.7008 loss: 1.0467 caption_loss_cls: 1.8184 detection_loss_cls: 0.0235 detection_loss_reg: 0.3104 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8594 instance_segmentation_loss_cls: 0.0227 instance_segmentation_loss_reg: 0.3009 instance_segmentation_loss_poly: 0.7961 2024/01/13 15:55:54 - mmengine - INFO - Saving checkpoint at 638000 iterations 2024/01/13 16:07:43 - mmengine - INFO - Iter(train) [638500/640000] base_lr: 2.7144e-09 lr: 2.7144e-10 eta: 0:35:15 time: 1.3832 data_time: 0.0270 memory: 25573 grad_norm: 3.7575 loss: 1.0356 caption_loss_cls: 1.8174 detection_loss_cls: 0.0234 detection_loss_reg: 0.3097 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8582 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.2989 instance_segmentation_loss_poly: 0.7907 2024/01/13 16:19:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 16:19:30 - mmengine - INFO - Iter(train) [639000/640000] base_lr: 1.2072e-09 lr: 1.2072e-10 eta: 0:23:30 time: 1.3914 data_time: 0.0271 memory: 25573 grad_norm: 3.7476 loss: 1.0304 caption_loss_cls: 1.8160 detection_loss_cls: 0.0233 detection_loss_reg: 0.3080 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8592 instance_segmentation_loss_cls: 0.0226 instance_segmentation_loss_reg: 0.3001 instance_segmentation_loss_poly: 0.7942 2024/01/13 16:30:44 - mmengine - INFO - Iter(train) [639500/640000] base_lr: 3.0240e-10 lr: 3.0240e-11 eta: 0:11:44 time: 1.3885 data_time: 0.0272 memory: 25573 grad_norm: 3.7481 loss: 1.0314 caption_loss_cls: 1.8176 detection_loss_cls: 0.0233 detection_loss_reg: 0.3080 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8553 instance_segmentation_loss_cls: 0.0225 instance_segmentation_loss_reg: 0.3003 instance_segmentation_loss_poly: 0.7951 2024/01/13 16:42:30 - mmengine - INFO - Exp name: univision_joint_5_cosine_filteranno_64witer_1120_text_fixbug_final_huge_dpr0p4_detnofilter_20240113_051803 2024/01/13 16:42:30 - mmengine - INFO - Iter(train) [640000/640000] base_lr: 1.2048e-15 lr: 1.2048e-16 eta: 0:00:00 time: 1.3936 data_time: 0.0272 memory: 25573 grad_norm: 3.7141 loss: 1.0219 caption_loss_cls: 1.8222 detection_loss_cls: 0.0233 detection_loss_reg: 0.3092 semantic_segmentation_loss_cls: 0.0062 grounding_loss_reg: 1.8540 instance_segmentation_loss_cls: 0.0224 instance_segmentation_loss_reg: 0.2978 instance_segmentation_loss_poly: 0.7888 2024/01/13 16:42:30 - mmengine - INFO - Saving checkpoint at 640000 iterations 2024/01/13 16:54:17 - mmengine - INFO - Evaluating bbox... 2024/01/13 16:55:14 - mmengine - INFO - bbox_mAP_copypaste: 0.529 0.710 0.578 0.369 0.574 0.685 2024/01/13 16:55:14 - mmengine - INFO - Evaluating segm... 2024/01/13 16:56:25 - mmengine - INFO - segm_mAP_copypaste: 0.358 0.626 0.356 0.205 0.404 0.547 2024/01/13 16:58:33 - mmengine - INFO - Evaluating bbox... 2024/01/13 16:59:31 - mmengine - INFO - bbox_mAP_copypaste: 0.528 0.709 0.577 0.367 0.572 0.686 2024/01/13 17:05:54 - mmengine - INFO - per class results: 2024/01/13 17:05:54 - mmengine - INFO - +---------------------+-------+-------+ | Class | IoU | Acc | +---------------------+-------+-------+ | wall | 79.51 | 89.97 | | building | 82.91 | 91.96 | | sky | 93.57 | 97.91 | | floor | 82.65 | 90.93 | | tree | 74.37 | 87.52 | | ceiling | 86.47 | 94.68 | | road | 84.79 | 90.25 | | bed | 90.88 | 96.01 | | windowpane | 63.89 | 79.92 | | grass | 66.94 | 81.64 | | cabinet | 63.75 | 74.11 | | sidewalk | 69.67 | 81.78 | | person | 82.11 | 92.04 | | earth | 38.1 | 50.49 | | door | 55.98 | 71.25 | | table | 64.72 | 80.35 | | mountain | 60.84 | 77.85 | | plant | 53.86 | 64.05 | | curtain | 76.6 | 87.88 | | chair | 62.92 | 76.13 | | car | 85.5 | 92.31 | | water | 60.37 | 72.97 | | painting | 74.25 | 87.71 | | sofa | 72.99 | 83.84 | | shelf | 46.98 | 67.81 | | house | 46.24 | 62.46 | | sea | 61.69 | 76.92 | | mirror | 69.5 | 78.31 | | rug | 64.82 | 74.95 | | field | 37.68 | 57.99 | | armchair | 53.27 | 72.01 | | seat | 67.41 | 82.48 | | fence | 47.86 | 64.74 | | desk | 53.21 | 69.91 | | rock | 49.81 | 71.42 | | wardrobe | 46.01 | 66.33 | | lamp | 67.71 | 80.0 | | bathtub | 80.52 | 89.71 | | railing | 41.65 | 58.36 | | cushion | 65.06 | 79.53 | | base | 21.8 | 30.26 | | box | 30.55 | 39.56 | | column | 57.06 | 66.29 | | signboard | 39.04 | 52.59 | | chest of drawers | 37.64 | 56.76 | | counter | 29.72 | 42.93 | | sand | 45.96 | 68.23 | | sink | 77.01 | 85.41 | | skyscraper | 46.95 | 58.93 | | fireplace | 75.39 | 88.34 | | refrigerator | 81.3 | 86.19 | | grandstand | 48.28 | 81.6 | | path | 21.77 | 31.57 | | stairs | 33.7 | 40.93 | | runway | 74.23 | 95.18 | | case | 50.98 | 65.43 | | pool table | 92.22 | 95.67 | | pillow | 61.62 | 71.45 | | screen door | 70.9 | 74.22 | | stairway | 34.8 | 44.48 | | river | 14.65 | 33.38 | | bridge | 68.66 | 79.59 | | bookcase | 42.39 | 63.54 | | blind | 38.78 | 43.72 | | coffee table | 65.34 | 81.89 | | toilet | 87.67 | 92.44 | | flower | 43.81 | 57.26 | | book | 50.36 | 69.1 | | hill | 16.42 | 23.52 | | bench | 61.78 | 71.58 | | countertop | 63.54 | 79.57 | | stove | 81.28 | 85.11 | | palm | 49.14 | 72.48 | | kitchen island | 47.04 | 69.43 | | computer | 78.22 | 89.01 | | swivel chair | 45.13 | 59.31 | | boat | 66.22 | 87.68 | | bar | 38.81 | 48.73 | | arcade machine | 68.97 | 72.44 | | hovel | 24.76 | 27.1 | | bus | 86.29 | 94.82 | | towel | 67.08 | 79.1 | | light | 53.15 | 63.72 | | truck | 50.16 | 66.21 | | tower | 35.09 | 58.11 | | chandelier | 68.14 | 78.78 | | awning | 32.49 | 40.22 | | streetlight | 34.36 | 46.06 | | booth | 41.88 | 56.14 | | television receiver | 75.32 | 87.8 | | airplane | 67.64 | 75.84 | | dirt track | 9.16 | 27.87 | | apparel | 32.54 | 47.41 | | pole | 29.42 | 44.22 | | land | 3.23 | 4.3 | | bannister | 15.5 | 21.27 | | escalator | 25.16 | 27.05 | | ottoman | 55.73 | 72.59 | | bottle | 27.85 | 35.89 | | buffet | 57.14 | 64.19 | | poster | 32.31 | 44.81 | | stage | 12.34 | 18.14 | | van | 48.68 | 64.11 | | ship | 9.76 | 10.28 | | fountain | 21.28 | 21.54 | | conveyer belt | 68.55 | 91.56 | | canopy | 36.05 | 43.12 | | washer | 68.33 | 71.99 | | plaything | 33.87 | 42.85 | | swimming pool | 67.74 | 69.38 | | stool | 44.68 | 61.48 | | barrel | 22.13 | 70.55 | | basket | 35.16 | 46.12 | | waterfall | 65.39 | 88.31 | | tent | 72.11 | 97.47 | | bag | 25.1 | 33.64 | | minibike | 73.6 | 84.89 | | cradle | 84.69 | 96.42 | | oven | 52.11 | 63.45 | | ball | 49.38 | 62.47 | | food | 53.02 | 57.25 | | step | 9.37 | 11.78 | | tank | 49.38 | 52.91 | | trade name | 28.17 | 35.44 | | microwave | 86.91 | 93.98 | | pot | 49.87 | 58.41 | | animal | 64.46 | 68.65 | | bicycle | 58.92 | 74.3 | | lake | 57.88 | 64.56 | | dishwasher | 72.16 | 87.63 | | screen | 60.46 | 78.47 | | blanket | 31.17 | 37.93 | | sculpture | 69.67 | 81.33 | | hood | 64.2 | 72.75 | | sconce | 46.53 | 58.35 | | vase | 44.68 | 62.79 | | traffic light | 41.19 | 59.45 | | tray | 20.35 | 29.7 | | ashcan | 45.0 | 56.56 | | fan | 63.73 | 74.77 | | pier | 40.81 | 45.15 | | crt screen | 6.6 | 19.73 | | plate | 58.96 | 77.74 | | monitor | 5.46 | 6.13 | | bulletin board | 58.03 | 67.42 | | shower | 3.5 | 5.34 | | radiator | 61.42 | 71.55 | | glass | 19.14 | 21.18 | | clock | 40.92 | 49.28 | | flag | 44.29 | 51.51 | +---------------------+-------+-------+ 2024/01/13 17:06:06 - mmengine - INFO - Iter(val) [209/209] coco/bbox_mAP: 0.5280 coco/bbox_mAP_50: 0.7090 coco/bbox_mAP_75: 0.5770 coco/bbox_mAP_s: 0.3670 coco/bbox_mAP_m: 0.5720 coco/bbox_mAP_l: 0.6860 coco/segm_mAP: 0.3580 coco/segm_mAP_50: 0.6260 coco/segm_mAP_75: 0.3560 coco/segm_mAP_s: 0.2050 coco/segm_mAP_m: 0.4040 coco/segm_mAP_l: 0.5470 Bleu_1: 0.7758 Bleu_2: 0.6153 Bleu_3: 0.4745 Bleu_4: 0.3618 METEOR: 0.2834 ROUGE_L: 0.5745 CIDEr: 1.1818 SPICE: 0.2106 aAcc: 84.4400 mIoU: 52.3800 mAcc: 64.2400 visual-grounding/miou: 0.8378 visual-grounding/acc: 0.8923 data_time: 0.0126 time: 1.8929