2021-10-07 16:25:08,199 : ***** Transfer task : STS12 ***** 2021-10-07 16:25:11,524 : MSRpar : pearson = 0.6385, spearman = 0.6277, align_loss = 0.1472, uniform_loss = -1.9531 2021-10-07 16:25:12,835 : MSRvid : pearson = 0.8927, spearman = 0.8893, align_loss = 0.1807, uniform_loss = -1.9198 2021-10-07 16:25:13,964 : SMTeuroparl : pearson = 0.5341, spearman = 0.5812, align_loss = 0.1966, uniform_loss = -1.3802 2021-10-07 16:25:15,951 : surprise.OnWN : pearson = 0.7567, spearman = 0.7188, align_loss = 0.2179, uniform_loss = -1.9324 2021-10-07 16:25:17,066 : surprise.SMTnews : pearson = 0.7105, spearman = 0.6142, align_loss = 0.1903, uniform_loss = -1.4507 2021-10-07 16:25:17,071 : ALL : Pearson = 0.8081, Spearman = 0.7174, align_loss = 0.1855, uniform_loss = -1.7752 2021-10-07 16:25:17,071 : ALL (weighted average) : Pearson = 0.7222, Spearman = 0.7042, align_loss = 0.1852, uniform_loss = -1.7910 2021-10-07 16:25:17,072 : ALL (average) : Pearson = 0.7065, Spearman = 0.6862, align_loss = 0.1865, uniform_loss = -1.7272 2021-10-07 16:25:17,077 : ***** Transfer task : STS13 (-SMT) ***** 2021-10-07 16:25:18,125 : FNWN : pearson = 0.6377, spearman = 0.6645, align_loss = 0.2690, uniform_loss = -1.7120 2021-10-07 16:25:19,856 : headlines : pearson = 0.7919, spearman = 0.7916, align_loss = 0.1893, uniform_loss = -1.9193 2021-10-07 16:25:20,985 : OnWN : pearson = 0.8636, spearman = 0.8378, align_loss = 0.2356, uniform_loss = -1.8226 2021-10-07 16:25:20,988 : ALL : Pearson = 0.8223, Spearman = 0.8260, align_loss = 0.2194, uniform_loss = -1.8502 2021-10-07 16:25:20,988 : ALL (weighted average) : Pearson = 0.7993, Spearman = 0.7928, align_loss = 0.2167, uniform_loss = -1.8570 2021-10-07 16:25:20,988 : ALL (average) : Pearson = 0.7644, Spearman = 0.7646, align_loss = 0.2313, uniform_loss = -1.8180 2021-10-07 16:25:20,989 : ***** Transfer task : STS14 ***** 2021-10-07 16:25:22,216 : deft-forum : pearson = 0.5599, spearman = 0.5425, align_loss = 0.2140, uniform_loss = -1.7958 2021-10-07 16:25:23,530 : deft-news : pearson = 0.7945, spearman = 0.7483, align_loss = 0.1556, uniform_loss = -1.8096 2021-10-07 16:25:25,363 : headlines : pearson = 0.7907, spearman = 0.7687, align_loss = 0.1759, uniform_loss = -1.9653 2021-10-07 16:25:26,984 : images : pearson = 0.8765, spearman = 0.8525, align_loss = 0.1784, uniform_loss = -2.0066 2021-10-07 16:25:28,642 : OnWN : pearson = 0.8735, spearman = 0.8602, align_loss = 0.2378, uniform_loss = -1.8534 2021-10-07 16:25:30,908 : tweet-news : pearson = 0.7726, spearman = 0.7054, align_loss = 0.2983, uniform_loss = -1.8457 2021-10-07 16:25:30,913 : ALL : Pearson = 0.7965, Spearman = 0.7567, align_loss = 0.2150, uniform_loss = -1.8916 2021-10-07 16:25:30,913 : ALL (weighted average) : Pearson = 0.7934, Spearman = 0.7623, align_loss = 0.2162, uniform_loss = -1.8945 2021-10-07 16:25:30,913 : ALL (average) : Pearson = 0.7779, Spearman = 0.7463, align_loss = 0.2100, uniform_loss = -1.8794 2021-10-07 16:25:30,918 : ***** Transfer task : STS15 ***** 2021-10-07 16:25:32,577 : answers-forums : pearson = 0.7341, spearman = 0.7387, align_loss = 0.3866, uniform_loss = -1.9430 2021-10-07 16:25:34,257 : answers-students : pearson = 0.7654, spearman = 0.7759, align_loss = 0.2215, uniform_loss = -1.2486 2021-10-07 16:25:36,065 : belief : pearson = 0.8094, spearman = 0.8136, align_loss = 0.2693, uniform_loss = -1.7862 2021-10-07 16:25:38,331 : headlines : pearson = 0.8290, spearman = 0.8334, align_loss = 0.1846, uniform_loss = -1.9589 2021-10-07 16:25:40,201 : images : pearson = 0.9071, spearman = 0.9132, align_loss = 0.1945, uniform_loss = -2.0353 2021-10-07 16:25:40,206 : ALL : Pearson = 0.8362, Spearman = 0.8449, align_loss = 0.2321, uniform_loss = -1.7768 2021-10-07 16:25:40,206 : ALL (weighted average) : Pearson = 0.8183, Spearman = 0.8247, align_loss = 0.2321, uniform_loss = -1.7768 2021-10-07 16:25:40,206 : ALL (average) : Pearson = 0.8090, Spearman = 0.8150, align_loss = 0.2513, uniform_loss = -1.7944 2021-10-07 16:25:40,211 : ***** Transfer task : STS16 ***** 2021-10-07 16:25:41,053 : answer-answer : pearson = 0.7412, spearman = 0.7434, align_loss = 0.2435, uniform_loss = -1.4824 2021-10-07 16:25:41,737 : headlines : pearson = 0.8196, spearman = 0.8381, align_loss = 0.1571, uniform_loss = -1.9802 2021-10-07 16:25:42,487 : plagiarism : pearson = 0.8495, spearman = 0.8620, align_loss = 0.1564, uniform_loss = -1.6272 2021-10-07 16:25:43,891 : postediting : pearson = 0.8548, spearman = 0.8739, align_loss = 0.1171, uniform_loss = -1.7985 2021-10-07 16:25:44,530 : question-question : pearson = 0.7249, spearman = 0.7206, align_loss = 0.1987, uniform_loss = -1.7836 2021-10-07 16:25:44,533 : ALL : Pearson = 0.7944, Spearman = 0.8074, align_loss = 0.1746, uniform_loss = -1.7344 2021-10-07 16:25:44,534 : ALL (weighted average) : Pearson = 0.7992, Spearman = 0.8091, align_loss = 0.1746, uniform_loss = -1.7331 2021-10-07 16:25:44,534 : ALL (average) : Pearson = 0.7980, Spearman = 0.8076, align_loss = 0.1746, uniform_loss = -1.7344 2021-10-07 16:25:44,536 : ***** Transfer task : STSBenchmark***** 2021-10-07 16:26:05,137 : train : pearson = 0.8198, spearman = 0.7975, align_loss = 0.1852, uniform_loss = -1.9620 2021-10-07 16:26:11,036 : dev : pearson = 0.8570, spearman = 0.8597, align_loss = 0.2060, uniform_loss = -1.9924 2021-10-07 16:26:16,108 : test : pearson = 0.8090, spearman = 0.8152, align_loss = 0.1776, uniform_loss = -1.9208 2021-10-07 16:26:16,137 : ALL : Pearson = 0.8258, Spearman = 0.8145, align_loss = 0.1876, uniform_loss = -1.9607 2021-10-07 16:26:16,137 : ALL (weighted average) : Pearson = 0.8246, Spearman = 0.8112, align_loss = 0.1876, uniform_loss = -1.9607 2021-10-07 16:26:16,137 : ALL (average) : Pearson = 0.8286, Spearman = 0.8241, align_loss = 0.1896, uniform_loss = -1.9584 2021-10-07 16:26:16,147 : ***** Transfer task : SICKRelatedness***** 2021-10-07 16:26:28,880 : train : pearson = 0.8053, spearman = 0.7307, align_loss = 0.1797, uniform_loss = -1.9860 2021-10-07 16:26:30,456 : dev : pearson = 0.8165, spearman = 0.7514, align_loss = 0.1858, uniform_loss = -2.1245 2021-10-07 16:26:44,329 : test : pearson = 0.7953, spearman = 0.7230, align_loss = 0.1798, uniform_loss = -1.9833 2021-10-07 16:26:44,336 : ALL : Pearson = 0.8009, Spearman = 0.7280, align_loss = 0.1800, uniform_loss = -1.9917 2021-10-07 16:26:44,336 : ALL (weighted average) : Pearson = 0.8009, Spearman = 0.7279, align_loss = 0.1800, uniform_loss = -1.9916 2021-10-07 16:26:44,336 : ALL (average) : Pearson = 0.8057, Spearman = 0.7350, align_loss = 0.1817, uniform_loss = -2.0313 2021-10-07 16:26:44,336 : ------ test ------ 2021-10-07 16:26:44,337 : +--------+--------+--------+--------+--------+--------------+-----------------+--------+ | STS12 | STS13 | STS14 | STS15 | STS16 | STSBenchmark | SICKRelatedness | Avg. | +--------+--------+--------+--------+--------+--------------+-----------------+--------+ | 71.74 | 82.60 | 75.67 | 84.49 | 80.74 | 81.52 | 72.30 | 78.44 | | 0.185 | 0.219 | 0.215 | 0.232 | 0.175 | 0.178 | 0.180 | 0.198 | | -1.775 | -1.850 | -1.892 | -1.777 | -1.734 | -1.921 | -1.983 | -1.847 | +--------+--------+--------+--------+--------+--------------+-----------------+--------+ 2021-10-07 16:26:44,338 : +------+------+------+------+------+------+------+------+ | MR | CR | SUBJ | MPQA | SST2 | TREC | MRPC | Avg. | +------+------+------+------+------+------+------+------+ | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | +------+------+------+------+------+------+------+------+