File size: 88,110 Bytes
733949b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 754 755 756 757 758 759 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 784 785 786 787 788 789 790 791 792 793 794 795 796 797 798 799 800 801 802 803 804 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 824 825 826 827 828 829 830 831 832 833 834 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 851 852 853 854 855 856 857 858 859 860 861 862 863 864 865 866 867 868 869 870 871 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1020 1021 1022 1023 1024 1025 1026 1027 1028 1029 1030 1031 1032 1033 1034 1035 1036 1037 1038 1039 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 1050 1051 1052 1053 1054 1055 1056 1057 1058 1059 1060 1061 1062 1063 1064 1065 1066 1067 1068 1069 1070 1071 1072 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1086 1087 1088 1089 1090 1091 1092 1093 1094 1095 1096 1097 1098 1099 1100 1101 1102 1103 1104 1105 1106 1107 1108 1109 1110 1111 1112 1113 1114 1115 1116 1117 1118 1119 1120 1121 1122 1123 1124 1125 1126 1127 1128 1129 1130 1131 1132 1133 1134 1135 1136 1137 1138 1139 1140 1141 1142 1143 1144 1145 1146 1147 1148 1149 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 1160 1161 1162 1163 1164 1165 1166 1167 1168 1169 1170 1171 1172 1173 1174 1175 1176 1177 1178 1179 1180 1181 1182 1183 1184 1185 1186 1187 1188 1189 1190 1191 1192 1193 1194 1195 1196 1197 1198 1199 1200 1201 1202 1203 1204 1205 1206 1207 1208 1209 1210 1211 1212 1213 1214 1215 1216 1217 1218 1219 1220 1221 1222 1223 1224 1225 1226 1227 1228 1229 1230 1231 1232 1233 1234 1235 1236 1237 1238 1239 1240 1241 1242 1243 1244 1245 1246 1247 1248 1249 1250 1251 1252 1253 1254 1255 1256 1257 1258 1259 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 1280 1281 1282 1283 1284 1285 1286 1287 1288 1289 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 1310 1311 1312 1313 1314 1315 1316 1317 1318 1319 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 1340 1341 1342 1343 1344 1345 1346 1347 1348 1349 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 1360 1361 1362 1363 1364 1365 1366 1367 1368 1369 1370 1371 1372 1373 1374 1375 1376 1377 1378 1379 1380 1381 1382 1383 1384 1385 1386 1387 1388 1389 1390 1391 1392 1393 1394 1395 1396 1397 1398 1399 1400 1401 1402 1403 1404 1405 1406 1407 1408 1409 1410 1411 1412 1413 1414 1415 1416 1417 1418 1419 1420 1421 1422 1423 1424 1425 1426 1427 1428 1429 1430 1431 1432 1433 1434 1435 1436 1437 1438 1439 1440 1441 1442 1443 1444 1445 1446 1447 1448 1449 1450 1451 1452 1453 1454 1455 1456 1457 1458 1459 1460 1461 1462 1463 1464 1465 1466 1467 1468 1469 1470 1471 1472 1473 1474 1475 1476 1477 1478 1479 1480 1481 1482 1483 1484 1485 1486 1487 1488 1489 1490 1491 1492 1493 1494 1495 1496 1497 1498 1499 1500 1501 1502 1503 1504 1505 1506 1507 1508 1509 1510 1511 1512 1513 1514 1515 1516 1517 1518 1519 1520 1521 1522 1523 1524 1525 1526 1527 1528 1529 1530 1531 1532 1533 1534 1535 1536 1537 1538 1539 1540 1541 1542 1543 1544 1545 1546 1547 1548 1549 1550 1551 1552 1553 1554 1555 1556 1557 1558 1559 1560 1561 1562 1563 1564 1565 1566 1567 1568 1569 1570 1571 1572 1573 1574 1575 1576 1577 1578 1579 1580 1581 1582 1583 1584 1585 1586 1587 1588 1589 1590 1591 1592 1593 1594 1595 1596 1597 1598 1599 1600 1601 1602 1603 1604 1605 1606 1607 1608 1609 1610 1611 1612 1613 1614 1615 1616 1617 1618 1619 1620 1621 1622 1623 1624 1625 1626 1627 1628 1629 1630 1631 1632 1633 1634 1635 1636 1637 1638 1639 1640 1641 1642 1643 1644 1645 1646 1647 1648 1649 1650 1651 1652 1653 1654 1655 1656 1657 1658 1659 1660 1661 1662 1663 1664 1665 1666 1667 1668 1669 1670 1671 1672 1673 1674 1675 1676 1677 1678 1679 1680 1681 1682 1683 1684 1685 1686 1687 1688 1689 1690 1691 1692 1693 1694 1695 1696 1697 1698 1699 1700 1701 1702 1703 1704 1705 1706 1707 1708 1709 1710 1711 1712 1713 1714 1715 1716 1717 1718 1719 1720 1721 1722 1723 1724 1725 1726 1727 1728 1729 1730 1731 1732 1733 1734 1735 1736 1737 1738 1739 1740 1741 1742 1743 1744 1745 1746 1747 1748 1749 1750 1751 1752 1753 1754 1755 1756 1757 1758 1759 1760 1761 1762 1763 1764 1765 1766 1767 1768 1769 1770 1771 1772 1773 1774 1775 1776 1777 1778 1779 1780 1781 1782 1783 1784 1785 1786 1787 1788 1789 1790 1791 1792 1793 1794 1795 1796 1797 1798 1799 1800 1801 1802 1803 1804 1805 1806 1807 1808 1809 1810 1811 1812 1813 1814 1815 1816 1817 1818 1819 1820 1821 1822 1823 1824 1825 1826 1827 1828 1829 1830 1831 1832 1833 1834 1835 1836 1837 1838 1839 1840 1841 1842 1843 1844 1845 1846 1847 1848 1849 1850 1851 1852 |
Single Image Deraining#Rain100H#PSNR
Question Answering#YahooCQA#P@1
Atari Games#Atari 2600 Private Eye#Score
Speech Recognition#MediaSpeech#WER for Turkish
3D Point Cloud Classification#ModelNet40#Mean Accuracy
Image Clustering#STL-10#Train Split
Time Series Classification#WalkvsRun#NLL
language_modeling#Text8#Number of params
Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Chinese#Accuracy
Weakly-supervised 3D Human Pose Estimation#Human3.6M#3D Annotations
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Decay)
Image-to-Image Translation#Cityscapes Labels-to-Photo#FID
Neural Architecture Search#ImageNet#Accuracy
Human Pose Forecasting#Human3.6M#MAR, walking, 400ms
Face Detection#WIDER Face (Medium)#AP
Incremental Learning#CIFAR-100 - 50 classes + 10 steps of 5 classes#Average Incremental Accuracy
Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (60% training data)
Text Simplification#PWKP / WikiSmall#SARI
Network Pruning#ImageNet#Accuracy
Line Segment Detection#York Urban Dataset#sAP10
Visual Dialog#VisDial v0.9 val#R@10
Link Prediction#WN18RR#MR
Stereo-LiDAR Fusion#KITTI Depth Completion Validation#RMSE
Question Answering#WikiHop#Test
Colorectal Gland Segmentation:#CRAG#Dice
Image Super-Resolution#Set14 - 4x upscaling#MOS
Semantic Segmentation#NYU Depth v2#Mean IoU
Fine-Grained Image Classification#DF20 - Mini#F1 - macro
Node Classification#Squirrel#Accuracy
Recommendation Systems#Netflix#Recall@50
6D Pose Estimation using RGB#LineMOD#Mean ADD
Unsupervised Machine Translation#WMT2016 German-English#BLEU
Video Retrieval#LSMDC#text-to-video R@5
Video Retrieval#LSMDC#text-to-video R@1
Semantic Segmentation#S3DIS#oAcc
Recommendation Systems#Netflix#Recall@20
Image Classification#ImageNet ReaL#Params
Natural Language Inference#SNLI#Parameters
Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Precision
language_modeling#WikiText-2#Validation perplexity
Lipreading#LRS2#Word Error Rate (WER)
JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR
Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: general purpose
Few-Shot Image Classification#Mini-ImageNet - 1-Shot Learning#Accuracy
Image Super-Resolution#Set14 - 3x upscaling#SSIM
Link Prediction#MovieLens 25M#Hits@10
Supervised Video Summarization#SumMe#F1-score (Canonical)
Fine-Grained Image Classification#Oxford 102 Flowers#Accuracy
Panoptic Segmentation#COCO panoptic#PQ
summarization#CNN / Daily Mail (Anonymized version)#METEOR
Link Prediction#Citeseer#AUC
Action Recognition#EPIC-KITCHENS-100#Action@1
Face Detection#Annotated Faces in the Wild#AP
Multimodal Machine Translation#Multi30K#Meteor (EN-DE)
Image-to-Image Translation#Cityscapes Labels-to-Photo#mIoU
Image Retrieval#Flickr30K 1K test#R@5
Image Retrieval#Flickr30K 1K test#R@1
Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
Pedestrian Detection#CityPersons#Heavy MR^-2
Data-to-Text Generation#E2E NLG Challenge#METEOR
Atari Games#Atari 2600 Skiing#Score
Deblurring#RealBlur-R (trained on GoPro)#PSNR (sRGB)
Semantic Retrieval#Contract Discovery#Soft-F1
Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
Language Modelling#WikiText-103#Number of params
Action Segmentation#50 Salads#F1@25%
Paraphrase Identification#Quora Question Pairs#Accuracy
Semi-Supervised Semantic Segmentation#Cityscapes 100 samples labeled#Validation mIoU
Image Generation#CelebA 64x64#FID
Time Series Classification#Libras#Accuracy
Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Frames Per View
Robotic Grasping#Cornell Grasp Dataset#5 fold cross validation
Referring Expression Segmentation#RefCOCO testB#IoU
JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR-B
Visual Navigation#Cooperative Vision-and-Dialogue Navigation#spl
Skeleton Based Action Recognition#Kinetics-Skeleton dataset#Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Mean)
3D Human Pose Estimation#3DPW#MPVPE
Action Recognition#Something-Something V1#Top 5 Accuracy
language_modeling#Text8#Bit per Character (BPC)
Image Generation#LSUN Bedroom 256 x 256#FID
Deblurring#RealBlur-J (trained on GoPro)#SSIM (sRGB)
Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CS)
relation_prediction#FB15K-237#H@1
Video Captioning#YouCook2#METEOR
Semantic Textual Similarity#STS Benchmark#Pearson Correlation
Speech Recognition#LibriSpeech test-clean#Word Error Rate (WER)
Video Retrieval#MSR-VTT#text-to-video R@10
Knowledge Graph Completion#FB15k-237#Hits@10
Graph Regression#ZINC 100k#MAE
Open-Domain Question Answering#SearchQA#Unigram Acc
Chinese Named Entity Recognition#OntoNotes 4#F1
Scene Text Detection#Total-Text#F-Measure
Atari Games#Atari 2600 James Bond#Score
Time Series Classification#CMUsubject16#NLL
Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV I)
Text-to-Image Generation#Multi-Modal-CelebA-HQ#LPIPS
Graph Classification#IMDb-M#Accuracy
Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CV)
Neural Architecture Search#CIFAR-10 Image Classification#Params
Nested Mention Recognition#ACE 2004#F1
JPEG Artifact Correction#LIVE1 (Quality 20 Color)#SSIM
Entity Linking#WiC-TSV#Task 1 Accuracy: all
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Recall)
Few-Shot Image Classification#CIFAR-FS 5-way (1-shot)#Accuracy
Deblurring#RealBlur-R (trained on GoPro)#SSIM (sRGB)
Action Recognition#Something-Something V2#GFLOPs
Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R2@1
Music Source Separation#MUSDB18#SDR (bass)
Language Modelling#Penn Treebank (Word Level)#Params
Object Detection#PASCAL VOC 2007#MAP
Common Sense Reasoning#CommonsenseQA#Accuracy
JPEG Artifact Correction#ICB (Quality 20 Color)#SSIM
Person Re-Identification#CUHK03 detected#Rank-1
Image Generation#ImageNet 128x128#FID
Image Retrieval with Multi-Modal Query#Fashion200k#Recall@1
Dependency Parsing#Penn Treebank#LAS
Time Series Classification#AUSLAN#NLL
Language Modelling#Hutter Prize#Number of params
Hand Pose Estimation#NYU Hands#Average 3D Error
Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@5
dependency_parsing#Penn Treebank#UAS
Visual Dialog#VisDial v0.9 val#Mean Rank
Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@1
Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@2
Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
Person Re-Identification#CUHK03#MAP
Retinal Vessel Segmentation#CHASE_DB1#F1 score
Grayscale Image Denoising#Urban100 sigma25#PSNR
Image-to-Image Translation#Cityscapes Labels-to-Photo#Class IOU
Action Recognition#Something-Something V2#Parameters
Question Answering#Natural Questions (short)#F1
Multivariate Time Series Forecasting#MIMIC-III#NegLL
Brain Tumor Segmentation#BRATS-2015#Dice Score
Paraphrase Identification#Quora Question Pairs#F1
Image Super-Resolution#BSD100 - 3x upscaling#PSNR
RGB-D Salient Object Detection#STERE#max E-Measure
language_modeling#Penn Treebank#Validation perplexity
Click-Through Rate Prediction#Criteo#Log Loss
Action Recognition#ActivityNet#mAP
Domain Generalization#ImageNet-R#Top-1 Error Rate
Domain Adaptation#USPS-to-MNIST#Accuracy
Atari Games#Atari 2600 Crazy Climber#Score
Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (80% training data)
Open-Domain Question Answering#Quasar#EM (Quasar-T)
Question Answering#bAbi#Mean Error Rate
Keypoint Detection#COCO test-challenge#AR
Continuous Control#PyBullet Ant#Return
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#J&F
Keypoint Detection#COCO test-challenge#AP
Text Classification#TREC-6#Error
Text Classification#Yelp-5#Accuracy
Atari Games#Atari 2600 Ms. Pacman#Score
Text Classification#AG News#Error
Named Entity Recognition#SciERC#F1
Image Classification#Kuzushiji-MNIST#Accuracy
Action Recognition#HACS#Top 5 Accuracy
Few-Shot Image Classification#Stanford Cars 5-way (5-shot)#Accuracy
Time Series Classification#CharacterTrajectories#Accuracy
Coreference Resolution#CoNLL 2012#Avg F1
JPEG Artifact Correction#Classic5 (Quality 10 Grayscale)#PSNR
Sentiment Analysis#Multi-Domain Sentiment Dataset#DVD
Text based Person Retrieval#CUHK-PEDES#R@1
Multi-Person Pose Estimation#COCO#Validation AP
Text based Person Retrieval#CUHK-PEDES#R@5
Language Modelling#WikiText-103#Validation perplexity
Image-to-Image Translation#ADE20K Labels-to-Photos#Accuracy
Recommendation Systems#Million Song Dataset#nDCG@100
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Recall)
Instance Segmentation#COCO test-dev#mask AP
Extractive Text Summarization#CNN / Daily Mail#ROUGE-1
Action Classification#Kinetics-600#Top-5 Accuracy
Text-to-Image Generation#Multi-Modal-CelebA-HQ#Real
Action Segmentation#GTEA#Acc
Self-Supervised Action Recognition#UCF101#3-fold Accuracy
Extractive Text Summarization#CNN / Daily Mail#ROUGE-2
3D Object Detection#KITTI Cyclists Easy#AP
Image Generation#STL-10#Inception score
Extractive Text Summarization#CNN / Daily Mail#ROUGE-L
Visual Dialog#VisDial v0.9 val#R@5
Visual Dialog#VisDial v0.9 val#R@1
JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#SSIM
Text Summarization#DUC 2004 Task 1#ROUGE-1
Text Summarization#DUC 2004 Task 1#ROUGE-2
Grayscale Image Denoising#Urban100 sigma15#PSNR
Dense Pixel Correspondence Estimation#HPatches#Viewpoint III AEPE
3D Part Segmentation#ShapeNet-Part#Class Average IoU
Text Summarization#DUC 2004 Task 1#ROUGE-L
Gesture-to-Gesture Translation#NTU Hand Digit#AMT
RGB-D Salient Object Detection#SIP#Average MAE
Nested Named Entity Recognition#ACE 2005#F1
Grayscale Image Denoising#BSD68 sigma25#PSNR
Question Answering#FQuAD#F1
Question Answering#FQuAD#EM
Atari Games#Atari 2600 Pong#Score
Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV II)
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#MS-SSIM
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Mean)
Photo geolocation estimation#Im2GPS#Region level (200 km)
Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CS)
Single Image Deraining#Test1200#SSIM
Chinese Named Entity Recognition#MSRA#F1
Text-to-Image Generation#Multi-Modal-CelebA-HQ#FID
Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (val)
Depth Completion#KITTI Depth Completion#MAE
Few-Shot Image Classification#Mini-Imagenet 20-way (5-shot)#Accuracy
Person Re-Identification#Market-1501#MAP
Recommendation Systems#MovieLens 10M#RMSE
Action Classification#Kinetics-400#Vid acc@1
Semantic Segmentation#S3DIS Area5#mIoU
Action Classification#Kinetics-400#Vid acc@5
Image Super-Resolution#Set14 - 8x upscaling#SSIM
Anomaly Detection#One-class CIFAR-10#AUROC
Image Retrieval#CUB-200-2011#R@1
Node Classification#Cora#Validation
Time Series Classification#DigitShapes#NLL
Image Generation#CelebA-HQ 128x128#FID
Atari Games#Atari 2600 Breakout#Score
Action Segmentation#50 Salads#Acc
Self-Supervised Action Recognition#HMDB51 (finetuned)#Top-1 Accuracy
Emotion Recognition in Conversation#EmoryNLP#Weighted Macro-F1
Language Modelling#enwik8#Number of params
Node Classification#Brazil Air-Traffic#Accuracy
Music Source Separation#MUSDB18#SDR (other)
Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
Person Search#PRW#mAP
Sentiment Analysis#Amazon Review Polarity#Accuracy
Deblurring#GoPro#PSNR
Named Entity Recognition#JNLPBA#F1
Object Detection#CrowdHuman (full body)#mMR
Question Answering#CoQA#In-domain
Action Segmentation#50 Salads#F1@50%
Panoptic Segmentation#Cityscapes val#AP
Image-to-Image Translation#SYNTHIA-to-Cityscapes#mIoU (13 classes)
Keypoint Detection#COCO#Test AP
Photo geolocation estimation#Im2GPS#City level (25 km)
Fine-Grained Image Classification#Stanford Cars#Accuracy
Trajectory Prediction#ETH/UCY#ADE-8/12
question_answering#SearchQA#N-gram F1
Single Image Deraining#Test2800#SSIM
Breast Tumour Classification#PCam#AUC
Real-Time Semantic Segmentation#Cityscapes test#Frame (fps)
Person Re-Identification#MSMT17#Rank-1
JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR
Unsupervised MNIST#MNIST#Accuracy
Vision and Language Navigation#VLN Challenge#success
3D Object Detection#KITTI Cars Moderate#AP
Sentiment Analysis#TweetEval#Emoji
Object Detection#iSAID#Average Precision
language_modeling#WikiText-2#Test perplexity
Image Super-Resolution#Urban100 - 3x upscaling#PSNR
Panoptic Segmentation#COCO test-dev#PQ
3D Instance Segmentation#S3DIS#mPrec
Atari Games#Atari-57#Medium Human-Normalized Score
Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
Multi-Person Pose Estimation#MPII Multi-Person#AP
Atari Games#Atari 2600 Asteroids#Score
Instance Segmentation#COCO test-dev#AP75
Action Classification#AViD#Accuracy
Face Alignment#WFLW#ME (%, all)
Monocular 3D Human Pose Estimation#Human3.6M#Need Ground Truth 2D Pose
Denoising#Darmstadt Noise Dataset#PSNR
Atari Games#Atari 2600 Assault#Score
Atari Games#Atari 2600 Time Pilot#Score
Hand Pose Estimation#ICVL Hands#Average 3D Error
Atari Games#Atari 2600 Robotank#Score
Pose Estimation#COCO test-dev#APL
Pose Estimation#COCO test-dev#APM
Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.95
Node Classification#Reddit#Accuracy
Face Verification#IJB-A#TAR @ FAR=0.01
Pose Transfer#Deep-Fashion#IS
Atari Games#Atari 2600 Gopher#Score
Natural Language Inference#WNLI#Accuracy
Visual Question Answering#GQA Test2019#Binary
Hand Pose Estimation#MSRA Hands#Average 3D Error
Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (80% training data)
Image Matting#Composition-1K#MSE
named_entity_recognition#CoNLL 2003 (English)#F1
Node Classification#Europe Air-Traffic#Accuracy
Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.75
Atari Games#Atari 2600 Montezuma's Revenge#Score
Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
Real-Time Semantic Segmentation#CamVid#mIoU
Semantic Segmentation#CamVid#Mean IoU
Instance Segmentation#COCO test-dev#AP50
Question Answering#OpenBookQA#Accuracy
Speech Recognition#LibriSpeech test-other#Word Error Rate (WER)
Link Prediction#WN18RR#Hits@3
Panoptic Segmentation#Cityscapes val#PQ
Link Prediction#WN18RR#Hits@1
Click-Through Rate Prediction#Company*#Log Loss
Video Retrieval#MSR-VTT#text-to-video Median Rank
Nested Named Entity Recognition#ACE 2004#F1
Color Image Denoising#Darmstadt Noise Dataset#PSNR (sRGB)
Deblurring#HIDE (trained on GOPRO)#PSNR (sRGB)
Image Generation#FFHQ#FID
Video Captioning#YouCook2#CIDEr
Session-Based Recommendations#Diginetica#MRR@20
Optical Flow Estimation#Sintel-final#Average End-Point Error
Skeleton Based Action Recognition#J-HMDB#Accuracy (RGB+pose)
Action Classification#Kinetics-400#Clip acc@5
Action Classification#Kinetics-400#Clip acc@1
RGB-D Salient Object Detection#NLPR#max E-Measure
3D Object Detection#KITTI Cyclists Hard#AP
Multi-Frame Super-Resolution#PROBA-V#Normalized cPSNR
Recommendation Systems#Flixster Monti#RMSE
Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
Image-to-Image Translation#COCO-Stuff Labels-to-Photos#Accuracy
Visual Question Answering#CLEVR#Accuracy
Egocentric Activity Recognition#EPIC-KITCHENS-55#Actions Top-1 (S2)
Self-Supervised Image Classification#ImageNet#Top 1 Accuracy
Click-Through Rate Prediction#Avazu#AUC
Few-Shot Image Classification#Meta-Dataset Rank#Mean Rank
Natural Language Inference#RTE#Accuracy
Time Series Classification#ECG#NLL
Image Relighting#VIDIT’20 validation set#Runtime(s)
Domain Adaptation#Office-Home#Accuracy
Click-Through Rate Prediction#Bing News#AUC
Domain Generalization#PACS#Average Accuracy
Image Super-Resolution#Set5 - 3x upscaling#PSNR
Multivariate Time Series Imputation#MuJoCo#MSE (10^2, 50% missing)
Color Image Denoising#Darmstadt Noise Dataset#SSIM (sRGB)
Scene Text Detection#ICDAR 2017 MLT#F-Measure
Image Clustering#STL-10#Accuracy
Few-Shot Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
Emotion Recognition in Conversation#EC#Micro-F1
Video Alignment#UPenn Action#Kendall's Tau
Weakly Supervised Action Localization#ActivityNet-1.2#mAP@0.5
Keypoint Detection#MPII Multi-Person#mAP@0.5
Video Captioning#YouCook2#ROUGE-L
Link Prediction#WordNet#Accuracy
Image Classification#CIFAR-10#Percentage correct
Single Image Deraining#Test100#SSIM
Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#IoU
Reading Comprehension#RACE#Accuracy (High)
Object Detection#CrowdHuman (full body)#AP
Text-to-Image Generation#COCO#FID
Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#PSNR
Anomaly Detection#MVTec AD#Detection AUROC
Node Classification#Pubmed Full-supervised#Accuracy
Referring Expression Segmentation#RefCoCo val#IoU
Birds Eye View Object Detection#KITTI Cyclists Moderate#AP
Hand Pose Estimation#HANDS 2017#Average 3D Error
Grammatical Error Detection#CoNLL-2014 A2#F0.5
Image Super-Resolution#Set14 - 4x upscaling#SSIM
Continuous Control#PyBullet Hopper#Return
Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
constituency_parsing#Penn Treebank#F1
Image Relighting#VIDIT’20 validation set#SSIM
Object Counting#CARPK#MAE
Atari Games#Atari 2600 Beam Rider#Score
Metric Learning#CUB-200-2011#R@1
Image Generation#LSUN Bedroom 256 x 256#FID-10k-training-steps
language_modeling#Hutter Prize#Bit per Character (BPC)
Fact-based Text Editing#WebEdit#Exact Match
Few-Shot Image Classification#CUB 200 5-way 5-shot#Accuracy
Video Retrieval#MSVD#text-to-video Median Rank
Visual Navigation#Cooperative Vision-and-Dialogue Navigation#dist_to_end_reduction
Domain Adaptation#ImageCLEF-DA#Accuracy
Fine-Grained Image Classification#DF20 - Mini#Top-1
Fine-Grained Image Classification#DF20 - Mini#Top-3
Part-Of-Speech Tagging#Penn Treebank#Accuracy
Action Spotting#SoccerNet#Average-mAP
Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Unseen)
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#J&F
Face Detection#PASCAL Face#AP
Atari Games#Atari 2600 Pitfall!#Score
Image Super-Resolution#Set5 - 4x upscaling#MOS
Human Pose Forecasting#Human3.6M#MAR, walking, 1,000ms
Image Clustering#Extended Yale-B#NMI
Person Re-Identification#DukeMTMC-reID#Rank-10
Click-Through Rate Prediction#Company*#AUC
Link Prediction#YAGO3-10#MRR
Image-to-Image Translation#ADE20K Labels-to-Photos#mIoU
Text Simplification#ASSET#SARI (EASSE>=0.2.1)
word_segmentation#PKU#F1
Dense Pixel Correspondence Estimation#HPatches#Viewpoint IV AEPE
Human-Object Interaction Detection#HICO-DET#mAP
Constituency Grammar Induction#PTB#Mean F1 (WSJ)
Spoken language identification#LRE07#Average
word_sense_disambiguation#Senseval 2#F1
Node Classification#Cora Full-supervised#Accuracy
RGB Salient Object Detection#DUTS-TE#F-measure
Video Captioning#YouCook2#BLEU-4
Atari Games#Atari 2600 Zaxxon#Score
Image Classification#CINIC-10#Accuracy
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#NIQE
Image Classification#WebVision-1000#Top-5 Accuracy
Time Series Classification#UWave#NLL
Data-to-Text Generation#E2E NLG Challenge#NIST
Semantic Segmentation#S3DIS Area5#oAcc
Monocular Depth Estimation#KITTI Eigen split unsupervised#absolute relative error
Reading Comprehension#ReClor#Test
Anomaly Detection#MVTec AD#Segmentation AUROC
Deblurring#HIDE (trained on GOPRO)#SSIM (sRGB)
Link Prediction#OpenBioLink#Hits@1
Text Classification#IMDb#Accuracy (10 classes)
Link Prediction#OpenBioLink#Hits@3
Pose Tracking#PoseTrack2017#mAP
Node Classification#Cora with Public Split: fixed 20 nodes per class#Accuracy
sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Restaurant (acc)
Text-to-Image Generation#COCO#Inception score
Causal Inference#IDHP#Average Treatment Effect Error
3D Part Segmentation#ShapeNet-Part#Instance Average IoU
Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (20% training data)
Face Detection#FDDB#AP
Fine-Grained Image Classification#Oxford 102 Flowers#PARAMS
Natural Language Inference#MultiNLI#Mismatched
Curved Text Detection#SCUT-CTW1500#F-Measure
Photo geolocation estimation#Im2GPS#Street level (1 km)
Keypoint Detection#COCO#Validation AP
Fake News Detection#FNC-1#Per-class Accuracy (Discuss)
Cross-Modal Retrieval#Flickr30k#Text-to-image R@5
Cross-Modal Retrieval#Flickr30k#Text-to-image R@1
Domain Adaptation#SYNTHIA-to-Cityscapes#mIoU
Image Generation#LSUN Churches 256 x 256#FID
Visual Object Tracking#TrackingNet#Normalized Precision
JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR-B
AMR Parsing#LDC2017T10#Smatch
Time Series Classification#Shapes#NLL
Machine Translation#WMT2016 Romanian-English#BLEU score
Ad-Hoc Information Retrieval#TREC Robust04#P@20
Named Entity Recognition#CoNLL 2003 (English)#F1
Time Series Classification#PenDigits#Accuracy
JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR-B
Real-Time Semantic Segmentation#Cityscapes test#mIoU
Monocular 3D Human Pose Estimation#Human3.6M#Frames Needed
Question Answering#DROP Test#F1
Few-Shot Image Classification#Mini-Imagenet 10-way (1-shot)#Accuracy
Action Recognition#HACS#Top 1 Accuracy
language_modeling#WikiText-103#Validation perplexity
Intent Detection#ATIS#Accuracy
Scene Text Detection#SCUT-CTW1500#Recall
Image Super-Resolution#Set14 - 2x upscaling#SSIM
Node Classification#CiteSeer (1%)#Accuracy
3D Human Pose Estimation#Total Capture#Average MPJPE (mm)
Automated Theorem Proving#HolStep (Conditional)#Classification Accuracy
Audio Classification#AudioSet#Test mAP
Fact-based Text Editing#WebEdit#SARI
Natural Language Inference#QNLI#Accuracy
Document Image Classification#RVL-CDIP#Accuracy
Natural Language Inference#ANLI test#A2
Natural Language Inference#ANLI test#A1
Natural Language Inference#ANLI test#A3
Question Answering#Quasart-T#EM
Image Super-Resolution#Manga109 - 3x upscaling#PSNR
Word Sense Disambiguation#SemEval 2013 Task 12#F1
Semantic Textual Similarity#MRPC#F1
Object Counting#CARPK#RMSE
Image Matting#Composition-1K#Conn
Self-Supervised Action Recognition#UCF101 (finetuned)#3-fold Accuracy
Multimodal Activity Recognition#Moments in Time Dataset#Top-1 (%)
3D Semantic Instance Segmentation#ScanNetV2#mAP@0.50
Video Super-Resolution#Vid4 - 4x upscaling#PSNR
relation_prediction#WN18RR#H@1
Cross-View Image-to-Image Translation#Dayton (256×256) - aerial-to-ground#SSIM
Language Modelling#enwik8#Bit per Character (BPC)
Hyperspectral Image Classification#Indian Pines#Overall Accuracy
Language Modelling#One Billion Word#PPL
Chinese Named Entity Recognition#Weibo NER#F1
RGB-D Salient Object Detection#SIP#max E-Measure
Question Answering#SQuAD1.1#F1
Question Answering#SQuAD1.1#EM
Question Answering#NarrativeQA#Rouge-L
Person Re-Identification#PRID2011#Rank-5
Person Re-Identification#PRID2011#Rank-1
Language Modelling#One Billion Word#Number of params
Image Classification#Clothing1M#Accuracy
JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR
Node Classification#BlogCatalog#Macro-F1
Image Classification#iNaturalist 2018#Top-1 Accuracy
RGB-D Salient Object Detection#DES#S-Measure
Fake News Detection#FNC-1#Per-class Accuracy (Unrelated)
Text Classification#DBpedia#Error
Word Sense Disambiguation#SensEval 2#F1
Link Prediction#Pubmed#AUC
Image Denoising#DND#SSIM (sRGB)
Video Retrieval#MSR-VTT-1kA#text-to-video Median Rank
Image Clustering#CIFAR-10#NMI
Scene Text Detection#ICDAR 2013#Precision
summarization#Gigaword#ROUGE-1
Atari Games#Atari 2600 Ice Hockey#Score
summarization#Gigaword#ROUGE-2
Entity Linking#WiC-TSV#Task 1 Accuracy: domain specific
summarization#Gigaword#ROUGE-L
Image Relighting#VIDIT’20 validation set#PSNR
Point Cloud Registration#3DMatch Benchmark#Recall
Machine Translation#IWSLT2015 English-Vietnamese#BLEU
Lesion Segmentation#ISIC 2018#Dice Score
Atari Games#Atari 2600 Freeway#Score
Action Recognition#AVA v2.1#mAP (Val)
Grayscale Image Denoising#Set12 sigma50#PSNR
3D Object Detection#nuScenes#NDS
Dialogue State Tracking#Wizard-of-Oz#Joint
Sentiment Analysis#Multi-Domain Sentiment Dataset#Books
Image Clustering#ImageNet-10#Accuracy
Semantic Segmentation#Semantic3D#mIoU
Image Clustering#Tiny-ImageNet#NMI
Image Relighting#VIDIT’20 validation set#MPS
Object Counting#Pascal VOC 2007 count-test#mRMSE
JPEG Artifact Correction#ICB (Quality 10 Grayscale)#SSIM
Crowd Counting#ShanghaiTech B#MAE
Human-Object Interaction Detection#V-COCO#Time Per Frame(ms)
Gesture-to-Gesture Translation#Senz3D#AMT
3D Human Pose Estimation#3D Poses in the Wild Challenge#MPJPE
Keypoint Detection#COCO test-dev#AR
Image Retrieval#Par6k#mAP
Action Recognition#Something-Something V2#Top-1 Accuracy
Graph Regression#PCQM4M-LSC#Test MAE
Graph Classification#PTC#Accuracy
Visual Question Answering#VQA v2 test-dev#Accuracy
Anomaly Detection#Numenta Anomaly Benchmark#NAB score
Semantic Segmentation#S3DIS#Mean IoU
Sentiment Analysis#CR#Accuracy
Image Classification#CIFAR-10#PARAMS
Open-Domain Question Answering#SearchQA#EM
Fine-Grained Image Classification#FGVC Aircraft#Accuracy
Visual Object Tracking#TrackingNet#Precision
Music Source Separation#MUSDB18#SDR (vocals)
Text Summarization#Pubmed#ROUGE-L
Link Prediction#Citeseer#AP
Drug Discovery#QM9#Error ratio
Text Summarization#Pubmed#ROUGE-1
Text Summarization#Pubmed#ROUGE-2
Visual Object Tracking#GOT-10k#Average Overlap
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Mean)
Pedestrian Detection#CityPersons#Partial MR^-2
Visual Object Tracking#TrackingNet#Accuracy
Multi-Person Pose Estimation#COCO#AP
Atari Games#Atari 2600 Asterix#Score
Image Classification#CIFAR-100#PARAMS
Few-Shot Image Classification#Mini-Imagenet 20-way (1-shot)#Accuracy
Cross-Lingual NER#CoNLL German#F1
RGB-D Salient Object Detection#STERE#S-Measure
Image Super-Resolution#Manga109 - 3x upscaling#SSIM
Temporal Action Localization#ActivityNet-1.3#mAP
Link Prediction#FB15k-237#Hits@10
3D Human Pose Estimation#HumanEva-I#Mean Reconstruction Error (mm)
Atari Games#Atari 2600 Enduro#Score
Photo geolocation estimation#Im2GPS#Country level (750 km)
Scene Graph Generation#Visual Genome#Recall@50
Panoptic Segmentation#Mapillary val#PQ
3D Instance Segmentation#ScanNet(v2)#Mean AP @ 0.5
Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV II)
Text Simplification#ASSET#BLEU
Image Clustering#coil-100#NMI
Skeleton Based Action Recognition#SBU#Accuracy
Colorectal Gland Segmentation:#CRAG#Hausdorff Distance (mm)
Image Super-Resolution#BSD100 - 2x upscaling#PSNR
6D Pose Estimation using RGB#LineMOD#Accuracy
Speech Recognition#Switchboard + Hub500#Percentage error
Link Prediction#FB15k#MR
Text Simplification#Newsela#BLEU
Data-to-Text Generation#E2E NLG Challenge#ROUGE-L
Named Entity Recognition#GENIA#F1
Visual Question Answering#GQA Test2019#Distribution
Image Classification#iNaturalist 2019#Top-1 Accuracy
Image Classification#mini WebVision 1.0#ImageNet Top-5 Accuracy
Head Pose Estimation#BIWI#MAE (trained with other data)
Question Answering#TrecQA#MAP
Visual Question Answering#VQA v1 test-std#Accuracy
Sentiment Analysis#Yelp Fine-grained classification#Error
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FED
Image Super-Resolution#Manga109 - 8x upscaling#SSIM
part-of-speech_tagging#VLSP 2013 POS tagging shared task#Accuracy
Nested Named Entity Recognition#GENIA#F1
Hate Speech Detection#Ethos Binary#Classification Accuracy
Machine Translation#WMT2016 English-Romanian#BLEU score
Text based Person Retrieval#CUHK-PEDES#R@10
Visual Question Answering#GQA Test2019#Consistency
Image Classification#ImageNet ReaL#Accuracy
named_entity_recognition#VLSP 2016 NER shared task#F1
Atari Games#Atari 2600 Phoenix#Score
Natural Language Inference#SNLI#% Train Accuracy
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FID
Visual Question Answering#CLEVR-Humans#Accuracy
Image Clustering#STL-10#Backbone
Node Classification#PubMed (0.03%)#Accuracy
Sentiment Analysis#Yelp Binary classification#Error
Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
Word Sense Disambiguation#SensEval 3 Task 1#F1
RGB-D Salient Object Detection#NLPR#Average MAE
Dependency Parsing#Penn Treebank#POS
Language Modelling#Penn Treebank (Character Level)#Bit per Character (BPC)
Few-Shot Image Classification#Mini-Imagenet 5-way (10-shot)#Accuracy
Graph Classification#NEURON-Average#Accuracy
Node Classification#Cora (3%)#Accuracy
sentiment_analysis#SUBJ#Accuracy
amr_parsing#LDC2015E86#Smatch
Part-Of-Speech Tagging#UD#Avg accuracy
Atari Games#Atari 2600 Wizard of Wor#Score
Pose Tracking#PoseTrack2017#MOTA
3D Object Reconstruction#Data3D−R2N2#3DIoU
Real-time Instance Segmentation#MSCOCO#AP75
Visual Question Answering#MSVD-QA#Accuracy
Few-Shot Image Classification#Meta-Dataset#Accuracy
Sentiment Analysis#SST-5 Fine-grained classification#Accuracy
Image Classification#WebVision-1000#ImageNet Top-5 Accuracy
Atari Games#Atari 2600 Atlantis#Score
Atari Games#Atari 2600 Road Runner#Score
Image Super-Resolution#Urban100 - 2x upscaling#PSNR
Semantic Segmentation#LIP val#mIoU
Real-time Instance Segmentation#MSCOCO#AP50
Speech Recognition#WSJ eval92#Word Error Rate (WER)
Domain Adaptation#Office-Caltech#Average Accuracy
Relation Extraction#DocRED#F1
Node Classification#Wiki-Vote#Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2016#J&F
Language Modelling#Penn Treebank (Word Level)#Validation perplexity
3D Point Cloud Classification#ModelNet40#Overall Accuracy
Retinal Vessel Segmentation#DRIVE#AUC
Face Alignment#300W#AUC0.08 private
Few-Shot Image Classification#CIFAR-FS 5-way (5-shot)#Accuracy
3D Object Detection#ScanNetV2#mAP@0.5
Multivariate Time Series Forecasting#MuJoCo#MSE (10^-2, 50% missing)
Link Prediction#YAGO3-10#Hits@10
Graph Classification#RE-M5K#Accuracy
Image Clustering#coil-100#Accuracy
Text-to-Image Generation#Multi-Modal-CelebA-HQ#Acc
Multiple Object Tracking#KITTI Tracking test#MOTA
Document Classification#Cora#Accuracy
Semantic Textual Similarity#SentEval#SICK-R
Fake News Detection#FNC-1#Weighted Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Mean)
Semantic Textual Similarity#SentEval#SICK-E
Self-Supervised Image Classification#ImageNet#Number of Params
Object Detection#Waymo 2D detection all_ns f0val#COCO-style AP
Few-Shot Image Classification#OMNIGLOT - 5-Shot, 20-way#Accuracy
Question Answering#TrecQA#MRR
Image Classification#mini WebVision 1.0#Top-1 Accuracy
Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Val)
Fine-Grained Image Classification#Stanford Cars#PARAMS
Continuous Control#PyBullet Walker2D#Return
Image-to-Image Translation#ADE20K Labels-to-Photos#FID
Machine Translation#IWSLT2015 German-English#BLEU score
Image Retrieval with Multi-Modal Query#Fashion200k#Recall@10
Time Series Classification#Wafer#NLL
Self-Supervised Image Classification#ImageNet#Top 5 Accuracy
Dialogue Act Classification#Switchboard corpus#Accuracy
Time Series Classification#CMUsubject16#Accuracy
Atari Games#Atari 2600 Bowling#Score
Sentiment Analysis#TweetEval#Hate
language_modeling#WikiText-2#Number of params
Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#MS-SSIM
3D Multi-Object Tracking#KITTI#MOTA
Graph Classification#COLLAB#Accuracy
Gesture-to-Gesture Translation#NTU Hand Digit#IS
3D Multi-Object Tracking#KITTI#MOTP
Link Prediction#Cora#AUC
Sentiment Analysis#Multi-Domain Sentiment Dataset#Kitchen
Image Retrieval#Oxf5k#MAP
Text Classification#Ohsumed#Accuracy
RGB-D Salient Object Detection#NJU2K#S-Measure
Retinal OCT Disease Classification#OCT2017#Sensitivity
Data-to-Text Generation#WebNLG#BLEU
Image Retrieval with Multi-Modal Query#Fashion200k#Recall@50
3D Object Detection#SUN-RGBD val#mAP@0.25
Machine Translation#WMT2014 English-German#SacreBLEU
Fact-based Text Editing#WebEdit#F1
Few-Shot Semantic Segmentation#PASCAL-5i (1-Shot)#Mean IoU
Time Series Classification#JapaneseVowels#NLL
Synthetic-to-Real Translation#Syn2Real-C#Accuracy
Few-Shot Image Classification#Stanford Cars 5-way (1-shot)#Accuracy
Image Classification#Stanford Cars#Accuracy
3D Instance Segmentation#ScanNet(v2)#mAP
Coreference Resolution#OntoNotes#F1
Image Generation#CelebA-HQ 1024x1024#FID
Node Classification#Pubmed#Validation
Multivariate Time Series Forecasting#USHCN-Daily#MSE
Human-Object Interaction Detection#HICO#mAP
Panoptic Segmentation#COCO test-dev#PQst
Image Classification#MNIST#Percentage error
Code Generation#WikiSQL#Execution Accuracy
Image Super-Resolution#Urban100 - 8x upscaling#SSIM
Relation Extraction#DocRED#Ign F1
Panoptic Segmentation#COCO test-dev#PQth
Object Detection#Manga109-s 15test#COCO-style AP
Instance Segmentation#Cityscapes test#Average Precision
Action Classification#Charades#MAP
Interactive Segmentation#GrabCut#NoC@85
Action Classification#Kinetics-400#Flops x views
Image Clustering#Imagenet-dog-15#Accuracy
Real-Time Object Detection#COCO#FPS
Recommendation Systems#MovieLens 1M#nDCG@10
Speech Enhancement#DEMAND#CBAK
word_sense_disambiguation#Senseval 3#F1
Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 1 Accuracy
Recommendation Systems#Million Song Dataset#Recall@50
Named Entity Recognition#NCBI-disease#F1
Trajectory Prediction#Stanford Drone#ADE-8/12 @K = 20
Image Clustering#Fashion-MNIST#NMI
Relation Extraction#TACRED#F1
Fine-Grained Image Classification#Stanford Dogs#Accuracy
Link Prediction#Yelp#HR@10
Color Image Denoising#CBSD68 sigma50#PSNR
Action Segmentation#50 Salads#F1@10%
Cross-Lingual NER#CoNLL Spanish#F1
Machine Translation#WMT2014 English-French#BLEU score
3D Multi-Person Pose Estimation (absolute)#MuPoTS-3D#3DPCK
Sentiment Analysis#TweetEval#Sentiment
RGB-D Salient Object Detection#NJU2K#max F-Measure
Atari Games#Atari 2600 Solaris#Score
Depth Completion#KITTI Depth Completion#RMSE
Entity Linking#WiC-TSV#Task 1 Accuracy: general purpose
Action Segmentation#50 Salads#Edit
Interactive Segmentation#GrabCut#NoC@90
Visual Dialog#Visual Dialog v1.0 test-std#R@5
Few-Shot Semantic Segmentation#PASCAL-5i (5-Shot)#Mean IoU
Visual Dialog#Visual Dialog v1.0 test-std#R@1
Keypoint Detection#COCO test-dev#ARM
Keypoint Detection#COCO test-dev#ARL
Link Prediction#MovieLens 25M#nDCG@10
Image Super-Resolution#Set5 - 2x upscaling#PSNR
Image Super-Resolution#Manga109 - 2x upscaling#PSNR
Keypoint Detection#COCO test-dev#APM
Question Answering#QASent#MAP
Keypoint Detection#COCO test-dev#APL
Unsupervised Domain Adaptation#Office-Home (RS-UT imbalance)#Average Per-Class Accuracy
Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 open ended#Percentage correct
Hate Speech Detection#Ethos Binary#F1-score
Action Segmentation#Breakfast#F1@25%
relation_prediction#FB15K-237#H@10
Adversarial Defense#ImageNet (non-targeted PGD, max perturbation=4)#Accuracy
Action Segmentation#Breakfast#Edit
Domain Adaptation#MNIST-to-USPS#Accuracy
Language Modelling#WikiText-103#Test perplexity
Time Series Classification#Wafer#Accuracy
Link Prediction#WN18#Hits@3
Link Prediction#WN18#Hits@1
Spoken language identification#VoxForge European#Accuracy (%)
Birds Eye View Object Detection#KITTI Cars Hard#AP
Time Series Classification#ECG#Accuracy
Video Semantic Segmentation#CamVid#Mean IoU
Link Prediction#FB15k-237#MRR
Video Super-Resolution#Vid4 - 4x upscaling#MOVIE
Neural Architecture Search#CIFAR-10#Parameters
Face Verification#Labeled Faces in the Wild#Accuracy
Unsupervised Domain Adaptation#Duke to MSMT#mAP
Few-Shot Image Classification#CUB 200 5-way 1-shot#Accuracy
Scene Text Detection#MSRA-TD500#Recall
Machine Translation#IWSLT2015 English-German#BLEU score
Sentiment Analysis#TweetEval#Offensive
Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Spanish#Accuracy
Fact-based Text Editing#WebEdit#Recall
Semantic Textual Similarity#STS Benchmark#Spearman Correlation
Vision and Language Navigation#VLN Challenge#error
Image Clustering#Extended Yale-B#Accuracy
Object Detection#COCO test-dev#AP75
Cross-Modal Retrieval#Flickr30k#Text-to-image R@10
Interactive Segmentation#DAVIS#NoC@85
Person Re-Identification#CUHK03#Rank-1
Atari Games#Atari 2600 Gravitar#Score
Interactive Segmentation#DAVIS#NoC@90
Code Generation#WikiSQL#Exact Match Accuracy
Few-Shot Image Classification#Mini-Imagenet 5-way (5-shot)#Accuracy
Semi-Supervised Image Classification#cifar-100, 10000 Labels#Accuracy
Object Detection#COCO minival#oLRP
language_modeling#WikiText-103#Number of params
Chinese Named Entity Recognition#Resume NER#F1
Entity Disambiguation#AIDA-CoNLL#In-KB Accuracy
Speech Enhancement#DEMAND#CSIG
language_modeling#Penn Treebank#Number of params
Image Generation#CIFAR-10#FID
Object Detection#COCO test-dev#AP50
Grayscale Image Denoising#Set12 sigma15#PSNR
Semantic Role Labeling#CoNLL 2005#F1
JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#SSIM
Unsupervised Machine Translation#WMT2014 English-French#BLEU
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Recall)
Question Generation#SQuAD1.1#BLEU-4
Scene Text Detection#ICDAR 2015#Precision
Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Russian#Accuracy
3D Object Detection#KITTI Cars Easy val#AP
3D Human Pose Estimation#3DPW#acceleration error
Text Simplification#TurkCorpus#BLEU
Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 5 Accuracy
Unsupervised Image Classification#MNIST#Accuracy
amr_parsing#LDC2014T12#F1 on Full
dependency_parsing#benchmark Vietnamese dependency treebank VnDT#UAS
Atari Games#Atari 2600 Video Pinball#Score
Image Classification#EMNIST-Balanced#Accuracy
Person Re-Identification#MARS#Rank-5
Image Clustering#MNIST-test#NMI
Semantic Similarity#SICK#Spearman Correlation
Person Re-Identification#MARS#Rank-1
Link Prediction#Yelp#nDCG@10
Neural Architecture Search#CIFAR-100#FLOPS
Question Answering#Quora Question Pairs#Accuracy
Word Sense Disambiguation#SemEval 2015 Task 13#F1
Speech Synthesis#North American English#Mean Opinion Score
Fine-Grained Image Classification#NABirds#Accuracy
Music Transcription#MusicNet#Number of params
Link Prediction#FB15k#MRR
Image Retrieval#Flickr30K 1K test#R@10
Mortality Prediction#MIMIC-III#Recall
Text Simplification#PWKP / WikiSmall#BLEU
Neural Architecture Search#CIFAR-100#PARAMS
Semantic Role Labeling (predicted predicates)#CoNLL 2012#F1
Fact-based Text Editing#WebEdit#DELETE
Grammatical Error Correction#CoNLL-2014 Shared Task#F0.5
Scene Text Detection#ICDAR 2015#Recall
3D Object Detection#KITTI Cars Hard#AP
Neural Architecture Search#CIFAR-100#Percentage Error
Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-French#Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Decay)
Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Laptop#F1
Node Classification#CiteSeer with Public Split: fixed 20 nodes per class#Accuracy
Temporal Action Localization#THUMOS’14#mAP IOU@0.2
Temporal Action Localization#THUMOS’14#mAP IOU@0.3
Subjectivity Analysis#SUBJ#Accuracy
Temporal Action Localization#THUMOS’14#mAP IOU@0.1
Temporal Action Localization#THUMOS’14#mAP IOU@0.6
Temporal Action Localization#THUMOS’14#mAP IOU@0.7
Temporal Action Localization#THUMOS’14#mAP IOU@0.4
Real-time Instance Segmentation#MSCOCO#APL
Temporal Action Localization#THUMOS’14#mAP IOU@0.5
Real-time Instance Segmentation#MSCOCO#APM
Question Answering#bAbi#Accuracy (trained on 10k)
Real-time Instance Segmentation#MSCOCO#APS
Speech Recognition#TIMIT#Percentage error
Visual Dialog#Visual Dialog v1.0 test-std#Mean
Graph Classification#NEURON-BINARY#Accuracy
Language Modelling#Penn Treebank (Word Level)#Test perplexity
Unsupervised Machine Translation#WMT2014 French-English#BLEU
Video Retrieval#MSVD#text-to-video R@5
RGB-D Salient Object Detection#NJU2K#Average MAE
Video Retrieval#MSVD#text-to-video R@1
text_classification#AG News#Error
Pose Estimation#MPII Human Pose#PCKh-0.5
Scene Text Detection#MSRA-TD500#Precision
3D Human Pose Estimation#3DPW#PA-MPJPE
Image Clustering#ImageNet-10#NMI
Face Alignment#WFLW#FR@0.1(%, all)
Image-to-Image Translation#COCO-Stuff Labels-to-Photos#FID
relationship_extraction#New York Times Corpus#P@30%
Fine-Grained Image Classification#Caltech-101#Top-1 Error Rate
Human-Object Interaction Detection#V-COCO#MAP
Conversational Response Selection#PolyAI Reddit#1-of-100 Accuracy
Semi-Supervised Semantic Segmentation#Cityscapes 12.5% labeled#Validation mIoU
Fact-based Text Editing#WebEdit#BLEU
Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (Test)
Object Counting#Pascal VOC 2007 count-test#mRMSE-nz
Sentiment Analysis#IMDb#Accuracy
Image Generation#Binarized MNIST#nats
3D Object Detection#ScanNetV2#mAP@0.25
Lane Detection#CULane#F1 score
Unsupervised Domain Adaptation#Duke to MSMT#rank-10
Image Clustering#Imagenet-dog-15#NMI
Image Super-Resolution#Set14 - 3x upscaling#PSNR
Dialogue State Tracking#Wizard-of-Oz#Request
Pedestrian Detection#Caltech#Reasonable Miss Rate
Instance Segmentation#COCO minival#mask AP
Relation Extraction#ADE Corpus#RE+ Macro F1
Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
Semi-Supervised Image Classification#SVHN, 1000 labels#Accuracy
Time Series Classification#KickvsPunch#NLL
Person Re-Identification#CUHK03 labeled#Rank-1
Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Unseen)
JPEG Artifact Correction#LIVE1 (Quality 10 Color)#SSIM
Atari Games#Atari 2600 Tennis#Score
3D Object Reconstruction#Data3D−R2N2#Avg F1
Question Answering#QASent#MRR
Traffic Prediction#PeMS-M#MAE (60 min)
Constituency Grammar Induction#PTB#Max F1 (WSJ)
Conditional Image Generation#CIFAR-10#FID
Visual Question Answering#VQA v2 test-std#yes/no
Image Classification#Flowers-102#Accuracy
Image Super-Resolution#Set5 - 4x upscaling#SSIM
Recommendation Systems#MovieLens 1M#RMSE
Action Segmentation#Breakfast#F1@10%
Graph Classification#ENZYMES#Accuracy
Unsupervised Facial Landmark Detection#MAFL#NME
Keypoint Detection#COCO test-dev#AR50
Depth Completion#KITTI Depth Completion#Runtime [ms]
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#PSNR
Image Super-Resolution#Urban100 - 4x upscaling#SSIM
Constituency Parsing#Penn Treebank#F1 score
Person Re-Identification#CUHK03 labeled#MAP
Keypoint Detection#COCO test-dev#AR75
Panoptic Segmentation#Cityscapes val#mIoU
Relation Extraction#ADE Corpus#NER Macro F1
Semi-Supervised Video Object Segmentation#YouTube#mIoU
Object Detection#UAVDT#mAP
Keypoint Detection#COCO test-challenge#ARL
Keypoint Detection#COCO test-challenge#ARM
Question Answering#WikiQA#MRR
Image Generation#Cityscapes#FID-10k-training-steps
Real-time Instance Segmentation#MSCOCO#Frame (fps)
Few-Shot Image Classification#FC100 5-way (5-shot)#Accuracy
word_segmentation#Chinese Treebank 6#F1
summarization#CNN / Daily Mail (Anonymized version)#ROUGE-2
summarization#CNN / Daily Mail (Anonymized version)#ROUGE-1
Cross-Lingual NER#CoNLL Dutch#F1
Natural Language Inference#FarsTail#% Test Accuracy
Scene Text Detection#Total-Text#Precision
Link Prediction#YAGO3-10#Hits@3
Link Prediction#YAGO3-10#Hits@1
Word Sense Disambiguation#SemEval 2007 Task 17#F1
Neural Architecture Search#CIFAR-10#Search Time (GPU days)
3D Object Detection#KITTI Pedestrians Hard#AP
word_segmentation#VLSP 2013 word segmentation shared task#F1
Image Clustering#Tiny-ImageNet#Accuracy
summarization#CNN / Daily Mail (Anonymized version)#ROUGE-L
Visual Question Answering#VQA-CP#Score
Node Classification#USA Air-Traffic#Accuracy
Image Clustering#CIFAR-10#ARI
Image/Document Clustering#pendigits#runtime (s)
Action Segmentation#GTEA#Edit
Weakly Supervised Action Localization#ActivityNet-1.3#mAP@0.5
Panoptic Segmentation#Cityscapes test#PQ
taxonomy_learning#SemEval 2018#MAP
AMR Parsing#LDC2014T12#F1 Full
sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Laptop (acc)
Keypoint Detection#COCO test-challenge#APL
Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#Kernel Inception Distance
Hate Speech Detection#HateXplain#Accuracy
Image Denoising#SIDD#SSIM (sRGB)
Document Summarization#CNN / Daily Mail#ROUGE-1
Document Summarization#CNN / Daily Mail#ROUGE-2
Few-Shot Object Detection#MS-COCO (10-shot)#AP
Time Series Classification#PenDigits#NLL
word_segmentation#MSR#F1
3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
Semantic Segmentation#SkyScapes-Dense#Mean IoU
Object Counting#COCO count-test#m-reIRMSE
Visual Question Answering#GQA Test2019#Accuracy
Speech Enhancement#DEMAND#PESQ
Node Classification#Cornell#Accuracy
Document Summarization#CNN / Daily Mail#ROUGE-L
Grammatical Error Correction#BEA-2019 (test)#F0.5
Visual Question Answering#GQA test-std#Accuracy
Click-Through Rate Prediction#Amazon#AUC
Multimodal Machine Translation#Multi30K#BLEU (EN-DE)
Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.3-0.7)
Open-Domain Question Answering#SearchQA#N-gram F1
Keypoint Detection#COCO test-challenge#AR50
RGB-D Salient Object Detection#NJU2K#max E-Measure
Domain Adaptation#SYNSIG-to-GTSRB#Accuracy
Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#PSNR
Keypoint Detection#COCO test-challenge#AR75
Retinal Vessel Segmentation#STARE#AUC
Stochastic Optimization#CIFAR-100 WRN-28-10 - 200 Epochs#Accuracy
Spoken language identification#LRE07#3 sec
3D Semantic Segmentation#SemanticKITTI#mIoU
Text Summarization#arXiv#ROUGE-1
Text Summarization#arXiv#ROUGE-2
Image Matting#Composition-1K#SAD
Vision and Language Navigation#VLN Challenge#length
Object Counting#COCO count-test#mRMSE
Scene Text Recognition#SVT#Accuracy
Atari Games#Atari 2600 Demon Attack#Score
Lipreading#Lip Reading in the Wild#Top-1 Accuracy
Image Classification#Flowers-102#PARAMS
Time Series Classification#CharacterTrajectories#NLL
Text Summarization#arXiv#ROUGE-L
question_answering#CNN / Daily Mail#Accuracy on Daily Mail
Instance Segmentation#iSAID#Average Precision
Single Image Deraining#Test1200#PSNR
Visual Question Answering#VQA v1 test-dev#Accuracy
Word Sense Disambiguation#SemEval 2007 Task 7#F1
Multimodal Activity Recognition#EV-Action#Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Decay)
Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#MS-SSIM
Entity Linking#WiC-TSV#Task 3 Accuracy: domain specific
relationship_extraction#SemEval-2010 Task 8#F1
Recommendation Systems#MovieLens 1M#HR@10
Named Entity Recognition#ACE 2004#F1
Node Classification#Facebook#Accuracy
Action Detection#Charades#mAP
Atari Games#Atari 2600 Amidar#Score
Image Classification#WebVision-1000#ImageNet Top-1 Accuracy
Scene Text Detection#ICDAR 2017 MLT#Precision
Fact-based Text Editing#WebEdit#KEEP
Visual Object Tracking#LaSOT#AUC
Image Classification#iNaturalist#Top 1 Accuracy
Graph Classification#UPFD-POL#Accuracy (%)
Skeleton Based Action Recognition#N-UCLA#Accuracy
Scene Text Detection#ICDAR 2017 MLT#Recall
Conditional Image Generation#ImageNet 128x128#FID
language_modeling#1B Words / Google Billion Word benchmark#Test perplexity
6D Pose Estimation#YCB-Video#ADDS AUC
Semi-Supervised Image Classification#CIFAR-10, 250 Labels#Accuracy
Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Seen)
Image Super-Resolution#Manga109 - 4x upscaling#SSIM
Panoptic Segmentation#COCO panoptic#PQst
machine_translation#WMT 2014 EN-FR#BLEU
Entity Linking#WiC-TSV#Task 3 Accuracy: all
Pose Estimation#COCO test-dev#AP50
Few-Shot Image Classification#Stanford Dogs 5-way (5-shot)#Accuracy
Panoptic Segmentation#COCO panoptic#PQth
Atari Games#Atari 2600 Chopper Command#Score
Time Series Classification#PEMS#NLL
Question Answering#SQuAD2.0 dev#F1
Question Answering#SQuAD2.0 dev#EM
Natural Language Inference#MultiNLI#Matched
Dense Pixel Correspondence Estimation#HPatches#Viewpoint V AEPE
Unsupervised Domain Adaptation#Market to Duke#mAP
Time Series Classification#NetFlow#NLL
Node Classification#PPI#F1
Temporal Action Proposal Generation#ActivityNet-1.3#AR@100
Sequential Image Classification#Sequential MNIST#Permuted Accuracy
Click-Through Rate Prediction#Bing News#Log Loss
Neural Architecture Search#CIFAR-10 Image Classification#Percentage error
JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR
Data-to-Text Generation#WebNLG Full#BLEU
Pose Estimation#Leeds Sports Poses#PCK
Person Re-Identification#Market-1501#Rank-5
Semantic Segmentation#COCO-Stuff test#mIoU
Person Re-Identification#Market-1501#Rank-1
JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR
Conditional Image Generation#CIFAR-10#Inception score
Pose Estimation#COCO test-dev#AP75
Image Generation#CelebA 256x256#bpd
Object Detection#KITTI Cars Easy#AP
Reading Comprehension#RACE#Accuracy (Middle)
Unsupervised Domain Adaptation#Cityscapes to Foggy Cityscapes#mAP@0.5
Real-Time Semantic Segmentation#Cityscapes test#Time (ms)
Ad-Hoc Information Retrieval#TREC Robust04#MAP
Image Clustering#CIFAR-100#Accuracy
Image Clustering#USPS#Accuracy
Question Answering#CNN / Daily Mail#CNN
Image Retrieval#CARS196#R@1
Image Super-Resolution#Set5 - 8x upscaling#SSIM
Fine-Grained Image Classification#Oxford-IIIT Pets#Top-1 Error Rate
Neural Architecture Search#CIFAR-10#Top-1 Error Rate
Image Clustering#USPS#NMI
Real-Time Semantic Segmentation#NYU Depth v2#mIoU
Node Classification#Citeseer Full-supervised#Accuracy
Atari Games#Atari 2600 Battle Zone#Score
Graph Regression#Lipophilicity#RMSE
Video Instance Segmentation#YouTube-VIS validation#AP75
Image Classification#ImageNet V2#Top 1 Accuracy
Action Segmentation#Breakfast#Acc
Scene Text Recognition#ICDAR2013#Accuracy
Few-Shot Image Classification#Tiered ImageNet 10-way (1-shot)#Accuracy
Semantic Segmentation#S3DIS Area5#mAcc
Cross-Modal Retrieval#COCO 2014#Image-to-text R@10
Object Counting#Pascal VOC 2007 count-test#m-relRMSE
Link Prediction#FB15k-237#MR
Spoken language identification#LRE07#10 sec
Video Instance Segmentation#YouTube-VIS validation#AP50
Text Classification#R8#Accuracy
Node Classification#Wikipedia#Macro-F1
Atari Games#Atari 2600 Alien#Score
Atari Games#Atari 2600 Q*Bert#Score
Single Image Deraining#Rain100L#PSNR
Image Super-Resolution#Set14 - 8x upscaling#PSNR
Question Answering#NarrativeQA#METEOR
Single Image Deraining#Test2800#PSNR
3D Object Detection#nuScenes#mAP
Optical Flow Estimation#Sintel-clean#Average End-Point Error
Image Classification#Oxford-IIIT Pets#Accuracy
Object Detection#KITTI Cars Moderate#AP
Grayscale Image Denoising#Urban100 sigma50#PSNR
Atari Games#Atari 2600 Defender#Score
Zero-Shot Learning#SUN Attribute#average top-1 classification accuracy
Semantic Textual Similarity#SentEval#MRPC
Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: domain specific
Few-Shot Object Detection#MS-COCO (30-shot)#AP
relationship_extraction#New York Times Corpus#P@10%
Few-Shot Image Classification#Mini-Imagenet 5-way (1-shot)#Accuracy
3D Human Pose Estimation#MPI-INF-3DHP#MJPE
Graph Classification#HIV-fMRI-77#F1
Sentiment Analysis#TweetEval#ALL
Single Image Deraining#Rain100H#SSIM
Medical Image Segmentation#CVC-ClinicDB#mean Dice
Video Generation#UCF-101 16 frames, 64x64, Unconditional#Inception Score
question_answering#Quasar#EM (Quasar-T)
Person Re-Identification#Market-1501#Rank-10
Question Answering#CNN / Daily Mail#Daily Mail
Video Object Detection#ImageNet VID#MAP
Weakly Supervised Action Localization#THUMOS 2014#mAP@0.5
Humor Detection#200k Short Texts for Humor Detection#F1-score
Node Classification#Flickr#Accuracy
Multi-Object Tracking#MOT17#MOTA
Sentiment Analysis#Amazon Review Full#Accuracy
Language Modelling#Hutter Prize#Bit per Character (BPC)
Semantic Segmentation#ScanNet#3DIoU
Semantic Segmentation#ADE20K#Test Score
Crowd Counting#UCF-QNRF#MAE
word_sense_disambiguation#SemEval 2007#F1
Question Answering#WikiQA#MAP
Image-to-Image Translation#COCO-Stuff Labels-to-Photos#mIoU
Keypoint Detection#COCO test-dev#AP50
Semantic Segmentation#Nighttime Driving#mIoU
Semantic Textual Similarity#SICK#Spearman Correlation
Text-to-Image Generation#CUB#Inception score
Visual Dialog#Visual Dialog v1.0 test-std#R@10
Mortality Prediction#MIMIC-III#Precision
Keypoint Detection#COCO test-dev#AP75
Dependency Parsing#Penn Treebank#UAS
Graph Classification#NCI109#Accuracy
Text Summarization#X-Sum#ROUGE-3
Text Summarization#X-Sum#ROUGE-2
Text Summarization#X-Sum#ROUGE-1
Unsupervised Domain Adaptation#Duke to MSMT#rank-1
Person Search#CUHK-SYSU#MAP
Unsupervised Domain Adaptation#Duke to MSMT#rank-5
Semantic Role Labeling#OntoNotes#F1
Semantic Similarity#SICK#Pearson Correlation
Video Retrieval#LSMDC#text-to-video R@10
Image Classification#VTAB-1k#Top-1 Accuracy
Anomaly Detection#Unlabeled CIFAR-10 vs CIFAR-100#AUROC
Line Segment Detection#wireframe dataset#sAP5
Domain Adaptation#SVNH-to-MNIST#Accuracy
3D Point Cloud Classification#ScanObjectNN#Overall Accuracy
Vehicle Pose Estimation#KITTI Cars Hard#Average Orientation Similarity
Weakly Supervised Object Detection#PASCAL VOC 2012 test#MAP
Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Laptop (Acc)
Few-Shot Image Classification#OMNIGLOT - 1-Shot, 5-way#Accuracy
Language Modelling#WikiText-2#Test perplexity
Graph Classification#IMDb-B#Accuracy
sentiment_analysis#SST-2#Accuracy
Multi-tissue Nucleus Segmentation#Kumar#Hausdorff Distance (mm)
Hate Speech Detection#Ethos Binary#Precision
Time Series Classification#AUSLAN#Accuracy
Click-Through Rate Prediction#Dianping#AUC
Face Verification#Trillion Pairs Dataset#Accuracy
Sentiment Analysis#TweetEval#Irony
dependency_parsing#Penn Treebank#LAS
Sentiment Analysis#MR#Accuracy
Video Generation#UCF-101 16 frames, Unconditional, Single GPU#Inception Score
Unsupervised Machine Translation#WMT2016 English-German#BLEU
Node Classification#Wisconsin#Accuracy
Cross-Modal Retrieval#COCO 2014#Text-to-image R@5
Cross-Modal Retrieval#COCO 2014#Text-to-image R@1
Video Instance Segmentation#YouTube-VIS validation#AR1
Question Answering#NewsQA#F1
Visual Object Tracking#VOT2017#Expected Average Overlap (EAO)
Node Classification#Wikipedia#Accuracy
Action Classification#Kinetics-700#Top-1 Accuracy
Atari Games#Atari 2600 Kung-Fu Master#Score
Image Classification#CIFAR-100#Percentage correct
Machine Translation#WMT2014 German-English#BLEU score
Object Counting#Pascal VOC 2007 count-test#m-reIRMSE-nz
Trajectory Prediction#Stanford Drone#FDE-8/12 @K= 20
Zero-Shot Learning#CUB-200-2011#average top-1 classification accuracy
Word Sense Disambiguation#Supervised:#SemEval 2015
Named Entity Recognition#BC5CDR#F1
Word Sense Disambiguation#Supervised:#SemEval 2013
Word Sense Disambiguation#Supervised:#SemEval 2007
Language Modelling#WikiText-2#Number of params
Line Segment Detection#wireframe dataset#sAP15
Line Segment Detection#wireframe dataset#sAP10
Node Classification#Pubmed#Accuracy
Neural Architecture Search#CIFAR-10 Image Classification#FLOPS
Visual Object Tracking#GOT-10k#Success Rate 0.5
Retinal OCT Disease Classification#OCT2017#Acc
Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Dice
Lane Detection#TuSimple#Accuracy
summarization#CNN / Daily Mail (Non-anonymized version)#METEOR
Image Clustering#CIFAR-10#Backbone
Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (Test)
6D Pose Estimation using RGBD#LineMOD#Mean ADD
text_classification#DBpedia#Error
Person Re-Identification#MARS#mAP
Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 multiple choice#Percentage correct
Time Series Classification#KickvsPunch#Accuracy
Hyperspectral Image Classification#Pavia University#Overall Accuracy
Text Simplification#TurkCorpus#SARI (EASSE>=0.2.1)
Graph Clustering#Cora#Accuracy
Vision and Language Navigation#VLN Challenge#spl
Crowd Counting#UCF CC 50#MAE
Keypoint Detection#COCO test-challenge#AP50
Video Retrieval#LSMDC#text-to-video Median Rank
Sentiment Analysis#TweetEval#Stance
chunking#Penn Treebank#F1
Keypoint Detection#COCO test-challenge#AP75
Relation Extraction#ACE 2004#NER Micro F1
Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 1 Accuracy
Atari Games#Atari 2600 HERO#Score
Multi-tissue Nucleus Segmentation#Kumar#Dice
Link Prediction#WN18#Hits@10
Semantic Segmentation#S3DIS#mAcc
Image Super-Resolution#BSD100 - 4x upscaling#SSIM
Image Classification#mini WebVision 1.0#ImageNet Top-1 Accuracy
Anomaly Detection#One-class ImageNet-30#AUROC
Few-Shot Image Classification#Tiered ImageNet 5-way (1-shot)#Accuracy
Neural Architecture Search#ImageNet#Params
Multimodal Activity Recognition#Moments in Time Dataset#Top-5 (%)
question_answering#SearchQA#EM
question_answering#SearchQA#F1
Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-pixel Accuracy
Real-Time Semantic Segmentation#CamVid#Frame (fps)
Image Generation#CIFAR-10#Inception score
Click-Through Rate Prediction#MovieLens 20M#AUC
summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-L
Action Recognition#NTU RGB+D#Accuracy (CV)
Cross-Modal Retrieval#Flickr30k#Image-to-text R@5
Cross-Modal Retrieval#Flickr30k#Image-to-text R@1
Semantic Segmentation#ADE20K val#mIoU
Multi-Label Classification#PASCAL VOC 2007#mAP
Ad-Hoc Information Retrieval#TREC Robust04#nDCG@20
Scene Text Detection#Total-Text#Recall
Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-1
Birds Eye View Object Detection#KITTI Cars Easy#AP
Emotion Recognition in Conversation#MELD#Weighted Macro-F1
Graph Classification#UPFD-GOS#Accuracy (%)
Named Entity Recognition#CoNLL 2003 (German)#F1
Person Re-Identification#MSMT17#mAP
Image Matting#Composition-1K#Grad
Birds Eye View Object Detection#KITTI Pedestrians Moderate#AP
Atari Games#Atari 2600 Space Invaders#Score
Real-Time Object Detection#PASCAL VOC 2007#MAP
Graph Regression#ZINC#MAE
Sentiment Analysis#Multi-Domain Sentiment Dataset#Electronics
Action Recognition#NTU RGB+D#Accuracy (CS)
Semantic Textual Similarity#SentEval#STS
Neural Architecture Search#NAS-Bench-201, CIFAR-100#Search time (s)
Node Classification#MAG240M-LSC#Test Accuracy
summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-1
summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-2
Retinal OCT Disease Classification#Srinivasan2014#Acc
Skeleton Based Action Recognition#SYSU 3D#Accuracy
Video Frame Interpolation#Middlebury#Interpolation Error
Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: all
Grammatical Error Correction#JFLEG#GLEU
Grayscale Image Denoising#BSD68 sigma50#PSNR
Facial Expression Recognition#AffectNet#Accuracy (8 emotion)
Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-L
Link Prediction#WN18RR#MRR
Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-2
Linguistic Acceptability#CoLA#Accuracy
Sentiment Analysis#Multi-Domain Sentiment Dataset#Average
Graph Classification#HIV-fMRI-77#Accuracy
Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-1
Monocular Depth Estimation#NYU-Depth V2#RMSE
Colorectal Gland Segmentation:#CRAG#F1-score
Video Retrieval#MSVD#text-to-video R@10
Fact-based Text Editing#WebEdit#Precision
Speech Recognition#MediaSpeech#WER for Spanish
Metric Learning#CARS196#R@1
Action Classification#Moments in Time#Top 1 Accuracy
Node Classification#Cora (0.5%)#Accuracy
Question Answering#SQuAD1.1 dev#F1
Question Answering#SQuAD1.1 dev#EM
Video Instance Segmentation#YouTube-VIS validation#AR10
Few-Shot Image Classification#Tiered ImageNet 10-way (5-shot)#Accuracy
Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (1-shot)#Accuracy
Weakly Supervised Object Detection#PASCAL VOC 2007#MAP
Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Recall)
Image Retrieval#Par106k#mAP
Fake News Detection#FNC-1#Per-class Accuracy (Agree)
Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#FID
Atari Games#Atari 2600 Centipede#Score
Image Generation#STL-10#FID
Image Clustering#CIFAR-100#Train Set
Weakly Supervised Object Detection#Charades#MAP
part-of-speech_tagging#Penn Treebank#Accuracy
word_sense_disambiguation#SemEval 2013#F1
Unsupervised Domain Adaptation#Duke to Market#mAP
Video Super-Resolution#Vid4 - 4x upscaling#SSIM
Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-NB
JPEG Artifact Correction#ICB (Quality 10 Color)#SSIM
Few-Shot Image Classification#Mini-Imagenet 10-way (5-shot)#Accuracy
Multi-Person Pose Estimation#COCO test-dev#AP75
Image Denoising#SIDD#PSNR (sRGB)
RGB-D Salient Object Detection#NLPR#max F-Measure
Action Recognition#EPIC-KITCHENS-100#Noun@1
Node Classification#BlogCatalog#Accuracy
Speech Enhancement#DEMAND#COVL
Named Entity Recognition#CoNLL 2002 (Spanish)#F1
Multi-Person Pose Estimation#COCO test-dev#AP50
Time Series Classification#ArabicDigits#NLL
Referring Expression Segmentation#RefCOCO testA#IoU
Joint Entity and Relation Extraction#SciERC#Relation F1
Action Segmentation#Breakfast#F1@50%
Face Identification#Trillion Pairs Dataset#Accuracy
Neural Architecture Search#ImageNet#MACs
Sentiment Analysis#SST-2 Binary classification#Accuracy
Monocular 3D Human Pose Estimation#Human3.6M#Use Video Sequence
Relation Extraction#ChemProt#F1
Atari Games#Atari 2600 Double Dunk#Score
Node Classification#Citeseer#Validation
Semi-Supervised Image Classification#SVHN, 250 Labels#Accuracy
RGB-D Salient Object Detection#SIP#S-Measure
Data-to-Text Generation#MULTIWOZ 2.1#BLEU
Image Super-Resolution#Set14 - 2x upscaling#PSNR
Self-Supervised Action Recognition#HMDB51#Pre-Training Dataset
Video Retrieval#MSR-VTT-1kA#text-to-video R@5
Video Retrieval#MSR-VTT-1kA#text-to-video R@1
Instance Segmentation#COCO minival#AP50
Object Detection#COCO test-dev#APS
RGB-D Salient Object Detection#STERE#Average MAE
Scene Text Recognition#ICDAR 2003#Accuracy
Click-Through Rate Prediction#Criteo#AUC
Node Classification#Citeseer#Accuracy
JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR-B
Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-WB
Recommendation Systems#MovieLens 20M#Recall@20
Instance Segmentation#COCO minival#AP75
Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
Image Classification#mini WebVision 1.0#Top-5 Accuracy
Abstractive Text Summarization#CNN / Daily Mail#ROUGE-L
Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (val)
Abstractive Text Summarization#CNN / Daily Mail#ROUGE-1
Abstractive Text Summarization#CNN / Daily Mail#ROUGE-2
Audio Classification#ESC-50#Top-1 Accuracy
Object Detection#COCO test-dev#APM
Object Detection#COCO test-dev#APL
Retinal Vessel Segmentation#DRIVE#F1 score
Music Modeling#Nottingham#NLL
Fine-Grained Image Classification#Food-101#Accuracy
Common Sense Reasoning#Winograd Schema Challenge#Score
language_modeling#Hutter Prize#Number of params
Quantization#ImageNet#Accuracy (%)
Language Modelling#Penn Treebank (Character Level)#Number of params
Music Source Separation#MUSDB18#SDR (drums)
Machine Translation#WMT2016 English-German#BLEU score
Link Prediction#OpenBioLink#Hits@10
Image Generation#ImageNet 64x64#Bits per dim
Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (5-shot)#Accuracy
Fine-Grained Image Classification#Oxford-IIIT Pets#PARAMS
Grammatical Error Detection#CoNLL-2014 A1#F0.5
Object Counting#COCO count-test#m-reIRMSE-nz
Image Clustering#MNIST-full#Accuracy
Visual Object Tracking#OTB-2013#AUC
Bias Detection#StereoSet#ICAT Score
Line Segment Detection#wireframe dataset#F1 score
Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#FID
Single Image Deraining#Test100#PSNR
Visual Dialog#Visual Dialog v1.0 test-std#NDCG (x 100)
JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR
Birds Eye View Object Detection#KITTI Cars Moderate#AP
Language Modelling#WikiText-2#Validation perplexity
Machine Translation#IWSLT2014 German-English#BLEU score
Graph Classification#REDDIT-B#Accuracy
Recommendation Systems#Netflix#nDCG@100
Image Classification#ImageNet#Top 1 Accuracy
Natural Language Inference#SciTail#Accuracy
Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.7
Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.5
Scene Text Recognition#ICDAR2015#Accuracy
Image Super-Resolution#Set5 - 3x upscaling#SSIM
Crowd Counting#ShanghaiTech A#MAE
Semi-Supervised Video Object Segmentation#YouTube-VOS#Overall
Recommendation Systems#Douban Monti#RMSE
Open-Domain Question Answering#Quasar#F1 (Quasar-T)
Instance Segmentation#COCO minival#APL
Instance Segmentation#COCO minival#APM
Instance Segmentation#COCO minival#APS
Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Seen)
Object Detection#KITTI Cars Hard#AP
Task-Oriented Dialogue Systems#KVRET#Entity F1
3D Object Detection#KITTI Pedestrians Moderate#AP
Multi-Person Pose Estimation#CrowdPose#mAP @0.5:0.95
Motion Segmentation#Apolloscape#Accuracy
Semantic Segmentation#ADE20K#Validation mIoU
Action Recognition#EPIC-KITCHENS-100#Verb@1
Action Recognition#THUMOS’14#mAP@0.3
Action Recognition#THUMOS’14#mAP@0.4
Action Recognition#THUMOS’14#mAP@0.5
named_entity_recognition#Ontonotes v5 (English)#F1
Action Recognition#THUMOS’14#mAP@0.1
Action Recognition#THUMOS’14#mAP@0.2
Action Segmentation#GTEA#F1@10%
language_modeling#WikiText-103#Test perplexity
Image-to-Image Translation#GTAV-to-Cityscapes Labels#mIoU
Continual Learning#visual domain decathlon (10 tasks)#decathlon discipline (Score)
Aspect Sentiment Triplet Extraction#SemEval#F1
Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#SSIM
Video Generation#BAIR Robot Pushing#FVD score
Relation Extraction#ACE 2004#RE+ Micro F1
Multi-Person Pose Estimation#COCO test-dev#AP
Monocular Depth Estimation#KITTI Eigen split#absolute relative error
Atari Games#Atari 2600 Tutankham#Score
RGB-D Salient Object Detection#LFSD#Average MAE
Unsupervised Domain Adaptation#Duke to Market#rank-10
Dense Video Captioning#ActivityNet Captions#METEOR
Image Super-Resolution#Set14 - 4x upscaling#PSNR
Domain Adaptation#Office-31#Average Accuracy
3D Object Detection#KITTI Cyclists Moderate#AP
Reading Comprehension#RACE#Accuracy
Panoptic Segmentation#Cityscapes val#PQst
Scene Text Detection#SCUT-CTW1500#Precision
Speech Separation#wsj0-2mix#SI-SDRi
question_answering#SearchQA#Unigram Acc
Panoptic Segmentation#Cityscapes val#PQth
Self-Supervised Image Classification#ImageNet (finetuned)#Top 1 Accuracy
Unsupervised Domain Adaptation#Market to Duke#rank-10
Continuous Control#PyBullet HalfCheetah#Return
language_modeling#Penn Treebank#Bit per Character (BPC)
amr_parsing#LDC2014T12#F1 on Newswire
Time Series Classification#JapaneseVowels#Accuracy
Weakly-supervised 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
Face Verification#IJB-C#TAR @ FAR=0.01
3D Human Pose Estimation#3DPW#MPJPE
Neural Architecture Search#ImageNet#Top-1 Error Rate
Fine-Grained Image Classification#Birdsnap#Accuracy
Fact-based Text Editing#WebEdit#ADD
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#SSIM
Protein Secondary Structure Prediction#CB513#Q8
3D Object Detection#KITTI Cars Moderate val#AP
Action Recognition#UCF101#3-fold Accuracy
Dense Object Detection#SKU-110K#AP
Image Retrieval#Oxf105k#MAP
Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV I)
Sequential Image Classification#Sequential MNIST#Unpermuted Accuracy
Node Classification#Coauthor CS#Accuracy
Graph Classification#CIFAR10 100k#Accuracy (%)
RGB-D Salient Object Detection#DES#Average MAE
question_answering#SQuAD#F1
question_answering#SQuAD#EM
Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-class Accuracy
Video Object Detection#ImageNet VID#runtime (ms)
Video Retrieval#MSR-VTT-1kA#text-to-video R@10
Real-Time Object Detection#COCO#MAP
Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Search time (s)
Temporal Action Proposal Generation#ActivityNet-1.3#AUC (val)
Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Restaurant (Acc)
Time Series Classification#ArabicDigits#Accuracy
Conditional Image Generation#ImageNet 128x128#Inception score
Face Alignment#WFLW#AUC@0.1 (all)
Image Classification#SVHN#Percentage error
Semantic Textual Similarity#STS14#Spearman Correlation
Multi-Person Pose Estimation#COCO test-dev#APL
Multi-Person Pose Estimation#COCO test-dev#APM
Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Test)
3D Instance Segmentation#S3DIS#mRec
Image Retrieval#In-Shop#R@1
Photo geolocation estimation#Im2GPS#Continent level (2500 km)
Graph Classification#MUTAG#Accuracy
Recommendation Systems#MovieLens 100K#RMSE (u1 Splits)
Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: general purpose
Real-Time Object Detection#COCO#inference time (ms)
3D Object Detection#KITTI Pedestrians Easy#AP
Real-time Instance Segmentation#MSCOCO#mask AP
Image Classification#MNIST#Accuracy
Image Clustering#CIFAR-10#Train set
Real-Time Object Detection#PASCAL VOC 2007#FPS
Pedestrian Detection#CityPersons#Bare MR^-2
Unsupervised Domain Adaptation#Duke to Market#rank-5
Semantic Segmentation#Cityscapes val#mIoU
Unsupervised Domain Adaptation#Duke to Market#rank-1
RGB Salient Object Detection#HKU-IS#MAE
Image Super-Resolution#Set5 - 4x upscaling#PSNR
Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#FID
Unsupervised Video Object Segmentation#DAVIS 2016#J&F
Crowd Counting#WorldExpo’10#Average MAE
Dense Object Detection#SKU-110K#AP75
Face Alignment#AFLW2000-3D#Mean NME
Generalized Zero-Shot Learning#SUN Attribute#Harmonic mean
Real-Time Semantic Segmentation#CamVid#Time (ms)
Emotion Recognition in Context#EMOTIC#mAP
Few-Shot Image Classification#OMNIGLOT - 1-Shot, 20-way#Accuracy
3D Human Pose Estimation#Human3.6M#Using 2D ground-truth joints
Spoken language identification#LRE07#30 sec
Recommendation Systems#MovieLens 20M#Recall@50
Stochastic Optimization#CIFAR-10 WRN-28-10 - 200 Epochs#Accuracy
Time Series Classification#PhysioNet Challenge 2012#AUC Stdev
Node Classification#PubMed with Public Split: fixed 20 nodes per class#Accuracy
summarization#DUC 2004 Task 1#ROUGE-L
6D Pose Estimation using RGB#LineMOD#Accuracy (ADD)
Person Search#CUHK-SYSU#Top-1
dependency_parsing#benchmark Vietnamese dependency treebank VnDT#LAS
3D Human Pose Estimation#MPI-INF-3DHP#3DPCK
summarization#DUC 2004 Task 1#ROUGE-2
summarization#DUC 2004 Task 1#ROUGE-1
Node Classification#PubMed (0.05%)#Accuracy
Link Prediction#WN18RR#Hits@10
Visual Question Answering#VCR (QA-R) test#Accuracy
Question Answering#Natural Questions (long)#F1
Person Re-Identification#CUHK03 detected#MAP
Atari Games#Atari 2600 Surround#Score
RGB-D Salient Object Detection#SIP#max F-Measure
Atari Games#Atari 2600 Boxing#Score
Visual Question Answering#DocVQA test#ANLS
Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
Traffic Prediction#METR-LA#MAE @ 12 step
Action Segmentation#GTEA#F1@25%
Person Re-Identification#PRID2011#Rank-20
Scene Text Detection#COCO-Text#F-Measure
Atari Games#Atari 2600 Bank Heist#Score
Node Classification#Cora (1%)#Accuracy
Monocular 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
Neural Network Compression#CIFAR-10#Size (MB)
Object Counting#COCO count-test#mRMSE-nz
Question Answering#SQuAD2.0#EM
Facial Expression Recognition#FER2013#Accuracy
Image Classification#STL-10#Percentage correct
Question Answering#SQuAD2.0#F1
Unsupervised Domain Adaptation#Market to MSMT#mAP
machine_translation#The IWSLT 2015 Evaluation Campaign#BLEU
Scene Text Detection#ICDAR 2015#F-Measure
Text Classification#IMDb#Accuracy (2 classes)
Facial Landmark Detection#300W#NME
Unsupervised Domain Adaptation#Market to MSMT#rank-5
Language Modelling#Text8#Number of params
Unsupervised Domain Adaptation#Market to MSMT#rank-1
Link Prediction#FB15k#Hits@1
Node Classification#Texas#Accuracy
Atari Games#Atari 2600 River Raid#Score
Cross-View Image-to-Image Translation#Dayton (64×64) - aerial-to-ground#SSIM
Link Prediction#FB15k#Hits@3
Cross-Modal Retrieval#Flickr30k#Image-to-text R@10
Supervised Video Summarization#TvSum#F1-score (Canonical)
Few-Shot Image Classification#OMNIGLOT - 5-Shot, 5-way#Accuracy
Sequential Image Classification#Sequential CIFAR-10#Unpermuted Accuracy
Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
Person Re-Identification#DukeMTMC-reID#Rank-1
Cross-Modal Retrieval#COCO 2014#Text-to-image R@10
Semantic Segmentation#Cityscapes test#Category mIoU
Person Re-Identification#DukeMTMC-reID#Rank-5
Image Super-Resolution#BSD100 - 2x upscaling#SSIM
Word Sense Disambiguation#Words in Context#Accuracy
Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
Node Classification#Pubmed#Training Split
Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.9)
Layout-to-Image Generation#COCO-Stuff 64x64#Inception Score
Atari Games#Atari 2600 Venture#Score
Text Generation#MATH#Average Accuracy
Grayscale Image Denoising#BSD68 sigma15#PSNR
Visual Question Answering#VQA v2 test-std#other
Question Answering#CoQA#Out-of-domain
Semantic Textual Similarity#MRPC#Accuracy
Human-Object Interaction Detection#HICO-DET#Time Per Frame (ms)
Line Segment Detection#York Urban Dataset#sAP5
Recommendation Systems#MovieLens 20M#nDCG@100
Question Answering#RACE#RACE-h
Question Answering#RACE#RACE-m
Semantic Segmentation#Cityscapes test#Mean IoU (class)
Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.5)
Superpixel Image Classification#75 Superpixel MNIST#Classification Error
Commonsense Reasoning for RL#commonsense-rl#Avg #Steps
Time Series Classification#PhysioNet Challenge 2012#AUC
Pose Transfer#Deep-Fashion#SSIM
Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Decay)
Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-pixel Accuracy
text_classification#TREC#Error
Medical Image Segmentation#Kvasir-SEG#Average MAE
Speech Enhancement#CHiME-3#SDR
Head Pose Estimation#AFLW2000#MAE
Gesture-to-Gesture Translation#Senz3D#IS
Visual Question Answering#GQA Test2019#Plausibility
3D Object Detection#KITTI Cars Easy#AP
Image Clustering#MNIST-test#Accuracy
Time Series Classification#UWave#Accuracy
Visual Dialog#Visual Dialog v1.0 test-std#MRR (x 100)
Image-to-Image Translation#Cityscapes Photo-to-Labels#Class IOU
Task-Oriented Dialogue Systems#KVRET#BLEU
word_sense_disambiguation#SemEval 2015#F1
Image Relighting#VIDIT’20 validation set#LPIPS
Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Views
JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR-B
Image Classification#ImageNet#Top 5 Accuracy
Image Clustering#CIFAR-10#Accuracy
Atari Games#Atari 2600 Up and Down#Score
Depth Estimation#NYU-Depth V2#RMS
Person Re-Identification#DukeMTMC-reID#MAP
Image Super-Resolution#WebFace - 8x upscaling#PSNR
Graph Classification#NCI1#Accuracy
Deblurring#GoPro#SSIM
Hate Speech Detection#HateXplain#Macro F1
Visual Question Answering#GQA Test2019#Validity
machine_translation#WMT 2014 EN-DE#BLEU
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LPIPS
Visual Dialog#VisDial v0.9 val#MRR
Keyword Spotting#Google Speech Commands#Google Speech Commands V2 12
Grammatical Error Detection#FCE#F0.5
Facial Expression Recognition#AffectNet#Accuracy (7 emotion)
Emotion Recognition in Conversation#IEMOCAP#F1
Link Prediction#FB15k#Hits@10
JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR
Semi-Supervised Image Classification#CIFAR-10, 1000 Labels#Accuracy
Relation Extraction#NYT#F1
Semi-Supervised Semantic Segmentation#Pascal VOC 2012 12.5% labeled#Validation mIoU
Scene Text Detection#COCO-Text#Precision
Keyword Spotting#Google Speech Commands#Google Speech Commands V2 35
Weakly Supervised Action Localization#THUMOS’14#mAP@0.5
Object Detection#COCO test-dev#box AP
Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: domain specific
Image Super-Resolution#BSD100 - 4x upscaling#PSNR
Atari Games#Atari 2600 Name This Game#Score
Relation Extraction#ACE 2005#NER Micro F1
Data-to-Text Generation#LDC2017T10#BLEU
Self-Supervised Action Recognition#UCF101#Pre-Training Dataset
Pose Estimation#COCO test-dev#AR
Pose Estimation#COCO test-dev#AP
Graph Classification#NEURON-MULTI#Accuracy
Relation Extraction#ACE 2005#Sentence Encoder
Image Generation#ImageNet 32x32#bpd
relation_prediction#FB15K-237#MRR
Action Recognition#HMDB-51#Average accuracy of 3 splits
Action Recognition#AVA v2.2#mAP
ccg_supertagging#CCGBank#Accuracy
Data-to-Text Generation#E2E NLG Challenge#BLEU
Atari Games#Atari 2600 Star Gunner#Score
Visual Question Answering#VCR (Q-A) test#Accuracy
Scene Text Detection#SCUT-CTW1500#F-Measure
Video Semantic Segmentation#Cityscapes val#mIoU
Action Recognition#Something-Something V1#Top 1 Accuracy
Link Prediction#FB15k-237#Hits@3
Link Prediction#FB15k-237#Hits@1
Text Classification#Yahoo! Answers#Accuracy
Partial Domain Adaptation#Office-Home#Accuracy (%)
6D Pose Estimation using RGB#Occlusion LineMOD#Mean ADD
Image Generation#CIFAR-10#bits/dimension
Graph Regression#ZINC-500k#MAE
Intent Detection#ATIS#F1
Human Part Segmentation#PASCAL-Part#mIoU
relation_prediction#WN18RR#H@10
Image Retrieval with Multi-Modal Query#MIT-States#Recall@10
Intent Detection#SNIPS#Slot F1 Score
taxonomy_learning#SemEval 2018#P@5
Video Instance Segmentation#YouTube-VIS validation#mask AP
Face Detection#WIDER Face (Hard)#AP
Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#mIoU
Scene Text Detection#ICDAR 2013#Recall
Unsupervised Person Re-Identification#Market-1501#Rank-1
dependency_parsing#Penn Treebank#POS
question_answering#CNN / Daily Mail#Accuracy on CNN
Optical Flow Estimation#KITTI 2015#Fl-all
Semantic Segmentation#PASCAL VOC 2012 val#mIoU
Named Entity Recognition#CoNLL++#F1
Question Answering#bAbi#Accuracy (trained on 1k)
Time Series Classification#Libras#NLL
Dense Pixel Correspondence Estimation#HPatches#Viewpoint II AEPE
Image Clustering#MNIST-full#NMI
Machine Translation#WMT2015 English-German#BLEU score
3D Face Reconstruction#NoW Benchmark#Mean Reconstruction Error (mm)
Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
Relation Extraction#CoNLL04#RE+ Macro F1
Pose Estimation#UPenn Action#Mean PCK@0.2
Conversational Response Selection#DSTC7 Ubuntu#1-of-100 Accuracy
Image Classification#WebVision-1000#Top-1 Accuracy
Atari Games#Atari 2600 Yars Revenge#Score
JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR-B
Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.5
Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
Image Super-Resolution#Urban100 - 2x upscaling#SSIM
Visual Question Answering#GQA Test2019#Open
Single Image Deraining#Rain100L#SSIM
Entity Linking#WiC-TSV#Task 3 Accuracy: general purpose
Scene Text Detection#MSRA-TD500#F-Measure
Mortality Prediction#MIMIC-III#F1 score
Video Retrieval#MSR-VTT-1kA#text-to-video Mean Rank
Node Classification#Actor#Accuracy
language_modeling#Penn Treebank#Test perplexity
Gesture-to-Gesture Translation#Senz3D#PSNR
Image Generation#CLEVR#FID-5k-training-steps
Self-Supervised Image Classification#ImageNet#Top 1 Accuracy (kNN, k=20)
Fine-Grained Image Classification#CUB-200-2011#Accuracy
Lung Nodule Classification#LIDC-IDRI#Accuracy
Link Prediction#Pubmed#AP
Pedestrian Detection#CityPersons#Reasonable MR^-2
Link Prediction#WN18#MRR
Face Identification#MegaFace#Accuracy
Domain Adaptation#VisDA2017#Accuracy
Face Verification#MegaFace#Accuracy
Question Answering#YahooCQA#MRR
Scene Text Detection#COCO-Text#Recall
Video Frame Interpolation#Vimeo90k#PSNR
RGB Salient Object Detection#DUT-OMRON#MAE
Image Retrieval with Multi-Modal Query#MIT-States#Recall@5
Image Retrieval with Multi-Modal Query#MIT-States#Recall@1
Gesture-to-Gesture Translation#NTU Hand Digit#PSNR
Image Retrieval#SOP#R@1
Multi-Label Classification#MS-COCO#mAP
Keyword Spotting#Google Speech Commands#Google Speech Commands V1 12
3D Human Pose Estimation#MPI-INF-3DHP#AUC
Lipreading#CAS-VSR-W1k (LRW-1000)#Top-1 Accuracy
Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 val#Mean IoU
Machine Translation#WMT2016 German-English#BLEU score
Video Retrieval#MSR-VTT#video-to-text R@5
Visual Question Answering#MSRVTT-QA#Accuracy
Domain Generalization#ImageNet-A#Top-1 accuracy %
Action Recognition#Jester#Val
Image Super-Resolution#Set5 - 8x upscaling#PSNR
Semi-Supervised Image Classification#STL-10, 1000 Labels#Accuracy
Image Super-Resolution#Manga109 - 8x upscaling#PSNR
Visual Question Answering#VQA v2 test-std#overall
RGB-D Salient Object Detection#DES#max F-Measure
Image Clustering#Fashion-MNIST#Accuracy
Semantic Segmentation#PASCAL Context#mIoU
Semantic Similarity#SICK#MSE
Retinal Vessel Segmentation#STARE#F1 score
Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#FID
Machine Translation#WMT2014 English-German#BLEU score
3D Object Detection#KITTI Cars Hard val#AP
Image Super-Resolution#Urban100 - 4x upscaling#PSNR
3D Human Pose Estimation#Human3.6M#Multi-View or Monocular
Relation Extraction#CoNLL04#NER Macro F1
Image Super-Resolution#BSD100 - 4x upscaling#MOS
Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 5 Accuracy
Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
Node Classification#PATTERN 100k#Accuracy (%)
Node Classification#MAG240M-LSC#Validation Accuracy
Image Generation#FFHQ#FID-10k-training-steps
relation_prediction#WN18RR#MRR
Fine-Grained Image Classification#DF20#Top-1
Fine-Grained Image Classification#DF20#Top-3
Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: all
3D Multi-Person Pose Estimation (root-relative)#MuPoTS-3D#3DPCK
Medical Image Segmentation#Kvasir-SEG#mean Dice
Video Retrieval#MSR-VTT#text-to-video R@1
RGB-D Salient Object Detection#LFSD#S-Measure
Semantic Textual Similarity#STS16#Spearman Correlation
RGB-D Salient Object Detection#STERE#max F-Measure
Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
Sentiment Analysis#TweetEval#Emotion
Neural Architecture Search#CIFAR-10#FLOPS
Atari Games#Atari 2600 Kangaroo#Score
Lane Detection#TuSimple#F1 score
Session-Based Recommendations#Diginetica#Hit@20
Atari Games#Atari 2600 Seaquest#Score
Neural Architecture Search#NAS-Bench-201, CIFAR-10#Search time (s)
Graph Classification#PROTEINS#Accuracy
Common Sense Reasoning#SWAG#Test
Multi-Object Tracking#MOT16#MOTA
Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
Visual Question Answering#VQA v2 test-std#number
Object Detection#COCO minival#APL
Object Detection#COCO minival#APM
Object Detection#COCO minival#APS
Atari Games#Atari 2600 Krull#Score
JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR
Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-German#Accuracy
RGB-D Salient Object Detection#DES#max E-Measure
Node Classification#PubMed (0.1%)#Accuracy
Link Prediction#WN18#MR
Semi-Supervised Image Classification#CIFAR-10, 40 Labels#Percentage error
Scene Text Detection#ICDAR 2013#F-Measure
Image Super-Resolution#Set5 - 2x upscaling#SSIM
Transfer Learning#Office-Home#Accuracy
JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR-B
Image Classification#smallNORB#Classification Error
Image Super-Resolution#Manga109 - 2x upscaling#SSIM
Object Detection#USB (Standard USB 1.0 protocol)#mCAP
Deblurring#RealBlur-J (trained on GoPro)#PSNR (sRGB)
JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR-B
Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Mean Acc (Restaurant + Laptop)
Node Classification#Chameleon#Accuracy
Question Answering#CoQA#Overall
Visual Object Tracking#VOT2017/18#Expected Average Overlap (EAO)
Hate Speech Detection#HateXplain#AUROC
Node Classification#CiteSeer (0.5%)#Accuracy
Age-Invariant Face Recognition#CACDVS#Accuracy
Layout-to-Image Generation#COCO-Stuff 64x64#FID
Image Clustering#STL-10#NMI
JPEG Artifact Correction#ICB (Quality 20 Grayscale)#SSIM
Graph Classification#D&D#Accuracy
Text Summarization#GigaWord#ROUGE-L
RGB Salient Object Detection#DUTS-TE#MAE
Natural Language Inference#SNLI#% Test Accuracy
Text Summarization#GigaWord#ROUGE-1
Text Summarization#GigaWord#ROUGE-2
Unsupervised Domain Adaptation#Market to MSMT#rank-10
Surgical tool detection#Cholec80#mAP
RGB-D Salient Object Detection#NLPR#S-Measure
Semantic Textual Similarity#STS15#Spearman Correlation
Named Entity Recognition#Ontonotes v5 (English)#F1
Unsupervised Domain Adaptation#Market to Duke#rank-1
Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (20% training data)
Unsupervised Domain Adaptation#Market to Duke#rank-5
Atari Games#Atari 2600 Berzerk#Score
Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LLE
Image Classification#ImageNet#Number of params
Face Detection#WIDER Face (Easy)#AP
Action Classification#Kinetics-600#Top-1 Accuracy
Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#SSIM
question_answering#Quasar#F1 (Quasar-T)
Visual Object Tracking#OTB-2015#AUC
Text Simplification#Newsela#SARI
Action Classification#Kinetics-700#Top-5 Accuracy
Language Modelling#Text8#Bit per Character (BPC)
Image Super-Resolution#Urban100 - 8x upscaling#PSNR
Out-of-Distribution Detection#STL-10#Percentage correct
Dense Pixel Correspondence Estimation#HPatches#Viewpoint I AEPE
Object Detection#COCO minival#AP50
Semi-Supervised Semantic Segmentation#Pascal VOC 2012 5% labeled#Validation mIoU
Node Classification#Cora#Accuracy
Aesthetics Quality Assessment#AVA#Accuracy
Named Entity Recognition#ACE 2005#F1
Instance Segmentation#COCO test-dev#APS
taxonomy_learning#SemEval 2018#MRR
Fake News Detection#FNC-1#Per-class Accuracy (Disagree)
Instance Segmentation#COCO test-dev#APM
Instance Segmentation#COCO test-dev#APL
Entity Alignment#DBP15k zh-en#Hits@1
Object Detection#COCO minival#AP75
language_modeling#1B Words / Google Billion Word benchmark#Number of params
Action Segmentation#GTEA#F1@50%
Action Classification#Moments in Time#Top 5 Accuracy
Question Answering#Children's Book Test#Accuracy-NE
Cross-Modal Retrieval#COCO 2014#Image-to-text R@1
Action Recognition#Sports-1M#Video hit@1
Action Recognition#Sports-1M#Video hit@5
Time Series Classification#PEMS#Accuracy
Real-Time Semantic Segmentation#NYU Depth v2#Speed(ms/f)
Cross-Modal Retrieval#COCO 2014#Image-to-text R@5
Word Sense Disambiguation#Supervised:#Senseval 3
Word Sense Disambiguation#Supervised:#Senseval 2
Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-class Accuracy
Image Super-Resolution#Manga109 - 4x upscaling#PSNR
Retinal Vessel Segmentation#CHASE_DB1#AUC
Atari Games#Atari 2600 Frostbite#Score
Vision and Language Navigation#VLN Challenge#oracle success
Relation Extraction#WebNLG#F1
Drug Discovery#Tox21#AUC
Image Generation#FFHQ 256 x 256#FID
Question Answering#TriviaQA#F1
Semi-Supervised Semantic Segmentation#Pascal VOC 2012 2% labeled#Validation mIoU
Semantic Textual Similarity#STS12#Spearman Correlation
Fine-Grained Image Classification#DF20#F1 - macro
Few-Shot Image Classification#FC100 5-way (1-shot)#Accuracy
Speech Recognition#swb_hub_500 WER fullSWBCH#Percentage error
Speech Recognition#MediaSpeech#WER for French
Image Classification#EMNIST-Letters#Accuracy
Time Series Classification#NetFlow#Accuracy
Text Style Transfer#Yelp Review Dataset (Small)#G-Score (BLEU, Accuracy)
Self-Supervised Action Recognition#HMDB51#Top-1 Accuracy
Semantic Textual Similarity#STS13#Spearman Correlation
Link Prediction#Cora#AP
Relation Extraction#SemEval-2010 Task 8#F1
Incremental Learning#CIFAR-100 - 50 classes + 5 steps of 10 classes#Average Incremental Accuracy
Cross-View Image-to-Image Translation#cvusa#SSIM
Speech Recognition#MediaSpeech#WER for Arabic
Person Search#PRW#Top-1
Image Clustering#CIFAR-100#NMI
Face Verification#YouTube Faces DB#Accuracy
Named Entity Recognition#CoNLL 2002 (Dutch)#F1
Image Super-Resolution#VggFace2 - 8x upscaling#PSNR
Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Recall
Synthetic-to-Real Translation#GTAV-to-Cityscapes Labels#mIoU
Fine-Grained Image Classification#Oxford-IIIT Pets#Accuracy
Image Classification#Fashion-MNIST#Percentage error
Question Answering#Children's Book Test#Accuracy-CN
Action Recognition#Something-Something V2#Top-5 Accuracy
Atari Games#Atari 2600 Fishing Derby#Score
Question Answering#NarrativeQA#BLEU-4
Question Answering#NarrativeQA#BLEU-1
Text Classification#20NEWS#Accuracy
Image Denoising#DND#PSNR (sRGB)
Visual Object Tracking#VOT2016#Expected Average Overlap (EAO)
Semi-Supervised Image Classification#SVHN, 500 Labels#Accuracy
sentiment_analysis#IMDb#Accuracy
Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-10
Nested Mention Recognition#ACE 2005#F1
Domain Adaptation#SVHN-to-MNIST#Accuracy
Object Detection#COCO minival#box AP
Action Recognition#EPIC-KITCHENS-100#GFLOPs
Music Transcription#MusicNet#APS
Semi-Supervised Image Classification#CIFAR-10, 4000 Labels#Accuracy
Hate Speech Detection#Ethos MultiLabel#Hamming Loss
Action Classification#Kinetics-600#GFLOPs
Semi-Supervised Semantic Segmentation#Cityscapes 25% labeled#Validation mIoU
Face Alignment#300W#Fullset (public)
unknown
|