omar-araboghli commited on
Commit
733949b
1 Parent(s): 87a6813

adding gold labels

Browse files
Files changed (1) hide show
  1. labels.tsv +1851 -0
labels.tsv ADDED
@@ -0,0 +1,1851 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Single Image Deraining#Rain100H#PSNR
2
+ Question Answering#YahooCQA#P@1
3
+ Atari Games#Atari 2600 Private Eye#Score
4
+ Speech Recognition#MediaSpeech#WER for Turkish
5
+ 3D Point Cloud Classification#ModelNet40#Mean Accuracy
6
+ Image Clustering#STL-10#Train Split
7
+ Time Series Classification#WalkvsRun#NLL
8
+ language_modeling#Text8#Number of params
9
+ Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Chinese#Accuracy
10
+ Weakly-supervised 3D Human Pose Estimation#Human3.6M#3D Annotations
11
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Decay)
12
+ Image-to-Image Translation#Cityscapes Labels-to-Photo#FID
13
+ Neural Architecture Search#ImageNet#Accuracy
14
+ Human Pose Forecasting#Human3.6M#MAR, walking, 400ms
15
+ Face Detection#WIDER Face (Medium)#AP
16
+ Incremental Learning#CIFAR-100 - 50 classes + 10 steps of 5 classes#Average Incremental Accuracy
17
+ Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (60% training data)
18
+ Text Simplification#PWKP / WikiSmall#SARI
19
+ Network Pruning#ImageNet#Accuracy
20
+ Line Segment Detection#York Urban Dataset#sAP10
21
+ Visual Dialog#VisDial v0.9 val#R@10
22
+ Link Prediction#WN18RR#MR
23
+ Stereo-LiDAR Fusion#KITTI Depth Completion Validation#RMSE
24
+ Question Answering#WikiHop#Test
25
+ Colorectal Gland Segmentation:#CRAG#Dice
26
+ Image Super-Resolution#Set14 - 4x upscaling#MOS
27
+ Semantic Segmentation#NYU Depth v2#Mean IoU
28
+ Fine-Grained Image Classification#DF20 - Mini#F1 - macro
29
+ Node Classification#Squirrel#Accuracy
30
+ Recommendation Systems#Netflix#Recall@50
31
+ 6D Pose Estimation using RGB#LineMOD#Mean ADD
32
+ Unsupervised Machine Translation#WMT2016 German-English#BLEU
33
+ Video Retrieval#LSMDC#text-to-video R@5
34
+ Video Retrieval#LSMDC#text-to-video R@1
35
+ Semantic Segmentation#S3DIS#oAcc
36
+ Recommendation Systems#Netflix#Recall@20
37
+ Image Classification#ImageNet ReaL#Params
38
+ Natural Language Inference#SNLI#Parameters
39
+ Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Precision
40
+ language_modeling#WikiText-2#Validation perplexity
41
+ Lipreading#LRS2#Word Error Rate (WER)
42
+ JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR
43
+ Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: general purpose
44
+ Few-Shot Image Classification#Mini-ImageNet - 1-Shot Learning#Accuracy
45
+ Image Super-Resolution#Set14 - 3x upscaling#SSIM
46
+ Link Prediction#MovieLens 25M#Hits@10
47
+ Supervised Video Summarization#SumMe#F1-score (Canonical)
48
+ Fine-Grained Image Classification#Oxford 102 Flowers#Accuracy
49
+ Panoptic Segmentation#COCO panoptic#PQ
50
+ summarization#CNN / Daily Mail (Anonymized version)#METEOR
51
+ Link Prediction#Citeseer#AUC
52
+ Action Recognition#EPIC-KITCHENS-100#Action@1
53
+ Face Detection#Annotated Faces in the Wild#AP
54
+ Multimodal Machine Translation#Multi30K#Meteor (EN-DE)
55
+ Image-to-Image Translation#Cityscapes Labels-to-Photo#mIoU
56
+ Image Retrieval#Flickr30K 1K test#R@5
57
+ Image Retrieval#Flickr30K 1K test#R@1
58
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
59
+ Pedestrian Detection#CityPersons#Heavy MR^-2
60
+ Data-to-Text Generation#E2E NLG Challenge#METEOR
61
+ Atari Games#Atari 2600 Skiing#Score
62
+ Deblurring#RealBlur-R (trained on GoPro)#PSNR (sRGB)
63
+ Semantic Retrieval#Contract Discovery#Soft-F1
64
+ Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
65
+ Language Modelling#WikiText-103#Number of params
66
+ Action Segmentation#50 Salads#F1@25%
67
+ Paraphrase Identification#Quora Question Pairs#Accuracy
68
+ Semi-Supervised Semantic Segmentation#Cityscapes 100 samples labeled#Validation mIoU
69
+ Image Generation#CelebA 64x64#FID
70
+ Time Series Classification#Libras#Accuracy
71
+ Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Frames Per View
72
+ Robotic Grasping#Cornell Grasp Dataset#5 fold cross validation
73
+ Referring Expression Segmentation#RefCOCO testB#IoU
74
+ JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR-B
75
+ Visual Navigation#Cooperative Vision-and-Dialogue Navigation#spl
76
+ Skeleton Based Action Recognition#Kinetics-Skeleton dataset#Accuracy
77
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Mean)
78
+ 3D Human Pose Estimation#3DPW#MPVPE
79
+ Action Recognition#Something-Something V1#Top 5 Accuracy
80
+ language_modeling#Text8#Bit per Character (BPC)
81
+ Image Generation#LSUN Bedroom 256 x 256#FID
82
+ Deblurring#RealBlur-J (trained on GoPro)#SSIM (sRGB)
83
+ Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CS)
84
+ relation_prediction#FB15K-237#H@1
85
+ Video Captioning#YouCook2#METEOR
86
+ Semantic Textual Similarity#STS Benchmark#Pearson Correlation
87
+ Speech Recognition#LibriSpeech test-clean#Word Error Rate (WER)
88
+ Video Retrieval#MSR-VTT#text-to-video R@10
89
+ Knowledge Graph Completion#FB15k-237#Hits@10
90
+ Graph Regression#ZINC 100k#MAE
91
+ Open-Domain Question Answering#SearchQA#Unigram Acc
92
+ Chinese Named Entity Recognition#OntoNotes 4#F1
93
+ Scene Text Detection#Total-Text#F-Measure
94
+ Atari Games#Atari 2600 James Bond#Score
95
+ Time Series Classification#CMUsubject16#NLL
96
+ Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV I)
97
+ Text-to-Image Generation#Multi-Modal-CelebA-HQ#LPIPS
98
+ Graph Classification#IMDb-M#Accuracy
99
+ Skeleton Based Action Recognition#NTU RGB+D#Accuracy (CV)
100
+ Neural Architecture Search#CIFAR-10 Image Classification#Params
101
+ Nested Mention Recognition#ACE 2004#F1
102
+ JPEG Artifact Correction#LIVE1 (Quality 20 Color)#SSIM
103
+ Entity Linking#WiC-TSV#Task 1 Accuracy: all
104
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Recall)
105
+ Few-Shot Image Classification#CIFAR-FS 5-way (1-shot)#Accuracy
106
+ Deblurring#RealBlur-R (trained on GoPro)#SSIM (sRGB)
107
+ Action Recognition#Something-Something V2#GFLOPs
108
+ Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
109
+ Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R2@1
110
+ Music Source Separation#MUSDB18#SDR (bass)
111
+ Language Modelling#Penn Treebank (Word Level)#Params
112
+ Object Detection#PASCAL VOC 2007#MAP
113
+ Common Sense Reasoning#CommonsenseQA#Accuracy
114
+ JPEG Artifact Correction#ICB (Quality 20 Color)#SSIM
115
+ Person Re-Identification#CUHK03 detected#Rank-1
116
+ Image Generation#ImageNet 128x128#FID
117
+ Image Retrieval with Multi-Modal Query#Fashion200k#Recall@1
118
+ Dependency Parsing#Penn Treebank#LAS
119
+ Time Series Classification#AUSLAN#NLL
120
+ Language Modelling#Hutter Prize#Number of params
121
+ Hand Pose Estimation#NYU Hands#Average 3D Error
122
+ Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@5
123
+ dependency_parsing#Penn Treebank#UAS
124
+ Visual Dialog#VisDial v0.9 val#Mean Rank
125
+ Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@1
126
+ Conversational Response Selection#Ubuntu Dialogue (v1, Ranking)#R10@2
127
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
128
+ Person Re-Identification#CUHK03#MAP
129
+ Retinal Vessel Segmentation#CHASE_DB1#F1 score
130
+ Grayscale Image Denoising#Urban100 sigma25#PSNR
131
+ Image-to-Image Translation#Cityscapes Labels-to-Photo#Class IOU
132
+ Action Recognition#Something-Something V2#Parameters
133
+ Question Answering#Natural Questions (short)#F1
134
+ Multivariate Time Series Forecasting#MIMIC-III#NegLL
135
+ Brain Tumor Segmentation#BRATS-2015#Dice Score
136
+ Paraphrase Identification#Quora Question Pairs#F1
137
+ Image Super-Resolution#BSD100 - 3x upscaling#PSNR
138
+ RGB-D Salient Object Detection#STERE#max E-Measure
139
+ language_modeling#Penn Treebank#Validation perplexity
140
+ Click-Through Rate Prediction#Criteo#Log Loss
141
+ Action Recognition#ActivityNet#mAP
142
+ Domain Generalization#ImageNet-R#Top-1 Error Rate
143
+ Domain Adaptation#USPS-to-MNIST#Accuracy
144
+ Atari Games#Atari 2600 Crazy Climber#Score
145
+ Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (80% training data)
146
+ Open-Domain Question Answering#Quasar#EM (Quasar-T)
147
+ Question Answering#bAbi#Mean Error Rate
148
+ Keypoint Detection#COCO test-challenge#AR
149
+ Continuous Control#PyBullet Ant#Return
150
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#J&F
151
+ Keypoint Detection#COCO test-challenge#AP
152
+ Text Classification#TREC-6#Error
153
+ Text Classification#Yelp-5#Accuracy
154
+ Atari Games#Atari 2600 Ms. Pacman#Score
155
+ Text Classification#AG News#Error
156
+ Named Entity Recognition#SciERC#F1
157
+ Image Classification#Kuzushiji-MNIST#Accuracy
158
+ Action Recognition#HACS#Top 5 Accuracy
159
+ Few-Shot Image Classification#Stanford Cars 5-way (5-shot)#Accuracy
160
+ Time Series Classification#CharacterTrajectories#Accuracy
161
+ Coreference Resolution#CoNLL 2012#Avg F1
162
+ JPEG Artifact Correction#Classic5 (Quality 10 Grayscale)#PSNR
163
+ Sentiment Analysis#Multi-Domain Sentiment Dataset#DVD
164
+ Text based Person Retrieval#CUHK-PEDES#R@1
165
+ Multi-Person Pose Estimation#COCO#Validation AP
166
+ Text based Person Retrieval#CUHK-PEDES#R@5
167
+ Language Modelling#WikiText-103#Validation perplexity
168
+ Image-to-Image Translation#ADE20K Labels-to-Photos#Accuracy
169
+ Recommendation Systems#Million Song Dataset#nDCG@100
170
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Recall)
171
+ Instance Segmentation#COCO test-dev#mask AP
172
+ Extractive Text Summarization#CNN / Daily Mail#ROUGE-1
173
+ Action Classification#Kinetics-600#Top-5 Accuracy
174
+ Text-to-Image Generation#Multi-Modal-CelebA-HQ#Real
175
+ Action Segmentation#GTEA#Acc
176
+ Self-Supervised Action Recognition#UCF101#3-fold Accuracy
177
+ Extractive Text Summarization#CNN / Daily Mail#ROUGE-2
178
+ 3D Object Detection#KITTI Cyclists Easy#AP
179
+ Image Generation#STL-10#Inception score
180
+ Extractive Text Summarization#CNN / Daily Mail#ROUGE-L
181
+ Visual Dialog#VisDial v0.9 val#R@5
182
+ Visual Dialog#VisDial v0.9 val#R@1
183
+ JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#SSIM
184
+ Text Summarization#DUC 2004 Task 1#ROUGE-1
185
+ Text Summarization#DUC 2004 Task 1#ROUGE-2
186
+ Grayscale Image Denoising#Urban100 sigma15#PSNR
187
+ Dense Pixel Correspondence Estimation#HPatches#Viewpoint III AEPE
188
+ 3D Part Segmentation#ShapeNet-Part#Class Average IoU
189
+ Text Summarization#DUC 2004 Task 1#ROUGE-L
190
+ Gesture-to-Gesture Translation#NTU Hand Digit#AMT
191
+ RGB-D Salient Object Detection#SIP#Average MAE
192
+ Nested Named Entity Recognition#ACE 2005#F1
193
+ Grayscale Image Denoising#BSD68 sigma25#PSNR
194
+ Question Answering#FQuAD#F1
195
+ Question Answering#FQuAD#EM
196
+ Atari Games#Atari 2600 Pong#Score
197
+ Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV II)
198
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#MS-SSIM
199
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Mean)
200
+ Photo geolocation estimation#Im2GPS#Region level (200 km)
201
+ Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CS)
202
+ Single Image Deraining#Test1200#SSIM
203
+ Chinese Named Entity Recognition#MSRA#F1
204
+ Text-to-Image Generation#Multi-Modal-CelebA-HQ#FID
205
+ Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (val)
206
+ Depth Completion#KITTI Depth Completion#MAE
207
+ Few-Shot Image Classification#Mini-Imagenet 20-way (5-shot)#Accuracy
208
+ Person Re-Identification#Market-1501#MAP
209
+ Recommendation Systems#MovieLens 10M#RMSE
210
+ Action Classification#Kinetics-400#Vid acc@1
211
+ Semantic Segmentation#S3DIS Area5#mIoU
212
+ Action Classification#Kinetics-400#Vid acc@5
213
+ Image Super-Resolution#Set14 - 8x upscaling#SSIM
214
+ Anomaly Detection#One-class CIFAR-10#AUROC
215
+ Image Retrieval#CUB-200-2011#R@1
216
+ Node Classification#Cora#Validation
217
+ Time Series Classification#DigitShapes#NLL
218
+ Image Generation#CelebA-HQ 128x128#FID
219
+ Atari Games#Atari 2600 Breakout#Score
220
+ Action Segmentation#50 Salads#Acc
221
+ Self-Supervised Action Recognition#HMDB51 (finetuned)#Top-1 Accuracy
222
+ Emotion Recognition in Conversation#EmoryNLP#Weighted Macro-F1
223
+ Language Modelling#enwik8#Number of params
224
+ Node Classification#Brazil Air-Traffic#Accuracy
225
+ Music Source Separation#MUSDB18#SDR (other)
226
+ Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
227
+ Person Search#PRW#mAP
228
+ Sentiment Analysis#Amazon Review Polarity#Accuracy
229
+ Deblurring#GoPro#PSNR
230
+ Named Entity Recognition#JNLPBA#F1
231
+ Object Detection#CrowdHuman (full body)#mMR
232
+ Question Answering#CoQA#In-domain
233
+ Action Segmentation#50 Salads#F1@50%
234
+ Panoptic Segmentation#Cityscapes val#AP
235
+ Image-to-Image Translation#SYNTHIA-to-Cityscapes#mIoU (13 classes)
236
+ Keypoint Detection#COCO#Test AP
237
+ Photo geolocation estimation#Im2GPS#City level (25 km)
238
+ Fine-Grained Image Classification#Stanford Cars#Accuracy
239
+ Trajectory Prediction#ETH/UCY#ADE-8/12
240
+ question_answering#SearchQA#N-gram F1
241
+ Single Image Deraining#Test2800#SSIM
242
+ Breast Tumour Classification#PCam#AUC
243
+ Real-Time Semantic Segmentation#Cityscapes test#Frame (fps)
244
+ Person Re-Identification#MSMT17#Rank-1
245
+ JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR
246
+ Unsupervised MNIST#MNIST#Accuracy
247
+ Vision and Language Navigation#VLN Challenge#success
248
+ 3D Object Detection#KITTI Cars Moderate#AP
249
+ Sentiment Analysis#TweetEval#Emoji
250
+ Object Detection#iSAID#Average Precision
251
+ language_modeling#WikiText-2#Test perplexity
252
+ Image Super-Resolution#Urban100 - 3x upscaling#PSNR
253
+ Panoptic Segmentation#COCO test-dev#PQ
254
+ 3D Instance Segmentation#S3DIS#mPrec
255
+ Atari Games#Atari-57#Medium Human-Normalized Score
256
+ Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
257
+ Multi-Person Pose Estimation#MPII Multi-Person#AP
258
+ Atari Games#Atari 2600 Asteroids#Score
259
+ Instance Segmentation#COCO test-dev#AP75
260
+ Action Classification#AViD#Accuracy
261
+ Face Alignment#WFLW#ME (%, all)
262
+ Monocular 3D Human Pose Estimation#Human3.6M#Need Ground Truth 2D Pose
263
+ Denoising#Darmstadt Noise Dataset#PSNR
264
+ Atari Games#Atari 2600 Assault#Score
265
+ Atari Games#Atari 2600 Time Pilot#Score
266
+ Hand Pose Estimation#ICVL Hands#Average 3D Error
267
+ Atari Games#Atari 2600 Robotank#Score
268
+ Pose Estimation#COCO test-dev#APL
269
+ Pose Estimation#COCO test-dev#APM
270
+ Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.95
271
+ Node Classification#Reddit#Accuracy
272
+ Face Verification#IJB-A#TAR @ FAR=0.01
273
+ Pose Transfer#Deep-Fashion#IS
274
+ Atari Games#Atari 2600 Gopher#Score
275
+ Natural Language Inference#WNLI#Accuracy
276
+ Visual Question Answering#GQA Test2019#Binary
277
+ Hand Pose Estimation#MSRA Hands#Average 3D Error
278
+ Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (80% training data)
279
+ Image Matting#Composition-1K#MSE
280
+ named_entity_recognition#CoNLL 2003 (English)#F1
281
+ Node Classification#Europe Air-Traffic#Accuracy
282
+ Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.75
283
+ Atari Games#Atari 2600 Montezuma's Revenge#Score
284
+ Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Decay)
285
+ Real-Time Semantic Segmentation#CamVid#mIoU
286
+ Semantic Segmentation#CamVid#Mean IoU
287
+ Instance Segmentation#COCO test-dev#AP50
288
+ Question Answering#OpenBookQA#Accuracy
289
+ Speech Recognition#LibriSpeech test-other#Word Error Rate (WER)
290
+ Link Prediction#WN18RR#Hits@3
291
+ Panoptic Segmentation#Cityscapes val#PQ
292
+ Link Prediction#WN18RR#Hits@1
293
+ Click-Through Rate Prediction#Company*#Log Loss
294
+ Video Retrieval#MSR-VTT#text-to-video Median Rank
295
+ Nested Named Entity Recognition#ACE 2004#F1
296
+ Color Image Denoising#Darmstadt Noise Dataset#PSNR (sRGB)
297
+ Deblurring#HIDE (trained on GOPRO)#PSNR (sRGB)
298
+ Image Generation#FFHQ#FID
299
+ Video Captioning#YouCook2#CIDEr
300
+ Session-Based Recommendations#Diginetica#MRR@20
301
+ Optical Flow Estimation#Sintel-final#Average End-Point Error
302
+ Skeleton Based Action Recognition#J-HMDB#Accuracy (RGB+pose)
303
+ Action Classification#Kinetics-400#Clip acc@5
304
+ Action Classification#Kinetics-400#Clip acc@1
305
+ RGB-D Salient Object Detection#NLPR#max E-Measure
306
+ 3D Object Detection#KITTI Cyclists Hard#AP
307
+ Multi-Frame Super-Resolution#PROBA-V#Normalized cPSNR
308
+ Recommendation Systems#Flixster Monti#RMSE
309
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
310
+ Image-to-Image Translation#COCO-Stuff Labels-to-Photos#Accuracy
311
+ Visual Question Answering#CLEVR#Accuracy
312
+ Egocentric Activity Recognition#EPIC-KITCHENS-55#Actions Top-1 (S2)
313
+ Self-Supervised Image Classification#ImageNet#Top 1 Accuracy
314
+ Click-Through Rate Prediction#Avazu#AUC
315
+ Few-Shot Image Classification#Meta-Dataset Rank#Mean Rank
316
+ Natural Language Inference#RTE#Accuracy
317
+ Time Series Classification#ECG#NLL
318
+ Image Relighting#VIDIT’20 validation set#Runtime(s)
319
+ Domain Adaptation#Office-Home#Accuracy
320
+ Click-Through Rate Prediction#Bing News#AUC
321
+ Domain Generalization#PACS#Average Accuracy
322
+ Image Super-Resolution#Set5 - 3x upscaling#PSNR
323
+ Multivariate Time Series Imputation#MuJoCo#MSE (10^2, 50% missing)
324
+ Color Image Denoising#Darmstadt Noise Dataset#SSIM (sRGB)
325
+ Scene Text Detection#ICDAR 2017 MLT#F-Measure
326
+ Image Clustering#STL-10#Accuracy
327
+ Few-Shot Image Classification#Tiered ImageNet 5-way (5-shot)#Accuracy
328
+ Emotion Recognition in Conversation#EC#Micro-F1
329
+ Video Alignment#UPenn Action#Kendall's Tau
330
+ Weakly Supervised Action Localization#ActivityNet-1.2#mAP@0.5
331
+ Keypoint Detection#MPII Multi-Person#mAP@0.5
332
+ Video Captioning#YouCook2#ROUGE-L
333
+ Link Prediction#WordNet#Accuracy
334
+ Image Classification#CIFAR-10#Percentage correct
335
+ Single Image Deraining#Test100#SSIM
336
+ Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#IoU
337
+ Reading Comprehension#RACE#Accuracy (High)
338
+ Object Detection#CrowdHuman (full body)#AP
339
+ Text-to-Image Generation#COCO#FID
340
+ Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#PSNR
341
+ Anomaly Detection#MVTec AD#Detection AUROC
342
+ Node Classification#Pubmed Full-supervised#Accuracy
343
+ Referring Expression Segmentation#RefCoCo val#IoU
344
+ Birds Eye View Object Detection#KITTI Cyclists Moderate#AP
345
+ Hand Pose Estimation#HANDS 2017#Average 3D Error
346
+ Grammatical Error Detection#CoNLL-2014 A2#F0.5
347
+ Image Super-Resolution#Set14 - 4x upscaling#SSIM
348
+ Continuous Control#PyBullet Hopper#Return
349
+ Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
350
+ constituency_parsing#Penn Treebank#F1
351
+ Image Relighting#VIDIT’20 validation set#SSIM
352
+ Object Counting#CARPK#MAE
353
+ Atari Games#Atari 2600 Beam Rider#Score
354
+ Metric Learning#CUB-200-2011#R@1
355
+ Image Generation#LSUN Bedroom 256 x 256#FID-10k-training-steps
356
+ language_modeling#Hutter Prize#Bit per Character (BPC)
357
+ Fact-based Text Editing#WebEdit#Exact Match
358
+ Few-Shot Image Classification#CUB 200 5-way 5-shot#Accuracy
359
+ Video Retrieval#MSVD#text-to-video Median Rank
360
+ Visual Navigation#Cooperative Vision-and-Dialogue Navigation#dist_to_end_reduction
361
+ Domain Adaptation#ImageCLEF-DA#Accuracy
362
+ Fine-Grained Image Classification#DF20 - Mini#Top-1
363
+ Fine-Grained Image Classification#DF20 - Mini#Top-3
364
+ Part-Of-Speech Tagging#Penn Treebank#Accuracy
365
+ Action Spotting#SoccerNet#Average-mAP
366
+ Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Unseen)
367
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#J&F
368
+ Face Detection#PASCAL Face#AP
369
+ Atari Games#Atari 2600 Pitfall!#Score
370
+ Image Super-Resolution#Set5 - 4x upscaling#MOS
371
+ Human Pose Forecasting#Human3.6M#MAR, walking, 1,000ms
372
+ Image Clustering#Extended Yale-B#NMI
373
+ Person Re-Identification#DukeMTMC-reID#Rank-10
374
+ Click-Through Rate Prediction#Company*#AUC
375
+ Link Prediction#YAGO3-10#MRR
376
+ Image-to-Image Translation#ADE20K Labels-to-Photos#mIoU
377
+ Text Simplification#ASSET#SARI (EASSE>=0.2.1)
378
+ word_segmentation#PKU#F1
379
+ Dense Pixel Correspondence Estimation#HPatches#Viewpoint IV AEPE
380
+ Human-Object Interaction Detection#HICO-DET#mAP
381
+ Constituency Grammar Induction#PTB#Mean F1 (WSJ)
382
+ Spoken language identification#LRE07#Average
383
+ word_sense_disambiguation#Senseval 2#F1
384
+ Node Classification#Cora Full-supervised#Accuracy
385
+ RGB Salient Object Detection#DUTS-TE#F-measure
386
+ Video Captioning#YouCook2#BLEU-4
387
+ Atari Games#Atari 2600 Zaxxon#Score
388
+ Image Classification#CINIC-10#Accuracy
389
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#NIQE
390
+ Image Classification#WebVision-1000#Top-5 Accuracy
391
+ Time Series Classification#UWave#NLL
392
+ Data-to-Text Generation#E2E NLG Challenge#NIST
393
+ Semantic Segmentation#S3DIS Area5#oAcc
394
+ Monocular Depth Estimation#KITTI Eigen split unsupervised#absolute relative error
395
+ Reading Comprehension#ReClor#Test
396
+ Anomaly Detection#MVTec AD#Segmentation AUROC
397
+ Deblurring#HIDE (trained on GOPRO)#SSIM (sRGB)
398
+ Link Prediction#OpenBioLink#Hits@1
399
+ Text Classification#IMDb#Accuracy (10 classes)
400
+ Link Prediction#OpenBioLink#Hits@3
401
+ Pose Tracking#PoseTrack2017#mAP
402
+ Node Classification#Cora with Public Split: fixed 20 nodes per class#Accuracy
403
+ sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Restaurant (acc)
404
+ Text-to-Image Generation#COCO#Inception score
405
+ Causal Inference#IDHP#Average Treatment Effect Error
406
+ 3D Part Segmentation#ShapeNet-Part#Instance Average IoU
407
+ Heterogeneous Node Classification#DBLP (PACT) 14k#Macro-F1 (20% training data)
408
+ Face Detection#FDDB#AP
409
+ Fine-Grained Image Classification#Oxford 102 Flowers#PARAMS
410
+ Natural Language Inference#MultiNLI#Mismatched
411
+ Curved Text Detection#SCUT-CTW1500#F-Measure
412
+ Photo geolocation estimation#Im2GPS#Street level (1 km)
413
+ Keypoint Detection#COCO#Validation AP
414
+ Fake News Detection#FNC-1#Per-class Accuracy (Discuss)
415
+ Cross-Modal Retrieval#Flickr30k#Text-to-image R@5
416
+ Cross-Modal Retrieval#Flickr30k#Text-to-image R@1
417
+ Domain Adaptation#SYNTHIA-to-Cityscapes#mIoU
418
+ Image Generation#LSUN Churches 256 x 256#FID
419
+ Visual Object Tracking#TrackingNet#Normalized Precision
420
+ JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR-B
421
+ AMR Parsing#LDC2017T10#Smatch
422
+ Time Series Classification#Shapes#NLL
423
+ Machine Translation#WMT2016 Romanian-English#BLEU score
424
+ Ad-Hoc Information Retrieval#TREC Robust04#P@20
425
+ Named Entity Recognition#CoNLL 2003 (English)#F1
426
+ Time Series Classification#PenDigits#Accuracy
427
+ JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR-B
428
+ Real-Time Semantic Segmentation#Cityscapes test#mIoU
429
+ Monocular 3D Human Pose Estimation#Human3.6M#Frames Needed
430
+ Question Answering#DROP Test#F1
431
+ Few-Shot Image Classification#Mini-Imagenet 10-way (1-shot)#Accuracy
432
+ Action Recognition#HACS#Top 1 Accuracy
433
+ language_modeling#WikiText-103#Validation perplexity
434
+ Intent Detection#ATIS#Accuracy
435
+ Scene Text Detection#SCUT-CTW1500#Recall
436
+ Image Super-Resolution#Set14 - 2x upscaling#SSIM
437
+ Node Classification#CiteSeer (1%)#Accuracy
438
+ 3D Human Pose Estimation#Total Capture#Average MPJPE (mm)
439
+ Automated Theorem Proving#HolStep (Conditional)#Classification Accuracy
440
+ Audio Classification#AudioSet#Test mAP
441
+ Fact-based Text Editing#WebEdit#SARI
442
+ Natural Language Inference#QNLI#Accuracy
443
+ Document Image Classification#RVL-CDIP#Accuracy
444
+ Natural Language Inference#ANLI test#A2
445
+ Natural Language Inference#ANLI test#A1
446
+ Natural Language Inference#ANLI test#A3
447
+ Question Answering#Quasart-T#EM
448
+ Image Super-Resolution#Manga109 - 3x upscaling#PSNR
449
+ Word Sense Disambiguation#SemEval 2013 Task 12#F1
450
+ Semantic Textual Similarity#MRPC#F1
451
+ Object Counting#CARPK#RMSE
452
+ Image Matting#Composition-1K#Conn
453
+ Self-Supervised Action Recognition#UCF101 (finetuned)#3-fold Accuracy
454
+ Multimodal Activity Recognition#Moments in Time Dataset#Top-1 (%)
455
+ 3D Semantic Instance Segmentation#ScanNetV2#mAP@0.50
456
+ Video Super-Resolution#Vid4 - 4x upscaling#PSNR
457
+ relation_prediction#WN18RR#H@1
458
+ Cross-View Image-to-Image Translation#Dayton (256×256) - aerial-to-ground#SSIM
459
+ Language Modelling#enwik8#Bit per Character (BPC)
460
+ Hyperspectral Image Classification#Indian Pines#Overall Accuracy
461
+ Language Modelling#One Billion Word#PPL
462
+ Chinese Named Entity Recognition#Weibo NER#F1
463
+ RGB-D Salient Object Detection#SIP#max E-Measure
464
+ Question Answering#SQuAD1.1#F1
465
+ Question Answering#SQuAD1.1#EM
466
+ Question Answering#NarrativeQA#Rouge-L
467
+ Person Re-Identification#PRID2011#Rank-5
468
+ Person Re-Identification#PRID2011#Rank-1
469
+ Language Modelling#One Billion Word#Number of params
470
+ Image Classification#Clothing1M#Accuracy
471
+ JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR
472
+ Node Classification#BlogCatalog#Macro-F1
473
+ Image Classification#iNaturalist 2018#Top-1 Accuracy
474
+ RGB-D Salient Object Detection#DES#S-Measure
475
+ Fake News Detection#FNC-1#Per-class Accuracy (Unrelated)
476
+ Text Classification#DBpedia#Error
477
+ Word Sense Disambiguation#SensEval 2#F1
478
+ Link Prediction#Pubmed#AUC
479
+ Image Denoising#DND#SSIM (sRGB)
480
+ Video Retrieval#MSR-VTT-1kA#text-to-video Median Rank
481
+ Image Clustering#CIFAR-10#NMI
482
+ Scene Text Detection#ICDAR 2013#Precision
483
+ summarization#Gigaword#ROUGE-1
484
+ Atari Games#Atari 2600 Ice Hockey#Score
485
+ summarization#Gigaword#ROUGE-2
486
+ Entity Linking#WiC-TSV#Task 1 Accuracy: domain specific
487
+ summarization#Gigaword#ROUGE-L
488
+ Image Relighting#VIDIT’20 validation set#PSNR
489
+ Point Cloud Registration#3DMatch Benchmark#Recall
490
+ Machine Translation#IWSLT2015 English-Vietnamese#BLEU
491
+ Lesion Segmentation#ISIC 2018#Dice Score
492
+ Atari Games#Atari 2600 Freeway#Score
493
+ Action Recognition#AVA v2.1#mAP (Val)
494
+ Grayscale Image Denoising#Set12 sigma50#PSNR
495
+ 3D Object Detection#nuScenes#NDS
496
+ Dialogue State Tracking#Wizard-of-Oz#Joint
497
+ Sentiment Analysis#Multi-Domain Sentiment Dataset#Books
498
+ Image Clustering#ImageNet-10#Accuracy
499
+ Semantic Segmentation#Semantic3D#mIoU
500
+ Image Clustering#Tiny-ImageNet#NMI
501
+ Image Relighting#VIDIT’20 validation set#MPS
502
+ Object Counting#Pascal VOC 2007 count-test#mRMSE
503
+ JPEG Artifact Correction#ICB (Quality 10 Grayscale)#SSIM
504
+ Crowd Counting#ShanghaiTech B#MAE
505
+ Human-Object Interaction Detection#V-COCO#Time Per Frame(ms)
506
+ Gesture-to-Gesture Translation#Senz3D#AMT
507
+ 3D Human Pose Estimation#3D Poses in the Wild Challenge#MPJPE
508
+ Keypoint Detection#COCO test-dev#AR
509
+ Image Retrieval#Par6k#mAP
510
+ Action Recognition#Something-Something V2#Top-1 Accuracy
511
+ Graph Regression#PCQM4M-LSC#Test MAE
512
+ Graph Classification#PTC#Accuracy
513
+ Visual Question Answering#VQA v2 test-dev#Accuracy
514
+ Anomaly Detection#Numenta Anomaly Benchmark#NAB score
515
+ Semantic Segmentation#S3DIS#Mean IoU
516
+ Sentiment Analysis#CR#Accuracy
517
+ Image Classification#CIFAR-10#PARAMS
518
+ Open-Domain Question Answering#SearchQA#EM
519
+ Fine-Grained Image Classification#FGVC Aircraft#Accuracy
520
+ Visual Object Tracking#TrackingNet#Precision
521
+ Music Source Separation#MUSDB18#SDR (vocals)
522
+ Text Summarization#Pubmed#ROUGE-L
523
+ Link Prediction#Citeseer#AP
524
+ Drug Discovery#QM9#Error ratio
525
+ Text Summarization#Pubmed#ROUGE-1
526
+ Text Summarization#Pubmed#ROUGE-2
527
+ Visual Object Tracking#GOT-10k#Average Overlap
528
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Mean)
529
+ Pedestrian Detection#CityPersons#Partial MR^-2
530
+ Visual Object Tracking#TrackingNet#Accuracy
531
+ Multi-Person Pose Estimation#COCO#AP
532
+ Atari Games#Atari 2600 Asterix#Score
533
+ Image Classification#CIFAR-100#PARAMS
534
+ Few-Shot Image Classification#Mini-Imagenet 20-way (1-shot)#Accuracy
535
+ Cross-Lingual NER#CoNLL German#F1
536
+ RGB-D Salient Object Detection#STERE#S-Measure
537
+ Image Super-Resolution#Manga109 - 3x upscaling#SSIM
538
+ Temporal Action Localization#ActivityNet-1.3#mAP
539
+ Link Prediction#FB15k-237#Hits@10
540
+ 3D Human Pose Estimation#HumanEva-I#Mean Reconstruction Error (mm)
541
+ Atari Games#Atari 2600 Enduro#Score
542
+ Photo geolocation estimation#Im2GPS#Country level (750 km)
543
+ Scene Graph Generation#Visual Genome#Recall@50
544
+ Panoptic Segmentation#Mapillary val#PQ
545
+ 3D Instance Segmentation#ScanNet(v2)#Mean AP @ 0.5
546
+ Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (CV II)
547
+ Text Simplification#ASSET#BLEU
548
+ Image Clustering#coil-100#NMI
549
+ Skeleton Based Action Recognition#SBU#Accuracy
550
+ Colorectal Gland Segmentation:#CRAG#Hausdorff Distance (mm)
551
+ Image Super-Resolution#BSD100 - 2x upscaling#PSNR
552
+ 6D Pose Estimation using RGB#LineMOD#Accuracy
553
+ Speech Recognition#Switchboard + Hub500#Percentage error
554
+ Link Prediction#FB15k#MR
555
+ Text Simplification#Newsela#BLEU
556
+ Data-to-Text Generation#E2E NLG Challenge#ROUGE-L
557
+ Named Entity Recognition#GENIA#F1
558
+ Visual Question Answering#GQA Test2019#Distribution
559
+ Image Classification#iNaturalist 2019#Top-1 Accuracy
560
+ Image Classification#mini WebVision 1.0#ImageNet Top-5 Accuracy
561
+ Head Pose Estimation#BIWI#MAE (trained with other data)
562
+ Question Answering#TrecQA#MAP
563
+ Visual Question Answering#VQA v1 test-std#Accuracy
564
+ Sentiment Analysis#Yelp Fine-grained classification#Error
565
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FED
566
+ Image Super-Resolution#Manga109 - 8x upscaling#SSIM
567
+ part-of-speech_tagging#VLSP 2013 POS tagging shared task#Accuracy
568
+ Nested Named Entity Recognition#GENIA#F1
569
+ Hate Speech Detection#Ethos Binary#Classification Accuracy
570
+ Machine Translation#WMT2016 English-Romanian#BLEU score
571
+ Text based Person Retrieval#CUHK-PEDES#R@10
572
+ Visual Question Answering#GQA Test2019#Consistency
573
+ Image Classification#ImageNet ReaL#Accuracy
574
+ named_entity_recognition#VLSP 2016 NER shared task#F1
575
+ Atari Games#Atari 2600 Phoenix#Score
576
+ Natural Language Inference#SNLI#% Train Accuracy
577
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#FID
578
+ Visual Question Answering#CLEVR-Humans#Accuracy
579
+ Image Clustering#STL-10#Backbone
580
+ Node Classification#PubMed (0.03%)#Accuracy
581
+ Sentiment Analysis#Yelp Binary classification#Error
582
+ Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Subject)
583
+ Word Sense Disambiguation#SensEval 3 Task 1#F1
584
+ RGB-D Salient Object Detection#NLPR#Average MAE
585
+ Dependency Parsing#Penn Treebank#POS
586
+ Language Modelling#Penn Treebank (Character Level)#Bit per Character (BPC)
587
+ Few-Shot Image Classification#Mini-Imagenet 5-way (10-shot)#Accuracy
588
+ Graph Classification#NEURON-Average#Accuracy
589
+ Node Classification#Cora (3%)#Accuracy
590
+ sentiment_analysis#SUBJ#Accuracy
591
+ amr_parsing#LDC2015E86#Smatch
592
+ Part-Of-Speech Tagging#UD#Avg accuracy
593
+ Atari Games#Atari 2600 Wizard of Wor#Score
594
+ Pose Tracking#PoseTrack2017#MOTA
595
+ 3D Object Reconstruction#Data3D−R2N2#3DIoU
596
+ Real-time Instance Segmentation#MSCOCO#AP75
597
+ Visual Question Answering#MSVD-QA#Accuracy
598
+ Few-Shot Image Classification#Meta-Dataset#Accuracy
599
+ Sentiment Analysis#SST-5 Fine-grained classification#Accuracy
600
+ Image Classification#WebVision-1000#ImageNet Top-5 Accuracy
601
+ Atari Games#Atari 2600 Atlantis#Score
602
+ Atari Games#Atari 2600 Road Runner#Score
603
+ Image Super-Resolution#Urban100 - 2x upscaling#PSNR
604
+ Semantic Segmentation#LIP val#mIoU
605
+ Real-time Instance Segmentation#MSCOCO#AP50
606
+ Speech Recognition#WSJ eval92#Word Error Rate (WER)
607
+ Domain Adaptation#Office-Caltech#Average Accuracy
608
+ Relation Extraction#DocRED#F1
609
+ Node Classification#Wiki-Vote#Accuracy
610
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#J&F
611
+ Language Modelling#Penn Treebank (Word Level)#Validation perplexity
612
+ 3D Point Cloud Classification#ModelNet40#Overall Accuracy
613
+ Retinal Vessel Segmentation#DRIVE#AUC
614
+ Face Alignment#300W#AUC0.08 private
615
+ Few-Shot Image Classification#CIFAR-FS 5-way (5-shot)#Accuracy
616
+ 3D Object Detection#ScanNetV2#mAP@0.5
617
+ Multivariate Time Series Forecasting#MuJoCo#MSE (10^-2, 50% missing)
618
+ Link Prediction#YAGO3-10#Hits@10
619
+ Graph Classification#RE-M5K#Accuracy
620
+ Image Clustering#coil-100#Accuracy
621
+ Text-to-Image Generation#Multi-Modal-CelebA-HQ#Acc
622
+ Multiple Object Tracking#KITTI Tracking test#MOTA
623
+ Document Classification#Cora#Accuracy
624
+ Semantic Textual Similarity#SentEval#SICK-R
625
+ Fake News Detection#FNC-1#Weighted Accuracy
626
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Mean)
627
+ Semantic Textual Similarity#SentEval#SICK-E
628
+ Self-Supervised Image Classification#ImageNet#Number of Params
629
+ Object Detection#Waymo 2D detection all_ns f0val#COCO-style AP
630
+ Few-Shot Image Classification#OMNIGLOT - 5-Shot, 20-way#Accuracy
631
+ Question Answering#TrecQA#MRR
632
+ Image Classification#mini WebVision 1.0#Top-1 Accuracy
633
+ Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Val)
634
+ Fine-Grained Image Classification#Stanford Cars#PARAMS
635
+ Continuous Control#PyBullet Walker2D#Return
636
+ Image-to-Image Translation#ADE20K Labels-to-Photos#FID
637
+ Machine Translation#IWSLT2015 German-English#BLEU score
638
+ Image Retrieval with Multi-Modal Query#Fashion200k#Recall@10
639
+ Time Series Classification#Wafer#NLL
640
+ Self-Supervised Image Classification#ImageNet#Top 5 Accuracy
641
+ Dialogue Act Classification#Switchboard corpus#Accuracy
642
+ Time Series Classification#CMUsubject16#Accuracy
643
+ Atari Games#Atari 2600 Bowling#Score
644
+ Sentiment Analysis#TweetEval#Hate
645
+ language_modeling#WikiText-2#Number of params
646
+ Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#MS-SSIM
647
+ 3D Multi-Object Tracking#KITTI#MOTA
648
+ Graph Classification#COLLAB#Accuracy
649
+ Gesture-to-Gesture Translation#NTU Hand Digit#IS
650
+ 3D Multi-Object Tracking#KITTI#MOTP
651
+ Link Prediction#Cora#AUC
652
+ Sentiment Analysis#Multi-Domain Sentiment Dataset#Kitchen
653
+ Image Retrieval#Oxf5k#MAP
654
+ Text Classification#Ohsumed#Accuracy
655
+ RGB-D Salient Object Detection#NJU2K#S-Measure
656
+ Retinal OCT Disease Classification#OCT2017#Sensitivity
657
+ Data-to-Text Generation#WebNLG#BLEU
658
+ Image Retrieval with Multi-Modal Query#Fashion200k#Recall@50
659
+ 3D Object Detection#SUN-RGBD val#mAP@0.25
660
+ Machine Translation#WMT2014 English-German#SacreBLEU
661
+ Fact-based Text Editing#WebEdit#F1
662
+ Few-Shot Semantic Segmentation#PASCAL-5i (1-Shot)#Mean IoU
663
+ Time Series Classification#JapaneseVowels#NLL
664
+ Synthetic-to-Real Translation#Syn2Real-C#Accuracy
665
+ Few-Shot Image Classification#Stanford Cars 5-way (1-shot)#Accuracy
666
+ Image Classification#Stanford Cars#Accuracy
667
+ 3D Instance Segmentation#ScanNet(v2)#mAP
668
+ Coreference Resolution#OntoNotes#F1
669
+ Image Generation#CelebA-HQ 1024x1024#FID
670
+ Node Classification#Pubmed#Validation
671
+ Multivariate Time Series Forecasting#USHCN-Daily#MSE
672
+ Human-Object Interaction Detection#HICO#mAP
673
+ Panoptic Segmentation#COCO test-dev#PQst
674
+ Image Classification#MNIST#Percentage error
675
+ Code Generation#WikiSQL#Execution Accuracy
676
+ Image Super-Resolution#Urban100 - 8x upscaling#SSIM
677
+ Relation Extraction#DocRED#Ign F1
678
+ Panoptic Segmentation#COCO test-dev#PQth
679
+ Object Detection#Manga109-s 15test#COCO-style AP
680
+ Instance Segmentation#Cityscapes test#Average Precision
681
+ Action Classification#Charades#MAP
682
+ Interactive Segmentation#GrabCut#NoC@85
683
+ Action Classification#Kinetics-400#Flops x views
684
+ Image Clustering#Imagenet-dog-15#Accuracy
685
+ Real-Time Object Detection#COCO#FPS
686
+ Recommendation Systems#MovieLens 1M#nDCG@10
687
+ Speech Enhancement#DEMAND#CBAK
688
+ word_sense_disambiguation#Senseval 3#F1
689
+ Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 1 Accuracy
690
+ Recommendation Systems#Million Song Dataset#Recall@50
691
+ Named Entity Recognition#NCBI-disease#F1
692
+ Trajectory Prediction#Stanford Drone#ADE-8/12 @K = 20
693
+ Image Clustering#Fashion-MNIST#NMI
694
+ Relation Extraction#TACRED#F1
695
+ Fine-Grained Image Classification#Stanford Dogs#Accuracy
696
+ Link Prediction#Yelp#HR@10
697
+ Color Image Denoising#CBSD68 sigma50#PSNR
698
+ Action Segmentation#50 Salads#F1@10%
699
+ Cross-Lingual NER#CoNLL Spanish#F1
700
+ Machine Translation#WMT2014 English-French#BLEU score
701
+ 3D Multi-Person Pose Estimation (absolute)#MuPoTS-3D#3DPCK
702
+ Sentiment Analysis#TweetEval#Sentiment
703
+ RGB-D Salient Object Detection#NJU2K#max F-Measure
704
+ Atari Games#Atari 2600 Solaris#Score
705
+ Depth Completion#KITTI Depth Completion#RMSE
706
+ Entity Linking#WiC-TSV#Task 1 Accuracy: general purpose
707
+ Action Segmentation#50 Salads#Edit
708
+ Interactive Segmentation#GrabCut#NoC@90
709
+ Visual Dialog#Visual Dialog v1.0 test-std#R@5
710
+ Few-Shot Semantic Segmentation#PASCAL-5i (5-Shot)#Mean IoU
711
+ Visual Dialog#Visual Dialog v1.0 test-std#R@1
712
+ Keypoint Detection#COCO test-dev#ARM
713
+ Keypoint Detection#COCO test-dev#ARL
714
+ Link Prediction#MovieLens 25M#nDCG@10
715
+ Image Super-Resolution#Set5 - 2x upscaling#PSNR
716
+ Image Super-Resolution#Manga109 - 2x upscaling#PSNR
717
+ Keypoint Detection#COCO test-dev#APM
718
+ Question Answering#QASent#MAP
719
+ Keypoint Detection#COCO test-dev#APL
720
+ Unsupervised Domain Adaptation#Office-Home (RS-UT imbalance)#Average Per-Class Accuracy
721
+ Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 open ended#Percentage correct
722
+ Hate Speech Detection#Ethos Binary#F1-score
723
+ Action Segmentation#Breakfast#F1@25%
724
+ relation_prediction#FB15K-237#H@10
725
+ Adversarial Defense#ImageNet (non-targeted PGD, max perturbation=4)#Accuracy
726
+ Action Segmentation#Breakfast#Edit
727
+ Domain Adaptation#MNIST-to-USPS#Accuracy
728
+ Language Modelling#WikiText-103#Test perplexity
729
+ Time Series Classification#Wafer#Accuracy
730
+ Link Prediction#WN18#Hits@3
731
+ Link Prediction#WN18#Hits@1
732
+ Spoken language identification#VoxForge European#Accuracy (%)
733
+ Birds Eye View Object Detection#KITTI Cars Hard#AP
734
+ Time Series Classification#ECG#Accuracy
735
+ Video Semantic Segmentation#CamVid#Mean IoU
736
+ Link Prediction#FB15k-237#MRR
737
+ Video Super-Resolution#Vid4 - 4x upscaling#MOVIE
738
+ Neural Architecture Search#CIFAR-10#Parameters
739
+ Face Verification#Labeled Faces in the Wild#Accuracy
740
+ Unsupervised Domain Adaptation#Duke to MSMT#mAP
741
+ Few-Shot Image Classification#CUB 200 5-way 1-shot#Accuracy
742
+ Scene Text Detection#MSRA-TD500#Recall
743
+ Machine Translation#IWSLT2015 English-German#BLEU score
744
+ Sentiment Analysis#TweetEval#Offensive
745
+ Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Spanish#Accuracy
746
+ Fact-based Text Editing#WebEdit#Recall
747
+ Semantic Textual Similarity#STS Benchmark#Spearman Correlation
748
+ Vision and Language Navigation#VLN Challenge#error
749
+ Image Clustering#Extended Yale-B#Accuracy
750
+ Object Detection#COCO test-dev#AP75
751
+ Cross-Modal Retrieval#Flickr30k#Text-to-image R@10
752
+ Interactive Segmentation#DAVIS#NoC@85
753
+ Person Re-Identification#CUHK03#Rank-1
754
+ Atari Games#Atari 2600 Gravitar#Score
755
+ Interactive Segmentation#DAVIS#NoC@90
756
+ Code Generation#WikiSQL#Exact Match Accuracy
757
+ Few-Shot Image Classification#Mini-Imagenet 5-way (5-shot)#Accuracy
758
+ Semi-Supervised Image Classification#cifar-100, 10000 Labels#Accuracy
759
+ Object Detection#COCO minival#oLRP
760
+ language_modeling#WikiText-103#Number of params
761
+ Chinese Named Entity Recognition#Resume NER#F1
762
+ Entity Disambiguation#AIDA-CoNLL#In-KB Accuracy
763
+ Speech Enhancement#DEMAND#CSIG
764
+ language_modeling#Penn Treebank#Number of params
765
+ Image Generation#CIFAR-10#FID
766
+ Object Detection#COCO test-dev#AP50
767
+ Grayscale Image Denoising#Set12 sigma15#PSNR
768
+ Semantic Role Labeling#CoNLL 2005#F1
769
+ JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#SSIM
770
+ Unsupervised Machine Translation#WMT2014 English-French#BLEU
771
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Recall)
772
+ Question Generation#SQuAD1.1#BLEU-4
773
+ Scene Text Detection#ICDAR 2015#Precision
774
+ Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-Russian#Accuracy
775
+ 3D Object Detection#KITTI Cars Easy val#AP
776
+ 3D Human Pose Estimation#3DPW#acceleration error
777
+ Text Simplification#TurkCorpus#BLEU
778
+ Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 5 Accuracy
779
+ Unsupervised Image Classification#MNIST#Accuracy
780
+ amr_parsing#LDC2014T12#F1 on Full
781
+ dependency_parsing#benchmark Vietnamese dependency treebank VnDT#UAS
782
+ Atari Games#Atari 2600 Video Pinball#Score
783
+ Image Classification#EMNIST-Balanced#Accuracy
784
+ Person Re-Identification#MARS#Rank-5
785
+ Image Clustering#MNIST-test#NMI
786
+ Semantic Similarity#SICK#Spearman Correlation
787
+ Person Re-Identification#MARS#Rank-1
788
+ Link Prediction#Yelp#nDCG@10
789
+ Neural Architecture Search#CIFAR-100#FLOPS
790
+ Question Answering#Quora Question Pairs#Accuracy
791
+ Word Sense Disambiguation#SemEval 2015 Task 13#F1
792
+ Speech Synthesis#North American English#Mean Opinion Score
793
+ Fine-Grained Image Classification#NABirds#Accuracy
794
+ Music Transcription#MusicNet#Number of params
795
+ Link Prediction#FB15k#MRR
796
+ Image Retrieval#Flickr30K 1K test#R@10
797
+ Mortality Prediction#MIMIC-III#Recall
798
+ Text Simplification#PWKP / WikiSmall#BLEU
799
+ Neural Architecture Search#CIFAR-100#PARAMS
800
+ Semantic Role Labeling (predicted predicates)#CoNLL 2012#F1
801
+ Fact-based Text Editing#WebEdit#DELETE
802
+ Grammatical Error Correction#CoNLL-2014 Shared Task#F0.5
803
+ Scene Text Detection#ICDAR 2015#Recall
804
+ 3D Object Detection#KITTI Cars Hard#AP
805
+ Neural Architecture Search#CIFAR-100#Percentage Error
806
+ Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-French#Accuracy
807
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#F-measure (Decay)
808
+ Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Laptop#F1
809
+ Node Classification#CiteSeer with Public Split: fixed 20 nodes per class#Accuracy
810
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.2
811
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.3
812
+ Subjectivity Analysis#SUBJ#Accuracy
813
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.1
814
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.6
815
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.7
816
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.4
817
+ Real-time Instance Segmentation#MSCOCO#APL
818
+ Temporal Action Localization#THUMOS’14#mAP IOU@0.5
819
+ Real-time Instance Segmentation#MSCOCO#APM
820
+ Question Answering#bAbi#Accuracy (trained on 10k)
821
+ Real-time Instance Segmentation#MSCOCO#APS
822
+ Speech Recognition#TIMIT#Percentage error
823
+ Visual Dialog#Visual Dialog v1.0 test-std#Mean
824
+ Graph Classification#NEURON-BINARY#Accuracy
825
+ Language Modelling#Penn Treebank (Word Level)#Test perplexity
826
+ Unsupervised Machine Translation#WMT2014 French-English#BLEU
827
+ Video Retrieval#MSVD#text-to-video R@5
828
+ RGB-D Salient Object Detection#NJU2K#Average MAE
829
+ Video Retrieval#MSVD#text-to-video R@1
830
+ text_classification#AG News#Error
831
+ Pose Estimation#MPII Human Pose#PCKh-0.5
832
+ Scene Text Detection#MSRA-TD500#Precision
833
+ 3D Human Pose Estimation#3DPW#PA-MPJPE
834
+ Image Clustering#ImageNet-10#NMI
835
+ Face Alignment#WFLW#FR@0.1(%, all)
836
+ Image-to-Image Translation#COCO-Stuff Labels-to-Photos#FID
837
+ relationship_extraction#New York Times Corpus#P@30%
838
+ Fine-Grained Image Classification#Caltech-101#Top-1 Error Rate
839
+ Human-Object Interaction Detection#V-COCO#MAP
840
+ Conversational Response Selection#PolyAI Reddit#1-of-100 Accuracy
841
+ Semi-Supervised Semantic Segmentation#Cityscapes 12.5% labeled#Validation mIoU
842
+ Fact-based Text Editing#WebEdit#BLEU
843
+ Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Accuracy (Test)
844
+ Object Counting#Pascal VOC 2007 count-test#mRMSE-nz
845
+ Sentiment Analysis#IMDb#Accuracy
846
+ Image Generation#Binarized MNIST#nats
847
+ 3D Object Detection#ScanNetV2#mAP@0.25
848
+ Lane Detection#CULane#F1 score
849
+ Unsupervised Domain Adaptation#Duke to MSMT#rank-10
850
+ Image Clustering#Imagenet-dog-15#NMI
851
+ Image Super-Resolution#Set14 - 3x upscaling#PSNR
852
+ Dialogue State Tracking#Wizard-of-Oz#Request
853
+ Pedestrian Detection#Caltech#Reasonable Miss Rate
854
+ Instance Segmentation#COCO minival#mask AP
855
+ Relation Extraction#ADE Corpus#RE+ Macro F1
856
+ Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
857
+ Semi-Supervised Image Classification#SVHN, 1000 labels#Accuracy
858
+ Time Series Classification#KickvsPunch#NLL
859
+ Person Re-Identification#CUHK03 labeled#Rank-1
860
+ Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Unseen)
861
+ JPEG Artifact Correction#LIVE1 (Quality 10 Color)#SSIM
862
+ Atari Games#Atari 2600 Tennis#Score
863
+ 3D Object Reconstruction#Data3D−R2N2#Avg F1
864
+ Question Answering#QASent#MRR
865
+ Traffic Prediction#PeMS-M#MAE (60 min)
866
+ Constituency Grammar Induction#PTB#Max F1 (WSJ)
867
+ Conditional Image Generation#CIFAR-10#FID
868
+ Visual Question Answering#VQA v2 test-std#yes/no
869
+ Image Classification#Flowers-102#Accuracy
870
+ Image Super-Resolution#Set5 - 4x upscaling#SSIM
871
+ Recommendation Systems#MovieLens 1M#RMSE
872
+ Action Segmentation#Breakfast#F1@10%
873
+ Graph Classification#ENZYMES#Accuracy
874
+ Unsupervised Facial Landmark Detection#MAFL#NME
875
+ Keypoint Detection#COCO test-dev#AR50
876
+ Depth Completion#KITTI Depth Completion#Runtime [ms]
877
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#PSNR
878
+ Image Super-Resolution#Urban100 - 4x upscaling#SSIM
879
+ Constituency Parsing#Penn Treebank#F1 score
880
+ Person Re-Identification#CUHK03 labeled#MAP
881
+ Keypoint Detection#COCO test-dev#AR75
882
+ Panoptic Segmentation#Cityscapes val#mIoU
883
+ Relation Extraction#ADE Corpus#NER Macro F1
884
+ Semi-Supervised Video Object Segmentation#YouTube#mIoU
885
+ Object Detection#UAVDT#mAP
886
+ Keypoint Detection#COCO test-challenge#ARL
887
+ Keypoint Detection#COCO test-challenge#ARM
888
+ Question Answering#WikiQA#MRR
889
+ Image Generation#Cityscapes#FID-10k-training-steps
890
+ Real-time Instance Segmentation#MSCOCO#Frame (fps)
891
+ Few-Shot Image Classification#FC100 5-way (5-shot)#Accuracy
892
+ word_segmentation#Chinese Treebank 6#F1
893
+ summarization#CNN / Daily Mail (Anonymized version)#ROUGE-2
894
+ summarization#CNN / Daily Mail (Anonymized version)#ROUGE-1
895
+ Cross-Lingual NER#CoNLL Dutch#F1
896
+ Natural Language Inference#FarsTail#% Test Accuracy
897
+ Scene Text Detection#Total-Text#Precision
898
+ Link Prediction#YAGO3-10#Hits@3
899
+ Link Prediction#YAGO3-10#Hits@1
900
+ Word Sense Disambiguation#SemEval 2007 Task 17#F1
901
+ Neural Architecture Search#CIFAR-10#Search Time (GPU days)
902
+ 3D Object Detection#KITTI Pedestrians Hard#AP
903
+ word_segmentation#VLSP 2013 word segmentation shared task#F1
904
+ Image Clustering#Tiny-ImageNet#Accuracy
905
+ summarization#CNN / Daily Mail (Anonymized version)#ROUGE-L
906
+ Visual Question Answering#VQA-CP#Score
907
+ Node Classification#USA Air-Traffic#Accuracy
908
+ Image Clustering#CIFAR-10#ARI
909
+ Image/Document Clustering#pendigits#runtime (s)
910
+ Action Segmentation#GTEA#Edit
911
+ Weakly Supervised Action Localization#ActivityNet-1.3#mAP@0.5
912
+ Panoptic Segmentation#Cityscapes test#PQ
913
+ taxonomy_learning#SemEval 2018#MAP
914
+ AMR Parsing#LDC2014T12#F1 Full
915
+ sentiment_analysis#SemEval-2014 Task 4 subtask 2 Aspect Term Polarity#Laptop (acc)
916
+ Keypoint Detection#COCO test-challenge#APL
917
+ Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#Kernel Inception Distance
918
+ Hate Speech Detection#HateXplain#Accuracy
919
+ Image Denoising#SIDD#SSIM (sRGB)
920
+ Document Summarization#CNN / Daily Mail#ROUGE-1
921
+ Document Summarization#CNN / Daily Mail#ROUGE-2
922
+ Few-Shot Object Detection#MS-COCO (10-shot)#AP
923
+ Time Series Classification#PenDigits#NLL
924
+ word_segmentation#MSR#F1
925
+ 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
926
+ Semantic Segmentation#SkyScapes-Dense#Mean IoU
927
+ Object Counting#COCO count-test#m-reIRMSE
928
+ Visual Question Answering#GQA Test2019#Accuracy
929
+ Speech Enhancement#DEMAND#PESQ
930
+ Node Classification#Cornell#Accuracy
931
+ Document Summarization#CNN / Daily Mail#ROUGE-L
932
+ Grammatical Error Correction#BEA-2019 (test)#F0.5
933
+ Visual Question Answering#GQA test-std#Accuracy
934
+ Click-Through Rate Prediction#Amazon#AUC
935
+ Multimodal Machine Translation#Multi30K#BLEU (EN-DE)
936
+ Skeleton Based Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
937
+ Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.3-0.7)
938
+ Open-Domain Question Answering#SearchQA#N-gram F1
939
+ Keypoint Detection#COCO test-challenge#AR50
940
+ RGB-D Salient Object Detection#NJU2K#max E-Measure
941
+ Domain Adaptation#SYNSIG-to-GTSRB#Accuracy
942
+ Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#PSNR
943
+ Keypoint Detection#COCO test-challenge#AR75
944
+ Retinal Vessel Segmentation#STARE#AUC
945
+ Stochastic Optimization#CIFAR-100 WRN-28-10 - 200 Epochs#Accuracy
946
+ Spoken language identification#LRE07#3 sec
947
+ 3D Semantic Segmentation#SemanticKITTI#mIoU
948
+ Text Summarization#arXiv#ROUGE-1
949
+ Text Summarization#arXiv#ROUGE-2
950
+ Image Matting#Composition-1K#SAD
951
+ Vision and Language Navigation#VLN Challenge#length
952
+ Object Counting#COCO count-test#mRMSE
953
+ Scene Text Recognition#SVT#Accuracy
954
+ Atari Games#Atari 2600 Demon Attack#Score
955
+ Lipreading#Lip Reading in the Wild#Top-1 Accuracy
956
+ Image Classification#Flowers-102#PARAMS
957
+ Time Series Classification#CharacterTrajectories#NLL
958
+ Text Summarization#arXiv#ROUGE-L
959
+ question_answering#CNN / Daily Mail#Accuracy on Daily Mail
960
+ Instance Segmentation#iSAID#Average Precision
961
+ Single Image Deraining#Test1200#PSNR
962
+ Visual Question Answering#VQA v1 test-dev#Accuracy
963
+ Word Sense Disambiguation#SemEval 2007 Task 7#F1
964
+ Multimodal Activity Recognition#EV-Action#Accuracy
965
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#Jaccard (Decay)
966
+ Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#MS-SSIM
967
+ Entity Linking#WiC-TSV#Task 3 Accuracy: domain specific
968
+ relationship_extraction#SemEval-2010 Task 8#F1
969
+ Recommendation Systems#MovieLens 1M#HR@10
970
+ Named Entity Recognition#ACE 2004#F1
971
+ Node Classification#Facebook#Accuracy
972
+ Action Detection#Charades#mAP
973
+ Atari Games#Atari 2600 Amidar#Score
974
+ Image Classification#WebVision-1000#ImageNet Top-1 Accuracy
975
+ Scene Text Detection#ICDAR 2017 MLT#Precision
976
+ Fact-based Text Editing#WebEdit#KEEP
977
+ Visual Object Tracking#LaSOT#AUC
978
+ Image Classification#iNaturalist#Top 1 Accuracy
979
+ Graph Classification#UPFD-POL#Accuracy (%)
980
+ Skeleton Based Action Recognition#N-UCLA#Accuracy
981
+ Scene Text Detection#ICDAR 2017 MLT#Recall
982
+ Conditional Image Generation#ImageNet 128x128#FID
983
+ language_modeling#1B Words / Google Billion Word benchmark#Test perplexity
984
+ 6D Pose Estimation#YCB-Video#ADDS AUC
985
+ Semi-Supervised Image Classification#CIFAR-10, 250 Labels#Accuracy
986
+ Semi-Supervised Video Object Segmentation#YouTube-VOS#F-Measure (Seen)
987
+ Image Super-Resolution#Manga109 - 4x upscaling#SSIM
988
+ Panoptic Segmentation#COCO panoptic#PQst
989
+ machine_translation#WMT 2014 EN-FR#BLEU
990
+ Entity Linking#WiC-TSV#Task 3 Accuracy: all
991
+ Pose Estimation#COCO test-dev#AP50
992
+ Few-Shot Image Classification#Stanford Dogs 5-way (5-shot)#Accuracy
993
+ Panoptic Segmentation#COCO panoptic#PQth
994
+ Atari Games#Atari 2600 Chopper Command#Score
995
+ Time Series Classification#PEMS#NLL
996
+ Question Answering#SQuAD2.0 dev#F1
997
+ Question Answering#SQuAD2.0 dev#EM
998
+ Natural Language Inference#MultiNLI#Matched
999
+ Dense Pixel Correspondence Estimation#HPatches#Viewpoint V AEPE
1000
+ Unsupervised Domain Adaptation#Market to Duke#mAP
1001
+ Time Series Classification#NetFlow#NLL
1002
+ Node Classification#PPI#F1
1003
+ Temporal Action Proposal Generation#ActivityNet-1.3#AR@100
1004
+ Sequential Image Classification#Sequential MNIST#Permuted Accuracy
1005
+ Click-Through Rate Prediction#Bing News#Log Loss
1006
+ Neural Architecture Search#CIFAR-10 Image Classification#Percentage error
1007
+ JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR
1008
+ Data-to-Text Generation#WebNLG Full#BLEU
1009
+ Pose Estimation#Leeds Sports Poses#PCK
1010
+ Person Re-Identification#Market-1501#Rank-5
1011
+ Semantic Segmentation#COCO-Stuff test#mIoU
1012
+ Person Re-Identification#Market-1501#Rank-1
1013
+ JPEG Artifact Correction#LIVE1 (Quality 20 Grayscale)#PSNR
1014
+ Conditional Image Generation#CIFAR-10#Inception score
1015
+ Pose Estimation#COCO test-dev#AP75
1016
+ Image Generation#CelebA 256x256#bpd
1017
+ Object Detection#KITTI Cars Easy#AP
1018
+ Reading Comprehension#RACE#Accuracy (Middle)
1019
+ Unsupervised Domain Adaptation#Cityscapes to Foggy Cityscapes#mAP@0.5
1020
+ Real-Time Semantic Segmentation#Cityscapes test#Time (ms)
1021
+ Ad-Hoc Information Retrieval#TREC Robust04#MAP
1022
+ Image Clustering#CIFAR-100#Accuracy
1023
+ Image Clustering#USPS#Accuracy
1024
+ Question Answering#CNN / Daily Mail#CNN
1025
+ Image Retrieval#CARS196#R@1
1026
+ Image Super-Resolution#Set5 - 8x upscaling#SSIM
1027
+ Fine-Grained Image Classification#Oxford-IIIT Pets#Top-1 Error Rate
1028
+ Neural Architecture Search#CIFAR-10#Top-1 Error Rate
1029
+ Image Clustering#USPS#NMI
1030
+ Real-Time Semantic Segmentation#NYU Depth v2#mIoU
1031
+ Node Classification#Citeseer Full-supervised#Accuracy
1032
+ Atari Games#Atari 2600 Battle Zone#Score
1033
+ Graph Regression#Lipophilicity#RMSE
1034
+ Video Instance Segmentation#YouTube-VIS validation#AP75
1035
+ Image Classification#ImageNet V2#Top 1 Accuracy
1036
+ Action Segmentation#Breakfast#Acc
1037
+ Scene Text Recognition#ICDAR2013#Accuracy
1038
+ Few-Shot Image Classification#Tiered ImageNet 10-way (1-shot)#Accuracy
1039
+ Semantic Segmentation#S3DIS Area5#mAcc
1040
+ Cross-Modal Retrieval#COCO 2014#Image-to-text R@10
1041
+ Object Counting#Pascal VOC 2007 count-test#m-relRMSE
1042
+ Link Prediction#FB15k-237#MR
1043
+ Spoken language identification#LRE07#10 sec
1044
+ Video Instance Segmentation#YouTube-VIS validation#AP50
1045
+ Text Classification#R8#Accuracy
1046
+ Node Classification#Wikipedia#Macro-F1
1047
+ Atari Games#Atari 2600 Alien#Score
1048
+ Atari Games#Atari 2600 Q*Bert#Score
1049
+ Single Image Deraining#Rain100L#PSNR
1050
+ Image Super-Resolution#Set14 - 8x upscaling#PSNR
1051
+ Question Answering#NarrativeQA#METEOR
1052
+ Single Image Deraining#Test2800#PSNR
1053
+ 3D Object Detection#nuScenes#mAP
1054
+ Optical Flow Estimation#Sintel-clean#Average End-Point Error
1055
+ Image Classification#Oxford-IIIT Pets#Accuracy
1056
+ Object Detection#KITTI Cars Moderate#AP
1057
+ Grayscale Image Denoising#Urban100 sigma50#PSNR
1058
+ Atari Games#Atari 2600 Defender#Score
1059
+ Zero-Shot Learning#SUN Attribute#average top-1 classification accuracy
1060
+ Semantic Textual Similarity#SentEval#MRPC
1061
+ Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: domain specific
1062
+ Few-Shot Object Detection#MS-COCO (30-shot)#AP
1063
+ relationship_extraction#New York Times Corpus#P@10%
1064
+ Few-Shot Image Classification#Mini-Imagenet 5-way (1-shot)#Accuracy
1065
+ 3D Human Pose Estimation#MPI-INF-3DHP#MJPE
1066
+ Graph Classification#HIV-fMRI-77#F1
1067
+ Sentiment Analysis#TweetEval#ALL
1068
+ Single Image Deraining#Rain100H#SSIM
1069
+ Medical Image Segmentation#CVC-ClinicDB#mean Dice
1070
+ Video Generation#UCF-101 16 frames, 64x64, Unconditional#Inception Score
1071
+ question_answering#Quasar#EM (Quasar-T)
1072
+ Person Re-Identification#Market-1501#Rank-10
1073
+ Question Answering#CNN / Daily Mail#Daily Mail
1074
+ Video Object Detection#ImageNet VID#MAP
1075
+ Weakly Supervised Action Localization#THUMOS 2014#mAP@0.5
1076
+ Humor Detection#200k Short Texts for Humor Detection#F1-score
1077
+ Node Classification#Flickr#Accuracy
1078
+ Multi-Object Tracking#MOT17#MOTA
1079
+ Sentiment Analysis#Amazon Review Full#Accuracy
1080
+ Language Modelling#Hutter Prize#Bit per Character (BPC)
1081
+ Semantic Segmentation#ScanNet#3DIoU
1082
+ Semantic Segmentation#ADE20K#Test Score
1083
+ Crowd Counting#UCF-QNRF#MAE
1084
+ word_sense_disambiguation#SemEval 2007#F1
1085
+ Question Answering#WikiQA#MAP
1086
+ Image-to-Image Translation#COCO-Stuff Labels-to-Photos#mIoU
1087
+ Keypoint Detection#COCO test-dev#AP50
1088
+ Semantic Segmentation#Nighttime Driving#mIoU
1089
+ Semantic Textual Similarity#SICK#Spearman Correlation
1090
+ Text-to-Image Generation#CUB#Inception score
1091
+ Visual Dialog#Visual Dialog v1.0 test-std#R@10
1092
+ Mortality Prediction#MIMIC-III#Precision
1093
+ Keypoint Detection#COCO test-dev#AP75
1094
+ Dependency Parsing#Penn Treebank#UAS
1095
+ Graph Classification#NCI109#Accuracy
1096
+ Text Summarization#X-Sum#ROUGE-3
1097
+ Text Summarization#X-Sum#ROUGE-2
1098
+ Text Summarization#X-Sum#ROUGE-1
1099
+ Unsupervised Domain Adaptation#Duke to MSMT#rank-1
1100
+ Person Search#CUHK-SYSU#MAP
1101
+ Unsupervised Domain Adaptation#Duke to MSMT#rank-5
1102
+ Semantic Role Labeling#OntoNotes#F1
1103
+ Semantic Similarity#SICK#Pearson Correlation
1104
+ Video Retrieval#LSMDC#text-to-video R@10
1105
+ Image Classification#VTAB-1k#Top-1 Accuracy
1106
+ Anomaly Detection#Unlabeled CIFAR-10 vs CIFAR-100#AUROC
1107
+ Line Segment Detection#wireframe dataset#sAP5
1108
+ Domain Adaptation#SVNH-to-MNIST#Accuracy
1109
+ 3D Point Cloud Classification#ScanObjectNN#Overall Accuracy
1110
+ Vehicle Pose Estimation#KITTI Cars Hard#Average Orientation Similarity
1111
+ Weakly Supervised Object Detection#PASCAL VOC 2012 test#MAP
1112
+ Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Laptop (Acc)
1113
+ Few-Shot Image Classification#OMNIGLOT - 1-Shot, 5-way#Accuracy
1114
+ Language Modelling#WikiText-2#Test perplexity
1115
+ Graph Classification#IMDb-B#Accuracy
1116
+ sentiment_analysis#SST-2#Accuracy
1117
+ Multi-tissue Nucleus Segmentation#Kumar#Hausdorff Distance (mm)
1118
+ Hate Speech Detection#Ethos Binary#Precision
1119
+ Time Series Classification#AUSLAN#Accuracy
1120
+ Click-Through Rate Prediction#Dianping#AUC
1121
+ Face Verification#Trillion Pairs Dataset#Accuracy
1122
+ Sentiment Analysis#TweetEval#Irony
1123
+ dependency_parsing#Penn Treebank#LAS
1124
+ Sentiment Analysis#MR#Accuracy
1125
+ Video Generation#UCF-101 16 frames, Unconditional, Single GPU#Inception Score
1126
+ Unsupervised Machine Translation#WMT2016 English-German#BLEU
1127
+ Node Classification#Wisconsin#Accuracy
1128
+ Cross-Modal Retrieval#COCO 2014#Text-to-image R@5
1129
+ Cross-Modal Retrieval#COCO 2014#Text-to-image R@1
1130
+ Video Instance Segmentation#YouTube-VIS validation#AR1
1131
+ Question Answering#NewsQA#F1
1132
+ Visual Object Tracking#VOT2017#Expected Average Overlap (EAO)
1133
+ Node Classification#Wikipedia#Accuracy
1134
+ Action Classification#Kinetics-700#Top-1 Accuracy
1135
+ Atari Games#Atari 2600 Kung-Fu Master#Score
1136
+ Image Classification#CIFAR-100#Percentage correct
1137
+ Machine Translation#WMT2014 German-English#BLEU score
1138
+ Object Counting#Pascal VOC 2007 count-test#m-reIRMSE-nz
1139
+ Trajectory Prediction#Stanford Drone#FDE-8/12 @K= 20
1140
+ Zero-Shot Learning#CUB-200-2011#average top-1 classification accuracy
1141
+ Word Sense Disambiguation#Supervised:#SemEval 2015
1142
+ Named Entity Recognition#BC5CDR#F1
1143
+ Word Sense Disambiguation#Supervised:#SemEval 2013
1144
+ Word Sense Disambiguation#Supervised:#SemEval 2007
1145
+ Language Modelling#WikiText-2#Number of params
1146
+ Line Segment Detection#wireframe dataset#sAP15
1147
+ Line Segment Detection#wireframe dataset#sAP10
1148
+ Node Classification#Pubmed#Accuracy
1149
+ Neural Architecture Search#CIFAR-10 Image Classification#FLOPS
1150
+ Visual Object Tracking#GOT-10k#Success Rate 0.5
1151
+ Retinal OCT Disease Classification#OCT2017#Acc
1152
+ Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Dice
1153
+ Lane Detection#TuSimple#Accuracy
1154
+ summarization#CNN / Daily Mail (Non-anonymized version)#METEOR
1155
+ Image Clustering#CIFAR-10#Backbone
1156
+ Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (Test)
1157
+ 6D Pose Estimation using RGBD#LineMOD#Mean ADD
1158
+ text_classification#DBpedia#Error
1159
+ Person Re-Identification#MARS#mAP
1160
+ Visual Question Answering#COCO Visual Question Answering (VQA) real images 1.0 multiple choice#Percentage correct
1161
+ Time Series Classification#KickvsPunch#Accuracy
1162
+ Hyperspectral Image Classification#Pavia University#Overall Accuracy
1163
+ Text Simplification#TurkCorpus#SARI (EASSE>=0.2.1)
1164
+ Graph Clustering#Cora#Accuracy
1165
+ Vision and Language Navigation#VLN Challenge#spl
1166
+ Crowd Counting#UCF CC 50#MAE
1167
+ Keypoint Detection#COCO test-challenge#AP50
1168
+ Video Retrieval#LSMDC#text-to-video Median Rank
1169
+ Sentiment Analysis#TweetEval#Stance
1170
+ chunking#Penn Treebank#F1
1171
+ Keypoint Detection#COCO test-challenge#AP75
1172
+ Relation Extraction#ACE 2004#NER Micro F1
1173
+ Semi-Supervised Image Classification#ImageNet - 10% labeled data#Top 1 Accuracy
1174
+ Atari Games#Atari 2600 HERO#Score
1175
+ Multi-tissue Nucleus Segmentation#Kumar#Dice
1176
+ Link Prediction#WN18#Hits@10
1177
+ Semantic Segmentation#S3DIS#mAcc
1178
+ Image Super-Resolution#BSD100 - 4x upscaling#SSIM
1179
+ Image Classification#mini WebVision 1.0#ImageNet Top-1 Accuracy
1180
+ Anomaly Detection#One-class ImageNet-30#AUROC
1181
+ Few-Shot Image Classification#Tiered ImageNet 5-way (1-shot)#Accuracy
1182
+ Neural Architecture Search#ImageNet#Params
1183
+ Multimodal Activity Recognition#Moments in Time Dataset#Top-5 (%)
1184
+ question_answering#SearchQA#EM
1185
+ question_answering#SearchQA#F1
1186
+ Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-pixel Accuracy
1187
+ Real-Time Semantic Segmentation#CamVid#Frame (fps)
1188
+ Image Generation#CIFAR-10#Inception score
1189
+ Click-Through Rate Prediction#MovieLens 20M#AUC
1190
+ summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-L
1191
+ Action Recognition#NTU RGB+D#Accuracy (CV)
1192
+ Cross-Modal Retrieval#Flickr30k#Image-to-text R@5
1193
+ Cross-Modal Retrieval#Flickr30k#Image-to-text R@1
1194
+ Semantic Segmentation#ADE20K val#mIoU
1195
+ Multi-Label Classification#PASCAL VOC 2007#mAP
1196
+ Ad-Hoc Information Retrieval#TREC Robust04#nDCG@20
1197
+ Scene Text Detection#Total-Text#Recall
1198
+ Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-1
1199
+ Birds Eye View Object Detection#KITTI Cars Easy#AP
1200
+ Emotion Recognition in Conversation#MELD#Weighted Macro-F1
1201
+ Graph Classification#UPFD-GOS#Accuracy (%)
1202
+ Named Entity Recognition#CoNLL 2003 (German)#F1
1203
+ Person Re-Identification#MSMT17#mAP
1204
+ Image Matting#Composition-1K#Grad
1205
+ Birds Eye View Object Detection#KITTI Pedestrians Moderate#AP
1206
+ Atari Games#Atari 2600 Space Invaders#Score
1207
+ Real-Time Object Detection#PASCAL VOC 2007#MAP
1208
+ Graph Regression#ZINC#MAE
1209
+ Sentiment Analysis#Multi-Domain Sentiment Dataset#Electronics
1210
+ Action Recognition#NTU RGB+D#Accuracy (CS)
1211
+ Semantic Textual Similarity#SentEval#STS
1212
+ Neural Architecture Search#NAS-Bench-201, CIFAR-100#Search time (s)
1213
+ Node Classification#MAG240M-LSC#Test Accuracy
1214
+ summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-1
1215
+ summarization#CNN / Daily Mail (Non-anonymized version)#ROUGE-2
1216
+ Retinal OCT Disease Classification#Srinivasan2014#Acc
1217
+ Skeleton Based Action Recognition#SYSU 3D#Accuracy
1218
+ Video Frame Interpolation#Middlebury#Interpolation Error
1219
+ Word Sense Disambiguation#WiC-TSV#Task 3 Accuracy: all
1220
+ Grammatical Error Correction#JFLEG#GLEU
1221
+ Grayscale Image Denoising#BSD68 sigma50#PSNR
1222
+ Facial Expression Recognition#AffectNet#Accuracy (8 emotion)
1223
+ Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-L
1224
+ Link Prediction#WN18RR#MRR
1225
+ Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-2
1226
+ Linguistic Acceptability#CoLA#Accuracy
1227
+ Sentiment Analysis#Multi-Domain Sentiment Dataset#Average
1228
+ Graph Classification#HIV-fMRI-77#Accuracy
1229
+ Text Summarization#CNN / Daily Mail (Anonymized)#ROUGE-1
1230
+ Monocular Depth Estimation#NYU-Depth V2#RMSE
1231
+ Colorectal Gland Segmentation:#CRAG#F1-score
1232
+ Video Retrieval#MSVD#text-to-video R@10
1233
+ Fact-based Text Editing#WebEdit#Precision
1234
+ Speech Recognition#MediaSpeech#WER for Spanish
1235
+ Metric Learning#CARS196#R@1
1236
+ Action Classification#Moments in Time#Top 1 Accuracy
1237
+ Node Classification#Cora (0.5%)#Accuracy
1238
+ Question Answering#SQuAD1.1 dev#F1
1239
+ Question Answering#SQuAD1.1 dev#EM
1240
+ Video Instance Segmentation#YouTube-VIS validation#AR10
1241
+ Few-Shot Image Classification#Tiered ImageNet 10-way (5-shot)#Accuracy
1242
+ Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (1-shot)#Accuracy
1243
+ Weakly Supervised Object Detection#PASCAL VOC 2007#MAP
1244
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (test-dev)#Jaccard (Recall)
1245
+ Image Retrieval#Par106k#mAP
1246
+ Fake News Detection#FNC-1#Per-class Accuracy (Agree)
1247
+ Fundus to Angiography Generation#Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients#FID
1248
+ Atari Games#Atari 2600 Centipede#Score
1249
+ Image Generation#STL-10#FID
1250
+ Image Clustering#CIFAR-100#Train Set
1251
+ Weakly Supervised Object Detection#Charades#MAP
1252
+ part-of-speech_tagging#Penn Treebank#Accuracy
1253
+ word_sense_disambiguation#SemEval 2013#F1
1254
+ Unsupervised Domain Adaptation#Duke to Market#mAP
1255
+ Video Super-Resolution#Vid4 - 4x upscaling#SSIM
1256
+ Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-NB
1257
+ JPEG Artifact Correction#ICB (Quality 10 Color)#SSIM
1258
+ Few-Shot Image Classification#Mini-Imagenet 10-way (5-shot)#Accuracy
1259
+ Multi-Person Pose Estimation#COCO test-dev#AP75
1260
+ Image Denoising#SIDD#PSNR (sRGB)
1261
+ RGB-D Salient Object Detection#NLPR#max F-Measure
1262
+ Action Recognition#EPIC-KITCHENS-100#Noun@1
1263
+ Node Classification#BlogCatalog#Accuracy
1264
+ Speech Enhancement#DEMAND#COVL
1265
+ Named Entity Recognition#CoNLL 2002 (Spanish)#F1
1266
+ Multi-Person Pose Estimation#COCO test-dev#AP50
1267
+ Time Series Classification#ArabicDigits#NLL
1268
+ Referring Expression Segmentation#RefCOCO testA#IoU
1269
+ Joint Entity and Relation Extraction#SciERC#Relation F1
1270
+ Action Segmentation#Breakfast#F1@50%
1271
+ Face Identification#Trillion Pairs Dataset#Accuracy
1272
+ Neural Architecture Search#ImageNet#MACs
1273
+ Sentiment Analysis#SST-2 Binary classification#Accuracy
1274
+ Monocular 3D Human Pose Estimation#Human3.6M#Use Video Sequence
1275
+ Relation Extraction#ChemProt#F1
1276
+ Atari Games#Atari 2600 Double Dunk#Score
1277
+ Node Classification#Citeseer#Validation
1278
+ Semi-Supervised Image Classification#SVHN, 250 Labels#Accuracy
1279
+ RGB-D Salient Object Detection#SIP#S-Measure
1280
+ Data-to-Text Generation#MULTIWOZ 2.1#BLEU
1281
+ Image Super-Resolution#Set14 - 2x upscaling#PSNR
1282
+ Self-Supervised Action Recognition#HMDB51#Pre-Training Dataset
1283
+ Video Retrieval#MSR-VTT-1kA#text-to-video R@5
1284
+ Video Retrieval#MSR-VTT-1kA#text-to-video R@1
1285
+ Instance Segmentation#COCO minival#AP50
1286
+ Object Detection#COCO test-dev#APS
1287
+ RGB-D Salient Object Detection#STERE#Average MAE
1288
+ Scene Text Recognition#ICDAR 2003#Accuracy
1289
+ Click-Through Rate Prediction#Criteo#AUC
1290
+ Node Classification#Citeseer#Accuracy
1291
+ JPEG Artifact Correction#Live1 (Quality 10 Grayscale)#PSNR-B
1292
+ Speech Enhancement#Deep Noise Suppression (DNS) Challenge#PESQ-WB
1293
+ Recommendation Systems#MovieLens 20M#Recall@20
1294
+ Instance Segmentation#COCO minival#AP75
1295
+ Sentiment Analysis#SemEval 2014 Task 4 Subtask 1+2#F1
1296
+ Image Classification#mini WebVision 1.0#Top-5 Accuracy
1297
+ Abstractive Text Summarization#CNN / Daily Mail#ROUGE-L
1298
+ Neural Architecture Search#NAS-Bench-201, CIFAR-10#Accuracy (val)
1299
+ Abstractive Text Summarization#CNN / Daily Mail#ROUGE-1
1300
+ Abstractive Text Summarization#CNN / Daily Mail#ROUGE-2
1301
+ Audio Classification#ESC-50#Top-1 Accuracy
1302
+ Object Detection#COCO test-dev#APM
1303
+ Object Detection#COCO test-dev#APL
1304
+ Retinal Vessel Segmentation#DRIVE#F1 score
1305
+ Music Modeling#Nottingham#NLL
1306
+ Fine-Grained Image Classification#Food-101#Accuracy
1307
+ Common Sense Reasoning#Winograd Schema Challenge#Score
1308
+ language_modeling#Hutter Prize#Number of params
1309
+ Quantization#ImageNet#Accuracy (%)
1310
+ Language Modelling#Penn Treebank (Character Level)#Number of params
1311
+ Music Source Separation#MUSDB18#SDR (drums)
1312
+ Machine Translation#WMT2016 English-German#BLEU score
1313
+ Link Prediction#OpenBioLink#Hits@10
1314
+ Image Generation#ImageNet 64x64#Bits per dim
1315
+ Few-Shot Image Classification#Mini-ImageNet-CUB 5-way (5-shot)#Accuracy
1316
+ Fine-Grained Image Classification#Oxford-IIIT Pets#PARAMS
1317
+ Grammatical Error Detection#CoNLL-2014 A1#F0.5
1318
+ Object Counting#COCO count-test#m-reIRMSE-nz
1319
+ Image Clustering#MNIST-full#Accuracy
1320
+ Visual Object Tracking#OTB-2013#AUC
1321
+ Bias Detection#StereoSet#ICAT Score
1322
+ Line Segment Detection#wireframe dataset#F1 score
1323
+ Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#FID
1324
+ Single Image Deraining#Test100#PSNR
1325
+ Visual Dialog#Visual Dialog v1.0 test-std#NDCG (x 100)
1326
+ JPEG Artifact Correction#LIVE1 (Quality 20 Color)#PSNR
1327
+ Birds Eye View Object Detection#KITTI Cars Moderate#AP
1328
+ Language Modelling#WikiText-2#Validation perplexity
1329
+ Machine Translation#IWSLT2014 German-English#BLEU score
1330
+ Graph Classification#REDDIT-B#Accuracy
1331
+ Recommendation Systems#Netflix#nDCG@100
1332
+ Image Classification#ImageNet#Top 1 Accuracy
1333
+ Natural Language Inference#SciTail#Accuracy
1334
+ Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.7
1335
+ Weakly Supervised Action Localization#THUMOS 2014#mAP@0.1:0.5
1336
+ Scene Text Recognition#ICDAR2015#Accuracy
1337
+ Image Super-Resolution#Set5 - 3x upscaling#SSIM
1338
+ Crowd Counting#ShanghaiTech A#MAE
1339
+ Semi-Supervised Video Object Segmentation#YouTube-VOS#Overall
1340
+ Recommendation Systems#Douban Monti#RMSE
1341
+ Open-Domain Question Answering#Quasar#F1 (Quasar-T)
1342
+ Instance Segmentation#COCO minival#APL
1343
+ Instance Segmentation#COCO minival#APM
1344
+ Instance Segmentation#COCO minival#APS
1345
+ Semi-Supervised Video Object Segmentation#YouTube-VOS#Jaccard (Seen)
1346
+ Object Detection#KITTI Cars Hard#AP
1347
+ Task-Oriented Dialogue Systems#KVRET#Entity F1
1348
+ 3D Object Detection#KITTI Pedestrians Moderate#AP
1349
+ Multi-Person Pose Estimation#CrowdPose#mAP @0.5:0.95
1350
+ Motion Segmentation#Apolloscape#Accuracy
1351
+ Semantic Segmentation#ADE20K#Validation mIoU
1352
+ Action Recognition#EPIC-KITCHENS-100#Verb@1
1353
+ Action Recognition#THUMOS’14#mAP@0.3
1354
+ Action Recognition#THUMOS’14#mAP@0.4
1355
+ Action Recognition#THUMOS’14#mAP@0.5
1356
+ named_entity_recognition#Ontonotes v5 (English)#F1
1357
+ Action Recognition#THUMOS’14#mAP@0.1
1358
+ Action Recognition#THUMOS’14#mAP@0.2
1359
+ Action Segmentation#GTEA#F1@10%
1360
+ language_modeling#WikiText-103#Test perplexity
1361
+ Image-to-Image Translation#GTAV-to-Cityscapes Labels#mIoU
1362
+ Continual Learning#visual domain decathlon (10 tasks)#decathlon discipline (Score)
1363
+ Aspect Sentiment Triplet Extraction#SemEval#F1
1364
+ Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#SSIM
1365
+ Video Generation#BAIR Robot Pushing#FVD score
1366
+ Relation Extraction#ACE 2004#RE+ Micro F1
1367
+ Multi-Person Pose Estimation#COCO test-dev#AP
1368
+ Monocular Depth Estimation#KITTI Eigen split#absolute relative error
1369
+ Atari Games#Atari 2600 Tutankham#Score
1370
+ RGB-D Salient Object Detection#LFSD#Average MAE
1371
+ Unsupervised Domain Adaptation#Duke to Market#rank-10
1372
+ Dense Video Captioning#ActivityNet Captions#METEOR
1373
+ Image Super-Resolution#Set14 - 4x upscaling#PSNR
1374
+ Domain Adaptation#Office-31#Average Accuracy
1375
+ 3D Object Detection#KITTI Cyclists Moderate#AP
1376
+ Reading Comprehension#RACE#Accuracy
1377
+ Panoptic Segmentation#Cityscapes val#PQst
1378
+ Scene Text Detection#SCUT-CTW1500#Precision
1379
+ Speech Separation#wsj0-2mix#SI-SDRi
1380
+ question_answering#SearchQA#Unigram Acc
1381
+ Panoptic Segmentation#Cityscapes val#PQth
1382
+ Self-Supervised Image Classification#ImageNet (finetuned)#Top 1 Accuracy
1383
+ Unsupervised Domain Adaptation#Market to Duke#rank-10
1384
+ Continuous Control#PyBullet HalfCheetah#Return
1385
+ language_modeling#Penn Treebank#Bit per Character (BPC)
1386
+ amr_parsing#LDC2014T12#F1 on Newswire
1387
+ Time Series Classification#JapaneseVowels#Accuracy
1388
+ Weakly-supervised 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
1389
+ Face Verification#IJB-C#TAR @ FAR=0.01
1390
+ 3D Human Pose Estimation#3DPW#MPJPE
1391
+ Neural Architecture Search#ImageNet#Top-1 Error Rate
1392
+ Fine-Grained Image Classification#Birdsnap#Accuracy
1393
+ Fact-based Text Editing#WebEdit#ADD
1394
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#SSIM
1395
+ Protein Secondary Structure Prediction#CB513#Q8
1396
+ 3D Object Detection#KITTI Cars Moderate val#AP
1397
+ Action Recognition#UCF101#3-fold Accuracy
1398
+ Dense Object Detection#SKU-110K#AP
1399
+ Image Retrieval#Oxf105k#MAP
1400
+ Skeleton Based Action Recognition#Varying-view RGB-D Action-Skeleton#Accuracy (AV I)
1401
+ Sequential Image Classification#Sequential MNIST#Unpermuted Accuracy
1402
+ Node Classification#Coauthor CS#Accuracy
1403
+ Graph Classification#CIFAR10 100k#Accuracy (%)
1404
+ RGB-D Salient Object Detection#DES#Average MAE
1405
+ question_answering#SQuAD#F1
1406
+ question_answering#SQuAD#EM
1407
+ Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-class Accuracy
1408
+ Video Object Detection#ImageNet VID#runtime (ms)
1409
+ Video Retrieval#MSR-VTT-1kA#text-to-video R@10
1410
+ Real-Time Object Detection#COCO#MAP
1411
+ Neural Architecture Search#NAS-Bench-201, ImageNet-16-120#Search time (s)
1412
+ Temporal Action Proposal Generation#ActivityNet-1.3#AUC (val)
1413
+ Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Restaurant (Acc)
1414
+ Time Series Classification#ArabicDigits#Accuracy
1415
+ Conditional Image Generation#ImageNet 128x128#Inception score
1416
+ Face Alignment#WFLW#AUC@0.1 (all)
1417
+ Image Classification#SVHN#Percentage error
1418
+ Semantic Textual Similarity#STS14#Spearman Correlation
1419
+ Multi-Person Pose Estimation#COCO test-dev#APL
1420
+ Multi-Person Pose Estimation#COCO test-dev#APM
1421
+ Neural Architecture Search#NAS-Bench-201, CIFAR-100#Accuracy (Test)
1422
+ 3D Instance Segmentation#S3DIS#mRec
1423
+ Image Retrieval#In-Shop#R@1
1424
+ Photo geolocation estimation#Im2GPS#Continent level (2500 km)
1425
+ Graph Classification#MUTAG#Accuracy
1426
+ Recommendation Systems#MovieLens 100K#RMSE (u1 Splits)
1427
+ Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: general purpose
1428
+ Real-Time Object Detection#COCO#inference time (ms)
1429
+ 3D Object Detection#KITTI Pedestrians Easy#AP
1430
+ Real-time Instance Segmentation#MSCOCO#mask AP
1431
+ Image Classification#MNIST#Accuracy
1432
+ Image Clustering#CIFAR-10#Train set
1433
+ Real-Time Object Detection#PASCAL VOC 2007#FPS
1434
+ Pedestrian Detection#CityPersons#Bare MR^-2
1435
+ Unsupervised Domain Adaptation#Duke to Market#rank-5
1436
+ Semantic Segmentation#Cityscapes val#mIoU
1437
+ Unsupervised Domain Adaptation#Duke to Market#rank-1
1438
+ RGB Salient Object Detection#HKU-IS#MAE
1439
+ Image Super-Resolution#Set5 - 4x upscaling#PSNR
1440
+ Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#FID
1441
+ Unsupervised Video Object Segmentation#DAVIS 2016#J&F
1442
+ Crowd Counting#WorldExpo’10#Average MAE
1443
+ Dense Object Detection#SKU-110K#AP75
1444
+ Face Alignment#AFLW2000-3D#Mean NME
1445
+ Generalized Zero-Shot Learning#SUN Attribute#Harmonic mean
1446
+ Real-Time Semantic Segmentation#CamVid#Time (ms)
1447
+ Emotion Recognition in Context#EMOTIC#mAP
1448
+ Few-Shot Image Classification#OMNIGLOT - 1-Shot, 20-way#Accuracy
1449
+ 3D Human Pose Estimation#Human3.6M#Using 2D ground-truth joints
1450
+ Spoken language identification#LRE07#30 sec
1451
+ Recommendation Systems#MovieLens 20M#Recall@50
1452
+ Stochastic Optimization#CIFAR-10 WRN-28-10 - 200 Epochs#Accuracy
1453
+ Time Series Classification#PhysioNet Challenge 2012#AUC Stdev
1454
+ Node Classification#PubMed with Public Split: fixed 20 nodes per class#Accuracy
1455
+ summarization#DUC 2004 Task 1#ROUGE-L
1456
+ 6D Pose Estimation using RGB#LineMOD#Accuracy (ADD)
1457
+ Person Search#CUHK-SYSU#Top-1
1458
+ dependency_parsing#benchmark Vietnamese dependency treebank VnDT#LAS
1459
+ 3D Human Pose Estimation#MPI-INF-3DHP#3DPCK
1460
+ summarization#DUC 2004 Task 1#ROUGE-2
1461
+ summarization#DUC 2004 Task 1#ROUGE-1
1462
+ Node Classification#PubMed (0.05%)#Accuracy
1463
+ Link Prediction#WN18RR#Hits@10
1464
+ Visual Question Answering#VCR (QA-R) test#Accuracy
1465
+ Question Answering#Natural Questions (long)#F1
1466
+ Person Re-Identification#CUHK03 detected#MAP
1467
+ Atari Games#Atari 2600 Surround#Score
1468
+ RGB-D Salient Object Detection#SIP#max F-Measure
1469
+ Atari Games#Atari 2600 Boxing#Score
1470
+ Visual Question Answering#DocVQA test#ANLS
1471
+ Unsupervised Video Object Segmentation#DAVIS 2016#F-measure (Mean)
1472
+ Traffic Prediction#METR-LA#MAE @ 12 step
1473
+ Action Segmentation#GTEA#F1@25%
1474
+ Person Re-Identification#PRID2011#Rank-20
1475
+ Scene Text Detection#COCO-Text#F-Measure
1476
+ Atari Games#Atari 2600 Bank Heist#Score
1477
+ Node Classification#Cora (1%)#Accuracy
1478
+ Monocular 3D Human Pose Estimation#Human3.6M#Average MPJPE (mm)
1479
+ Neural Network Compression#CIFAR-10#Size (MB)
1480
+ Object Counting#COCO count-test#mRMSE-nz
1481
+ Question Answering#SQuAD2.0#EM
1482
+ Facial Expression Recognition#FER2013#Accuracy
1483
+ Image Classification#STL-10#Percentage correct
1484
+ Question Answering#SQuAD2.0#F1
1485
+ Unsupervised Domain Adaptation#Market to MSMT#mAP
1486
+ machine_translation#The IWSLT 2015 Evaluation Campaign#BLEU
1487
+ Scene Text Detection#ICDAR 2015#F-Measure
1488
+ Text Classification#IMDb#Accuracy (2 classes)
1489
+ Facial Landmark Detection#300W#NME
1490
+ Unsupervised Domain Adaptation#Market to MSMT#rank-5
1491
+ Language Modelling#Text8#Number of params
1492
+ Unsupervised Domain Adaptation#Market to MSMT#rank-1
1493
+ Link Prediction#FB15k#Hits@1
1494
+ Node Classification#Texas#Accuracy
1495
+ Atari Games#Atari 2600 River Raid#Score
1496
+ Cross-View Image-to-Image Translation#Dayton (64×64) - aerial-to-ground#SSIM
1497
+ Link Prediction#FB15k#Hits@3
1498
+ Cross-Modal Retrieval#Flickr30k#Image-to-text R@10
1499
+ Supervised Video Summarization#TvSum#F1-score (Canonical)
1500
+ Few-Shot Image Classification#OMNIGLOT - 5-Shot, 5-way#Accuracy
1501
+ Sequential Image Classification#Sequential CIFAR-10#Unpermuted Accuracy
1502
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Recall)
1503
+ Person Re-Identification#DukeMTMC-reID#Rank-1
1504
+ Cross-Modal Retrieval#COCO 2014#Text-to-image R@10
1505
+ Semantic Segmentation#Cityscapes test#Category mIoU
1506
+ Person Re-Identification#DukeMTMC-reID#Rank-5
1507
+ Image Super-Resolution#BSD100 - 2x upscaling#SSIM
1508
+ Word Sense Disambiguation#Words in Context#Accuracy
1509
+ Action Recognition#NTU RGB+D 120#Accuracy (Cross-Setup)
1510
+ Node Classification#Pubmed#Training Split
1511
+ Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.9)
1512
+ Layout-to-Image Generation#COCO-Stuff 64x64#Inception Score
1513
+ Atari Games#Atari 2600 Venture#Score
1514
+ Text Generation#MATH#Average Accuracy
1515
+ Grayscale Image Denoising#BSD68 sigma15#PSNR
1516
+ Visual Question Answering#VQA v2 test-std#other
1517
+ Question Answering#CoQA#Out-of-domain
1518
+ Semantic Textual Similarity#MRPC#Accuracy
1519
+ Human-Object Interaction Detection#HICO-DET#Time Per Frame (ms)
1520
+ Line Segment Detection#York Urban Dataset#sAP5
1521
+ Recommendation Systems#MovieLens 20M#nDCG@100
1522
+ Question Answering#RACE#RACE-h
1523
+ Question Answering#RACE#RACE-m
1524
+ Semantic Segmentation#Cityscapes test#Mean IoU (class)
1525
+ Weakly Supervised Action Localization#THUMOS14#avg-mAP (0.1-0.5)
1526
+ Superpixel Image Classification#75 Superpixel MNIST#Classification Error
1527
+ Commonsense Reasoning for RL#commonsense-rl#Avg #Steps
1528
+ Time Series Classification#PhysioNet Challenge 2012#AUC
1529
+ Pose Transfer#Deep-Fashion#SSIM
1530
+ Semi-Supervised Video Object Segmentation#DAVIS 2017 (val)#F-measure (Decay)
1531
+ Image-to-Image Translation#Cityscapes Photo-to-Labels#Per-pixel Accuracy
1532
+ text_classification#TREC#Error
1533
+ Medical Image Segmentation#Kvasir-SEG#Average MAE
1534
+ Speech Enhancement#CHiME-3#SDR
1535
+ Head Pose Estimation#AFLW2000#MAE
1536
+ Gesture-to-Gesture Translation#Senz3D#IS
1537
+ Visual Question Answering#GQA Test2019#Plausibility
1538
+ 3D Object Detection#KITTI Cars Easy#AP
1539
+ Image Clustering#MNIST-test#Accuracy
1540
+ Time Series Classification#UWave#Accuracy
1541
+ Visual Dialog#Visual Dialog v1.0 test-std#MRR (x 100)
1542
+ Image-to-Image Translation#Cityscapes Photo-to-Labels#Class IOU
1543
+ Task-Oriented Dialogue Systems#KVRET#BLEU
1544
+ word_sense_disambiguation#SemEval 2015#F1
1545
+ Image Relighting#VIDIT’20 validation set#LPIPS
1546
+ Weakly-supervised 3D Human Pose Estimation#Human3.6M#Number of Views
1547
+ JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR-B
1548
+ Image Classification#ImageNet#Top 5 Accuracy
1549
+ Image Clustering#CIFAR-10#Accuracy
1550
+ Atari Games#Atari 2600 Up and Down#Score
1551
+ Depth Estimation#NYU-Depth V2#RMS
1552
+ Person Re-Identification#DukeMTMC-reID#MAP
1553
+ Image Super-Resolution#WebFace - 8x upscaling#PSNR
1554
+ Graph Classification#NCI1#Accuracy
1555
+ Deblurring#GoPro#SSIM
1556
+ Hate Speech Detection#HateXplain#Macro F1
1557
+ Visual Question Answering#GQA Test2019#Validity
1558
+ machine_translation#WMT 2014 EN-DE#BLEU
1559
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LPIPS
1560
+ Visual Dialog#VisDial v0.9 val#MRR
1561
+ Keyword Spotting#Google Speech Commands#Google Speech Commands V2 12
1562
+ Grammatical Error Detection#FCE#F0.5
1563
+ Facial Expression Recognition#AffectNet#Accuracy (7 emotion)
1564
+ Emotion Recognition in Conversation#IEMOCAP#F1
1565
+ Link Prediction#FB15k#Hits@10
1566
+ JPEG Artifact Correction#ICB (Quality 10 Grayscale)#PSNR
1567
+ Semi-Supervised Image Classification#CIFAR-10, 1000 Labels#Accuracy
1568
+ Relation Extraction#NYT#F1
1569
+ Semi-Supervised Semantic Segmentation#Pascal VOC 2012 12.5% labeled#Validation mIoU
1570
+ Scene Text Detection#COCO-Text#Precision
1571
+ Keyword Spotting#Google Speech Commands#Google Speech Commands V2 35
1572
+ Weakly Supervised Action Localization#THUMOS’14#mAP@0.5
1573
+ Object Detection#COCO test-dev#box AP
1574
+ Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: domain specific
1575
+ Image Super-Resolution#BSD100 - 4x upscaling#PSNR
1576
+ Atari Games#Atari 2600 Name This Game#Score
1577
+ Relation Extraction#ACE 2005#NER Micro F1
1578
+ Data-to-Text Generation#LDC2017T10#BLEU
1579
+ Self-Supervised Action Recognition#UCF101#Pre-Training Dataset
1580
+ Pose Estimation#COCO test-dev#AR
1581
+ Pose Estimation#COCO test-dev#AP
1582
+ Graph Classification#NEURON-MULTI#Accuracy
1583
+ Relation Extraction#ACE 2005#Sentence Encoder
1584
+ Image Generation#ImageNet 32x32#bpd
1585
+ relation_prediction#FB15K-237#MRR
1586
+ Action Recognition#HMDB-51#Average accuracy of 3 splits
1587
+ Action Recognition#AVA v2.2#mAP
1588
+ ccg_supertagging#CCGBank#Accuracy
1589
+ Data-to-Text Generation#E2E NLG Challenge#BLEU
1590
+ Atari Games#Atari 2600 Star Gunner#Score
1591
+ Visual Question Answering#VCR (Q-A) test#Accuracy
1592
+ Scene Text Detection#SCUT-CTW1500#F-Measure
1593
+ Video Semantic Segmentation#Cityscapes val#mIoU
1594
+ Action Recognition#Something-Something V1#Top 1 Accuracy
1595
+ Link Prediction#FB15k-237#Hits@3
1596
+ Link Prediction#FB15k-237#Hits@1
1597
+ Text Classification#Yahoo! Answers#Accuracy
1598
+ Partial Domain Adaptation#Office-Home#Accuracy (%)
1599
+ 6D Pose Estimation using RGB#Occlusion LineMOD#Mean ADD
1600
+ Image Generation#CIFAR-10#bits/dimension
1601
+ Graph Regression#ZINC-500k#MAE
1602
+ Intent Detection#ATIS#F1
1603
+ Human Part Segmentation#PASCAL-Part#mIoU
1604
+ relation_prediction#WN18RR#H@10
1605
+ Image Retrieval with Multi-Modal Query#MIT-States#Recall@10
1606
+ Intent Detection#SNIPS#Slot F1 Score
1607
+ taxonomy_learning#SemEval 2018#P@5
1608
+ Video Instance Segmentation#YouTube-VIS validation#mask AP
1609
+ Face Detection#WIDER Face (Hard)#AP
1610
+ Image-to-Image Translation#ADE20K-Outdoor Labels-to-Photos#mIoU
1611
+ Scene Text Detection#ICDAR 2013#Recall
1612
+ Unsupervised Person Re-Identification#Market-1501#Rank-1
1613
+ dependency_parsing#Penn Treebank#POS
1614
+ question_answering#CNN / Daily Mail#Accuracy on CNN
1615
+ Optical Flow Estimation#KITTI 2015#Fl-all
1616
+ Semantic Segmentation#PASCAL VOC 2012 val#mIoU
1617
+ Named Entity Recognition#CoNLL++#F1
1618
+ Question Answering#bAbi#Accuracy (trained on 1k)
1619
+ Time Series Classification#Libras#NLL
1620
+ Dense Pixel Correspondence Estimation#HPatches#Viewpoint II AEPE
1621
+ Image Clustering#MNIST-full#NMI
1622
+ Machine Translation#WMT2015 English-German#BLEU score
1623
+ 3D Face Reconstruction#NoW Benchmark#Mean Reconstruction Error (mm)
1624
+ Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
1625
+ Relation Extraction#CoNLL04#RE+ Macro F1
1626
+ Pose Estimation#UPenn Action#Mean PCK@0.2
1627
+ Conversational Response Selection#DSTC7 Ubuntu#1-of-100 Accuracy
1628
+ Image Classification#WebVision-1000#Top-1 Accuracy
1629
+ Atari Games#Atari 2600 Yars Revenge#Score
1630
+ JPEG Artifact Correction#ICB (Quality 10 Color)#PSNR-B
1631
+ Temporal Action Localization#ActivityNet-1.3#mAP IOU@0.5
1632
+ Unsupervised Video Object Segmentation#DAVIS 2016#Jaccard (Mean)
1633
+ Image Super-Resolution#Urban100 - 2x upscaling#SSIM
1634
+ Visual Question Answering#GQA Test2019#Open
1635
+ Single Image Deraining#Rain100L#SSIM
1636
+ Entity Linking#WiC-TSV#Task 3 Accuracy: general purpose
1637
+ Scene Text Detection#MSRA-TD500#F-Measure
1638
+ Mortality Prediction#MIMIC-III#F1 score
1639
+ Video Retrieval#MSR-VTT-1kA#text-to-video Mean Rank
1640
+ Node Classification#Actor#Accuracy
1641
+ language_modeling#Penn Treebank#Test perplexity
1642
+ Gesture-to-Gesture Translation#Senz3D#PSNR
1643
+ Image Generation#CLEVR#FID-5k-training-steps
1644
+ Self-Supervised Image Classification#ImageNet#Top 1 Accuracy (kNN, k=20)
1645
+ Fine-Grained Image Classification#CUB-200-2011#Accuracy
1646
+ Lung Nodule Classification#LIDC-IDRI#Accuracy
1647
+ Link Prediction#Pubmed#AP
1648
+ Pedestrian Detection#CityPersons#Reasonable MR^-2
1649
+ Link Prediction#WN18#MRR
1650
+ Face Identification#MegaFace#Accuracy
1651
+ Domain Adaptation#VisDA2017#Accuracy
1652
+ Face Verification#MegaFace#Accuracy
1653
+ Question Answering#YahooCQA#MRR
1654
+ Scene Text Detection#COCO-Text#Recall
1655
+ Video Frame Interpolation#Vimeo90k#PSNR
1656
+ RGB Salient Object Detection#DUT-OMRON#MAE
1657
+ Image Retrieval with Multi-Modal Query#MIT-States#Recall@5
1658
+ Image Retrieval with Multi-Modal Query#MIT-States#Recall@1
1659
+ Gesture-to-Gesture Translation#NTU Hand Digit#PSNR
1660
+ Image Retrieval#SOP#R@1
1661
+ Multi-Label Classification#MS-COCO#mAP
1662
+ Keyword Spotting#Google Speech Commands#Google Speech Commands V1 12
1663
+ 3D Human Pose Estimation#MPI-INF-3DHP#AUC
1664
+ Lipreading#CAS-VSR-W1k (LRW-1000)#Top-1 Accuracy
1665
+ Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 val#Mean IoU
1666
+ Machine Translation#WMT2016 German-English#BLEU score
1667
+ Video Retrieval#MSR-VTT#video-to-text R@5
1668
+ Visual Question Answering#MSRVTT-QA#Accuracy
1669
+ Domain Generalization#ImageNet-A#Top-1 accuracy %
1670
+ Action Recognition#Jester#Val
1671
+ Image Super-Resolution#Set5 - 8x upscaling#PSNR
1672
+ Semi-Supervised Image Classification#STL-10, 1000 Labels#Accuracy
1673
+ Image Super-Resolution#Manga109 - 8x upscaling#PSNR
1674
+ Visual Question Answering#VQA v2 test-std#overall
1675
+ RGB-D Salient Object Detection#DES#max F-Measure
1676
+ Image Clustering#Fashion-MNIST#Accuracy
1677
+ Semantic Segmentation#PASCAL Context#mIoU
1678
+ Semantic Similarity#SICK#MSE
1679
+ Retinal Vessel Segmentation#STARE#F1 score
1680
+ Image Super-Resolution#FFHQ 1024 x 1024 - 4x upscaling#FID
1681
+ Machine Translation#WMT2014 English-German#BLEU score
1682
+ 3D Object Detection#KITTI Cars Hard val#AP
1683
+ Image Super-Resolution#Urban100 - 4x upscaling#PSNR
1684
+ 3D Human Pose Estimation#Human3.6M#Multi-View or Monocular
1685
+ Relation Extraction#CoNLL04#NER Macro F1
1686
+ Image Super-Resolution#BSD100 - 4x upscaling#MOS
1687
+ Semi-Supervised Image Classification#ImageNet - 1% labeled data#Top 5 Accuracy
1688
+ Weakly-Supervised Semantic Segmentation#PASCAL VOC 2012 test#Mean IoU
1689
+ Node Classification#PATTERN 100k#Accuracy (%)
1690
+ Node Classification#MAG240M-LSC#Validation Accuracy
1691
+ Image Generation#FFHQ#FID-10k-training-steps
1692
+ relation_prediction#WN18RR#MRR
1693
+ Fine-Grained Image Classification#DF20#Top-1
1694
+ Fine-Grained Image Classification#DF20#Top-3
1695
+ Word Sense Disambiguation#WiC-TSV#Task 1 Accuracy: all
1696
+ 3D Multi-Person Pose Estimation (root-relative)#MuPoTS-3D#3DPCK
1697
+ Medical Image Segmentation#Kvasir-SEG#mean Dice
1698
+ Video Retrieval#MSR-VTT#text-to-video R@1
1699
+ RGB-D Salient Object Detection#LFSD#S-Measure
1700
+ Semantic Textual Similarity#STS16#Spearman Correlation
1701
+ RGB-D Salient Object Detection#STERE#max F-Measure
1702
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#F-measure (Recall)
1703
+ Sentiment Analysis#TweetEval#Emotion
1704
+ Neural Architecture Search#CIFAR-10#FLOPS
1705
+ Atari Games#Atari 2600 Kangaroo#Score
1706
+ Lane Detection#TuSimple#F1 score
1707
+ Session-Based Recommendations#Diginetica#Hit@20
1708
+ Atari Games#Atari 2600 Seaquest#Score
1709
+ Neural Architecture Search#NAS-Bench-201, CIFAR-10#Search time (s)
1710
+ Graph Classification#PROTEINS#Accuracy
1711
+ Common Sense Reasoning#SWAG#Test
1712
+ Multi-Object Tracking#MOT16#MOTA
1713
+ Semi-Supervised Video Object Segmentation#DAVIS 2016#Jaccard (Decay)
1714
+ Visual Question Answering#VQA v2 test-std#number
1715
+ Object Detection#COCO minival#APL
1716
+ Object Detection#COCO minival#APM
1717
+ Object Detection#COCO minival#APS
1718
+ Atari Games#Atari 2600 Krull#Score
1719
+ JPEG Artifact Correction#LIVE1 (Quality 10 Color)#PSNR
1720
+ Cross-Lingual Document Classification#MLDoc Zero-Shot English-to-German#Accuracy
1721
+ RGB-D Salient Object Detection#DES#max E-Measure
1722
+ Node Classification#PubMed (0.1%)#Accuracy
1723
+ Link Prediction#WN18#MR
1724
+ Semi-Supervised Image Classification#CIFAR-10, 40 Labels#Percentage error
1725
+ Scene Text Detection#ICDAR 2013#F-Measure
1726
+ Image Super-Resolution#Set5 - 2x upscaling#SSIM
1727
+ Transfer Learning#Office-Home#Accuracy
1728
+ JPEG Artifact Correction#ICB (Quality 20 Color)#PSNR-B
1729
+ Image Classification#smallNORB#Classification Error
1730
+ Image Super-Resolution#Manga109 - 2x upscaling#SSIM
1731
+ Object Detection#USB (Standard USB 1.0 protocol)#mCAP
1732
+ Deblurring#RealBlur-J (trained on GoPro)#PSNR (sRGB)
1733
+ JPEG Artifact Correction#ICB (Quality 20 Grayscale)#PSNR-B
1734
+ Aspect-Based Sentiment Analysis#SemEval 2014 Task 4 Sub Task 2#Mean Acc (Restaurant + Laptop)
1735
+ Node Classification#Chameleon#Accuracy
1736
+ Question Answering#CoQA#Overall
1737
+ Visual Object Tracking#VOT2017/18#Expected Average Overlap (EAO)
1738
+ Hate Speech Detection#HateXplain#AUROC
1739
+ Node Classification#CiteSeer (0.5%)#Accuracy
1740
+ Age-Invariant Face Recognition#CACDVS#Accuracy
1741
+ Layout-to-Image Generation#COCO-Stuff 64x64#FID
1742
+ Image Clustering#STL-10#NMI
1743
+ JPEG Artifact Correction#ICB (Quality 20 Grayscale)#SSIM
1744
+ Graph Classification#D&D#Accuracy
1745
+ Text Summarization#GigaWord#ROUGE-L
1746
+ RGB Salient Object Detection#DUTS-TE#MAE
1747
+ Natural Language Inference#SNLI#% Test Accuracy
1748
+ Text Summarization#GigaWord#ROUGE-1
1749
+ Text Summarization#GigaWord#ROUGE-2
1750
+ Unsupervised Domain Adaptation#Market to MSMT#rank-10
1751
+ Surgical tool detection#Cholec80#mAP
1752
+ RGB-D Salient Object Detection#NLPR#S-Measure
1753
+ Semantic Textual Similarity#STS15#Spearman Correlation
1754
+ Named Entity Recognition#Ontonotes v5 (English)#F1
1755
+ Unsupervised Domain Adaptation#Market to Duke#rank-1
1756
+ Heterogeneous Node Classification#DBLP (PACT) 14k#Micro-F1 (20% training data)
1757
+ Unsupervised Domain Adaptation#Market to Duke#rank-5
1758
+ Atari Games#Atari 2600 Berzerk#Score
1759
+ Image Super-Resolution#FFHQ 512 x 512 - 4x upscaling#LLE
1760
+ Image Classification#ImageNet#Number of params
1761
+ Face Detection#WIDER Face (Easy)#AP
1762
+ Action Classification#Kinetics-600#Top-1 Accuracy
1763
+ Image Super-Resolution#FFHQ 256 x 256 - 4x upscaling#SSIM
1764
+ question_answering#Quasar#F1 (Quasar-T)
1765
+ Visual Object Tracking#OTB-2015#AUC
1766
+ Text Simplification#Newsela#SARI
1767
+ Action Classification#Kinetics-700#Top-5 Accuracy
1768
+ Language Modelling#Text8#Bit per Character (BPC)
1769
+ Image Super-Resolution#Urban100 - 8x upscaling#PSNR
1770
+ Out-of-Distribution Detection#STL-10#Percentage correct
1771
+ Dense Pixel Correspondence Estimation#HPatches#Viewpoint I AEPE
1772
+ Object Detection#COCO minival#AP50
1773
+ Semi-Supervised Semantic Segmentation#Pascal VOC 2012 5% labeled#Validation mIoU
1774
+ Node Classification#Cora#Accuracy
1775
+ Aesthetics Quality Assessment#AVA#Accuracy
1776
+ Named Entity Recognition#ACE 2005#F1
1777
+ Instance Segmentation#COCO test-dev#APS
1778
+ taxonomy_learning#SemEval 2018#MRR
1779
+ Fake News Detection#FNC-1#Per-class Accuracy (Disagree)
1780
+ Instance Segmentation#COCO test-dev#APM
1781
+ Instance Segmentation#COCO test-dev#APL
1782
+ Entity Alignment#DBP15k zh-en#Hits@1
1783
+ Object Detection#COCO minival#AP75
1784
+ language_modeling#1B Words / Google Billion Word benchmark#Number of params
1785
+ Action Segmentation#GTEA#F1@50%
1786
+ Action Classification#Moments in Time#Top 5 Accuracy
1787
+ Question Answering#Children's Book Test#Accuracy-NE
1788
+ Cross-Modal Retrieval#COCO 2014#Image-to-text R@1
1789
+ Action Recognition#Sports-1M#Video hit@1
1790
+ Action Recognition#Sports-1M#Video hit@5
1791
+ Time Series Classification#PEMS#Accuracy
1792
+ Real-Time Semantic Segmentation#NYU Depth v2#Speed(ms/f)
1793
+ Cross-Modal Retrieval#COCO 2014#Image-to-text R@5
1794
+ Word Sense Disambiguation#Supervised:#Senseval 3
1795
+ Word Sense Disambiguation#Supervised:#Senseval 2
1796
+ Image-to-Image Translation#Cityscapes Labels-to-Photo#Per-class Accuracy
1797
+ Image Super-Resolution#Manga109 - 4x upscaling#PSNR
1798
+ Retinal Vessel Segmentation#CHASE_DB1#AUC
1799
+ Atari Games#Atari 2600 Frostbite#Score
1800
+ Vision and Language Navigation#VLN Challenge#oracle success
1801
+ Relation Extraction#WebNLG#F1
1802
+ Drug Discovery#Tox21#AUC
1803
+ Image Generation#FFHQ 256 x 256#FID
1804
+ Question Answering#TriviaQA#F1
1805
+ Semi-Supervised Semantic Segmentation#Pascal VOC 2012 2% labeled#Validation mIoU
1806
+ Semantic Textual Similarity#STS12#Spearman Correlation
1807
+ Fine-Grained Image Classification#DF20#F1 - macro
1808
+ Few-Shot Image Classification#FC100 5-way (1-shot)#Accuracy
1809
+ Speech Recognition#swb_hub_500 WER fullSWBCH#Percentage error
1810
+ Speech Recognition#MediaSpeech#WER for French
1811
+ Image Classification#EMNIST-Letters#Accuracy
1812
+ Time Series Classification#NetFlow#Accuracy
1813
+ Text Style Transfer#Yelp Review Dataset (Small)#G-Score (BLEU, Accuracy)
1814
+ Self-Supervised Action Recognition#HMDB51#Top-1 Accuracy
1815
+ Semantic Textual Similarity#STS13#Spearman Correlation
1816
+ Link Prediction#Cora#AP
1817
+ Relation Extraction#SemEval-2010 Task 8#F1
1818
+ Incremental Learning#CIFAR-100 - 50 classes + 5 steps of 10 classes#Average Incremental Accuracy
1819
+ Cross-View Image-to-Image Translation#cvusa#SSIM
1820
+ Speech Recognition#MediaSpeech#WER for Arabic
1821
+ Person Search#PRW#Top-1
1822
+ Image Clustering#CIFAR-100#NMI
1823
+ Face Verification#YouTube Faces DB#Accuracy
1824
+ Named Entity Recognition#CoNLL 2002 (Dutch)#F1
1825
+ Image Super-Resolution#VggFace2 - 8x upscaling#PSNR
1826
+ Lesion Segmentation#Anatomical Tracings of Lesions After Stroke (ATLAS)#Recall
1827
+ Synthetic-to-Real Translation#GTAV-to-Cityscapes Labels#mIoU
1828
+ Fine-Grained Image Classification#Oxford-IIIT Pets#Accuracy
1829
+ Image Classification#Fashion-MNIST#Percentage error
1830
+ Question Answering#Children's Book Test#Accuracy-CN
1831
+ Action Recognition#Something-Something V2#Top-5 Accuracy
1832
+ Atari Games#Atari 2600 Fishing Derby#Score
1833
+ Question Answering#NarrativeQA#BLEU-4
1834
+ Question Answering#NarrativeQA#BLEU-1
1835
+ Text Classification#20NEWS#Accuracy
1836
+ Image Denoising#DND#PSNR (sRGB)
1837
+ Visual Object Tracking#VOT2016#Expected Average Overlap (EAO)
1838
+ Semi-Supervised Image Classification#SVHN, 500 Labels#Accuracy
1839
+ sentiment_analysis#IMDb#Accuracy
1840
+ Unsupervised Person Re-Identification#DukeMTMC-reID#Rank-10
1841
+ Nested Mention Recognition#ACE 2005#F1
1842
+ Domain Adaptation#SVHN-to-MNIST#Accuracy
1843
+ Object Detection#COCO minival#box AP
1844
+ Action Recognition#EPIC-KITCHENS-100#GFLOPs
1845
+ Music Transcription#MusicNet#APS
1846
+ Semi-Supervised Image Classification#CIFAR-10, 4000 Labels#Accuracy
1847
+ Hate Speech Detection#Ethos MultiLabel#Hamming Loss
1848
+ Action Classification#Kinetics-600#GFLOPs
1849
+ Semi-Supervised Semantic Segmentation#Cityscapes 25% labeled#Validation mIoU
1850
+ Face Alignment#300W#Fullset (public)
1851
+ unknown