Trial 2
Browse files## What are you reporting:
**Contaminated model(s)**: GPT-4
**Contaminated corpora**:
conll2003
nyu-mll/glue
rajpurkar/squad_v2
https://catalog.ldc.upenn.edu/LDC2006T06
quac;;GPT-4;Model
natural_questions
google/boolq
**Contaminated split(s)**: If the dataset has Train, Development and/or Test splits please report the contaminated split(s). You can report a percentage of the dataset contaminated; if the entire dataset is compromised, report 100%.
It is unclear what is the percentage, we just know the model regurgitates training validataion and test data and or metadata of each.
> You may also report instances where there is no contamination. In such cases, follow the previous instructions but report a contamination level of 0%.
## Briefly describe your method to detect data contamination
- [ ] Model-based approach
Description of your method, 3-4 sentences. Evidence of data contamination (Read below):
Prompt GPT and see that it knows to return metadata and training and val\test examples on its own.
see more here
https://hitz-zentroa.github.io/lm-contamination/blog/
#### Data-based approaches
Data-based approaches identify evidence of data contamination in a pre-training corpus by directly examining the dataset for instances of the evaluation data. This method involves algorithmically searching through a large pre-training dataset to find occurrences of the evaluation data. You should provide evidence of data contamination in the form: "dataset X appears in line N of corpus Y," "dataset X appears N times in corpus Y," or "N examples from dataset X appear in corpus Y."
#### Model-based approaches
Model-based approaches, on the other hand, utilize heuristic algorithms to infer the presence of data contamination in a pre-trained model. These methods do not directly analyze the data but instead assess the model's behavior to predict data contamination. Examples include prompting the model to reproduce elements of an evaluation dataset to demonstrate memorization (i.e https://hitz-zentroa.github.io/lm-contamination/blog/) or using perplexity measures to estimate data contamination (). You should provide evidence of data contamination in the form of evaluation results of the algorithm from research papers, screenshots of model outputs that demonstrate memorization of a pre-training dataset, or any other form of evaluation that substantiates the method's effectiveness in detecting data contamination. You can provide a confidence score in your predictions.
## Citation
Is there a paper that reports the data contamination or describes the method used to detect data contamination?
Blog post not paper, so we can create a bib if we want
URL: `[https://aclanthology.org/2023.findings-emnlp.722/](https://hitz-zentroa.github.io/lm-contamination/blog/)`
Citation: `@inproceedings{...`
*Important!* If you wish to be listed as an author in the final report, please complete this information for all the authors of this Pull Request.
- Full name: Leshem Choshen
- Institution: MIT-IBM watson AI lab, MIT
- Email: leshem.choshen@mail.huji.ac.il
- contamination_report.csv +214 -206
@@ -1,5 +1,12 @@
|
|
1 |
Evaluation Dataset;Subset;Contaminated Source;Model or corpus;Train Split;Development Split;Test Split;Approach;Reference;PR
|
2 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
lama;T-REx;allenai/c4;corpus;;;4.6;data-based;https://arxiv.org/abs/2104.08758;6
|
4 |
lama;Google-RE;allenai/c4;corpus;;;5.7;data-based;https://arxiv.org/abs/2104.08758;6
|
5 |
EdinburghNLP/xsum;;allenai/c4;corpus;;;15.49;data-based;https://arxiv.org/abs/2104.08758;6
|
@@ -15,9 +22,9 @@ nyu-mll/glue;MRPC-sentence-1;allenai/c4;corpus;;;2.7;data-based;https://arxiv.or
|
|
15 |
nyu-mll/glue;MRPC-sentence-2;allenai/c4;corpus;;;2.7;data-based;https://arxiv.org/abs/2104.08758;6
|
16 |
nyu-mll/glue;QNLI-sentence;allenai/c4;corpus;;;53.6;data-based;https://arxiv.org/abs/2104.08758;6
|
17 |
nyu-mll/glue;QNLI-question;allenai/c4;corpus;;;1.8;data-based;https://arxiv.org/abs/2104.08758;6
|
18 |
-
nyu-mll/glue;RTE-sentence-1;allenai/c4;corpus;;;6
|
19 |
nyu-mll/glue;RTE-sentence-2;allenai/c4;corpus;;;10.8;data-based;https://arxiv.org/abs/2104.08758;6
|
20 |
-
nyu-mll/glue;SST-2;allenai/c4;corpus;;;11
|
21 |
nyu-mll/glue;STS-B-sentence-1;allenai/c4;corpus;;;18.3;data-based;https://arxiv.org/abs/2104.08758;6
|
22 |
nyu-mll/glue;STS-B-sentence-2;allenai/c4;corpus;;;18.6;data-based;https://arxiv.org/abs/2104.08758;6
|
23 |
nyu-mll/glue;WNLI-sentence-1;allenai/c4;corpus;;;4.8;data-based;https://arxiv.org/abs/2104.08758;6
|
@@ -28,20 +35,20 @@ UCLNLP/adversarial_qa;adversarialQA;oscar-corpus/OSCAR-2301;corpus;;;0.03;data-b
|
|
28 |
UCLNLP/adversarial_qa;adversarialQA;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
29 |
UCLNLP/adversarial_qa;adversarialQA;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
30 |
|
31 |
-
UCLNLP/adversarial_qa;dbert;allenai/c4;corpus;;;0
|
32 |
-
UCLNLP/adversarial_qa;dbert;oscar-corpus/OSCAR-2301;corpus;;;0
|
33 |
-
UCLNLP/adversarial_qa;dbert;EleutherAI/pile;corpus;;;0
|
34 |
-
UCLNLP/adversarial_qa;dbert;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
35 |
|
36 |
-
UCLNLP/adversarial_qa;dbidaf;allenai/c4;corpus;;;0
|
37 |
-
UCLNLP/adversarial_qa;dbidaf;oscar-corpus/OSCAR-2301;corpus;;;0
|
38 |
-
UCLNLP/adversarial_qa;dbidaf;EleutherAI/pile;corpus;;;0
|
39 |
-
UCLNLP/adversarial_qa;dbidaf;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
40 |
|
41 |
UCLNLP/adversarial_qa;droberta;allenai/c4;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
42 |
UCLNLP/adversarial_qa;droberta;oscar-corpus/OSCAR-2301;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
43 |
UCLNLP/adversarial_qa;droberta;EleutherAI/pile;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
44 |
-
UCLNLP/adversarial_qa;droberta;togethercomputer/RedPajama-Data-V2;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707
|
45 |
|
46 |
aeslc;;allenai/c4;corpus;;;1.57;data-based;https://arxiv.org/abs/2310.20707;2
|
47 |
aeslc;;oscar-corpus/OSCAR-2301;corpus;;;0.31;data-based;https://arxiv.org/abs/2310.20707;2
|
@@ -49,7 +56,7 @@ aeslc;;EleutherAI/pile;corpus;;;45.49;data-based;https://arxiv.org/abs/2310.2070
|
|
49 |
aeslc;;togethercomputer/RedPajama-Data-V2;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
50 |
|
51 |
amazon_reviews_multi;;allenai/c4;corpus;;;2.28;data-based;https://arxiv.org/abs/2310.20707;2
|
52 |
-
amazon_reviews_multi;;oscar-corpus/OSCAR-2301;corpus;;;2.
|
53 |
amazon_reviews_multi;;EleutherAI/pile;corpus;;;1.48;data-based;https://arxiv.org/abs/2310.20707;2
|
54 |
amazon_reviews_multi;;togethercomputer/RedPajama-Data-V2;corpus;;;2.06;data-based;https://arxiv.org/abs/2310.20707;2
|
55 |
|
@@ -58,23 +65,23 @@ billsum;;oscar-corpus/OSCAR-2301;corpus;;;0.06;data-based;https://arxiv.org/abs/
|
|
58 |
billsum;;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
59 |
billsum;;togethercomputer/RedPajama-Data-V2;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
60 |
|
61 |
-
cosmos_qa;;allenai/c4;corpus;;;0
|
62 |
-
cosmos_qa;;oscar-corpus/OSCAR-2301;corpus;;;0
|
63 |
-
cosmos_qa;;EleutherAI/pile;corpus;;;0
|
64 |
-
cosmos_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
65 |
|
66 |
-
crows_pairs;;allenai/c4;corpus;;;0
|
67 |
crows_pairs;;oscar-corpus/OSCAR-2301;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
68 |
-
crows_pairs;;EleutherAI/pile;corpus;;;0
|
69 |
-
crows_pairs;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
70 |
|
71 |
-
ibm/duorc;ParaphraseRC;allenai/c4;corpus;;;0
|
72 |
-
ibm/duorc;ParaphraseRC;oscar-corpus/OSCAR-2301;corpus;;;0
|
73 |
-
ibm/duorc;ParaphraseRC;EleutherAI/pile;corpus;;;0
|
74 |
-
ibm/duorc;ParaphraseRC;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
75 |
|
76 |
ibm/duorc;SelfRC;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
77 |
-
ibm/duorc;SelfRC;oscar-corpus/OSCAR-2301;corpus;;;0
|
78 |
ibm/duorc;SelfRC;EleutherAI/pile;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
79 |
ibm/duorc;SelfRC;togethercomputer/RedPajama-Data-V2;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
80 |
|
@@ -104,7 +111,7 @@ nyu-mll/glue;mnli-mismatched;EleutherAI/pile;corpus;;;2.11;data-based;https://ar
|
|
104 |
nyu-mll/glue;mnli-mismatched;togethercomputer/RedPajama-Data-V2;corpus;;;2.17;data-based;https://arxiv.org/abs/2310.20707;2
|
105 |
|
106 |
nyu-mll/glue;mrpc;allenai/c4;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
107 |
-
nyu-mll/glue;mrpc;oscar-corpus/OSCAR-2301;corpus;;;0
|
108 |
nyu-mll/glue;mrpc;EleutherAI/pile;corpus;;;0.64;data-based;https://arxiv.org/abs/2310.20707;2
|
109 |
nyu-mll/glue;mrpc;togethercomputer/RedPajama-Data-V2;corpus;;;1.16;data-based;https://arxiv.org/abs/2310.20707;2
|
110 |
|
@@ -113,7 +120,7 @@ nyu-mll/glue;qnli;oscar-corpus/OSCAR-2301;corpus;;;0.04;data-based;https://arxiv
|
|
113 |
nyu-mll/glue;qnli;EleutherAI/pile;corpus;;;1.48;data-based;https://arxiv.org/abs/2310.20707;2
|
114 |
nyu-mll/glue;qnli;togethercomputer/RedPajama-Data-V2;corpus;;;1.21;data-based;https://arxiv.org/abs/2310.20707;2
|
115 |
|
116 |
-
nyu-mll/glue;rte;allenai/c4;corpus;;;0.
|
117 |
nyu-mll/glue;rte;oscar-corpus/OSCAR-2301;corpus;;;0.17;data-based;https://arxiv.org/abs/2310.20707;2
|
118 |
nyu-mll/glue;rte;EleutherAI/pile;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
119 |
nyu-mll/glue;rte;togethercomputer/RedPajama-Data-V2;corpus;;;67.47;data-based;https://arxiv.org/abs/2310.20707;2
|
@@ -123,9 +130,9 @@ nyu-mll/glue;stsb;oscar-corpus/OSCAR-2301;corpus;;;3.12;data-based;https://arxiv
|
|
123 |
nyu-mll/glue;stsb;EleutherAI/pile;corpus;;;11.09;data-based;https://arxiv.org/abs/2310.20707;2
|
124 |
nyu-mll/glue;stsb;togethercomputer/RedPajama-Data-V2;corpus;;;9.86;data-based;https://arxiv.org/abs/2310.20707;2
|
125 |
|
126 |
-
nyu-mll/glue;wnli;allenai/c4;corpus;;;0
|
127 |
-
nyu-mll/glue;wnli;oscar-corpus/OSCAR-2301;corpus;;;0
|
128 |
-
nyu-mll/glue;wnli;EleutherAI/pile;corpus;;;0
|
129 |
nyu-mll/glue;wnli;togethercomputer/RedPajama-Data-V2;corpus;;;2.05;data-based;https://arxiv.org/abs/2310.20707;2
|
130 |
|
131 |
head_qa;en;allenai/c4;corpus;;;5.22;data-based;https://arxiv.org/abs/2310.20707;2
|
@@ -134,57 +141,57 @@ head_qa;en;EleutherAI/pile;corpus;;;5.11;data-based;https://arxiv.org/abs/2310.2
|
|
134 |
head_qa;en;togethercomputer/RedPajama-Data-V2;corpus;;;5.94;data-based;https://arxiv.org/abs/2310.20707;2
|
135 |
|
136 |
health_fact;;allenai/c4;corpus;;;7.53;data-based;https://arxiv.org/abs/2310.20707;2
|
137 |
-
health_fact;;oscar-corpus/OSCAR-2301;corpus;;;3.
|
138 |
health_fact;;EleutherAI/pile;corpus;;;1.94;data-based;https://arxiv.org/abs/2310.20707;2
|
139 |
-
health_fact;;togethercomputer/RedPajama-Data-V2;corpus;;;18.
|
140 |
|
141 |
-
hlgd;;allenai/c4;corpus;;;0
|
142 |
-
hlgd;;oscar-corpus/OSCAR-2301;corpus;;;0
|
143 |
-
hlgd;;EleutherAI/pile;corpus;;;0
|
144 |
-
hlgd;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
145 |
|
146 |
liar;;allenai/c4;corpus;;;29.23;data-based;https://arxiv.org/abs/2310.20707;2
|
147 |
liar;;oscar-corpus/OSCAR-2301;corpus;;;13.95;data-based;https://arxiv.org/abs/2310.20707;2
|
148 |
liar;;EleutherAI/pile;corpus;;;10.91;data-based;https://arxiv.org/abs/2310.20707;2
|
149 |
liar;;togethercomputer/RedPajama-Data-V2;corpus;;;45.05;data-based;https://arxiv.org/abs/2310.20707;2
|
150 |
|
151 |
-
math_dataset;algebra__linear_1d;allenai/c4;corpus;;;0
|
152 |
-
math_dataset;algebra__linear_1d;oscar-corpus/OSCAR-2301;corpus;;;0
|
153 |
-
math_dataset;algebra__linear_1d;EleutherAI/pile;corpus;;;0
|
154 |
-
math_dataset;algebra__linear_1d;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
155 |
|
156 |
-
math_dataset;algebra__linear_2d;allenai/c4;corpus;;;0
|
157 |
-
math_dataset;algebra__linear_2d;oscar-corpus/OSCAR-2301;corpus;;;0
|
158 |
-
math_dataset;algebra__linear_2d;EleutherAI/pile;corpus;;;0
|
159 |
-
math_dataset;algebra__linear_2d;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
160 |
|
161 |
-
math_dataset;algebra__linear_2d_composed;allenai/c4;corpus;;;0
|
162 |
-
math_dataset;algebra__linear_2d_composed;oscar-corpus/OSCAR-2301;corpus;;;0
|
163 |
-
math_dataset;algebra__linear_2d_composed;EleutherAI/pile;corpus;;;0
|
164 |
-
math_dataset;algebra__linear_2d_composed;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
165 |
|
166 |
math_qa;;allenai/c4;corpus;;;0.34;data-based;https://arxiv.org/abs/2310.20707;2
|
167 |
math_qa;;oscar-corpus/OSCAR-2301;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
168 |
-
math_qa;;EleutherAI/pile;corpus;;;0
|
169 |
math_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0.07;data-based;https://arxiv.org/abs/2310.20707;2
|
170 |
|
171 |
-
mc_taco;;allenai/c4;corpus;;;0
|
172 |
-
mc_taco;;oscar-corpus/OSCAR-2301;corpus;;;0
|
173 |
-
mc_taco;;EleutherAI/pile;corpus;;;0
|
174 |
mc_taco;;togethercomputer/RedPajama-Data-V2;corpus;;;0.14;data-based;https://arxiv.org/abs/2310.20707;2
|
175 |
|
176 |
-
mocha;;allenai/c4;corpus;;;0
|
177 |
-
mocha;;oscar-corpus/OSCAR-2301;corpus;;;0
|
178 |
-
mocha;;EleutherAI/pile;corpus;;;0
|
179 |
-
mocha;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
180 |
|
181 |
-
openai_humaneval;;allenai/c4;corpus;;;0
|
182 |
openai_humaneval;;oscar-corpus/OSCAR-2301;corpus;;;1.22;data-based;https://arxiv.org/abs/2310.20707;2
|
183 |
-
openai_humaneval;;EleutherAI/pile;corpus;;;0
|
184 |
-
openai_humaneval;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
185 |
|
186 |
paws-x;en;allenai/c4;corpus;;;0.05;data-based;https://arxiv.org/abs/2310.20707;2
|
187 |
-
paws-x;en;oscar-corpus/OSCAR-2301;corpus;;;0
|
188 |
paws-x;en;EleutherAI/pile;corpus;;;0.15;data-based;https://arxiv.org/abs/2310.20707;2
|
189 |
paws-x;en;togethercomputer/RedPajama-Data-V2;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
190 |
|
@@ -200,247 +207,248 @@ piqa;;togethercomputer/RedPajama-Data-V2;corpus;;;0.13;data-based;https://arxiv.
|
|
200 |
|
201 |
race;all;allenai/c4;corpus;;;0.14;data-based;https://arxiv.org/abs/2310.20707;2
|
202 |
race;all;oscar-corpus/OSCAR-2301;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
203 |
-
race;all;EleutherAI/pile;corpus;;;0
|
204 |
race;all;togethercomputer/RedPajama-Data-V2;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
205 |
|
206 |
race;high;allenai/c4;corpus;;;0.11;data-based;https://arxiv.org/abs/2310.20707;2
|
207 |
-
race;high;oscar-corpus/OSCAR-2301;corpus;;;0
|
208 |
-
race;high;EleutherAI/pile;corpus;;;0
|
209 |
race;high;togethercomputer/RedPajama-Data-V2;corpus;;;0.26;data-based;https://arxiv.org/abs/2310.20707;2
|
210 |
|
211 |
race;middle;allenai/c4;corpus;;;0.21;data-based;https://arxiv.org/abs/2310.20707;2
|
212 |
race;middle;oscar-corpus/OSCAR-2301;corpus;;;0.21;data-based;https://arxiv.org/abs/2310.20707;2
|
213 |
-
race;middle;EleutherAI/pile;corpus;;;0
|
214 |
race;middle;togethercomputer/RedPajama-Data-V2;corpus;;;0.35;data-based;https://arxiv.org/abs/2310.20707;2
|
215 |
|
216 |
-
allenai/ropes;;allenai/c4;corpus;;;0
|
217 |
-
allenai/ropes;;oscar-corpus/OSCAR-2301;corpus;;;0
|
218 |
-
allenai/ropes;;EleutherAI/pile;corpus;;;0
|
219 |
-
allenai/ropes;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
220 |
|
221 |
-
samsum;;allenai/c4;corpus;;;0
|
222 |
-
samsum;;oscar-corpus/OSCAR-2301;corpus;;;0
|
223 |
-
samsum;;EleutherAI/pile;corpus;;;0
|
224 |
samsum;;togethercomputer/RedPajama-Data-V2;corpus;;;0.12;data-based;https://arxiv.org/abs/2310.20707;2
|
225 |
|
226 |
-
scan;addprim_jump;allenai/c4;corpus;;;0
|
227 |
-
scan;addprim_jump;oscar-corpus/OSCAR-2301;corpus;;;0
|
228 |
scan;addprim_jump;EleutherAI/pile;corpus;;;0.05;data-based;https://arxiv.org/abs/2310.20707;2
|
229 |
scan;addprim_jump;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
230 |
|
231 |
-
scan;addprim_turn;allenai/c4;corpus;;;0
|
232 |
-
scan;addprim_turn;oscar-corpus/OSCAR-2301;corpus;;;0
|
233 |
scan;addprim_turn;EleutherAI/pile;corpus;;;0.08;data-based;https://arxiv.org/abs/2310.20707;2
|
234 |
-
scan;addprim_turn;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
235 |
-
|
236 |
-
scan;filler_num0;allenai/c4;corpus;;;0
|
237 |
-
scan;filler_num0;oscar-corpus/OSCAR-2301;corpus;;;0
|
238 |
-
scan;filler_num0;EleutherAI/pile;corpus;;;0
|
239 |
scan;filler_num0;togethercomputer/RedPajama-Data-V2;corpus;;;0.9;data-based;https://arxiv.org/abs/2310.20707;2
|
240 |
-
|
241 |
-
scan;length;allenai/c4;corpus;;;0
|
242 |
-
scan;length;oscar-corpus/OSCAR-2301;corpus;;;0
|
243 |
scan;length;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
244 |
-
scan;length;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
245 |
-
|
246 |
scan;simple;allenai/c4;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
247 |
-
scan;simple;oscar-corpus/OSCAR-2301;corpus;;;0
|
248 |
scan;simple;EleutherAI/pile;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
249 |
scan;simple;togethercomputer/RedPajama-Data-V2;corpus;;;0.26;data-based;https://arxiv.org/abs/2310.20707;2
|
250 |
-
|
251 |
-
scan;template_around;allenai/c4;corpus;;;0
|
252 |
-
scan;template_around;oscar-corpus/OSCAR-2301;corpus;;;0
|
253 |
-
scan;template_around;EleutherAI/pile;corpus;;;0
|
254 |
scan;template_around;togethercomputer/RedPajama-Data-V2;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
255 |
-
|
256 |
-
scan;template_jump;allenai/c4;corpus;;;0
|
257 |
-
scan;template_jump;oscar-corpus/OSCAR-2301;corpus;;;0
|
258 |
-
scan;template_jump;EleutherAI/pile;corpus;;;0
|
259 |
scan;template_jump;togethercomputer/RedPajama-Data-V2;corpus;;;0.9;data-based;https://arxiv.org/abs/2310.20707;2
|
260 |
-
|
261 |
-
scan;template_opposite;allenai/c4;corpus;;;0
|
262 |
-
scan;template_opposite;oscar-corpus/OSCAR-2301;corpus;;;0
|
263 |
scan;template_opposite;EleutherAI/pile;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
264 |
scan;template_opposite;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
265 |
-
|
266 |
-
scan;template_right;allenai/c4;corpus;;;0
|
267 |
-
scan;template_right;oscar-corpus/OSCAR-2301;corpus;;;0
|
268 |
scan;template_right;EleutherAI/pile;corpus;;;0.11;data-based;https://arxiv.org/abs/2310.20707;2
|
269 |
scan;template_right;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
270 |
-
|
271 |
allenai/scicite;;allenai/c4;corpus;;;1.78;data-based;https://arxiv.org/abs/2310.20707;2
|
272 |
allenai/scicite;;oscar-corpus/OSCAR-2301;corpus;;;1.51;data-based;https://arxiv.org/abs/2310.20707;2
|
273 |
allenai/scicite;;EleutherAI/pile;corpus;;;0.86;data-based;https://arxiv.org/abs/2310.20707;2
|
274 |
allenai/scicite;;togethercomputer/RedPajama-Data-V2;corpus;;;1.72;data-based;https://arxiv.org/abs/2310.20707;2
|
275 |
-
|
276 |
scitail;snli_format;allenai/c4;corpus;;;0.09;data-based;https://arxiv.org/abs/2310.20707;2
|
277 |
scitail;snli_format;oscar-corpus/OSCAR-2301;corpus;;;0.38;data-based;https://arxiv.org/abs/2310.20707;2
|
278 |
scitail;snli_format;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
279 |
scitail;snli_format;togethercomputer/RedPajama-Data-V2;corpus;;;0.71;data-based;https://arxiv.org/abs/2310.20707;2
|
280 |
-
|
281 |
scitail;tsv_format;allenai/c4;corpus;;;0.09;data-based;https://arxiv.org/abs/2310.20707;2
|
282 |
scitail;tsv_format;oscar-corpus/OSCAR-2301;corpus;;;0.38;data-based;https://arxiv.org/abs/2310.20707;2
|
283 |
scitail;tsv_format;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
284 |
scitail;tsv_format;togethercomputer/RedPajama-Data-V2;corpus;;;0.71;data-based;https://arxiv.org/abs/2310.20707;2
|
285 |
-
|
286 |
sem_eval_2014_task_1;;allenai/c4;corpus;;;0.35;data-based;https://arxiv.org/abs/2310.20707;2
|
287 |
sem_eval_2014_task_1;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
288 |
sem_eval_2014_task_1;;EleutherAI/pile;corpus;;;4.89;data-based;https://arxiv.org/abs/2310.20707;2
|
289 |
sem_eval_2014_task_1;;togethercomputer/RedPajama-Data-V2;corpus;;;52.81;data-based;https://arxiv.org/abs/2310.20707;2
|
290 |
-
|
291 |
sick;;allenai/c4;corpus;;;0.31;data-based;https://arxiv.org/abs/2310.20707;2
|
292 |
sick;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
293 |
sick;;EleutherAI/pile;corpus;;;4.79;data-based;https://arxiv.org/abs/2310.20707;2
|
294 |
sick;;togethercomputer/RedPajama-Data-V2;corpus;;;52.61;data-based;https://arxiv.org/abs/2310.20707;2
|
295 |
-
|
296 |
snli;;allenai/c4;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
297 |
snli;;oscar-corpus/OSCAR-2301;corpus;;;0.08;data-based;https://arxiv.org/abs/2310.20707;2
|
298 |
snli;;EleutherAI/pile;corpus;;;1.11;data-based;https://arxiv.org/abs/2310.20707;2
|
299 |
snli;;togethercomputer/RedPajama-Data-V2;corpus;;;1.22;data-based;https://arxiv.org/abs/2310.20707;2
|
300 |
-
|
301 |
-
squadshifts;amazon;allenai/c4;corpus;;;0
|
302 |
-
squadshifts;amazon;oscar-corpus/OSCAR-2301;corpus;;;0
|
303 |
-
squadshifts;amazon;EleutherAI/pile;corpus;;;0
|
304 |
-
squadshifts;amazon;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
305 |
-
|
306 |
squadshifts;new_wiki;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
307 |
squadshifts;new_wiki;oscar-corpus/OSCAR-2301;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
308 |
squadshifts;new_wiki;EleutherAI/pile;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
309 |
squadshifts;new_wiki;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
310 |
-
|
311 |
squadshifts;nyt;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
312 |
squadshifts;nyt;oscar-corpus/OSCAR-2301;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
313 |
squadshifts;nyt;EleutherAI/pile;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
314 |
squadshifts;nyt;togethercomputer/RedPajama-Data-V2;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
315 |
-
|
316 |
stsb_multi_mt;;allenai/c4;corpus;;;3.48;data-based;https://arxiv.org/abs/2310.20707;2
|
317 |
stsb_multi_mt;;oscar-corpus/OSCAR-2301;corpus;;;3.12;data-based;https://arxiv.org/abs/2310.20707;2
|
318 |
stsb_multi_mt;;EleutherAI/pile;corpus;;;11.09;data-based;https://arxiv.org/abs/2310.20707;2
|
319 |
stsb_multi_mt;;togethercomputer/RedPajama-Data-V2;corpus;;;9.86;data-based;https://arxiv.org/abs/2310.20707;2
|
320 |
-
|
321 |
-
subjqa;books;allenai/c4;corpus;;;0
|
322 |
-
subjqa;books;oscar-corpus/OSCAR-2301;corpus;;;0
|
323 |
-
subjqa;books;EleutherAI/pile;corpus;;;0
|
324 |
-
subjqa;books;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
325 |
-
|
326 |
-
subjqa;grocery;allenai/c4;corpus;;;0
|
327 |
-
subjqa;grocery;oscar-corpus/OSCAR-2301;corpus;;;0
|
328 |
-
subjqa;grocery;EleutherAI/pile;corpus;;;0
|
329 |
-
subjqa;grocery;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
330 |
-
|
331 |
-
subjqa;movies;allenai/c4;corpus;;;0
|
332 |
-
subjqa;movies;oscar-corpus/OSCAR-2301;corpus;;;0
|
333 |
-
subjqa;movies;EleutherAI/pile;corpus;;;0
|
334 |
-
subjqa;movies;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
335 |
-
|
336 |
-
subjqa;restaurants;allenai/c4;corpus;;;0
|
337 |
-
subjqa;restaurants;oscar-corpus/OSCAR-2301;corpus;;;0
|
338 |
-
subjqa;restaurants;EleutherAI/pile;corpus;;;0
|
339 |
-
subjqa;restaurants;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
340 |
-
|
341 |
super_glue;axb;allenai/c4;corpus;;;1.99;data-based;https://arxiv.org/abs/2310.20707;2
|
342 |
super_glue;axb;oscar-corpus/OSCAR-2301;corpus;;;1.45;data-based;https://arxiv.org/abs/2310.20707;2
|
343 |
super_glue;axb;EleutherAI/pile;corpus;;;5.07;data-based;https://arxiv.org/abs/2310.20707;2
|
344 |
super_glue;axb;togethercomputer/RedPajama-Data-V2;corpus;;;6.16;data-based;https://arxiv.org/abs/2310.20707;2
|
345 |
-
|
346 |
-
super_glue;axg;allenai/c4;corpus;;;0
|
347 |
-
super_glue;axg;oscar-corpus/OSCAR-2301;corpus;;;0
|
348 |
super_glue;axg;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
349 |
-
super_glue;axg;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
350 |
-
|
351 |
-
super_glue;boolq;allenai/c4;corpus;;;0
|
352 |
super_glue;boolq;oscar-corpus/OSCAR-2301;corpus;;;3.05;data-based;https://arxiv.org/abs/2310.20707;2
|
353 |
-
super_glue;boolq;EleutherAI/pile;corpus;;;0
|
354 |
super_glue;boolq;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
355 |
-
|
356 |
-
super_glue;cb;allenai/c4;corpus;;;0
|
357 |
-
super_glue;cb;oscar-corpus/OSCAR-2301;corpus;;;0
|
358 |
-
super_glue;cb;EleutherAI/pile;corpus;;;2
|
359 |
super_glue;cb;togethercomputer/RedPajama-Data-V2;corpus;;;1.6;data-based;https://arxiv.org/abs/2310.20707;2
|
360 |
-
|
361 |
super_glue;copa;allenai/c4;corpus;;;0.6;data-based;https://arxiv.org/abs/2310.20707;2
|
362 |
-
super_glue;copa;oscar-corpus/OSCAR-2301;corpus;;;1
|
363 |
super_glue;copa;EleutherAI/pile;corpus;;;1.2;data-based;https://arxiv.org/abs/2310.20707;2
|
364 |
-
super_glue;copa;togethercomputer/RedPajama-Data-V2;corpus;;;100
|
365 |
-
|
366 |
-
super_glue;multirc;allenai/c4;corpus;;;0
|
367 |
-
super_glue;multirc;oscar-corpus/OSCAR-2301;corpus;;;0
|
368 |
-
super_glue;multirc;EleutherAI/pile;corpus;;;0
|
369 |
-
super_glue;multirc;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
370 |
-
|
371 |
-
super_glue;record;allenai/c4;corpus;;;0
|
372 |
-
super_glue;record;oscar-corpus/OSCAR-2301;corpus;;;0
|
373 |
-
super_glue;record;EleutherAI/pile;corpus;;;0
|
374 |
-
super_glue;record;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
375 |
-
|
376 |
super_glue;rte;allenai/c4;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
377 |
super_glue;rte;oscar-corpus/OSCAR-2301;corpus;;;0.17;data-based;https://arxiv.org/abs/2310.20707;2
|
378 |
super_glue;rte;EleutherAI/pile;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
379 |
super_glue;rte;togethercomputer/RedPajama-Data-V2;corpus;;;67.47;data-based;https://arxiv.org/abs/2310.20707;2
|
380 |
-
|
381 |
super_glue;wic;allenai/c4;corpus;;;64.43;data-based;https://arxiv.org/abs/2310.20707;2
|
382 |
super_glue;wic;oscar-corpus/OSCAR-2301;corpus;;;49.43;data-based;https://arxiv.org/abs/2310.20707;2
|
383 |
super_glue;wic;EleutherAI/pile;corpus;;;18.57;data-based;https://arxiv.org/abs/2310.20707;2
|
384 |
super_glue;wic;togethercomputer/RedPajama-Data-V2;corpus;;;60.21;data-based;https://arxiv.org/abs/2310.20707;2
|
385 |
-
|
386 |
swag;regular;allenai/c4;corpus;;;2.48;data-based;https://arxiv.org/abs/2310.20707;2
|
387 |
swag;regular;oscar-corpus/OSCAR-2301;corpus;;;1.65;data-based;https://arxiv.org/abs/2310.20707;2
|
388 |
swag;regular;EleutherAI/pile;corpus;;;2.21;data-based;https://arxiv.org/abs/2310.20707;2
|
389 |
swag;regular;togethercomputer/RedPajama-Data-V2;corpus;;;2.79;data-based;https://arxiv.org/abs/2310.20707;2
|
390 |
-
|
391 |
-
tab_fact;tab_fact;allenai/c4;corpus;;;0
|
392 |
-
tab_fact;tab_fact;oscar-corpus/OSCAR-2301;corpus;;;0
|
393 |
-
tab_fact;tab_fact;EleutherAI/pile;corpus;;;0
|
394 |
-
tab_fact;tab_fact;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
395 |
-
|
396 |
wiki_qa;;allenai/c4;corpus;;;0.24;data-based;https://arxiv.org/abs/2310.20707;2
|
397 |
wiki_qa;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
398 |
wiki_qa;;EleutherAI/pile;corpus;;;0.19;data-based;https://arxiv.org/abs/2310.20707;2
|
399 |
wiki_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0.91;data-based;https://arxiv.org/abs/2310.20707;2
|
400 |
-
|
401 |
-
winograd_wsc;wsc273;allenai/c4;corpus;;;29.
|
402 |
-
winograd_wsc;wsc273;oscar-corpus/OSCAR-2301;corpus;;;30.
|
403 |
winograd_wsc;wsc273;EleutherAI/pile;corpus;;;32.23;data-based;https://arxiv.org/abs/2310.20707;2
|
404 |
winograd_wsc;wsc273;togethercomputer/RedPajama-Data-V2;corpus;;;58.24;data-based;https://arxiv.org/abs/2310.20707;2
|
405 |
-
|
406 |
-
winogrande;winogrande_xl;allenai/c4;corpus;;;0
|
407 |
-
winogrande;winogrande_xl;oscar-corpus/OSCAR-2301;corpus;;;0
|
408 |
-
winogrande;winogrande_xl;EleutherAI/pile;corpus;;;0
|
409 |
-
winogrande;winogrande_xl;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
410 |
-
|
411 |
xnli;en;allenai/c4;corpus;;;0.12;data-based;https://arxiv.org/abs/2310.20707;2
|
412 |
xnli;en;oscar-corpus/OSCAR-2301;corpus;;;0.24;data-based;https://arxiv.org/abs/2310.20707;2
|
413 |
xnli;en;EleutherAI/pile;corpus;;;0.36;data-based;https://arxiv.org/abs/2310.20707;2
|
414 |
xnli;en;togethercomputer/RedPajama-Data-V2;corpus;;;0.44;data-based;https://arxiv.org/abs/2310.20707;2
|
415 |
-
|
416 |
xsum;;allenai/c4;corpus;;;2.13;data-based;https://arxiv.org/abs/2310.20707;2
|
417 |
xsum;;oscar-corpus/OSCAR-2301;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
418 |
-
xsum;;EleutherAI/pile;corpus;;;3.
|
419 |
xsum;;togethercomputer/RedPajama-Data-V2;corpus;;;4.28;data-based;https://arxiv.org/abs/2310.20707;2
|
420 |
-
|
421 |
-
zest;;allenai/c4;corpus;;;0
|
422 |
-
zest;;oscar-corpus/OSCAR-2301;corpus;;;0
|
423 |
-
zest;;EleutherAI/pile;corpus;;;0
|
424 |
-
zest;;togethercomputer/RedPajama-Data-V2;corpus;;;0
|
425 |
-
|
426 |
-
|
427 |
-
imdb;;GPT-4;model;100
|
428 |
-
imdb;;GPT-3.5;model;0
|
429 |
-
|
430 |
-
ag_news;;GPT-4;model;100
|
431 |
-
ag_news;;GPT-3.5;model;0
|
432 |
-
|
433 |
-
yelp_review_full;;GPT-4;model;0
|
434 |
-
yelp_review_full;;GPT-3.5;model;0
|
435 |
-
|
436 |
-
nyu-mll/glue;rte;GPT-4;model;100
|
437 |
-
nyu-mll/glue;rte;GPT-3.5;model;0
|
438 |
-
|
439 |
-
nyu-mll/glue;wnli;GPT-4;model;100
|
440 |
-
nyu-mll/glue;wnli;GPT-3.5;model;0
|
441 |
-
|
442 |
-
samsum;;GPT-4;model;0
|
443 |
-
samsum;;GPT-3.5;model;0
|
444 |
-
|
445 |
-
EdinburghNLP/xsum;;GPT-4;model;0
|
446 |
-
EdinburghNLP/xsum;;GPT-3.5;model;0
|
|
|
|
1 |
Evaluation Dataset;Subset;Contaminated Source;Model or corpus;Train Split;Development Split;Test Split;Approach;Reference;PR
|
2 |
+
conll2003;;GPT-3.5;Model;100;100;100;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
3 |
+
nyu-mll/glue;mnli;GPT-3.5;Model;100;100;100;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
4 |
+
rajpurkar/squad_v2;;GPT-3.5;Model;100;100;0;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
5 |
+
https://catalog.ldc.upenn.edu/LDC2006T06;;GPT-3.5;Model;100;100;100;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
6 |
+
quac;;GPT-3.5;Model;100;100;0;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
7 |
+
natural_questions;;GPT-3.5;Model;100;100;0;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
8 |
+
google/boolq;;GPT-3.5;Model;100;100;0;model-based;https://hitz-zentroa.github.io/lm-contamination/blog/;7
|
9 |
+
|
10 |
lama;T-REx;allenai/c4;corpus;;;4.6;data-based;https://arxiv.org/abs/2104.08758;6
|
11 |
lama;Google-RE;allenai/c4;corpus;;;5.7;data-based;https://arxiv.org/abs/2104.08758;6
|
12 |
EdinburghNLP/xsum;;allenai/c4;corpus;;;15.49;data-based;https://arxiv.org/abs/2104.08758;6
|
|
|
22 |
nyu-mll/glue;MRPC-sentence-2;allenai/c4;corpus;;;2.7;data-based;https://arxiv.org/abs/2104.08758;6
|
23 |
nyu-mll/glue;QNLI-sentence;allenai/c4;corpus;;;53.6;data-based;https://arxiv.org/abs/2104.08758;6
|
24 |
nyu-mll/glue;QNLI-question;allenai/c4;corpus;;;1.8;data-based;https://arxiv.org/abs/2104.08758;6
|
25 |
+
nyu-mll/glue;RTE-sentence-1;allenai/c4;corpus;;;6;data-based;https://arxiv.org/abs/2104.08758;6
|
26 |
nyu-mll/glue;RTE-sentence-2;allenai/c4;corpus;;;10.8;data-based;https://arxiv.org/abs/2104.08758;6
|
27 |
+
nyu-mll/glue;SST-2;allenai/c4;corpus;;;11;data-based;https://arxiv.org/abs/2104.08758;6
|
28 |
nyu-mll/glue;STS-B-sentence-1;allenai/c4;corpus;;;18.3;data-based;https://arxiv.org/abs/2104.08758;6
|
29 |
nyu-mll/glue;STS-B-sentence-2;allenai/c4;corpus;;;18.6;data-based;https://arxiv.org/abs/2104.08758;6
|
30 |
nyu-mll/glue;WNLI-sentence-1;allenai/c4;corpus;;;4.8;data-based;https://arxiv.org/abs/2104.08758;6
|
|
|
35 |
UCLNLP/adversarial_qa;adversarialQA;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
36 |
UCLNLP/adversarial_qa;adversarialQA;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
37 |
|
38 |
+
UCLNLP/adversarial_qa;dbert;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
39 |
+
UCLNLP/adversarial_qa;dbert;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
40 |
+
UCLNLP/adversarial_qa;dbert;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
41 |
+
UCLNLP/adversarial_qa;dbert;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
42 |
|
43 |
+
UCLNLP/adversarial_qa;dbidaf;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
44 |
+
UCLNLP/adversarial_qa;dbidaf;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
45 |
+
UCLNLP/adversarial_qa;dbidaf;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
46 |
+
UCLNLP/adversarial_qa;dbidaf;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
47 |
|
48 |
UCLNLP/adversarial_qa;droberta;allenai/c4;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
49 |
UCLNLP/adversarial_qa;droberta;oscar-corpus/OSCAR-2301;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
50 |
UCLNLP/adversarial_qa;droberta;EleutherAI/pile;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
51 |
+
UCLNLP/adversarial_qa;droberta;togethercomputer/RedPajama-Data-V2;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;
|
52 |
|
53 |
aeslc;;allenai/c4;corpus;;;1.57;data-based;https://arxiv.org/abs/2310.20707;2
|
54 |
aeslc;;oscar-corpus/OSCAR-2301;corpus;;;0.31;data-based;https://arxiv.org/abs/2310.20707;2
|
|
|
56 |
aeslc;;togethercomputer/RedPajama-Data-V2;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
57 |
|
58 |
amazon_reviews_multi;;allenai/c4;corpus;;;2.28;data-based;https://arxiv.org/abs/2310.20707;2
|
59 |
+
amazon_reviews_multi;;oscar-corpus/OSCAR-2301;corpus;;;2.1;data-based;https://arxiv.org/abs/2310.20707;2
|
60 |
amazon_reviews_multi;;EleutherAI/pile;corpus;;;1.48;data-based;https://arxiv.org/abs/2310.20707;2
|
61 |
amazon_reviews_multi;;togethercomputer/RedPajama-Data-V2;corpus;;;2.06;data-based;https://arxiv.org/abs/2310.20707;2
|
62 |
|
|
|
65 |
billsum;;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
66 |
billsum;;togethercomputer/RedPajama-Data-V2;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
67 |
|
68 |
+
cosmos_qa;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
69 |
+
cosmos_qa;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
70 |
+
cosmos_qa;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
71 |
+
cosmos_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
72 |
|
73 |
+
crows_pairs;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
74 |
crows_pairs;;oscar-corpus/OSCAR-2301;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
75 |
+
crows_pairs;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
76 |
+
crows_pairs;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
77 |
|
78 |
+
ibm/duorc;ParaphraseRC;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
79 |
+
ibm/duorc;ParaphraseRC;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
80 |
+
ibm/duorc;ParaphraseRC;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
81 |
+
ibm/duorc;ParaphraseRC;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
82 |
|
83 |
ibm/duorc;SelfRC;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
84 |
+
ibm/duorc;SelfRC;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
85 |
ibm/duorc;SelfRC;EleutherAI/pile;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
86 |
ibm/duorc;SelfRC;togethercomputer/RedPajama-Data-V2;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
87 |
|
|
|
111 |
nyu-mll/glue;mnli-mismatched;togethercomputer/RedPajama-Data-V2;corpus;;;2.17;data-based;https://arxiv.org/abs/2310.20707;2
|
112 |
|
113 |
nyu-mll/glue;mrpc;allenai/c4;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
114 |
+
nyu-mll/glue;mrpc;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
115 |
nyu-mll/glue;mrpc;EleutherAI/pile;corpus;;;0.64;data-based;https://arxiv.org/abs/2310.20707;2
|
116 |
nyu-mll/glue;mrpc;togethercomputer/RedPajama-Data-V2;corpus;;;1.16;data-based;https://arxiv.org/abs/2310.20707;2
|
117 |
|
|
|
120 |
nyu-mll/glue;qnli;EleutherAI/pile;corpus;;;1.48;data-based;https://arxiv.org/abs/2310.20707;2
|
121 |
nyu-mll/glue;qnli;togethercomputer/RedPajama-Data-V2;corpus;;;1.21;data-based;https://arxiv.org/abs/2310.20707;2
|
122 |
|
123 |
+
nyu-mll/glue;rte;allenai/c4;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
124 |
nyu-mll/glue;rte;oscar-corpus/OSCAR-2301;corpus;;;0.17;data-based;https://arxiv.org/abs/2310.20707;2
|
125 |
nyu-mll/glue;rte;EleutherAI/pile;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
126 |
nyu-mll/glue;rte;togethercomputer/RedPajama-Data-V2;corpus;;;67.47;data-based;https://arxiv.org/abs/2310.20707;2
|
|
|
130 |
nyu-mll/glue;stsb;EleutherAI/pile;corpus;;;11.09;data-based;https://arxiv.org/abs/2310.20707;2
|
131 |
nyu-mll/glue;stsb;togethercomputer/RedPajama-Data-V2;corpus;;;9.86;data-based;https://arxiv.org/abs/2310.20707;2
|
132 |
|
133 |
+
nyu-mll/glue;wnli;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
134 |
+
nyu-mll/glue;wnli;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
135 |
+
nyu-mll/glue;wnli;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
136 |
nyu-mll/glue;wnli;togethercomputer/RedPajama-Data-V2;corpus;;;2.05;data-based;https://arxiv.org/abs/2310.20707;2
|
137 |
|
138 |
head_qa;en;allenai/c4;corpus;;;5.22;data-based;https://arxiv.org/abs/2310.20707;2
|
|
|
141 |
head_qa;en;togethercomputer/RedPajama-Data-V2;corpus;;;5.94;data-based;https://arxiv.org/abs/2310.20707;2
|
142 |
|
143 |
health_fact;;allenai/c4;corpus;;;7.53;data-based;https://arxiv.org/abs/2310.20707;2
|
144 |
+
health_fact;;oscar-corpus/OSCAR-2301;corpus;;;3.4;data-based;https://arxiv.org/abs/2310.20707;2
|
145 |
health_fact;;EleutherAI/pile;corpus;;;1.94;data-based;https://arxiv.org/abs/2310.20707;2
|
146 |
+
health_fact;;togethercomputer/RedPajama-Data-V2;corpus;;;18.7;data-based;https://arxiv.org/abs/2310.20707;2
|
147 |
|
148 |
+
hlgd;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
149 |
+
hlgd;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
150 |
+
hlgd;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
151 |
+
hlgd;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
152 |
|
153 |
liar;;allenai/c4;corpus;;;29.23;data-based;https://arxiv.org/abs/2310.20707;2
|
154 |
liar;;oscar-corpus/OSCAR-2301;corpus;;;13.95;data-based;https://arxiv.org/abs/2310.20707;2
|
155 |
liar;;EleutherAI/pile;corpus;;;10.91;data-based;https://arxiv.org/abs/2310.20707;2
|
156 |
liar;;togethercomputer/RedPajama-Data-V2;corpus;;;45.05;data-based;https://arxiv.org/abs/2310.20707;2
|
157 |
|
158 |
+
math_dataset;algebra__linear_1d;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
159 |
+
math_dataset;algebra__linear_1d;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
160 |
+
math_dataset;algebra__linear_1d;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
161 |
+
math_dataset;algebra__linear_1d;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
162 |
|
163 |
+
math_dataset;algebra__linear_2d;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
164 |
+
math_dataset;algebra__linear_2d;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
165 |
+
math_dataset;algebra__linear_2d;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
166 |
+
math_dataset;algebra__linear_2d;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
167 |
|
168 |
+
math_dataset;algebra__linear_2d_composed;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
169 |
+
math_dataset;algebra__linear_2d_composed;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
170 |
+
math_dataset;algebra__linear_2d_composed;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
171 |
+
math_dataset;algebra__linear_2d_composed;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
172 |
|
173 |
math_qa;;allenai/c4;corpus;;;0.34;data-based;https://arxiv.org/abs/2310.20707;2
|
174 |
math_qa;;oscar-corpus/OSCAR-2301;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
175 |
+
math_qa;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
176 |
math_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0.07;data-based;https://arxiv.org/abs/2310.20707;2
|
177 |
|
178 |
+
mc_taco;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
179 |
+
mc_taco;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
180 |
+
mc_taco;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
181 |
mc_taco;;togethercomputer/RedPajama-Data-V2;corpus;;;0.14;data-based;https://arxiv.org/abs/2310.20707;2
|
182 |
|
183 |
+
mocha;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
184 |
+
mocha;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
185 |
+
mocha;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
186 |
+
mocha;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
187 |
|
188 |
+
openai_humaneval;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
189 |
openai_humaneval;;oscar-corpus/OSCAR-2301;corpus;;;1.22;data-based;https://arxiv.org/abs/2310.20707;2
|
190 |
+
openai_humaneval;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
191 |
+
openai_humaneval;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
192 |
|
193 |
paws-x;en;allenai/c4;corpus;;;0.05;data-based;https://arxiv.org/abs/2310.20707;2
|
194 |
+
paws-x;en;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
195 |
paws-x;en;EleutherAI/pile;corpus;;;0.15;data-based;https://arxiv.org/abs/2310.20707;2
|
196 |
paws-x;en;togethercomputer/RedPajama-Data-V2;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
197 |
|
|
|
207 |
|
208 |
race;all;allenai/c4;corpus;;;0.14;data-based;https://arxiv.org/abs/2310.20707;2
|
209 |
race;all;oscar-corpus/OSCAR-2301;corpus;;;0.06;data-based;https://arxiv.org/abs/2310.20707;2
|
210 |
+
race;all;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
211 |
race;all;togethercomputer/RedPajama-Data-V2;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
212 |
|
213 |
race;high;allenai/c4;corpus;;;0.11;data-based;https://arxiv.org/abs/2310.20707;2
|
214 |
+
race;high;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
215 |
+
race;high;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
216 |
race;high;togethercomputer/RedPajama-Data-V2;corpus;;;0.26;data-based;https://arxiv.org/abs/2310.20707;2
|
217 |
|
218 |
race;middle;allenai/c4;corpus;;;0.21;data-based;https://arxiv.org/abs/2310.20707;2
|
219 |
race;middle;oscar-corpus/OSCAR-2301;corpus;;;0.21;data-based;https://arxiv.org/abs/2310.20707;2
|
220 |
+
race;middle;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
221 |
race;middle;togethercomputer/RedPajama-Data-V2;corpus;;;0.35;data-based;https://arxiv.org/abs/2310.20707;2
|
222 |
|
223 |
+
allenai/ropes;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
224 |
+
allenai/ropes;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
225 |
+
allenai/ropes;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
226 |
+
allenai/ropes;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
227 |
|
228 |
+
samsum;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
229 |
+
samsum;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
230 |
+
samsum;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
231 |
samsum;;togethercomputer/RedPajama-Data-V2;corpus;;;0.12;data-based;https://arxiv.org/abs/2310.20707;2
|
232 |
|
233 |
+
scan;addprim_jump;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
234 |
+
scan;addprim_jump;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
235 |
scan;addprim_jump;EleutherAI/pile;corpus;;;0.05;data-based;https://arxiv.org/abs/2310.20707;2
|
236 |
scan;addprim_jump;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
237 |
|
238 |
+
scan;addprim_turn;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
239 |
+
scan;addprim_turn;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
240 |
scan;addprim_turn;EleutherAI/pile;corpus;;;0.08;data-based;https://arxiv.org/abs/2310.20707;2
|
241 |
+
scan;addprim_turn;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
242 |
+
|
243 |
+
scan;filler_num0;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
244 |
+
scan;filler_num0;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
245 |
+
scan;filler_num0;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
246 |
scan;filler_num0;togethercomputer/RedPajama-Data-V2;corpus;;;0.9;data-based;https://arxiv.org/abs/2310.20707;2
|
247 |
+
|
248 |
+
scan;length;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
249 |
+
scan;length;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
250 |
scan;length;EleutherAI/pile;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
251 |
+
scan;length;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
252 |
+
|
253 |
scan;simple;allenai/c4;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
254 |
+
scan;simple;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
255 |
scan;simple;EleutherAI/pile;corpus;;;0.1;data-based;https://arxiv.org/abs/2310.20707;2
|
256 |
scan;simple;togethercomputer/RedPajama-Data-V2;corpus;;;0.26;data-based;https://arxiv.org/abs/2310.20707;2
|
257 |
+
|
258 |
+
scan;template_around;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
259 |
+
scan;template_around;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
260 |
+
scan;template_around;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
261 |
scan;template_around;togethercomputer/RedPajama-Data-V2;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
262 |
+
|
263 |
+
scan;template_jump;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
264 |
+
scan;template_jump;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
265 |
+
scan;template_jump;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
266 |
scan;template_jump;togethercomputer/RedPajama-Data-V2;corpus;;;0.9;data-based;https://arxiv.org/abs/2310.20707;2
|
267 |
+
|
268 |
+
scan;template_opposite;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
269 |
+
scan;template_opposite;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
270 |
scan;template_opposite;EleutherAI/pile;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
271 |
scan;template_opposite;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
272 |
+
|
273 |
+
scan;template_right;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
274 |
+
scan;template_right;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
275 |
scan;template_right;EleutherAI/pile;corpus;;;0.11;data-based;https://arxiv.org/abs/2310.20707;2
|
276 |
scan;template_right;togethercomputer/RedPajama-Data-V2;corpus;;;0.16;data-based;https://arxiv.org/abs/2310.20707;2
|
277 |
+
|
278 |
allenai/scicite;;allenai/c4;corpus;;;1.78;data-based;https://arxiv.org/abs/2310.20707;2
|
279 |
allenai/scicite;;oscar-corpus/OSCAR-2301;corpus;;;1.51;data-based;https://arxiv.org/abs/2310.20707;2
|
280 |
allenai/scicite;;EleutherAI/pile;corpus;;;0.86;data-based;https://arxiv.org/abs/2310.20707;2
|
281 |
allenai/scicite;;togethercomputer/RedPajama-Data-V2;corpus;;;1.72;data-based;https://arxiv.org/abs/2310.20707;2
|
282 |
+
|
283 |
scitail;snli_format;allenai/c4;corpus;;;0.09;data-based;https://arxiv.org/abs/2310.20707;2
|
284 |
scitail;snli_format;oscar-corpus/OSCAR-2301;corpus;;;0.38;data-based;https://arxiv.org/abs/2310.20707;2
|
285 |
scitail;snli_format;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
286 |
scitail;snli_format;togethercomputer/RedPajama-Data-V2;corpus;;;0.71;data-based;https://arxiv.org/abs/2310.20707;2
|
287 |
+
|
288 |
scitail;tsv_format;allenai/c4;corpus;;;0.09;data-based;https://arxiv.org/abs/2310.20707;2
|
289 |
scitail;tsv_format;oscar-corpus/OSCAR-2301;corpus;;;0.38;data-based;https://arxiv.org/abs/2310.20707;2
|
290 |
scitail;tsv_format;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
291 |
scitail;tsv_format;togethercomputer/RedPajama-Data-V2;corpus;;;0.71;data-based;https://arxiv.org/abs/2310.20707;2
|
292 |
+
|
293 |
sem_eval_2014_task_1;;allenai/c4;corpus;;;0.35;data-based;https://arxiv.org/abs/2310.20707;2
|
294 |
sem_eval_2014_task_1;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
295 |
sem_eval_2014_task_1;;EleutherAI/pile;corpus;;;4.89;data-based;https://arxiv.org/abs/2310.20707;2
|
296 |
sem_eval_2014_task_1;;togethercomputer/RedPajama-Data-V2;corpus;;;52.81;data-based;https://arxiv.org/abs/2310.20707;2
|
297 |
+
|
298 |
sick;;allenai/c4;corpus;;;0.31;data-based;https://arxiv.org/abs/2310.20707;2
|
299 |
sick;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
300 |
sick;;EleutherAI/pile;corpus;;;4.79;data-based;https://arxiv.org/abs/2310.20707;2
|
301 |
sick;;togethercomputer/RedPajama-Data-V2;corpus;;;52.61;data-based;https://arxiv.org/abs/2310.20707;2
|
302 |
+
|
303 |
snli;;allenai/c4;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
304 |
snli;;oscar-corpus/OSCAR-2301;corpus;;;0.08;data-based;https://arxiv.org/abs/2310.20707;2
|
305 |
snli;;EleutherAI/pile;corpus;;;1.11;data-based;https://arxiv.org/abs/2310.20707;2
|
306 |
snli;;togethercomputer/RedPajama-Data-V2;corpus;;;1.22;data-based;https://arxiv.org/abs/2310.20707;2
|
307 |
+
|
308 |
+
squadshifts;amazon;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
309 |
+
squadshifts;amazon;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
310 |
+
squadshifts;amazon;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
311 |
+
squadshifts;amazon;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
312 |
+
|
313 |
squadshifts;new_wiki;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
314 |
squadshifts;new_wiki;oscar-corpus/OSCAR-2301;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
315 |
squadshifts;new_wiki;EleutherAI/pile;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
316 |
squadshifts;new_wiki;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
317 |
+
|
318 |
squadshifts;nyt;allenai/c4;corpus;;;0.01;data-based;https://arxiv.org/abs/2310.20707;2
|
319 |
squadshifts;nyt;oscar-corpus/OSCAR-2301;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
320 |
squadshifts;nyt;EleutherAI/pile;corpus;;;0.02;data-based;https://arxiv.org/abs/2310.20707;2
|
321 |
squadshifts;nyt;togethercomputer/RedPajama-Data-V2;corpus;;;0.04;data-based;https://arxiv.org/abs/2310.20707;2
|
322 |
+
|
323 |
stsb_multi_mt;;allenai/c4;corpus;;;3.48;data-based;https://arxiv.org/abs/2310.20707;2
|
324 |
stsb_multi_mt;;oscar-corpus/OSCAR-2301;corpus;;;3.12;data-based;https://arxiv.org/abs/2310.20707;2
|
325 |
stsb_multi_mt;;EleutherAI/pile;corpus;;;11.09;data-based;https://arxiv.org/abs/2310.20707;2
|
326 |
stsb_multi_mt;;togethercomputer/RedPajama-Data-V2;corpus;;;9.86;data-based;https://arxiv.org/abs/2310.20707;2
|
327 |
+
|
328 |
+
subjqa;books;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
329 |
+
subjqa;books;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
330 |
+
subjqa;books;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
331 |
+
subjqa;books;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
332 |
+
|
333 |
+
subjqa;grocery;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
334 |
+
subjqa;grocery;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
335 |
+
subjqa;grocery;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
336 |
+
subjqa;grocery;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
337 |
+
|
338 |
+
subjqa;movies;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
339 |
+
subjqa;movies;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
340 |
+
subjqa;movies;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
341 |
+
subjqa;movies;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
342 |
+
|
343 |
+
subjqa;restaurants;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
344 |
+
subjqa;restaurants;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
345 |
+
subjqa;restaurants;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
346 |
+
subjqa;restaurants;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
347 |
+
|
348 |
super_glue;axb;allenai/c4;corpus;;;1.99;data-based;https://arxiv.org/abs/2310.20707;2
|
349 |
super_glue;axb;oscar-corpus/OSCAR-2301;corpus;;;1.45;data-based;https://arxiv.org/abs/2310.20707;2
|
350 |
super_glue;axb;EleutherAI/pile;corpus;;;5.07;data-based;https://arxiv.org/abs/2310.20707;2
|
351 |
super_glue;axb;togethercomputer/RedPajama-Data-V2;corpus;;;6.16;data-based;https://arxiv.org/abs/2310.20707;2
|
352 |
+
|
353 |
+
super_glue;axg;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
354 |
+
super_glue;axg;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
355 |
super_glue;axg;EleutherAI/pile;corpus;;;0.28;data-based;https://arxiv.org/abs/2310.20707;2
|
356 |
+
super_glue;axg;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
357 |
+
|
358 |
+
super_glue;boolq;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
359 |
super_glue;boolq;oscar-corpus/OSCAR-2301;corpus;;;3.05;data-based;https://arxiv.org/abs/2310.20707;2
|
360 |
+
super_glue;boolq;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
361 |
super_glue;boolq;togethercomputer/RedPajama-Data-V2;corpus;;;0.03;data-based;https://arxiv.org/abs/2310.20707;2
|
362 |
+
|
363 |
+
super_glue;cb;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
364 |
+
super_glue;cb;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
365 |
+
super_glue;cb;EleutherAI/pile;corpus;;;2;data-based;https://arxiv.org/abs/2310.20707;2
|
366 |
super_glue;cb;togethercomputer/RedPajama-Data-V2;corpus;;;1.6;data-based;https://arxiv.org/abs/2310.20707;2
|
367 |
+
|
368 |
super_glue;copa;allenai/c4;corpus;;;0.6;data-based;https://arxiv.org/abs/2310.20707;2
|
369 |
+
super_glue;copa;oscar-corpus/OSCAR-2301;corpus;;;1;data-based;https://arxiv.org/abs/2310.20707;2
|
370 |
super_glue;copa;EleutherAI/pile;corpus;;;1.2;data-based;https://arxiv.org/abs/2310.20707;2
|
371 |
+
super_glue;copa;togethercomputer/RedPajama-Data-V2;corpus;;;100;data-based;https://arxiv.org/abs/2310.20707;2
|
372 |
+
|
373 |
+
super_glue;multirc;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
374 |
+
super_glue;multirc;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
375 |
+
super_glue;multirc;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
376 |
+
super_glue;multirc;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
377 |
+
|
378 |
+
super_glue;record;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
379 |
+
super_glue;record;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
380 |
+
super_glue;record;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
381 |
+
super_glue;record;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
382 |
+
|
383 |
super_glue;rte;allenai/c4;corpus;;;0.2;data-based;https://arxiv.org/abs/2310.20707;2
|
384 |
super_glue;rte;oscar-corpus/OSCAR-2301;corpus;;;0.17;data-based;https://arxiv.org/abs/2310.20707;2
|
385 |
super_glue;rte;EleutherAI/pile;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
386 |
super_glue;rte;togethercomputer/RedPajama-Data-V2;corpus;;;67.47;data-based;https://arxiv.org/abs/2310.20707;2
|
387 |
+
|
388 |
super_glue;wic;allenai/c4;corpus;;;64.43;data-based;https://arxiv.org/abs/2310.20707;2
|
389 |
super_glue;wic;oscar-corpus/OSCAR-2301;corpus;;;49.43;data-based;https://arxiv.org/abs/2310.20707;2
|
390 |
super_glue;wic;EleutherAI/pile;corpus;;;18.57;data-based;https://arxiv.org/abs/2310.20707;2
|
391 |
super_glue;wic;togethercomputer/RedPajama-Data-V2;corpus;;;60.21;data-based;https://arxiv.org/abs/2310.20707;2
|
392 |
+
|
393 |
swag;regular;allenai/c4;corpus;;;2.48;data-based;https://arxiv.org/abs/2310.20707;2
|
394 |
swag;regular;oscar-corpus/OSCAR-2301;corpus;;;1.65;data-based;https://arxiv.org/abs/2310.20707;2
|
395 |
swag;regular;EleutherAI/pile;corpus;;;2.21;data-based;https://arxiv.org/abs/2310.20707;2
|
396 |
swag;regular;togethercomputer/RedPajama-Data-V2;corpus;;;2.79;data-based;https://arxiv.org/abs/2310.20707;2
|
397 |
+
|
398 |
+
tab_fact;tab_fact;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
399 |
+
tab_fact;tab_fact;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
400 |
+
tab_fact;tab_fact;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
401 |
+
tab_fact;tab_fact;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
402 |
+
|
403 |
wiki_qa;;allenai/c4;corpus;;;0.24;data-based;https://arxiv.org/abs/2310.20707;2
|
404 |
wiki_qa;;oscar-corpus/OSCAR-2301;corpus;;;0.18;data-based;https://arxiv.org/abs/2310.20707;2
|
405 |
wiki_qa;;EleutherAI/pile;corpus;;;0.19;data-based;https://arxiv.org/abs/2310.20707;2
|
406 |
wiki_qa;;togethercomputer/RedPajama-Data-V2;corpus;;;0.91;data-based;https://arxiv.org/abs/2310.20707;2
|
407 |
+
|
408 |
+
winograd_wsc;wsc273;allenai/c4;corpus;;;29.3;data-based;https://arxiv.org/abs/2310.20707;2
|
409 |
+
winograd_wsc;wsc273;oscar-corpus/OSCAR-2301;corpus;;;30.4;data-based;https://arxiv.org/abs/2310.20707;2
|
410 |
winograd_wsc;wsc273;EleutherAI/pile;corpus;;;32.23;data-based;https://arxiv.org/abs/2310.20707;2
|
411 |
winograd_wsc;wsc273;togethercomputer/RedPajama-Data-V2;corpus;;;58.24;data-based;https://arxiv.org/abs/2310.20707;2
|
412 |
+
|
413 |
+
winogrande;winogrande_xl;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
414 |
+
winogrande;winogrande_xl;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
415 |
+
winogrande;winogrande_xl;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
416 |
+
winogrande;winogrande_xl;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
417 |
+
|
418 |
xnli;en;allenai/c4;corpus;;;0.12;data-based;https://arxiv.org/abs/2310.20707;2
|
419 |
xnli;en;oscar-corpus/OSCAR-2301;corpus;;;0.24;data-based;https://arxiv.org/abs/2310.20707;2
|
420 |
xnli;en;EleutherAI/pile;corpus;;;0.36;data-based;https://arxiv.org/abs/2310.20707;2
|
421 |
xnli;en;togethercomputer/RedPajama-Data-V2;corpus;;;0.44;data-based;https://arxiv.org/abs/2310.20707;2
|
422 |
+
|
423 |
xsum;;allenai/c4;corpus;;;2.13;data-based;https://arxiv.org/abs/2310.20707;2
|
424 |
xsum;;oscar-corpus/OSCAR-2301;corpus;;;0.13;data-based;https://arxiv.org/abs/2310.20707;2
|
425 |
+
xsum;;EleutherAI/pile;corpus;;;3.3;data-based;https://arxiv.org/abs/2310.20707;2
|
426 |
xsum;;togethercomputer/RedPajama-Data-V2;corpus;;;4.28;data-based;https://arxiv.org/abs/2310.20707;2
|
427 |
+
|
428 |
+
zest;;allenai/c4;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
429 |
+
zest;;oscar-corpus/OSCAR-2301;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
430 |
+
zest;;EleutherAI/pile;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
431 |
+
zest;;togethercomputer/RedPajama-Data-V2;corpus;;;0;data-based;https://arxiv.org/abs/2310.20707;2
|
432 |
+
|
433 |
+
|
434 |
+
imdb;;GPT-4;model;100;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
435 |
+
imdb;;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
436 |
+
|
437 |
+
ag_news;;GPT-4;model;100;;100;model-based;https://arxiv.org/pdf/2308.08493;3
|
438 |
+
ag_news;;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
439 |
+
|
440 |
+
yelp_review_full;;GPT-4;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
441 |
+
yelp_review_full;;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
442 |
+
|
443 |
+
nyu-mll/glue;rte;GPT-4;model;100;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
444 |
+
nyu-mll/glue;rte;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
445 |
+
|
446 |
+
nyu-mll/glue;wnli;GPT-4;model;100;;100;model-based;https://arxiv.org/pdf/2308.08493;3
|
447 |
+
nyu-mll/glue;wnli;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
448 |
+
|
449 |
+
samsum;;GPT-4;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
450 |
+
samsum;;GPT-3.5;model;0;;0;model-based;https://arxiv.org/pdf/2308.08493;3
|
451 |
+
|
452 |
+
EdinburghNLP/xsum;;GPT-4;model;0;;100;model-based;https://arxiv.org/pdf/2308.08493;3
|
453 |
+
EdinburghNLP/xsum;;GPT-3.5;model;0;;100;model-based;https://arxiv.org/pdf/2308.08493;3
|
454 |
+
|