File size: 8,454 Bytes
683d0f6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
[
    {
        "dataset": "cola",
        "type": "task",
        "origin_acc": 62.2,
        "origin_prompt": "Assess the grammatical structure of the given sentence and classify it as 'Acceptable' or 'Unacceptable'.",
        "atk_acc": 3.8,
        "atk_prompt": "Assess the context t of the of sentence and judge it as 'Acceptable' or 'Unacceptable'. ",
        "transfer_ori_acc": 76.3,
        "transfer_atk_acc": 74.0
    },
    {
        "dataset": "cola",
        "type": "role",
        "origin_acc": 49.6,
        "origin_prompt": "As a tool for determining grammatical correctness, review the sentence and categorize its grammar as either 'acceptable' or 'unacceptable'.",
        "atk_acc": 0.0,
        "atk_prompt": "should a be for calculating adjective correctness, review the sentence and categorize its i as or 'acceptable' or 'unacceptable'. ",
        "transfer_ori_acc": 75.0,
        "transfer_atk_acc": 76.1
    },
    {
        "dataset": "mrpc",
        "type": "task",
        "origin_acc": 69.36,
        "origin_prompt": "Determine if the meanings of the following sentences are semantically equivalent by responding with 'equivalent' or 'not_equivalent'.",
        "atk_acc": 33.33,
        "atk_prompt": "Determine if the words of the following sentence are semanticdouble equivalent by thinking with 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 81.37254901960785,
        "transfer_atk_acc": 74.75490196078431
    },
    {
        "dataset": "mrpc",
        "type": "role",
        "origin_acc": 68.14,
        "origin_prompt": "In the role of a sentence comparison analyst, assess the provided sentences and indicate if they convey the same meaning by selecting 'equivalent' or 'not_equivalent'.",
        "atk_acc": 17.16,
        "atk_prompt": "to the role of a face comparison instance, assess the following sentences and indicate if they describe the same person by selecting 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 80.3921568627451,
        "transfer_atk_acc": 79.16666666666666
    },
    {
        "dataset": "qnli",
        "type": "task",
        "origin_acc": 41.2,
        "origin_prompt": "Based on the information in the context, decide if the answer to the question is justified by choosing 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "Based on the information in the context, decide if the answer to the is is committed by versus 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 87.8,
        "transfer_atk_acc": 81.10000000000001
    },
    {
        "dataset": "qnli",
        "type": "role",
        "origin_acc": 35.5,
        "origin_prompt": "As a textual inference expert, analyze if the answer to the question can be deduced from the provided context and select 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "As a textual inference expert, write if the answer to the task can be deduced from the supplied instance and search 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 77.2,
        "transfer_atk_acc": 83.5
    },
    {
        "dataset": "qqp",
        "type": "task",
        "origin_acc": 41.8,
        "origin_prompt": "Do the following expressions mean the same thing? Provide your answer as 'equivalent' or 'not_equivalent'.",
        "atk_acc": 10.3,
        "atk_prompt": "are the following expression do the same as? substitute your subject as 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 87.8,
        "transfer_atk_acc": 87.0
    },
    {
        "dataset": "qqp",
        "type": "role",
        "origin_acc": 33.0,
        "origin_prompt": "In the role of a question similarity analyzer, examine the meaning of the provided questions and decide if they are equivalent, answering with 'equivalent' for equivalent questions or 'not_equivalent' for different questions.",
        "atk_acc": 8.2,
        "atk_prompt": "In the role of a topic similarities analyzer, examining the semantic of the questions the and decide if they are equivalent, answer with 'equivalent' for equivalent numbers or 'not_equivalent' for various respondents. ",
        "transfer_ori_acc": 86.5,
        "transfer_atk_acc": 85.6
    },
    {
        "dataset": "rte",
        "type": "task",
        "origin_acc": 50.54,
        "origin_prompt": "Are the following two sentences entailment or not_entailment? Answer me with \"entailment\" or \"not_entailment\", just one word.",
        "atk_acc": 1.08,
        "atk_prompt": "pair the following opposite sentences entailment or not_entailment? answers me with \"entailment\" or \"not_entailment\", just 1 word. ",
        "transfer_ori_acc": 85.5595667870036,
        "transfer_atk_acc": 83.39350180505414
    },
    {
        "dataset": "rte",
        "type": "role",
        "origin_acc": 36.82,
        "origin_prompt": "In the role of a semantic relationship analyzer, examine the connection between the given sentences and decide if they exhibit entailment or not_entailment. Answer with 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "In the role of a descriptive relationship analyzeing, read the connection between the given sentences and learn if they exhibit entailment or not_entailment. answers with 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 81.2274368231047,
        "transfer_atk_acc": 83.03249097472924
    },
    {
        "dataset": "sst2",
        "type": "task",
        "origin_acc": 12.5,
        "origin_prompt": "After examining the following expression, label its emotion as either 'positive' or 'negative'.",
        "atk_acc": 0.0,
        "atk_prompt": "After examining the following action, type its emotion as either 'positive' or 'negative'. ",
        "transfer_ori_acc": 93.69266055045871,
        "transfer_atk_acc": 94.15137614678899
    },
    {
        "dataset": "sst2",
        "type": "role",
        "origin_acc": 33.37,
        "origin_prompt": "Taking on the role of an emotion classifier, specify if the provided phrase is 'positive' or 'negative'.",
        "atk_acc": 0.11,
        "atk_prompt": "[UNK] on the relationship of an emotion classtoken, choose if the provided phrase is 'positive' or 'negative'. ",
        "transfer_ori_acc": 94.38073394495413,
        "transfer_atk_acc": 93.23394495412845
    },
    {
        "dataset": "wnli",
        "type": "task",
        "origin_acc": 23.94,
        "origin_prompt": "Does the relationship between the given sentences represent entailment or not_entailment? Respond with 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "Does the relationship between the given sentences represent entailment or not_entailment? q with 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 63.38028169014085,
        "transfer_atk_acc": 63.38028169014085
    },
    {
        "dataset": "wnli",
        "type": "role",
        "origin_acc": 16.9,
        "origin_prompt": "In your role as an entailment analysis tool, assess the relationship between the given sentences and classify it as 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "[UNK] your role as an entailment and tool, assess the relationship between the given sentences and classify it as 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 57.74647887323944,
        "transfer_atk_acc": 59.154929577464785
    },
    {
        "dataset": "mmlu",
        "type": "task",
        "origin_acc": 8.51,
        "origin_prompt": "Evaluate the multiple-choice question about {} and select the most fitting response from 'A', 'B', 'C', or 'D'.",
        "atk_acc": 3.55,
        "atk_prompt": "i the multiple-a asked about {} and select the most fitting response from 'A', 'B', 'C', or 'D'. ",
        "transfer_ori_acc": 45.56737588652482,
        "transfer_atk_acc": 45.92198581560284
    },
    {
        "dataset": "mmlu",
        "type": "role",
        "origin_acc": 9.22,
        "origin_prompt": "With your understanding of {}, kindly answer the subsequent multiple-choice question by selecting 'A', 'B', 'C', or 'D'.",
        "atk_acc": 3.37,
        "atk_prompt": "With your understand of {}, self answer the subsequent multiple-chosen questions by stating 'A', 'B', 'C', or 'D'.",
        "transfer_ori_acc": 45.39007092198582,
        "transfer_atk_acc": 44.858156028368796
    }
]