File size: 8,697 Bytes
683d0f6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
[
    {
        "dataset": "cola",
        "type": "task",
        "origin_acc": 69.0,
        "origin_prompt": "Assess the following sentence and determine if it is grammatically correct. Respond with 'Acceptable' or 'Unacceptable':",
        "atk_acc": 0.0,
        "atk_prompt": "Evaluating the later sen𝚝ence and determine if it is grammatically correct. Respoոd with 'Acceptable' or 'Unacceptable':",
        "transfer_ori_acc": 86.2,
        "transfer_atk_acc": 86.1
    },
    {
        "dataset": "cola",
        "type": "role",
        "origin_acc": 54.9,
        "origin_prompt": "As a tool for determining grammatical correctness, review the sentence and categorize its grammar as either 'acceptable' or 'unacceptable':",
        "atk_acc": 0.0,
        "atk_prompt": "Aѕ a tool for determining grammatical regularity, reviewing the penalties and categorizе its grammar as ei𝚝her 'acceptable' or 'unacceptable':",
        "transfer_ori_acc": 85.7,
        "transfer_atk_acc": 85.0
    },
    {
        "dataset": "mrpc",
        "type": "task",
        "origin_acc": 58.09,
        "origin_prompt": "Compare the following sentences and determine if they share the same semantic meaning by responding with 'equivalent' or 'not_equivalent'.",
        "atk_acc": 0.49,
        "atk_prompt": "Comрare the later sentences and determine if they share the same semantic meaning by respoոding with 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 86.02941176470588,
        "transfer_atk_acc": 86.27450980392157
    },
    {
        "dataset": "mrpc",
        "type": "role",
        "origin_acc": 53.43,
        "origin_prompt": "In the role of a sentence comparison analyst, assess the provided sentences and indicate if they convey the same meaning by selecting 'equivalent' or 'not_equivalent'.",
        "atk_acc": 0.0,
        "atk_prompt": "Iո the rloe of a sentence comparison analyst, asess the provided sentences and indicate if they convey the same meaning by selec𝚝ing 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 86.27450980392157,
        "transfer_atk_acc": 86.02941176470588
    },
    {
        "dataset": "qnli",
        "type": "task",
        "origin_acc": 30.5,
        "origin_prompt": "Review the given context and question, and decide if the context contains enough information to support the answer by selecting 'entailment' or 'not_entailment'.",
        "atk_acc": 0.0,
        "atk_prompt": "Revisited the afforded contеxt and question, and decide if the context contains enough information to support the answer by selec𝚝ing 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 94.1,
        "transfer_atk_acc": 94.19999999999999
    },
    {
        "dataset": "qnli",
        "type": "role",
        "origin_acc": 38.4,
        "origin_prompt": "As a textual inference expert, analyze if the answer to the question can be deduced from the provided context and select 'entailment' or 'not_entailment'.",
        "atk_acc": 0.1,
        "atk_prompt": "Aѕ a textual deduced expeⲅt, analyzes if the replying to the question can be deduced from the provided context and selec𝚝 'entailment' or 'not_entailment'. ",
        "transfer_ori_acc": 93.30000000000001,
        "transfer_atk_acc": 93.5
    },
    {
        "dataset": "qqp",
        "type": "task",
        "origin_acc": 23.9,
        "origin_prompt": "Can these two statements be considered equal in meaning? Answer with 'equivalent' or 'not_equivalent'.",
        "atk_acc": 0.2,
        "atk_prompt": "Possible these two statemen𝚝s be considered equal in meaing? Answer with 'equivalent' or 'not_equivalent'. ",
        "transfer_ori_acc": 87.4,
        "transfer_atk_acc": 88.2
    },
    {
        "dataset": "qqp",
        "type": "role",
        "origin_acc": 16.9,
        "origin_prompt": "In the role of a question similarity analyzer, examine the meaning of the provided questions and decide if they are equivalent, answering with 'equivalent' for equivalent questions or 'not_equivalent' for different questions.",
        "atk_acc": 0.0,
        "atk_prompt": "Iո the role of a question similarity analyzer, examine the meaning of the provided questions and decide if they are equivalent, answering with 'equivalent' for equivalent questions or 'not_equivalent' for different ques𝚝ions. ",
        "transfer_ori_acc": 88.1,
        "transfer_atk_acc": 87.8
    },
    {
        "dataset": "rte",
        "type": "task",
        "origin_acc": 45.13,
        "origin_prompt": "Does the relationship between the given sentences represent entailment or not_entailment? Respond with 'entailment' or 'not_entailment'.",
        "atk_acc": 1.08,
        "atk_prompt": "Ai the relationship between the given sentence represent entailment or not_entailment? Answer with 'entailment' or 'not_entailment'.",
        "transfer_ori_acc": 93.50180505415162,
        "transfer_atk_acc": 93.86281588447653
    },
    {
        "dataset": "rte",
        "type": "role",
        "origin_acc": 51.26,
        "origin_prompt": "In the role of a semantic relationship analyzer, examine the connection between the given sentences and decide if they exhibit entailment or not_entailment. Answer with 'entailment' or 'not_entailment':",
        "atk_acc": 0.36,
        "atk_prompt": "Onto the role of a semantic relationship offing, e×amine the connection between the given seոtences and decіde if they exhibit entailment or not_entailment. Answer with 'entailment' or 'not_entailment':",
        "transfer_ori_acc": 92.4187725631769,
        "transfer_atk_acc": 92.05776173285199
    },
    {
        "dataset": "sst2",
        "type": "task",
        "origin_acc": 48.05,
        "origin_prompt": "After examining the following expression, label its emotion as either 'positive' or 'negative':",
        "atk_acc": 2.06,
        "atk_prompt": "After examining the following expression, label its emo𝚝ion as neither 'positive' or 'negative': ",
        "transfer_ori_acc": 91.97247706422019,
        "transfer_atk_acc": 88.30275229357798
    },
    {
        "dataset": "sst2",
        "type": "role",
        "origin_acc": 37.5,
        "origin_prompt": "As an emotion detector, determine if the provided passage conveys a 'positive' or 'negative' sentiment. Classify:",
        "atk_acc": 1.03,
        "atk_prompt": "Because an emotion detector, determine if the povided passgae convey a 'positive' or 'negative' ambience. Classi𝚏y: ",
        "transfer_ori_acc": 95.87155963302753,
        "transfer_atk_acc": 95.18348623853211
    },
    {
        "dataset": "wnli",
        "type": "task",
        "origin_acc": 43.66,
        "origin_prompt": "Does the relationship between the given sentences represent entailment or not_entailment? Respond with 'entailment' or 'not_entailment'.",
        "atk_acc": 1.41,
        "atk_prompt": "Deos the relationship between the given sentences represent entailment or not_entailment? Answering with 'entailment' or 'not_entailment'.",
        "transfer_ori_acc": 78.87323943661971,
        "transfer_atk_acc": 78.87323943661971
    },
    {
        "dataset": "wnli",
        "type": "role",
        "origin_acc": 43.66,
        "origin_prompt": "In your role as an entailment analysis tool, assess the relationship between the given sentences and classify it as 'entailment' or 'not_entailment':",
        "atk_acc": 0.0,
        "atk_prompt": "During your roⅼe as an entailment analysis tool, evaluating the relationship between the given sentences and classifying it as 'entailment' or 'not_entailment':",
        "transfer_ori_acc": 78.87323943661971,
        "transfer_atk_acc": 80.28169014084507
    },
    {
        "dataset": "mmlu",
        "type": "task",
        "origin_acc": 25.67,
        "origin_prompt": "Evaluate the multiple-choice question about {} and select the most fitting response from 'A', 'B', 'C', or 'D'.",
        "atk_acc": 9.92,
        "atk_prompt": "Assess the mul𝚝iple-choice question about {} and seⅼect the most fitting response from 'A', 'B', 'C', or 'D'. ",
        "transfer_ori_acc": 53.01418439716312,
        "transfer_atk_acc": 53.54609929078015
    },
    {
        "dataset": "mmlu",
        "type": "role",
        "origin_acc": 24.39,
        "origin_prompt": "With your knowledge of {}, tackle the following multiple-choice question by choosing 'A', 'B', 'C', or 'D'.",
        "atk_acc": 9.57,
        "atk_prompt": "With your knowledge of {}, tackle the 𝚏ollowing multiple-choіce problem by choosing 'A', 'B', 'C', or 'D'.",
        "transfer_ori_acc": 52.12765957446809,
        "transfer_atk_acc": 53.01418439716312
    }
]