File size: 9,550 Bytes
ea585ab
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
Visible device: cuda
Seed used: 0
Batch size: 64
Epochs: 40
Learning rate: 1e-05
Entropy weight: 0.01
Regularization weight: 0.0
Only use multiwoz like domains: False
Vectorizer: Data set used is multiwoz21
We filter state by active domains: True
Vectorizer: Data set used is multiwoz21
Embedding semantic descriptions: True
Embedded descriptions successfully. Size: torch.Size([338, 768])
Data set used for descriptions: multiwoz21
We use Roberta to embed actions.
Didnt load a model
Start training
Epoch: 0
Precision: 0
Recall: 0
F1: 0
Best Precision: 0.0
Best Recall: 0.0
Best F1: 0.0
Epoch: 1
Precision: 0
Recall: 0
F1: 0
Best Precision: 0.0
Best Recall: 0.0
Best F1: 0.0
Epoch: 2
Average actions: 2.4348959922790527
Average target actions: 2.28125
Precision: 0.043010752688172046
Recall: 0.0425531914893617
F1: 0.04278074866310161
<<dialog policy>> epoch 2: saved network to mdl
Best Precision: 0.043010752688172046
Best Recall: 0.0425531914893617
Best F1: 0.04278074866310161
Epoch: 3
Precision: 0.043010752688172046
Recall: 0.0425531914893617
F1: 0.04278074866310161
Best Precision: 0.043010752688172046
Best Recall: 0.0425531914893617
Best F1: 0.04278074866310161
Epoch: 4
Average actions: 2.4114584922790527
Average target actions: 2.7890625
Precision: 0.07058823529411765
Recall: 0.06382978723404255
F1: 0.06703910614525138
<<dialog policy>> epoch 4: saved network to mdl
Best Precision: 0.07058823529411765
Best Recall: 0.06382978723404255
Best F1: 0.06703910614525138
Epoch: 5
Precision: 0.07058823529411765
Recall: 0.06382978723404255
F1: 0.06703910614525138
Best Precision: 0.07058823529411765
Best Recall: 0.06382978723404255
Best F1: 0.06703910614525138
Epoch: 6
Average actions: 2.1536459922790527
Average target actions: 2.5859375
Precision: 0.049079754601226995
Recall: 0.0425531914893617
F1: 0.045584045584045586
Best Precision: 0.07058823529411765
Best Recall: 0.06382978723404255
Best F1: 0.06703910614525138
Epoch: 7
Precision: 0.049079754601226995
Recall: 0.0425531914893617
F1: 0.045584045584045586
Best Precision: 0.07058823529411765
Best Recall: 0.06382978723404255
Best F1: 0.06703910614525138
Epoch: 8
Average actions: 2.15625
Average target actions: 2.5520834922790527
Precision: 0.07547169811320754
Recall: 0.06382978723404255
F1: 0.06916426512968299
<<dialog policy>> epoch 8: saved network to mdl
Best Precision: 0.07547169811320754
Best Recall: 0.06382978723404255
Best F1: 0.06916426512968299
Epoch: 9
Precision: 0.07547169811320754
Recall: 0.06382978723404255
F1: 0.06916426512968299
Best Precision: 0.07547169811320754
Best Recall: 0.06382978723404255
Best F1: 0.06916426512968299
Epoch: 10
Average actions: 2.0572915077209473
Average target actions: 2.3489584922790527
Precision: 0.04516129032258064
Recall: 0.03723404255319149
F1: 0.04081632653061224
Best Precision: 0.07547169811320754
Best Recall: 0.06382978723404255
Best F1: 0.06916426512968299
Epoch: 11
Precision: 0.04516129032258064
Recall: 0.03723404255319149
F1: 0.04081632653061224
Best Precision: 0.07547169811320754
Best Recall: 0.06382978723404255
Best F1: 0.06916426512968299
Epoch: 12
Average actions: 1.984375
Average target actions: 2.5520834922790527
Precision: 0.08666666666666667
Recall: 0.06914893617021277
F1: 0.07692307692307691
<<dialog policy>> epoch 12: saved network to mdl
Best Precision: 0.08666666666666667
Best Recall: 0.06914893617021277
Best F1: 0.07692307692307691
Epoch: 13
Precision: 0.08666666666666667
Recall: 0.06914893617021277
F1: 0.07692307692307691
Best Precision: 0.08666666666666667
Best Recall: 0.06914893617021277
Best F1: 0.07692307692307691
Epoch: 14
Average actions: 2.0416665077209473
Average target actions: 2.3828125
Precision: 0.05228758169934641
Recall: 0.0425531914893617
F1: 0.046920821114369494
Best Precision: 0.08666666666666667
Best Recall: 0.06914893617021277
Best F1: 0.07692307692307691
Epoch: 15
Precision: 0.05228758169934641
Recall: 0.0425531914893617
F1: 0.046920821114369494
Best Precision: 0.08666666666666667
Best Recall: 0.06914893617021277
Best F1: 0.07692307692307691
Epoch: 16
Average actions: 2.1666665077209473
Average target actions: 2.2135417461395264
Precision: 0.1346153846153846
Recall: 0.11170212765957446
F1: 0.12209302325581395
<<dialog policy>> epoch 16: saved network to mdl
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 17
Precision: 0.1346153846153846
Recall: 0.11170212765957446
F1: 0.12209302325581395
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 18
Average actions: 1.7734375
Average target actions: 2.5520834922790527
Precision: 0.0661764705882353
Recall: 0.047872340425531915
F1: 0.05555555555555556
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 19
Precision: 0.0661764705882353
Recall: 0.047872340425531915
F1: 0.05555555555555556
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 20
Average actions: 2.1328125
Average target actions: 2.6197917461395264
Precision: 0.1346153846153846
Recall: 0.11170212765957446
F1: 0.12209302325581395
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 21
Precision: 0.1346153846153846
Recall: 0.11170212765957446
F1: 0.12209302325581395
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 22
Average actions: 1.9296875
Average target actions: 2.1119792461395264
Precision: 0.08391608391608392
Recall: 0.06382978723404255
F1: 0.07250755287009063
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 23
Precision: 0.08391608391608392
Recall: 0.06382978723404255
F1: 0.07250755287009063
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 24
Average actions: 2.2213540077209473
Average target actions: 2.3151042461395264
Precision: 0.09815950920245399
Recall: 0.0851063829787234
F1: 0.09116809116809117
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 25
Precision: 0.09815950920245399
Recall: 0.0851063829787234
F1: 0.09116809116809117
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 26
Average actions: 2.1171875
Average target actions: 2.7890625
Precision: 0.12987012987012986
Recall: 0.10638297872340426
F1: 0.11695906432748537
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 27
Precision: 0.12987012987012986
Recall: 0.10638297872340426
F1: 0.11695906432748537
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 28
Average actions: 1.7734375
Average target actions: 2.484375
Precision: 0.08823529411764706
Recall: 0.06382978723404255
F1: 0.07407407407407407
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 29
Precision: 0.08823529411764706
Recall: 0.06382978723404255
F1: 0.07407407407407407
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 30
Average actions: 2.1822915077209473
Average target actions: 2.3489584922790527
Precision: 0.10126582278481013
Recall: 0.0851063829787234
F1: 0.09248554913294797
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 31
Precision: 0.10126582278481013
Recall: 0.0851063829787234
F1: 0.09248554913294797
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 32
Average actions: 2.0442707538604736
Average target actions: 2.6197917461395264
Precision: 0.12345679012345678
Recall: 0.10638297872340426
F1: 0.11428571428571428
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 33
Precision: 0.12345679012345678
Recall: 0.10638297872340426
F1: 0.11428571428571428
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 34
Average actions: 1.8307292461395264
Average target actions: 2.5859375
Precision: 0.11510791366906475
Recall: 0.0851063829787234
F1: 0.09785932721712538
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 35
Precision: 0.11510791366906475
Recall: 0.0851063829787234
F1: 0.09785932721712538
Best Precision: 0.1346153846153846
Best Recall: 0.11170212765957446
Best F1: 0.12209302325581395
Epoch: 36
Average actions: 2.2838540077209473
Average target actions: 2.3489584922790527
Precision: 0.1286549707602339
Recall: 0.11702127659574468
F1: 0.12256267409470752
<<dialog policy>> epoch 36: saved network to mdl
Best Precision: 0.1346153846153846
Best Recall: 0.11702127659574468
Best F1: 0.12256267409470752
Epoch: 37
Precision: 0.1286549707602339
Recall: 0.11702127659574468
F1: 0.12256267409470752
Best Precision: 0.1346153846153846
Best Recall: 0.11702127659574468
Best F1: 0.12256267409470752
Epoch: 38
Average actions: 1.9479167461395264
Average target actions: 2.7552084922790527
Precision: 0.12337662337662338
Recall: 0.10106382978723404
F1: 0.1111111111111111
Best Precision: 0.1346153846153846
Best Recall: 0.11702127659574468
Best F1: 0.12256267409470752
Epoch: 39
Precision: 0.12337662337662338
Recall: 0.10106382978723404
F1: 0.1111111111111111
Best Precision: 0.1346153846153846
Best Recall: 0.11702127659574468
Best F1: 0.12256267409470752