IrwinD commited on
Commit
638eca8
·
verified ·
1 Parent(s): 9ca0a8f

End of training

Browse files
Files changed (2) hide show
  1. README.md +107 -107
  2. model.safetensors +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
- value: 0.4164
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,12 +32,12 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the hdfs_log_summary_dataset dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 2.1319
36
- - Rouge1: 0.4164
37
- - Rouge2: 0.1024
38
- - Rougel: 0.3062
39
- - Rougelsum: 0.2972
40
- - Gen Len: 19.0
41
 
42
  ## Model description
43
 
@@ -69,106 +69,106 @@ The following hyperparameters were used during training:
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
71
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
72
- | No log | 1.0 | 12 | 2.7041 | 0.2055 | 0.0262 | 0.1605 | 0.1605 | 19.0 |
73
- | No log | 2.0 | 24 | 2.5221 | 0.2579 | 0.043 | 0.1681 | 0.188 | 18.4 |
74
- | No log | 3.0 | 36 | 2.3734 | 0.2267 | 0.0312 | 0.1471 | 0.142 | 15.0 |
75
- | No log | 4.0 | 48 | 2.2547 | 0.3859 | 0.0906 | 0.284 | 0.2976 | 19.0 |
76
- | No log | 5.0 | 60 | 2.1711 | 0.4009 | 0.1082 | 0.3073 | 0.3049 | 17.2 |
77
- | No log | 6.0 | 72 | 2.1276 | 0.451 | 0.1003 | 0.2929 | 0.3133 | 19.0 |
78
- | No log | 7.0 | 84 | 2.0695 | 0.4656 | 0.1099 | 0.2987 | 0.3184 | 19.0 |
79
- | No log | 8.0 | 96 | 2.0238 | 0.4656 | 0.1199 | 0.2987 | 0.3184 | 19.0 |
80
- | No log | 9.0 | 108 | 1.9969 | 0.4656 | 0.1292 | 0.308 | 0.3288 | 19.0 |
81
- | No log | 10.0 | 120 | 1.9705 | 0.4646 | 0.1422 | 0.3081 | 0.3262 | 19.0 |
82
- | No log | 11.0 | 132 | 1.9569 | 0.4364 | 0.1155 | 0.3061 | 0.3259 | 19.0 |
83
- | No log | 12.0 | 144 | 1.9347 | 0.4459 | 0.1249 | 0.2999 | 0.3207 | 19.0 |
84
- | No log | 13.0 | 156 | 1.8836 | 0.4459 | 0.1249 | 0.2999 | 0.3207 | 19.0 |
85
- | No log | 14.0 | 168 | 1.8488 | 0.4245 | 0.0965 | 0.289 | 0.3086 | 19.0 |
86
- | No log | 15.0 | 180 | 1.8547 | 0.4253 | 0.1076 | 0.3125 | 0.3101 | 19.0 |
87
- | No log | 16.0 | 192 | 1.8332 | 0.452 | 0.1166 | 0.2952 | 0.3096 | 19.0 |
88
- | No log | 17.0 | 204 | 1.8252 | 0.4578 | 0.1247 | 0.3185 | 0.3285 | 19.0 |
89
- | No log | 18.0 | 216 | 1.8042 | 0.4656 | 0.1343 | 0.3267 | 0.3333 | 19.0 |
90
- | No log | 19.0 | 228 | 1.7856 | 0.4578 | 0.1247 | 0.3185 | 0.3285 | 19.0 |
91
- | No log | 20.0 | 240 | 1.8086 | 0.4451 | 0.1308 | 0.3241 | 0.3328 | 19.0 |
92
- | No log | 21.0 | 252 | 1.8156 | 0.433 | 0.1389 | 0.3247 | 0.3279 | 19.0 |
93
- | No log | 22.0 | 264 | 1.7810 | 0.4429 | 0.1135 | 0.3005 | 0.3155 | 19.0 |
94
- | No log | 23.0 | 276 | 1.7715 | 0.3852 | 0.072 | 0.2633 | 0.2655 | 19.0 |
95
- | No log | 24.0 | 288 | 1.8142 | 0.4176 | 0.1092 | 0.2922 | 0.2922 | 19.0 |
96
- | No log | 25.0 | 300 | 1.8024 | 0.4111 | 0.11 | 0.2811 | 0.2811 | 19.0 |
97
- | No log | 26.0 | 312 | 1.7650 | 0.404 | 0.1024 | 0.2776 | 0.2756 | 19.0 |
98
- | No log | 27.0 | 324 | 1.7557 | 0.4032 | 0.0963 | 0.2769 | 0.2733 | 19.0 |
99
- | No log | 28.0 | 336 | 1.7856 | 0.4282 | 0.1475 | 0.3136 | 0.3129 | 19.0 |
100
- | No log | 29.0 | 348 | 1.7468 | 0.4325 | 0.1374 | 0.3167 | 0.3167 | 19.0 |
101
- | No log | 30.0 | 360 | 1.7433 | 0.4258 | 0.1562 | 0.3007 | 0.3007 | 19.0 |
102
- | No log | 31.0 | 372 | 1.7651 | 0.4325 | 0.1556 | 0.3253 | 0.3253 | 19.0 |
103
- | No log | 32.0 | 384 | 1.7467 | 0.3914 | 0.0991 | 0.2751 | 0.2751 | 19.0 |
104
- | No log | 33.0 | 396 | 1.7758 | 0.3914 | 0.0991 | 0.2751 | 0.2751 | 19.0 |
105
- | No log | 34.0 | 408 | 1.7551 | 0.3858 | 0.0991 | 0.269 | 0.269 | 19.0 |
106
- | No log | 35.0 | 420 | 1.7500 | 0.3999 | 0.1255 | 0.2931 | 0.2931 | 19.0 |
107
- | No log | 36.0 | 432 | 1.7631 | 0.4176 | 0.1446 | 0.3276 | 0.3273 | 19.0 |
108
- | No log | 37.0 | 444 | 1.7702 | 0.406 | 0.1277 | 0.2883 | 0.2883 | 19.0 |
109
- | No log | 38.0 | 456 | 1.8084 | 0.3933 | 0.1088 | 0.2771 | 0.2771 | 19.0 |
110
- | No log | 39.0 | 468 | 1.8104 | 0.3999 | 0.1308 | 0.3018 | 0.3018 | 19.0 |
111
- | No log | 40.0 | 480 | 1.8087 | 0.3864 | 0.1097 | 0.2785 | 0.2785 | 19.0 |
112
- | No log | 41.0 | 492 | 1.8254 | 0.3974 | 0.1277 | 0.2883 | 0.2883 | 19.0 |
113
- | 1.3176 | 42.0 | 504 | 1.8406 | 0.4042 | 0.1385 | 0.2955 | 0.2955 | 19.0 |
114
- | 1.3176 | 43.0 | 516 | 1.8620 | 0.3864 | 0.1097 | 0.2785 | 0.2785 | 19.0 |
115
- | 1.3176 | 44.0 | 528 | 1.8932 | 0.3855 | 0.108 | 0.2872 | 0.2867 | 19.0 |
116
- | 1.3176 | 45.0 | 540 | 1.8810 | 0.3911 | 0.108 | 0.2994 | 0.2994 | 19.0 |
117
- | 1.3176 | 46.0 | 552 | 1.8600 | 0.3985 | 0.1095 | 0.298 | 0.2969 | 19.0 |
118
- | 1.3176 | 47.0 | 564 | 1.8706 | 0.3937 | 0.1178 | 0.2932 | 0.2932 | 19.0 |
119
- | 1.3176 | 48.0 | 576 | 1.8394 | 0.4061 | 0.1178 | 0.306 | 0.3056 | 19.0 |
120
- | 1.3176 | 49.0 | 588 | 1.8910 | 0.3929 | 0.0813 | 0.2833 | 0.2822 | 19.0 |
121
- | 1.3176 | 50.0 | 600 | 1.9152 | 0.3808 | 0.0807 | 0.281 | 0.2805 | 19.0 |
122
- | 1.3176 | 51.0 | 612 | 1.9092 | 0.3883 | 0.0918 | 0.289 | 0.289 | 19.0 |
123
- | 1.3176 | 52.0 | 624 | 1.8571 | 0.3877 | 0.1009 | 0.2971 | 0.2971 | 19.0 |
124
- | 1.3176 | 53.0 | 636 | 1.8913 | 0.3985 | 0.1254 | 0.3069 | 0.3052 | 19.0 |
125
- | 1.3176 | 54.0 | 648 | 1.9744 | 0.3985 | 0.1254 | 0.3069 | 0.3052 | 19.0 |
126
- | 1.3176 | 55.0 | 660 | 1.9156 | 0.3975 | 0.1024 | 0.292 | 0.2837 | 19.0 |
127
- | 1.3176 | 56.0 | 672 | 1.8886 | 0.3937 | 0.1183 | 0.306 | 0.303 | 19.0 |
128
- | 1.3176 | 57.0 | 684 | 1.9325 | 0.3883 | 0.1178 | 0.306 | 0.3056 | 19.0 |
129
- | 1.3176 | 58.0 | 696 | 1.9252 | 0.3994 | 0.1183 | 0.3175 | 0.3158 | 19.0 |
130
- | 1.3176 | 59.0 | 708 | 1.9159 | 0.3883 | 0.1178 | 0.306 | 0.3056 | 19.0 |
131
- | 1.3176 | 60.0 | 720 | 2.0071 | 0.4097 | 0.1369 | 0.3281 | 0.3261 | 19.0 |
132
- | 1.3176 | 61.0 | 732 | 1.9834 | 0.4164 | 0.1358 | 0.3266 | 0.3247 | 19.0 |
133
- | 1.3176 | 62.0 | 744 | 1.9928 | 0.4378 | 0.1185 | 0.3342 | 0.3261 | 19.0 |
134
- | 1.3176 | 63.0 | 756 | 1.9718 | 0.4267 | 0.1149 | 0.3306 | 0.3239 | 19.0 |
135
- | 1.3176 | 64.0 | 768 | 1.9513 | 0.4267 | 0.1149 | 0.3306 | 0.3239 | 19.0 |
136
- | 1.3176 | 65.0 | 780 | 1.9836 | 0.4067 | 0.1021 | 0.2976 | 0.29 | 19.0 |
137
- | 1.3176 | 66.0 | 792 | 1.9588 | 0.4134 | 0.1015 | 0.3069 | 0.2992 | 19.0 |
138
- | 1.3176 | 67.0 | 804 | 1.9513 | 0.4098 | 0.101 | 0.3142 | 0.3072 | 19.0 |
139
- | 1.3176 | 68.0 | 816 | 2.0276 | 0.4146 | 0.1015 | 0.3142 | 0.3045 | 19.0 |
140
- | 1.3176 | 69.0 | 828 | 2.0201 | 0.4134 | 0.1015 | 0.3069 | 0.2992 | 19.0 |
141
- | 1.3176 | 70.0 | 840 | 2.0082 | 0.4001 | 0.1022 | 0.3051 | 0.3051 | 19.0 |
142
- | 1.3176 | 71.0 | 852 | 2.0198 | 0.4004 | 0.1022 | 0.3124 | 0.3115 | 19.0 |
143
- | 1.3176 | 72.0 | 864 | 2.0386 | 0.4209 | 0.1015 | 0.3251 | 0.3172 | 19.0 |
144
- | 1.3176 | 73.0 | 876 | 2.0227 | 0.4057 | 0.1022 | 0.3221 | 0.3221 | 19.0 |
145
- | 1.3176 | 74.0 | 888 | 2.0413 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
146
- | 1.3176 | 75.0 | 900 | 2.0415 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
147
- | 1.3176 | 76.0 | 912 | 2.0913 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
148
- | 1.3176 | 77.0 | 924 | 2.0887 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
149
- | 1.3176 | 78.0 | 936 | 2.0997 | 0.4004 | 0.1022 | 0.3124 | 0.3115 | 19.0 |
150
- | 1.3176 | 79.0 | 948 | 2.0907 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
151
- | 1.3176 | 80.0 | 960 | 2.1398 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
152
- | 1.3176 | 81.0 | 972 | 2.1364 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
153
- | 1.3176 | 82.0 | 984 | 2.1417 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
154
- | 1.3176 | 83.0 | 996 | 2.1454 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
155
- | 0.5445 | 84.0 | 1008 | 2.1506 | 0.4014 | 0.1031 | 0.3045 | 0.3024 | 19.0 |
156
- | 0.5445 | 85.0 | 1020 | 2.1224 | 0.4151 | 0.1184 | 0.3106 | 0.3036 | 19.0 |
157
- | 0.5445 | 86.0 | 1032 | 2.0857 | 0.4151 | 0.1184 | 0.3106 | 0.3036 | 19.0 |
158
- | 0.5445 | 87.0 | 1044 | 2.0810 | 0.3939 | 0.103 | 0.2969 | 0.2898 | 19.0 |
159
- | 0.5445 | 88.0 | 1056 | 2.0854 | 0.4007 | 0.103 | 0.2969 | 0.2898 | 19.0 |
160
- | 0.5445 | 89.0 | 1068 | 2.1048 | 0.4098 | 0.103 | 0.2969 | 0.2898 | 19.0 |
161
- | 0.5445 | 90.0 | 1080 | 2.1153 | 0.4007 | 0.103 | 0.2969 | 0.2898 | 19.0 |
162
- | 0.5445 | 91.0 | 1092 | 2.1200 | 0.3971 | 0.1018 | 0.2939 | 0.2873 | 19.0 |
163
- | 0.5445 | 92.0 | 1104 | 2.1221 | 0.3971 | 0.1018 | 0.2939 | 0.2873 | 19.0 |
164
- | 0.5445 | 93.0 | 1116 | 2.1291 | 0.4007 | 0.103 | 0.2969 | 0.2898 | 19.0 |
165
- | 0.5445 | 94.0 | 1128 | 2.1419 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
166
- | 0.5445 | 95.0 | 1140 | 2.1438 | 0.4073 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
167
- | 0.5445 | 96.0 | 1152 | 2.1381 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
168
- | 0.5445 | 97.0 | 1164 | 2.1349 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
169
- | 0.5445 | 98.0 | 1176 | 2.1347 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
170
- | 0.5445 | 99.0 | 1188 | 2.1322 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
171
- | 0.5445 | 100.0 | 1200 | 2.1319 | 0.4164 | 0.1024 | 0.3062 | 0.2972 | 19.0 |
172
 
173
 
174
  ### Framework versions
 
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
+ value: 0.3829
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the hdfs_log_summary_dataset dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.4037
36
+ - Rouge1: 0.3829
37
+ - Rouge2: 0.1129
38
+ - Rougel: 0.2968
39
+ - Rougelsum: 0.3082
40
+ - Gen Len: 20.0
41
 
42
  ## Model description
43
 
 
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
71
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
72
+ | No log | 1.0 | 12 | 0.5375 | 0.0672 | 0.0252 | 0.07 | 0.0672 | 20.0 |
73
+ | No log | 2.0 | 24 | 0.4705 | 0.1964 | 0.0256 | 0.1555 | 0.1555 | 19.8 |
74
+ | No log | 3.0 | 36 | 0.4266 | 0.2449 | 0.0295 | 0.214 | 0.2211 | 19.8 |
75
+ | No log | 4.0 | 48 | 0.3889 | 0.3408 | 0.1154 | 0.2913 | 0.2985 | 19.8 |
76
+ | No log | 5.0 | 60 | 0.3618 | 0.4363 | 0.2003 | 0.3342 | 0.3588 | 19.8 |
77
+ | No log | 6.0 | 72 | 0.3456 | 0.4409 | 0.2189 | 0.3272 | 0.3591 | 20.0 |
78
+ | No log | 7.0 | 84 | 0.3328 | 0.4264 | 0.2258 | 0.3262 | 0.3521 | 20.0 |
79
+ | No log | 8.0 | 96 | 0.3233 | 0.4264 | 0.2258 | 0.3262 | 0.3521 | 20.0 |
80
+ | No log | 9.0 | 108 | 0.3185 | 0.4192 | 0.2258 | 0.3262 | 0.3433 | 20.0 |
81
+ | No log | 10.0 | 120 | 0.3142 | 0.4273 | 0.2077 | 0.332 | 0.3645 | 20.0 |
82
+ | No log | 11.0 | 132 | 0.3141 | 0.4495 | 0.2203 | 0.3423 | 0.3867 | 20.0 |
83
+ | No log | 12.0 | 144 | 0.3143 | 0.4609 | 0.2256 | 0.3671 | 0.4064 | 20.0 |
84
+ | No log | 13.0 | 156 | 0.3106 | 0.477 | 0.2492 | 0.3589 | 0.3989 | 20.0 |
85
+ | No log | 14.0 | 168 | 0.3085 | 0.432 | 0.2089 | 0.307 | 0.3448 | 20.0 |
86
+ | No log | 15.0 | 180 | 0.3080 | 0.4391 | 0.2207 | 0.3181 | 0.3533 | 20.0 |
87
+ | No log | 16.0 | 192 | 0.3063 | 0.4179 | 0.2089 | 0.3027 | 0.3337 | 20.0 |
88
+ | No log | 17.0 | 204 | 0.3072 | 0.4179 | 0.2089 | 0.3027 | 0.3337 | 20.0 |
89
+ | No log | 18.0 | 216 | 0.3104 | 0.4462 | 0.2281 | 0.3247 | 0.3589 | 20.0 |
90
+ | No log | 19.0 | 228 | 0.3094 | 0.4462 | 0.2281 | 0.3247 | 0.3589 | 20.0 |
91
+ | No log | 20.0 | 240 | 0.3119 | 0.4457 | 0.2029 | 0.3242 | 0.3452 | 20.0 |
92
+ | No log | 21.0 | 252 | 0.3158 | 0.4657 | 0.2356 | 0.3445 | 0.3749 | 20.0 |
93
+ | No log | 22.0 | 264 | 0.3153 | 0.4657 | 0.2356 | 0.3445 | 0.3749 | 20.0 |
94
+ | No log | 23.0 | 276 | 0.3160 | 0.4657 | 0.2356 | 0.3445 | 0.3749 | 20.0 |
95
+ | No log | 24.0 | 288 | 0.3173 | 0.4752 | 0.2416 | 0.3594 | 0.39 | 20.0 |
96
+ | No log | 25.0 | 300 | 0.3173 | 0.4325 | 0.1942 | 0.3352 | 0.3508 | 20.0 |
97
+ | No log | 26.0 | 312 | 0.3185 | 0.4624 | 0.2042 | 0.3527 | 0.3704 | 20.0 |
98
+ | No log | 27.0 | 324 | 0.3202 | 0.4798 | 0.2486 | 0.3656 | 0.3883 | 20.0 |
99
+ | No log | 28.0 | 336 | 0.3243 | 0.4763 | 0.2284 | 0.3779 | 0.4026 | 20.0 |
100
+ | No log | 29.0 | 348 | 0.3241 | 0.4763 | 0.2284 | 0.3779 | 0.4026 | 20.0 |
101
+ | No log | 30.0 | 360 | 0.3276 | 0.4726 | 0.2038 | 0.3505 | 0.3674 | 20.0 |
102
+ | No log | 31.0 | 372 | 0.3250 | 0.4726 | 0.2038 | 0.3579 | 0.3795 | 20.0 |
103
+ | No log | 32.0 | 384 | 0.3273 | 0.4741 | 0.2293 | 0.3661 | 0.3896 | 20.0 |
104
+ | No log | 33.0 | 396 | 0.3282 | 0.4726 | 0.2267 | 0.3594 | 0.3768 | 20.0 |
105
+ | No log | 34.0 | 408 | 0.3341 | 0.4726 | 0.2267 | 0.3594 | 0.3768 | 20.0 |
106
+ | No log | 35.0 | 420 | 0.3356 | 0.4726 | 0.2038 | 0.3505 | 0.3674 | 20.0 |
107
+ | No log | 36.0 | 432 | 0.3326 | 0.4726 | 0.2038 | 0.3505 | 0.3674 | 20.0 |
108
+ | No log | 37.0 | 444 | 0.3356 | 0.4726 | 0.2038 | 0.3505 | 0.3674 | 20.0 |
109
+ | No log | 38.0 | 456 | 0.3374 | 0.4785 | 0.1735 | 0.3409 | 0.3656 | 20.0 |
110
+ | No log | 39.0 | 468 | 0.3395 | 0.4356 | 0.1714 | 0.3394 | 0.3658 | 20.0 |
111
+ | No log | 40.0 | 480 | 0.3497 | 0.4273 | 0.1714 | 0.3322 | 0.3482 | 20.0 |
112
+ | No log | 41.0 | 492 | 0.3479 | 0.4276 | 0.1714 | 0.3586 | 0.372 | 20.0 |
113
+ | 0.2605 | 42.0 | 504 | 0.3442 | 0.4617 | 0.2074 | 0.3749 | 0.3948 | 20.0 |
114
+ | 0.2605 | 43.0 | 516 | 0.3450 | 0.4617 | 0.2074 | 0.3749 | 0.3948 | 20.0 |
115
+ | 0.2605 | 44.0 | 528 | 0.3443 | 0.4325 | 0.1824 | 0.3352 | 0.358 | 20.0 |
116
+ | 0.2605 | 45.0 | 540 | 0.3553 | 0.4277 | 0.1714 | 0.3322 | 0.355 | 20.0 |
117
+ | 0.2605 | 46.0 | 552 | 0.3527 | 0.4299 | 0.163 | 0.3335 | 0.3563 | 20.0 |
118
+ | 0.2605 | 47.0 | 564 | 0.3561 | 0.4227 | 0.163 | 0.3335 | 0.349 | 20.0 |
119
+ | 0.2605 | 48.0 | 576 | 0.3554 | 0.4398 | 0.1834 | 0.3212 | 0.3403 | 20.0 |
120
+ | 0.2605 | 49.0 | 588 | 0.3511 | 0.433 | 0.1757 | 0.3147 | 0.3307 | 20.0 |
121
+ | 0.2605 | 50.0 | 600 | 0.3620 | 0.4011 | 0.1232 | 0.2925 | 0.3011 | 20.0 |
122
+ | 0.2605 | 51.0 | 612 | 0.3549 | 0.3804 | 0.0873 | 0.2922 | 0.2873 | 20.0 |
123
+ | 0.2605 | 52.0 | 624 | 0.3562 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
124
+ | 0.2605 | 53.0 | 636 | 0.3616 | 0.4398 | 0.1834 | 0.3212 | 0.3403 | 20.0 |
125
+ | 0.2605 | 54.0 | 648 | 0.3645 | 0.4398 | 0.192 | 0.3404 | 0.3618 | 20.0 |
126
+ | 0.2605 | 55.0 | 660 | 0.3704 | 0.4398 | 0.1834 | 0.3293 | 0.3507 | 20.0 |
127
+ | 0.2605 | 56.0 | 672 | 0.3656 | 0.4128 | 0.1331 | 0.2987 | 0.3115 | 20.0 |
128
+ | 0.2605 | 57.0 | 684 | 0.3632 | 0.3748 | 0.1134 | 0.2818 | 0.2929 | 20.0 |
129
+ | 0.2605 | 58.0 | 696 | 0.3690 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
130
+ | 0.2605 | 59.0 | 708 | 0.3729 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
131
+ | 0.2605 | 60.0 | 720 | 0.3763 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
132
+ | 0.2605 | 61.0 | 732 | 0.3743 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
133
+ | 0.2605 | 62.0 | 744 | 0.3715 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
134
+ | 0.2605 | 63.0 | 756 | 0.3725 | 0.4398 | 0.1834 | 0.3212 | 0.3403 | 20.0 |
135
+ | 0.2605 | 64.0 | 768 | 0.3650 | 0.4184 | 0.1735 | 0.344 | 0.3514 | 20.0 |
136
+ | 0.2605 | 65.0 | 780 | 0.3673 | 0.4197 | 0.163 | 0.313 | 0.3358 | 20.0 |
137
+ | 0.2605 | 66.0 | 792 | 0.3813 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
138
+ | 0.2605 | 67.0 | 804 | 0.3874 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
139
+ | 0.2605 | 68.0 | 816 | 0.3852 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
140
+ | 0.2605 | 69.0 | 828 | 0.3895 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
141
+ | 0.2605 | 70.0 | 840 | 0.3947 | 0.3961 | 0.1129 | 0.3205 | 0.3287 | 20.0 |
142
+ | 0.2605 | 71.0 | 852 | 0.3826 | 0.4168 | 0.123 | 0.3365 | 0.3552 | 20.0 |
143
+ | 0.2605 | 72.0 | 864 | 0.3811 | 0.4168 | 0.123 | 0.3365 | 0.3552 | 20.0 |
144
+ | 0.2605 | 73.0 | 876 | 0.3836 | 0.3961 | 0.1129 | 0.3205 | 0.3287 | 20.0 |
145
+ | 0.2605 | 74.0 | 888 | 0.3820 | 0.4398 | 0.1834 | 0.3212 | 0.3403 | 20.0 |
146
+ | 0.2605 | 75.0 | 900 | 0.3866 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
147
+ | 0.2605 | 76.0 | 912 | 0.3906 | 0.4227 | 0.163 | 0.3335 | 0.349 | 20.0 |
148
+ | 0.2605 | 77.0 | 924 | 0.3936 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
149
+ | 0.2605 | 78.0 | 936 | 0.3906 | 0.4124 | 0.163 | 0.313 | 0.3285 | 20.0 |
150
+ | 0.2605 | 79.0 | 948 | 0.3931 | 0.4128 | 0.1331 | 0.3082 | 0.3165 | 20.0 |
151
+ | 0.2605 | 80.0 | 960 | 0.3966 | 0.4128 | 0.1331 | 0.3082 | 0.3165 | 20.0 |
152
+ | 0.2605 | 81.0 | 972 | 0.3935 | 0.4128 | 0.1331 | 0.2987 | 0.3115 | 20.0 |
153
+ | 0.2605 | 82.0 | 984 | 0.3951 | 0.4128 | 0.1331 | 0.2987 | 0.3115 | 20.0 |
154
+ | 0.2605 | 83.0 | 996 | 0.3954 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
155
+ | 0.0959 | 84.0 | 1008 | 0.3960 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
156
+ | 0.0959 | 85.0 | 1020 | 0.3980 | 0.3829 | 0.1237 | 0.3008 | 0.3135 | 20.0 |
157
+ | 0.0959 | 86.0 | 1032 | 0.4003 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
158
+ | 0.0959 | 87.0 | 1044 | 0.4018 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
159
+ | 0.0959 | 88.0 | 1056 | 0.4056 | 0.3829 | 0.1129 | 0.2905 | 0.3 | 20.0 |
160
+ | 0.0959 | 89.0 | 1068 | 0.4086 | 0.4128 | 0.1448 | 0.304 | 0.3193 | 20.0 |
161
+ | 0.0959 | 90.0 | 1080 | 0.4081 | 0.4128 | 0.1448 | 0.3161 | 0.3275 | 20.0 |
162
+ | 0.0959 | 91.0 | 1092 | 0.4060 | 0.3961 | 0.1246 | 0.3284 | 0.3398 | 20.0 |
163
+ | 0.0959 | 92.0 | 1104 | 0.4045 | 0.3829 | 0.1246 | 0.3068 | 0.3193 | 20.0 |
164
+ | 0.0959 | 93.0 | 1116 | 0.4051 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
165
+ | 0.0959 | 94.0 | 1128 | 0.4038 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
166
+ | 0.0959 | 95.0 | 1140 | 0.4025 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
167
+ | 0.0959 | 96.0 | 1152 | 0.4026 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
168
+ | 0.0959 | 97.0 | 1164 | 0.4034 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
169
+ | 0.0959 | 98.0 | 1176 | 0.4032 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
170
+ | 0.0959 | 99.0 | 1188 | 0.4036 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
171
+ | 0.0959 | 100.0 | 1200 | 0.4037 | 0.3829 | 0.1129 | 0.2968 | 0.3082 | 20.0 |
172
 
173
 
174
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4064c739ed5ba02570b416899cad8f2cf941e340311fdc195486dba97b956b15
3
  size 990345064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bfc9c4a35d335be8263efb98aee1105c4517ce3b5e826ef3ca02e15692398422
3
  size 990345064