Edit model card

bedus-creation/mbart-small-dataset-ii-eng-to-lim-005

This model is a fine-tuned version of mbart-50 on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 4.7245
  • Validation Loss: 6.1589
  • Epoch: 349

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
  • training_precision: float32

Training results

Train Loss Validation Loss Epoch
8.4366 7.8649 0
7.8684 7.6440 1
7.7002 7.5328 2
7.5948 7.4486 3
7.5176 7.3868 4
7.4560 7.3324 5
7.4044 7.2855 6
7.3559 7.2365 7
7.3105 7.1809 8
7.2556 7.1305 9
7.2074 7.0882 10
7.1645 7.0523 11
7.1267 7.0236 12
7.0951 6.9883 13
7.0593 6.9593 14
7.0349 6.9400 15
7.0110 6.9160 16
6.9824 6.8902 17
6.9607 6.8716 18
6.9412 6.8525 19
6.9182 6.8337 20
6.8982 6.8178 21
6.8824 6.7984 22
6.8617 6.7825 23
6.8442 6.7660 24
6.8259 6.7494 25
6.8097 6.7386 26
6.7982 6.7210 27
6.7809 6.7095 28
6.7623 6.7007 29
6.7463 6.6821 30
6.7365 6.6703 31
6.7197 6.6623 32
6.7048 6.6462 33
6.6967 6.6421 34
6.6796 6.6343 35
6.6644 6.6172 36
6.6519 6.6143 37
6.6419 6.5981 38
6.6274 6.5878 39
6.6165 6.5824 40
6.6036 6.5701 41
6.5878 6.5622 42
6.5831 6.5504 43
6.5689 6.5434 44
6.5584 6.5383 45
6.5399 6.5246 46
6.5335 6.5189 47
6.5220 6.5079 48
6.5128 6.4998 49
6.5000 6.4904 50
6.4916 6.4851 51
6.4780 6.4783 52
6.4646 6.4720 53
6.4613 6.4552 54
6.4490 6.4510 55
6.4343 6.4442 56
6.4277 6.4371 57
6.4194 6.4313 58
6.4047 6.4199 59
6.3960 6.4106 60
6.3860 6.4075 61
6.3724 6.4045 62
6.3687 6.4019 63
6.3549 6.3878 64
6.3448 6.3807 65
6.3413 6.3781 66
6.3290 6.3738 67
6.3190 6.3642 68
6.3131 6.3598 69
6.2984 6.3536 70
6.2902 6.3422 71
6.2861 6.3377 72
6.2722 6.3377 73
6.2680 6.3278 74
6.2566 6.3217 75
6.2483 6.3172 76
6.2423 6.3098 77
6.2298 6.3081 78
6.2227 6.3011 79
6.2144 6.2932 80
6.2101 6.2905 81
6.1995 6.2877 82
6.1914 6.2838 83
6.1854 6.2800 84
6.1717 6.2722 85
6.1653 6.2689 86
6.1523 6.2678 87
6.1478 6.2577 88
6.1426 6.2567 89
6.1373 6.2535 90
6.1280 6.2511 91
6.1219 6.2371 92
6.1153 6.2373 93
6.1040 6.2347 94
6.0969 6.2340 95
6.0923 6.2320 96
6.0803 6.2222 97
6.0725 6.2178 98
6.0729 6.2144 99
6.0577 6.2236 100
6.0550 6.2041 101
6.0484 6.2030 102
6.0361 6.2051 103
6.0302 6.1977 104
6.0218 6.1937 105
6.0174 6.1935 106
6.0073 6.1899 107
6.0060 6.1883 108
5.9978 6.1783 109
5.9896 6.1827 110
5.9777 6.1770 111
5.9778 6.1693 112
5.9708 6.1707 113
5.9673 6.1590 114
5.9527 6.1713 115
5.9481 6.1604 116
5.9424 6.1603 117
5.9370 6.1547 118
5.9304 6.1574 119
5.9178 6.1506 120
5.9134 6.1478 121
5.9063 6.1440 122
5.8979 6.1406 123
5.8954 6.1384 124
5.8916 6.1418 125
5.8832 6.1362 126
5.8768 6.1319 127
5.8658 6.1348 128
5.8624 6.1318 129
5.8533 6.1196 130
5.8543 6.1273 131
5.8467 6.1118 132
5.8442 6.1191 133
5.8304 6.1320 134
5.8203 6.1158 135
5.8213 6.1142 136
5.8104 6.1116 137
5.8094 6.1126 138
5.7985 6.1105 139
5.7935 6.1018 140
5.7890 6.0984 141
5.7830 6.1016 142
5.7746 6.0977 143
5.7674 6.0997 144
5.7672 6.1080 145
5.7610 6.1039 146
5.7481 6.0915 147
5.7424 6.0873 148
5.7376 6.1008 149
5.7373 6.0831 150
5.7297 6.0911 151
5.7246 6.0920 152
5.7212 6.0897 153
5.7130 6.0784 154
5.7075 6.0794 155
5.6996 6.0880 156
5.6904 6.0793 157
5.6885 6.0713 158
5.6852 6.0854 159
5.6778 6.0719 160
5.6744 6.0712 161
5.6658 6.0784 162
5.6502 6.0747 163
5.6529 6.0715 164
5.6495 6.0735 165
5.6423 6.0722 166
5.6295 6.0707 167
5.6348 6.0691 168
5.6265 6.0762 169
5.6196 6.0679 170
5.6145 6.0675 171
5.6079 6.0622 172
5.6054 6.0676 173
5.5981 6.0658 174
5.5913 6.0607 175
5.5825 6.0546 176
5.5814 6.0588 177
5.5798 6.0482 178
5.5649 6.0603 179
5.5668 6.0510 180
5.5597 6.0643 181
5.5475 6.0641 182
5.5528 6.0585 183
5.5409 6.0620 184
5.5352 6.0466 185
5.5403 6.0507 186
5.5293 6.0510 187
5.5201 6.0662 188
5.5154 6.0554 189
5.5134 6.0430 190
5.5063 6.0596 191
5.4987 6.0458 192
5.4974 6.0416 193
5.4857 6.0499 194
5.4817 6.0659 195
5.4750 6.0540 196
5.4719 6.0493 197
5.4618 6.0423 198
5.4644 6.0460 199
5.4526 6.0523 200
5.4507 6.0451 201
5.4504 6.0430 202
5.4412 6.0421 203
5.4377 6.0492 204
5.4367 6.0482 205
5.4190 6.0259 206
5.4210 6.0281 207
5.4191 6.0418 208
5.4090 6.0383 209
5.4051 6.0445 210
5.3975 6.0565 211
5.3942 6.0581 212
5.3930 6.0509 213
5.3825 6.0506 214
5.3811 6.0428 215
5.3722 6.0368 216
5.3676 6.0392 217
5.3655 6.0460 218
5.3577 6.0488 219
5.3539 6.0431 220
5.3497 6.0410 221
5.3433 6.0381 222
5.3437 6.0376 223
5.3369 6.0409 224
5.3283 6.0320 225
5.3231 6.0516 226
5.3160 6.0432 227
5.3075 6.0544 228
5.3095 6.0537 229
5.3025 6.0458 230
5.2969 6.0451 231
5.2807 6.0449 232
5.2925 6.0455 233
5.2767 6.0551 234
5.2778 6.0392 235
5.2713 6.0419 236
5.2691 6.0435 237
5.2570 6.0495 238
5.2574 6.0301 239
5.2521 6.0362 240
5.2458 6.0449 241
5.2352 6.0462 242
5.2389 6.0425 243
5.2265 6.0372 244
5.2297 6.0372 245
5.2244 6.0580 246
5.2181 6.0523 247
5.2061 6.0487 248
5.2100 6.0475 249
5.1985 6.0405 250
5.1945 6.0451 251
5.1911 6.0552 252
5.1839 6.0503 253
5.1829 6.0510 254
5.1797 6.0456 255
5.1747 6.0627 256
5.1652 6.0384 257
5.1659 6.0546 258
5.1449 6.0503 259
5.1592 6.0514 260
5.1448 6.0491 261
5.1405 6.0556 262
5.1391 6.0594 263
5.1346 6.0362 264
5.1275 6.0367 265
5.1218 6.0447 266
5.1144 6.0636 267
5.1152 6.0556 268
5.1083 6.0503 269
5.1046 6.0597 270
5.0923 6.0726 271
5.0988 6.0692 272
5.0926 6.0654 273
5.0892 6.0757 274
5.0772 6.0547 275
5.0774 6.0703 276
5.0696 6.0715 277
5.0645 6.0838 278
5.0599 6.0687 279
5.0565 6.0621 280
5.0535 6.0846 281
5.0409 6.0779 282
5.0413 6.0753 283
5.0380 6.0609 284
5.0336 6.0889 285
5.0248 6.0762 286
5.0230 6.0876 287
5.0155 6.0588 288
5.0121 6.0788 289
5.0035 6.0777 290
5.0067 6.0848 291
5.0016 6.0831 292
4.9929 6.0991 293
4.9889 6.1011 294
4.9837 6.0805 295
4.9777 6.0858 296
4.9738 6.0803 297
4.9708 6.0757 298
4.9677 6.0886 299
4.9630 6.0828 300
4.9541 6.0883 301
4.9541 6.1026 302
4.9453 6.0925 303
4.9385 6.0854 304
4.9337 6.1038 305
4.9290 6.0854 306
4.9287 6.1008 307
4.9214 6.1174 308
4.9151 6.1056 309
4.9118 6.0934 310
4.9087 6.0919 311
4.8985 6.1064 312
4.9003 6.1010 313
4.8951 6.1118 314
4.8824 6.1020 315
4.8834 6.1020 316
4.8764 6.1173 317
4.8704 6.1189 318
4.8690 6.0976 319
4.8662 6.1058 320
4.8586 6.1060 321
4.8571 6.1026 322
4.8514 6.1102 323
4.8426 6.1298 324
4.8375 6.1047 325
4.8341 6.1111 326
4.8303 6.1144 327
4.8320 6.1271 328
4.8190 6.1221 329
4.8214 6.1342 330
4.8055 6.1497 331
4.8082 6.1288 332
4.7967 6.1218 333
4.7966 6.1433 334
4.7859 6.1117 335
4.7841 6.1447 336
4.7871 6.1406 337
4.7743 6.1606 338
4.7696 6.1391 339
4.7652 6.1216 340
4.7684 6.1420 341
4.7607 6.1365 342
4.7596 6.1462 343
4.7539 6.1352 344
4.7382 6.1507 345
4.7425 6.1461 346
4.7299 6.1556 347
4.7268 6.1298 348
4.7245 6.1589 349

Framework versions

  • Transformers 4.33.3
  • TensorFlow 2.13.0
  • Datasets 2.14.5
  • Tokenizers 0.13.3
Downloads last month
6