mrshlltaylor's picture
End of training
794f538 verified
|
raw
history blame
No virus
26.8 kB
metadata
license: mit
base_model: gpt2
tags:
  - generated_from_trainer
model-index:
  - name: k3-Entity-Relationship-GPT2
    results: []

k3-Entity-Relationship-GPT2

This model is a fine-tuned version of gpt2 on an unknown dataset. It achieves the following results on the evaluation set:

  • Loss: 0.0051

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0003
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 10
  • total_train_batch_size: 80
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • num_epochs: 500
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 1 0.2858
No log 2.0 2 0.1609
No log 3.0 3 0.1687
No log 4.0 4 0.1693
No log 5.0 5 0.1588
No log 6.0 6 0.1355
No log 7.0 7 0.1201
No log 8.0 8 0.1189
No log 9.0 9 0.1168
0.0164 10.0 10 0.1135
0.0164 11.0 11 0.1092
0.0164 12.0 12 0.1015
0.0164 13.0 13 0.0965
0.0164 14.0 14 0.0925
0.0164 15.0 15 0.0875
0.0164 16.0 16 0.0834
0.0164 17.0 17 0.0800
0.0164 18.0 18 0.0770
0.0164 19.0 19 0.0741
0.0097 20.0 20 0.0719
0.0097 21.0 21 0.0689
0.0097 22.0 22 0.0663
0.0097 23.0 23 0.0642
0.0097 24.0 24 0.0616
0.0097 25.0 25 0.0593
0.0097 26.0 26 0.0575
0.0097 27.0 27 0.0558
0.0097 28.0 28 0.0537
0.0097 29.0 29 0.0519
0.0065 30.0 30 0.0503
0.0065 31.0 31 0.0492
0.0065 32.0 32 0.0480
0.0065 33.0 33 0.0463
0.0065 34.0 34 0.0447
0.0065 35.0 35 0.0438
0.0065 36.0 36 0.0431
0.0065 37.0 37 0.0419
0.0065 38.0 38 0.0406
0.0065 39.0 39 0.0397
0.0048 40.0 40 0.0390
0.0048 41.0 41 0.0381
0.0048 42.0 42 0.0372
0.0048 43.0 43 0.0362
0.0048 44.0 44 0.0354
0.0048 45.0 45 0.0347
0.0048 46.0 46 0.0340
0.0048 47.0 47 0.0332
0.0048 48.0 48 0.0323
0.0048 49.0 49 0.0317
0.0037 50.0 50 0.0312
0.0037 51.0 51 0.0307
0.0037 52.0 52 0.0301
0.0037 53.0 53 0.0295
0.0037 54.0 54 0.0290
0.0037 55.0 55 0.0285
0.0037 56.0 56 0.0281
0.0037 57.0 57 0.0276
0.0037 58.0 58 0.0271
0.0037 59.0 59 0.0265
0.0031 60.0 60 0.0262
0.0031 61.0 61 0.0259
0.0031 62.0 62 0.0256
0.0031 63.0 63 0.0252
0.0031 64.0 64 0.0247
0.0031 65.0 65 0.0241
0.0031 66.0 66 0.0237
0.0031 67.0 67 0.0234
0.0031 68.0 68 0.0231
0.0031 69.0 69 0.0228
0.0026 70.0 70 0.0225
0.0026 71.0 71 0.0222
0.0026 72.0 72 0.0219
0.0026 73.0 73 0.0217
0.0026 74.0 74 0.0213
0.0026 75.0 75 0.0210
0.0026 76.0 76 0.0207
0.0026 77.0 77 0.0205
0.0026 78.0 78 0.0202
0.0026 79.0 79 0.0200
0.0023 80.0 80 0.0197
0.0023 81.0 81 0.0195
0.0023 82.0 82 0.0192
0.0023 83.0 83 0.0190
0.0023 84.0 84 0.0188
0.0023 85.0 85 0.0186
0.0023 86.0 86 0.0184
0.0023 87.0 87 0.0181
0.0023 88.0 88 0.0179
0.0023 89.0 89 0.0177
0.002 90.0 90 0.0175
0.002 91.0 91 0.0174
0.002 92.0 92 0.0173
0.002 93.0 93 0.0171
0.002 94.0 94 0.0169
0.002 95.0 95 0.0167
0.002 96.0 96 0.0166
0.002 97.0 97 0.0164
0.002 98.0 98 0.0162
0.002 99.0 99 0.0161
0.0018 100.0 100 0.0159
0.0018 101.0 101 0.0158
0.0018 102.0 102 0.0156
0.0018 103.0 103 0.0155
0.0018 104.0 104 0.0153
0.0018 105.0 105 0.0151
0.0018 106.0 106 0.0149
0.0018 107.0 107 0.0148
0.0018 108.0 108 0.0146
0.0018 109.0 109 0.0145
0.0016 110.0 110 0.0144
0.0016 111.0 111 0.0142
0.0016 112.0 112 0.0141
0.0016 113.0 113 0.0140
0.0016 114.0 114 0.0139
0.0016 115.0 115 0.0138
0.0016 116.0 116 0.0137
0.0016 117.0 117 0.0135
0.0016 118.0 118 0.0135
0.0016 119.0 119 0.0133
0.0015 120.0 120 0.0132
0.0015 121.0 121 0.0131
0.0015 122.0 122 0.0130
0.0015 123.0 123 0.0129
0.0015 124.0 124 0.0128
0.0015 125.0 125 0.0127
0.0015 126.0 126 0.0126
0.0015 127.0 127 0.0125
0.0015 128.0 128 0.0124
0.0015 129.0 129 0.0122
0.0014 130.0 130 0.0121
0.0014 131.0 131 0.0121
0.0014 132.0 132 0.0121
0.0014 133.0 133 0.0120
0.0014 134.0 134 0.0119
0.0014 135.0 135 0.0118
0.0014 136.0 136 0.0117
0.0014 137.0 137 0.0116
0.0014 138.0 138 0.0115
0.0014 139.0 139 0.0114
0.0013 140.0 140 0.0113
0.0013 141.0 141 0.0112
0.0013 142.0 142 0.0111
0.0013 143.0 143 0.0111
0.0013 144.0 144 0.0110
0.0013 145.0 145 0.0109
0.0013 146.0 146 0.0109
0.0013 147.0 147 0.0108
0.0013 148.0 148 0.0107
0.0013 149.0 149 0.0107
0.0012 150.0 150 0.0106
0.0012 151.0 151 0.0106
0.0012 152.0 152 0.0105
0.0012 153.0 153 0.0104
0.0012 154.0 154 0.0103
0.0012 155.0 155 0.0102
0.0012 156.0 156 0.0102
0.0012 157.0 157 0.0101
0.0012 158.0 158 0.0101
0.0012 159.0 159 0.0100
0.0011 160.0 160 0.0100
0.0011 161.0 161 0.0099
0.0011 162.0 162 0.0099
0.0011 163.0 163 0.0098
0.0011 164.0 164 0.0097
0.0011 165.0 165 0.0097
0.0011 166.0 166 0.0096
0.0011 167.0 167 0.0096
0.0011 168.0 168 0.0095
0.0011 169.0 169 0.0094
0.001 170.0 170 0.0094
0.001 171.0 171 0.0093
0.001 172.0 172 0.0092
0.001 173.0 173 0.0092
0.001 174.0 174 0.0091
0.001 175.0 175 0.0091
0.001 176.0 176 0.0090
0.001 177.0 177 0.0090
0.001 178.0 178 0.0090
0.001 179.0 179 0.0089
0.001 180.0 180 0.0089
0.001 181.0 181 0.0088
0.001 182.0 182 0.0088
0.001 183.0 183 0.0087
0.001 184.0 184 0.0087
0.001 185.0 185 0.0087
0.001 186.0 186 0.0086
0.001 187.0 187 0.0086
0.001 188.0 188 0.0086
0.001 189.0 189 0.0085
0.0009 190.0 190 0.0085
0.0009 191.0 191 0.0084
0.0009 192.0 192 0.0084
0.0009 193.0 193 0.0083
0.0009 194.0 194 0.0083
0.0009 195.0 195 0.0082
0.0009 196.0 196 0.0082
0.0009 197.0 197 0.0082
0.0009 198.0 198 0.0081
0.0009 199.0 199 0.0081
0.0009 200.0 200 0.0081
0.0009 201.0 201 0.0080
0.0009 202.0 202 0.0080
0.0009 203.0 203 0.0080
0.0009 204.0 204 0.0079
0.0009 205.0 205 0.0079
0.0009 206.0 206 0.0079
0.0009 207.0 207 0.0078
0.0009 208.0 208 0.0078
0.0009 209.0 209 0.0077
0.0009 210.0 210 0.0077
0.0009 211.0 211 0.0077
0.0009 212.0 212 0.0077
0.0009 213.0 213 0.0076
0.0009 214.0 214 0.0076
0.0009 215.0 215 0.0075
0.0009 216.0 216 0.0075
0.0009 217.0 217 0.0075
0.0009 218.0 218 0.0075
0.0009 219.0 219 0.0075
0.0008 220.0 220 0.0074
0.0008 221.0 221 0.0074
0.0008 222.0 222 0.0074
0.0008 223.0 223 0.0073
0.0008 224.0 224 0.0073
0.0008 225.0 225 0.0073
0.0008 226.0 226 0.0073
0.0008 227.0 227 0.0072
0.0008 228.0 228 0.0072
0.0008 229.0 229 0.0072
0.0008 230.0 230 0.0071
0.0008 231.0 231 0.0071
0.0008 232.0 232 0.0071
0.0008 233.0 233 0.0071
0.0008 234.0 234 0.0070
0.0008 235.0 235 0.0070
0.0008 236.0 236 0.0070
0.0008 237.0 237 0.0070
0.0008 238.0 238 0.0069
0.0008 239.0 239 0.0069
0.0008 240.0 240 0.0069
0.0008 241.0 241 0.0069
0.0008 242.0 242 0.0069
0.0008 243.0 243 0.0069
0.0008 244.0 244 0.0069
0.0008 245.0 245 0.0068
0.0008 246.0 246 0.0068
0.0008 247.0 247 0.0068
0.0008 248.0 248 0.0068
0.0008 249.0 249 0.0067
0.0007 250.0 250 0.0067
0.0007 251.0 251 0.0067
0.0007 252.0 252 0.0067
0.0007 253.0 253 0.0067
0.0007 254.0 254 0.0066
0.0007 255.0 255 0.0066
0.0007 256.0 256 0.0066
0.0007 257.0 257 0.0066
0.0007 258.0 258 0.0065
0.0007 259.0 259 0.0065
0.0007 260.0 260 0.0065
0.0007 261.0 261 0.0065
0.0007 262.0 262 0.0065
0.0007 263.0 263 0.0065
0.0007 264.0 264 0.0064
0.0007 265.0 265 0.0064
0.0007 266.0 266 0.0064
0.0007 267.0 267 0.0064
0.0007 268.0 268 0.0064
0.0007 269.0 269 0.0064
0.0007 270.0 270 0.0064
0.0007 271.0 271 0.0063
0.0007 272.0 272 0.0063
0.0007 273.0 273 0.0063
0.0007 274.0 274 0.0063
0.0007 275.0 275 0.0063
0.0007 276.0 276 0.0063
0.0007 277.0 277 0.0063
0.0007 278.0 278 0.0063
0.0007 279.0 279 0.0062
0.0007 280.0 280 0.0062
0.0007 281.0 281 0.0062
0.0007 282.0 282 0.0062
0.0007 283.0 283 0.0061
0.0007 284.0 284 0.0061
0.0007 285.0 285 0.0061
0.0007 286.0 286 0.0061
0.0007 287.0 287 0.0060
0.0007 288.0 288 0.0060
0.0007 289.0 289 0.0060
0.0007 290.0 290 0.0060
0.0007 291.0 291 0.0060
0.0007 292.0 292 0.0060
0.0007 293.0 293 0.0060
0.0007 294.0 294 0.0060
0.0007 295.0 295 0.0060
0.0007 296.0 296 0.0060
0.0007 297.0 297 0.0059
0.0007 298.0 298 0.0059
0.0007 299.0 299 0.0059
0.0006 300.0 300 0.0059
0.0006 301.0 301 0.0059
0.0006 302.0 302 0.0059
0.0006 303.0 303 0.0059
0.0006 304.0 304 0.0059
0.0006 305.0 305 0.0058
0.0006 306.0 306 0.0058
0.0006 307.0 307 0.0058
0.0006 308.0 308 0.0058
0.0006 309.0 309 0.0058
0.0006 310.0 310 0.0058
0.0006 311.0 311 0.0058
0.0006 312.0 312 0.0058
0.0006 313.0 313 0.0058
0.0006 314.0 314 0.0058
0.0006 315.0 315 0.0057
0.0006 316.0 316 0.0057
0.0006 317.0 317 0.0057
0.0006 318.0 318 0.0057
0.0006 319.0 319 0.0057
0.0006 320.0 320 0.0057
0.0006 321.0 321 0.0057
0.0006 322.0 322 0.0057
0.0006 323.0 323 0.0057
0.0006 324.0 324 0.0057
0.0006 325.0 325 0.0056
0.0006 326.0 326 0.0056
0.0006 327.0 327 0.0056
0.0006 328.0 328 0.0056
0.0006 329.0 329 0.0056
0.0006 330.0 330 0.0056
0.0006 331.0 331 0.0056
0.0006 332.0 332 0.0056
0.0006 333.0 333 0.0056
0.0006 334.0 334 0.0056
0.0006 335.0 335 0.0056
0.0006 336.0 336 0.0056
0.0006 337.0 337 0.0055
0.0006 338.0 338 0.0055
0.0006 339.0 339 0.0055
0.0006 340.0 340 0.0055
0.0006 341.0 341 0.0055
0.0006 342.0 342 0.0055
0.0006 343.0 343 0.0055
0.0006 344.0 344 0.0055
0.0006 345.0 345 0.0055
0.0006 346.0 346 0.0055
0.0006 347.0 347 0.0055
0.0006 348.0 348 0.0055
0.0006 349.0 349 0.0054
0.0006 350.0 350 0.0054
0.0006 351.0 351 0.0054
0.0006 352.0 352 0.0054
0.0006 353.0 353 0.0054
0.0006 354.0 354 0.0054
0.0006 355.0 355 0.0054
0.0006 356.0 356 0.0054
0.0006 357.0 357 0.0054
0.0006 358.0 358 0.0054
0.0006 359.0 359 0.0054
0.0006 360.0 360 0.0054
0.0006 361.0 361 0.0054
0.0006 362.0 362 0.0054
0.0006 363.0 363 0.0054
0.0006 364.0 364 0.0054
0.0006 365.0 365 0.0054
0.0006 366.0 366 0.0054
0.0006 367.0 367 0.0054
0.0006 368.0 368 0.0053
0.0006 369.0 369 0.0053
0.0006 370.0 370 0.0053
0.0006 371.0 371 0.0053
0.0006 372.0 372 0.0053
0.0006 373.0 373 0.0053
0.0006 374.0 374 0.0053
0.0006 375.0 375 0.0053
0.0006 376.0 376 0.0053
0.0006 377.0 377 0.0053
0.0006 378.0 378 0.0053
0.0006 379.0 379 0.0053
0.0006 380.0 380 0.0053
0.0006 381.0 381 0.0053
0.0006 382.0 382 0.0053
0.0006 383.0 383 0.0053
0.0006 384.0 384 0.0053
0.0006 385.0 385 0.0053
0.0006 386.0 386 0.0052
0.0006 387.0 387 0.0052
0.0006 388.0 388 0.0052
0.0006 389.0 389 0.0052
0.0006 390.0 390 0.0052
0.0006 391.0 391 0.0052
0.0006 392.0 392 0.0052
0.0006 393.0 393 0.0052
0.0006 394.0 394 0.0052
0.0006 395.0 395 0.0052
0.0006 396.0 396 0.0052
0.0006 397.0 397 0.0052
0.0006 398.0 398 0.0052
0.0006 399.0 399 0.0052
0.0006 400.0 400 0.0052
0.0006 401.0 401 0.0052
0.0006 402.0 402 0.0052
0.0006 403.0 403 0.0052
0.0006 404.0 404 0.0052
0.0006 405.0 405 0.0052
0.0006 406.0 406 0.0052
0.0006 407.0 407 0.0052
0.0006 408.0 408 0.0052
0.0006 409.0 409 0.0052
0.0006 410.0 410 0.0052
0.0006 411.0 411 0.0052
0.0006 412.0 412 0.0052
0.0006 413.0 413 0.0052
0.0006 414.0 414 0.0052
0.0006 415.0 415 0.0052
0.0006 416.0 416 0.0052
0.0006 417.0 417 0.0052
0.0006 418.0 418 0.0052
0.0006 419.0 419 0.0052
0.0006 420.0 420 0.0052
0.0006 421.0 421 0.0052
0.0006 422.0 422 0.0052
0.0006 423.0 423 0.0051
0.0006 424.0 424 0.0052
0.0006 425.0 425 0.0051
0.0006 426.0 426 0.0051
0.0006 427.0 427 0.0051
0.0006 428.0 428 0.0051
0.0006 429.0 429 0.0051
0.0006 430.0 430 0.0051
0.0006 431.0 431 0.0051
0.0006 432.0 432 0.0051
0.0006 433.0 433 0.0051
0.0006 434.0 434 0.0051
0.0006 435.0 435 0.0051
0.0006 436.0 436 0.0051
0.0006 437.0 437 0.0051
0.0006 438.0 438 0.0051
0.0006 439.0 439 0.0051
0.0006 440.0 440 0.0051
0.0006 441.0 441 0.0051
0.0006 442.0 442 0.0051
0.0006 443.0 443 0.0051
0.0006 444.0 444 0.0051
0.0006 445.0 445 0.0051
0.0006 446.0 446 0.0051
0.0006 447.0 447 0.0051
0.0006 448.0 448 0.0051
0.0006 449.0 449 0.0051
0.0006 450.0 450 0.0051
0.0006 451.0 451 0.0051
0.0006 452.0 452 0.0051
0.0006 453.0 453 0.0051
0.0006 454.0 454 0.0051
0.0006 455.0 455 0.0051
0.0006 456.0 456 0.0051
0.0006 457.0 457 0.0051
0.0006 458.0 458 0.0051
0.0006 459.0 459 0.0051
0.0006 460.0 460 0.0051
0.0006 461.0 461 0.0051
0.0006 462.0 462 0.0051
0.0006 463.0 463 0.0051
0.0006 464.0 464 0.0051
0.0006 465.0 465 0.0051
0.0006 466.0 466 0.0051
0.0006 467.0 467 0.0051
0.0006 468.0 468 0.0051
0.0006 469.0 469 0.0051
0.0006 470.0 470 0.0051
0.0006 471.0 471 0.0051
0.0006 472.0 472 0.0051
0.0006 473.0 473 0.0051
0.0006 474.0 474 0.0051
0.0006 475.0 475 0.0051
0.0006 476.0 476 0.0051
0.0006 477.0 477 0.0051
0.0006 478.0 478 0.0051
0.0006 479.0 479 0.0051
0.0006 480.0 480 0.0051
0.0006 481.0 481 0.0051
0.0006 482.0 482 0.0051
0.0006 483.0 483 0.0051
0.0006 484.0 484 0.0051
0.0006 485.0 485 0.0051
0.0006 486.0 486 0.0051
0.0006 487.0 487 0.0051
0.0006 488.0 488 0.0051
0.0006 489.0 489 0.0051
0.0006 490.0 490 0.0051
0.0006 491.0 491 0.0051
0.0006 492.0 492 0.0051
0.0006 493.0 493 0.0051
0.0006 494.0 494 0.0051
0.0006 495.0 495 0.0051
0.0006 496.0 496 0.0051
0.0006 497.0 497 0.0051
0.0006 498.0 498 0.0051
0.0006 499.0 499 0.0051
0.0006 500.0 500 0.0051

Framework versions

  • Transformers 4.41.2
  • Pytorch 2.3.0+cu121
  • Datasets 2.20.0
  • Tokenizers 0.19.1