bart-cnn-pubmed-arxiv-pubmed-v3-e64
This model is a fine-tuned version of theojolliffe/bart-cnn-pubmed-arxiv-pubmed on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 1.0630
- Rouge1: 58.7
- Rouge2: 47.8042
- Rougel: 50.6967
- Rougelsum: 57.5543
- Gen Len: 142.0
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 2
- eval_batch_size: 2
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 64
- mixed_precision_training: Native AMP
Training results
Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
---|---|---|---|---|---|---|---|---|
No log | 1.0 | 398 | 0.9499 | 53.8396 | 34.0954 | 35.6734 | 51.3453 | 142.0 |
1.1219 | 2.0 | 796 | 0.8223 | 53.0414 | 33.3193 | 35.7448 | 50.1675 | 142.0 |
0.6681 | 3.0 | 1194 | 0.7689 | 53.6684 | 35.3651 | 37.7087 | 51.1441 | 142.0 |
0.4393 | 4.0 | 1592 | 0.7694 | 53.9066 | 35.3925 | 38.8917 | 51.6172 | 142.0 |
0.4393 | 5.0 | 1990 | 0.7597 | 54.0746 | 36.1026 | 39.1318 | 51.9272 | 142.0 |
0.2947 | 6.0 | 2388 | 0.8284 | 53.1168 | 34.7428 | 38.0573 | 50.9563 | 142.0 |
0.2016 | 7.0 | 2786 | 0.7951 | 55.7222 | 39.0458 | 42.5265 | 53.5359 | 142.0 |
0.1422 | 8.0 | 3184 | 0.7793 | 56.2376 | 40.3348 | 43.435 | 54.3228 | 142.0 |
0.1096 | 9.0 | 3582 | 0.8260 | 55.0372 | 39.0552 | 42.5403 | 53.0694 | 142.0 |
0.1096 | 10.0 | 3980 | 0.8397 | 53.849 | 37.519 | 40.674 | 52.1357 | 141.7037 |
0.0881 | 11.0 | 4378 | 0.8504 | 56.4835 | 41.0484 | 44.9407 | 54.3557 | 142.0 |
0.0693 | 12.0 | 4776 | 0.8285 | 55.7705 | 39.8585 | 43.722 | 53.7607 | 142.0 |
0.0572 | 13.0 | 5174 | 0.8327 | 57.932 | 43.5378 | 46.8233 | 55.8739 | 142.0 |
0.0461 | 14.0 | 5572 | 0.8720 | 57.6733 | 42.9742 | 45.8698 | 56.018 | 142.0 |
0.0461 | 15.0 | 5970 | 0.8723 | 57.6072 | 42.6946 | 45.2551 | 55.8486 | 142.0 |
0.0416 | 16.0 | 6368 | 0.8764 | 57.1973 | 43.1931 | 46.4492 | 55.3842 | 142.0 |
0.0343 | 17.0 | 6766 | 0.8638 | 57.4474 | 43.3544 | 46.3026 | 55.7863 | 142.0 |
0.03 | 18.0 | 7164 | 0.9234 | 57.9166 | 43.8551 | 46.6473 | 56.3895 | 142.0 |
0.0252 | 19.0 | 7562 | 0.9393 | 58.2908 | 45.2321 | 47.1398 | 56.6618 | 142.0 |
0.0252 | 20.0 | 7960 | 0.8966 | 59.2798 | 46.381 | 49.3514 | 57.6061 | 142.0 |
0.024 | 21.0 | 8358 | 0.9056 | 57.8409 | 44.2048 | 47.3329 | 56.2568 | 142.0 |
0.0195 | 22.0 | 8756 | 0.9424 | 57.551 | 44.6847 | 47.2771 | 56.2391 | 142.0 |
0.0182 | 23.0 | 9154 | 0.9361 | 59.1078 | 46.4704 | 49.4178 | 57.6796 | 142.0 |
0.0169 | 24.0 | 9552 | 0.9456 | 56.7966 | 43.3135 | 46.4208 | 55.4646 | 142.0 |
0.0169 | 25.0 | 9950 | 0.9867 | 59.5561 | 47.4638 | 50.0725 | 58.2388 | 141.8519 |
0.0147 | 26.0 | 10348 | 0.9727 | 58.2574 | 44.9904 | 47.2701 | 56.4274 | 142.0 |
0.0125 | 27.0 | 10746 | 0.9589 | 58.6792 | 45.8465 | 48.0781 | 57.0755 | 142.0 |
0.0117 | 28.0 | 11144 | 0.9635 | 59.1118 | 46.6614 | 50.0552 | 57.6153 | 142.0 |
0.0103 | 29.0 | 11542 | 0.9623 | 58.2517 | 45.6401 | 48.5888 | 56.7733 | 142.0 |
0.0103 | 30.0 | 11940 | 0.9752 | 59.0707 | 47.203 | 49.7992 | 57.6216 | 142.0 |
0.0096 | 31.0 | 12338 | 0.9610 | 57.6781 | 44.0504 | 47.6718 | 56.1201 | 142.0 |
0.0089 | 32.0 | 12736 | 0.9705 | 58.5592 | 45.7397 | 48.681 | 57.0302 | 142.0 |
0.008 | 33.0 | 13134 | 0.9989 | 58.1997 | 45.6345 | 48.2551 | 56.8571 | 141.7778 |
0.0075 | 34.0 | 13532 | 0.9880 | 57.9632 | 44.7845 | 47.8763 | 56.3979 | 142.0 |
0.0075 | 35.0 | 13930 | 1.0041 | 58.1316 | 46.2737 | 49.5986 | 56.8263 | 142.0 |
0.0061 | 36.0 | 14328 | 0.9923 | 58.4686 | 46.1735 | 49.1299 | 57.0331 | 142.0 |
0.0066 | 37.0 | 14726 | 1.0157 | 58.4277 | 45.6559 | 49.1739 | 56.8198 | 141.6481 |
0.0052 | 38.0 | 15124 | 1.0220 | 58.5166 | 46.3883 | 50.0964 | 57.0104 | 142.0 |
0.0049 | 39.0 | 15522 | 0.9949 | 59.3697 | 47.0609 | 50.2733 | 58.1388 | 142.0 |
0.0049 | 40.0 | 15920 | 1.0368 | 59.9537 | 48.4059 | 51.8185 | 58.8002 | 142.0 |
0.0039 | 41.0 | 16318 | 1.0228 | 58.2093 | 46.4807 | 49.54 | 56.9994 | 142.0 |
0.0041 | 42.0 | 16716 | 1.0218 | 57.6376 | 45.4951 | 49.003 | 56.4606 | 142.0 |
0.0035 | 43.0 | 17114 | 1.0381 | 57.2845 | 43.9593 | 46.779 | 55.6106 | 142.0 |
0.0059 | 44.0 | 17512 | 1.0316 | 58.5506 | 46.2111 | 49.4844 | 56.9506 | 142.0 |
0.0059 | 45.0 | 17910 | 1.0388 | 58.8383 | 47.6053 | 50.6187 | 57.7125 | 142.0 |
0.0028 | 46.0 | 18308 | 1.0068 | 59.3198 | 47.6888 | 50.2478 | 58.0 | 142.0 |
0.0028 | 47.0 | 18706 | 1.0446 | 58.8938 | 46.7524 | 49.5642 | 57.3659 | 142.0 |
0.0022 | 48.0 | 19104 | 1.0347 | 59.8253 | 48.3871 | 51.3949 | 58.5652 | 142.0 |
0.0024 | 49.0 | 19502 | 1.0294 | 60.655 | 50.2339 | 53.1662 | 59.3333 | 142.0 |
0.0024 | 50.0 | 19900 | 1.0225 | 58.5131 | 47.3009 | 50.1642 | 57.2287 | 142.0 |
0.0022 | 51.0 | 20298 | 1.0320 | 59.6101 | 47.4104 | 50.5291 | 58.075 | 142.0 |
0.0018 | 52.0 | 20696 | 1.0507 | 58.7957 | 46.8893 | 50.2996 | 57.3662 | 142.0 |
0.0015 | 53.0 | 21094 | 1.0599 | 58.9064 | 47.9433 | 51.3082 | 57.6871 | 142.0 |
0.0015 | 54.0 | 21492 | 1.0636 | 59.6607 | 48.5737 | 51.2361 | 58.333 | 142.0 |
0.0013 | 55.0 | 21890 | 1.0452 | 58.7026 | 46.5286 | 49.9672 | 57.2521 | 142.0 |
0.0012 | 56.0 | 22288 | 1.0418 | 58.9452 | 47.7209 | 50.657 | 57.7103 | 142.0 |
0.0011 | 57.0 | 22686 | 1.0578 | 58.485 | 46.0691 | 49.811 | 57.2591 | 142.0 |
0.0009 | 58.0 | 23084 | 1.0561 | 59.2268 | 48.1987 | 50.1948 | 57.8871 | 142.0 |
0.0009 | 59.0 | 23482 | 1.0548 | 59.6307 | 48.1778 | 50.9934 | 58.2098 | 142.0 |
0.0009 | 60.0 | 23880 | 1.0498 | 59.5054 | 48.8866 | 51.5977 | 58.1868 | 142.0 |
0.0008 | 61.0 | 24278 | 1.0583 | 60.0232 | 49.2518 | 52.2297 | 58.6774 | 142.0 |
0.0007 | 62.0 | 24676 | 1.0659 | 59.1755 | 48.4144 | 51.5157 | 58.0416 | 142.0 |
0.0007 | 63.0 | 25074 | 1.0622 | 59.1023 | 47.74 | 50.5188 | 57.9707 | 142.0 |
0.0007 | 64.0 | 25472 | 1.0630 | 58.7 | 47.8042 | 50.6967 | 57.5543 | 142.0 |
Framework versions
- Transformers 4.18.0
- Pytorch 1.11.0+cu113
- Datasets 2.1.0
- Tokenizers 0.12.1
- Downloads last month
- 15
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.