theojolliffe commited on
Commit
75a6348
1 Parent(s): 284bf7b

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -22
README.md CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv](https://huggingface.co/theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.8007
20
- - Rouge1: 52.4033
21
- - Rouge2: 34.5747
22
- - Rougel: 37.1754
23
- - Rougelsum: 50.116
24
- - Gen Len: 141.6481
25
 
26
  ## Model description
27
 
@@ -46,28 +46,51 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 20
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
56
- | No log | 0.31 | 125 | 1.2189 | 52.135 | 32.1909 | 33.2568 | 49.212 | 142.0 |
57
- | No log | 0.63 | 250 | 1.0848 | 51.8506 | 31.8383 | 34.0225 | 48.9427 | 141.8889 |
58
- | No log | 0.94 | 375 | 0.9838 | 52.0218 | 31.5571 | 32.7594 | 49.1574 | 142.0 |
59
- | 1.1603 | 1.26 | 500 | 0.9675 | 51.5844 | 31.8092 | 33.2469 | 48.5004 | 142.0 |
60
- | 1.1603 | 1.57 | 625 | 0.9470 | 52.3383 | 32.5252 | 34.4442 | 49.5981 | 142.0 |
61
- | 1.1603 | 1.88 | 750 | 0.8849 | 53.1715 | 34.2133 | 35.3615 | 50.857 | 141.8148 |
62
- | 1.1603 | 2.2 | 875 | 0.8490 | 53.5919 | 34.3111 | 36.5608 | 50.8721 | 141.6296 |
63
- | 0.688 | 2.51 | 1000 | 0.8434 | 52.5115 | 33.3104 | 35.8243 | 50.0625 | 142.0 |
64
- | 0.688 | 2.83 | 1125 | 0.8089 | 53.3029 | 33.258 | 35.3429 | 50.2641 | 141.963 |
65
- | 0.688 | 3.14 | 1250 | 0.8768 | 53.2829 | 33.6257 | 36.3661 | 50.5444 | 142.0 |
66
- | 0.688 | 3.45 | 1375 | 0.8256 | 53.5736 | 34.7489 | 36.4858 | 51.1342 | 141.8889 |
67
- | 0.4551 | 3.77 | 1500 | 0.7884 | 54.0105 | 35.051 | 37.4089 | 51.2838 | 141.8889 |
68
- | 0.4551 | 4.08 | 1625 | 0.8145 | 52.6526 | 34.173 | 37.4877 | 50.3849 | 141.0 |
69
- | 0.4551 | 4.4 | 1750 | 0.8358 | 54.8493 | 36.3011 | 38.7691 | 51.951 | 142.0 |
70
- | 0.4551 | 4.71 | 1875 | 0.8007 | 52.4033 | 34.5747 | 37.1754 | 50.116 | 141.6481 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
71
 
72
 
73
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv](https://huggingface.co/theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.8794
20
+ - Rouge1: 55.9136
21
+ - Rouge2: 40.6124
22
+ - Rougel: 43.8806
23
+ - Rougelsum: 54.2039
24
+ - Gen Len: 142.0
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 1000
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
56
+ | No log | 0.31 | 125 | 1.2057 | 50.9436 | 30.6436 | 32.6348 | 48.0674 | 141.3519 |
57
+ | No log | 0.63 | 250 | 1.0933 | 52.0677 | 31.2561 | 32.8008 | 49.0282 | 141.9815 |
58
+ | No log | 0.94 | 375 | 0.9685 | 51.6623 | 32.148 | 34.0536 | 48.9779 | 141.5556 |
59
+ | 1.1594 | 1.26 | 500 | 0.9725 | 50.4646 | 30.6781 | 32.1995 | 47.3852 | 142.0 |
60
+ | 1.1594 | 1.57 | 625 | 0.9342 | 52.2146 | 32.2166 | 33.7256 | 49.2233 | 142.0 |
61
+ | 1.1594 | 1.88 | 750 | 0.8715 | 52.2443 | 33.66 | 36.0575 | 49.7769 | 141.6481 |
62
+ | 1.1594 | 2.2 | 875 | 0.8334 | 53.0976 | 33.9638 | 36.0616 | 50.7417 | 141.8889 |
63
+ | 0.6845 | 2.51 | 1000 | 0.8241 | 52.3152 | 32.8571 | 35.302 | 49.6273 | 142.0 |
64
+ | 0.6845 | 2.83 | 1125 | 0.7986 | 54.075 | 35.0318 | 37.4544 | 51.4955 | 142.0 |
65
+ | 0.6845 | 3.14 | 1250 | 0.8532 | 52.1242 | 32.5844 | 34.6821 | 49.6048 | 141.7037 |
66
+ | 0.6845 | 3.45 | 1375 | 0.8319 | 52.0714 | 32.8862 | 35.3255 | 49.3984 | 141.7593 |
67
+ | 0.4488 | 3.77 | 1500 | 0.8033 | 53.2189 | 34.7029 | 37.5627 | 50.8068 | 142.0 |
68
+ | 0.4488 | 4.08 | 1625 | 0.8322 | 53.1666 | 34.8916 | 37.733 | 50.9602 | 142.0 |
69
+ | 0.4488 | 4.4 | 1750 | 0.7985 | 51.8809 | 32.9926 | 36.3812 | 49.6592 | 142.0 |
70
+ | 0.4488 | 4.71 | 1875 | 0.8049 | 54.2959 | 36.648 | 39.2174 | 52.2153 | 141.8148 |
71
+ | 0.3017 | 5.03 | 2000 | 0.8148 | 53.1564 | 35.2561 | 38.4413 | 50.9793 | 141.7778 |
72
+ | 0.3017 | 5.34 | 2125 | 0.8153 | 53.5528 | 35.217 | 37.9034 | 51.3596 | 141.0741 |
73
+ | 0.3017 | 5.65 | 2250 | 0.8009 | 52.4906 | 34.9253 | 37.9829 | 50.3951 | 141.6111 |
74
+ | 0.3017 | 5.97 | 2375 | 0.7509 | 54.3645 | 37.5095 | 40.5725 | 52.1743 | 142.0 |
75
+ | 0.2052 | 6.28 | 2500 | 0.8019 | 54.5817 | 36.5587 | 40.0273 | 52.5349 | 142.0 |
76
+ | 0.2052 | 6.6 | 2625 | 0.8176 | 55.3618 | 38.556 | 41.5709 | 53.5806 | 142.0 |
77
+ | 0.2052 | 6.91 | 2750 | 0.7956 | 55.5057 | 38.0122 | 40.8857 | 53.1755 | 141.9815 |
78
+ | 0.2052 | 7.22 | 2875 | 0.7966 | 54.4586 | 37.4821 | 40.7638 | 52.4144 | 142.0 |
79
+ | 0.1465 | 7.54 | 3000 | 0.8311 | 54.3973 | 37.1016 | 40.2977 | 52.3982 | 142.0 |
80
+ | 0.1465 | 7.85 | 3125 | 0.8227 | 53.9072 | 36.5277 | 39.0963 | 51.9937 | 141.8889 |
81
+ | 0.1465 | 8.17 | 3250 | 0.7947 | 54.7043 | 38.5848 | 41.2942 | 52.8724 | 142.0 |
82
+ | 0.1465 | 8.48 | 3375 | 0.7954 | 54.5769 | 37.8265 | 40.6915 | 52.6429 | 141.9444 |
83
+ | 0.115 | 8.79 | 3500 | 0.8433 | 54.7883 | 38.0489 | 41.414 | 52.3718 | 142.0 |
84
+ | 0.115 | 9.11 | 3625 | 0.8416 | 56.5204 | 41.3216 | 44.451 | 54.7371 | 142.0 |
85
+ | 0.115 | 9.42 | 3750 | 0.8164 | 55.2908 | 39.0328 | 41.5761 | 53.4643 | 142.0 |
86
+ | 0.115 | 9.74 | 3875 | 0.8363 | 55.2659 | 39.4302 | 42.1691 | 53.7407 | 141.8889 |
87
+ | 0.0912 | 10.05 | 4000 | 0.8850 | 55.7855 | 40.6168 | 43.1968 | 54.2718 | 142.0 |
88
+ | 0.0912 | 10.36 | 4125 | 0.8268 | 56.1701 | 40.7518 | 42.987 | 54.1229 | 141.9259 |
89
+ | 0.0912 | 10.68 | 4250 | 0.8564 | 55.4179 | 39.6097 | 42.3691 | 53.4582 | 141.8889 |
90
+ | 0.0912 | 10.99 | 4375 | 0.8557 | 56.1136 | 41.4924 | 45.8591 | 54.6113 | 141.6667 |
91
+ | 0.0707 | 11.31 | 4500 | 0.8432 | 55.0109 | 39.3858 | 42.0807 | 53.4629 | 142.0 |
92
+ | 0.0707 | 11.62 | 4625 | 0.8377 | 54.3239 | 37.7401 | 40.4619 | 52.4602 | 142.0 |
93
+ | 0.0707 | 11.93 | 4750 | 0.8794 | 55.9136 | 40.6124 | 43.8806 | 54.2039 | 142.0 |
94
 
95
 
96
  ### Framework versions