theojolliffe
commited on
Commit
•
75a6348
1
Parent(s):
284bf7b
update model card README.md
Browse files
README.md
CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
16 |
|
17 |
This model is a fine-tuned version of [theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv](https://huggingface.co/theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv) on an unknown dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
-
- Loss: 0.
|
20 |
-
- Rouge1:
|
21 |
-
- Rouge2:
|
22 |
-
- Rougel:
|
23 |
-
- Rougelsum:
|
24 |
-
- Gen Len:
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -46,28 +46,51 @@ The following hyperparameters were used during training:
|
|
46 |
- seed: 42
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: linear
|
49 |
-
- num_epochs:
|
50 |
- mixed_precision_training: Native AMP
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
|
56 |
-
| No log | 0.31 | 125 | 1.
|
57 |
-
| No log | 0.63 | 250 | 1.
|
58 |
-
| No log | 0.94 | 375 | 0.
|
59 |
-
| 1.
|
60 |
-
| 1.
|
61 |
-
| 1.
|
62 |
-
| 1.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
71 |
|
72 |
|
73 |
### Framework versions
|
|
|
16 |
|
17 |
This model is a fine-tuned version of [theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv](https://huggingface.co/theojolliffe/bart-cnn-pubmed-arxiv-pubmed-arxiv) on an unknown dataset.
|
18 |
It achieves the following results on the evaluation set:
|
19 |
+
- Loss: 0.8794
|
20 |
+
- Rouge1: 55.9136
|
21 |
+
- Rouge2: 40.6124
|
22 |
+
- Rougel: 43.8806
|
23 |
+
- Rougelsum: 54.2039
|
24 |
+
- Gen Len: 142.0
|
25 |
|
26 |
## Model description
|
27 |
|
|
|
46 |
- seed: 42
|
47 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
48 |
- lr_scheduler_type: linear
|
49 |
+
- num_epochs: 1000
|
50 |
- mixed_precision_training: Native AMP
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
|
56 |
+
| No log | 0.31 | 125 | 1.2057 | 50.9436 | 30.6436 | 32.6348 | 48.0674 | 141.3519 |
|
57 |
+
| No log | 0.63 | 250 | 1.0933 | 52.0677 | 31.2561 | 32.8008 | 49.0282 | 141.9815 |
|
58 |
+
| No log | 0.94 | 375 | 0.9685 | 51.6623 | 32.148 | 34.0536 | 48.9779 | 141.5556 |
|
59 |
+
| 1.1594 | 1.26 | 500 | 0.9725 | 50.4646 | 30.6781 | 32.1995 | 47.3852 | 142.0 |
|
60 |
+
| 1.1594 | 1.57 | 625 | 0.9342 | 52.2146 | 32.2166 | 33.7256 | 49.2233 | 142.0 |
|
61 |
+
| 1.1594 | 1.88 | 750 | 0.8715 | 52.2443 | 33.66 | 36.0575 | 49.7769 | 141.6481 |
|
62 |
+
| 1.1594 | 2.2 | 875 | 0.8334 | 53.0976 | 33.9638 | 36.0616 | 50.7417 | 141.8889 |
|
63 |
+
| 0.6845 | 2.51 | 1000 | 0.8241 | 52.3152 | 32.8571 | 35.302 | 49.6273 | 142.0 |
|
64 |
+
| 0.6845 | 2.83 | 1125 | 0.7986 | 54.075 | 35.0318 | 37.4544 | 51.4955 | 142.0 |
|
65 |
+
| 0.6845 | 3.14 | 1250 | 0.8532 | 52.1242 | 32.5844 | 34.6821 | 49.6048 | 141.7037 |
|
66 |
+
| 0.6845 | 3.45 | 1375 | 0.8319 | 52.0714 | 32.8862 | 35.3255 | 49.3984 | 141.7593 |
|
67 |
+
| 0.4488 | 3.77 | 1500 | 0.8033 | 53.2189 | 34.7029 | 37.5627 | 50.8068 | 142.0 |
|
68 |
+
| 0.4488 | 4.08 | 1625 | 0.8322 | 53.1666 | 34.8916 | 37.733 | 50.9602 | 142.0 |
|
69 |
+
| 0.4488 | 4.4 | 1750 | 0.7985 | 51.8809 | 32.9926 | 36.3812 | 49.6592 | 142.0 |
|
70 |
+
| 0.4488 | 4.71 | 1875 | 0.8049 | 54.2959 | 36.648 | 39.2174 | 52.2153 | 141.8148 |
|
71 |
+
| 0.3017 | 5.03 | 2000 | 0.8148 | 53.1564 | 35.2561 | 38.4413 | 50.9793 | 141.7778 |
|
72 |
+
| 0.3017 | 5.34 | 2125 | 0.8153 | 53.5528 | 35.217 | 37.9034 | 51.3596 | 141.0741 |
|
73 |
+
| 0.3017 | 5.65 | 2250 | 0.8009 | 52.4906 | 34.9253 | 37.9829 | 50.3951 | 141.6111 |
|
74 |
+
| 0.3017 | 5.97 | 2375 | 0.7509 | 54.3645 | 37.5095 | 40.5725 | 52.1743 | 142.0 |
|
75 |
+
| 0.2052 | 6.28 | 2500 | 0.8019 | 54.5817 | 36.5587 | 40.0273 | 52.5349 | 142.0 |
|
76 |
+
| 0.2052 | 6.6 | 2625 | 0.8176 | 55.3618 | 38.556 | 41.5709 | 53.5806 | 142.0 |
|
77 |
+
| 0.2052 | 6.91 | 2750 | 0.7956 | 55.5057 | 38.0122 | 40.8857 | 53.1755 | 141.9815 |
|
78 |
+
| 0.2052 | 7.22 | 2875 | 0.7966 | 54.4586 | 37.4821 | 40.7638 | 52.4144 | 142.0 |
|
79 |
+
| 0.1465 | 7.54 | 3000 | 0.8311 | 54.3973 | 37.1016 | 40.2977 | 52.3982 | 142.0 |
|
80 |
+
| 0.1465 | 7.85 | 3125 | 0.8227 | 53.9072 | 36.5277 | 39.0963 | 51.9937 | 141.8889 |
|
81 |
+
| 0.1465 | 8.17 | 3250 | 0.7947 | 54.7043 | 38.5848 | 41.2942 | 52.8724 | 142.0 |
|
82 |
+
| 0.1465 | 8.48 | 3375 | 0.7954 | 54.5769 | 37.8265 | 40.6915 | 52.6429 | 141.9444 |
|
83 |
+
| 0.115 | 8.79 | 3500 | 0.8433 | 54.7883 | 38.0489 | 41.414 | 52.3718 | 142.0 |
|
84 |
+
| 0.115 | 9.11 | 3625 | 0.8416 | 56.5204 | 41.3216 | 44.451 | 54.7371 | 142.0 |
|
85 |
+
| 0.115 | 9.42 | 3750 | 0.8164 | 55.2908 | 39.0328 | 41.5761 | 53.4643 | 142.0 |
|
86 |
+
| 0.115 | 9.74 | 3875 | 0.8363 | 55.2659 | 39.4302 | 42.1691 | 53.7407 | 141.8889 |
|
87 |
+
| 0.0912 | 10.05 | 4000 | 0.8850 | 55.7855 | 40.6168 | 43.1968 | 54.2718 | 142.0 |
|
88 |
+
| 0.0912 | 10.36 | 4125 | 0.8268 | 56.1701 | 40.7518 | 42.987 | 54.1229 | 141.9259 |
|
89 |
+
| 0.0912 | 10.68 | 4250 | 0.8564 | 55.4179 | 39.6097 | 42.3691 | 53.4582 | 141.8889 |
|
90 |
+
| 0.0912 | 10.99 | 4375 | 0.8557 | 56.1136 | 41.4924 | 45.8591 | 54.6113 | 141.6667 |
|
91 |
+
| 0.0707 | 11.31 | 4500 | 0.8432 | 55.0109 | 39.3858 | 42.0807 | 53.4629 | 142.0 |
|
92 |
+
| 0.0707 | 11.62 | 4625 | 0.8377 | 54.3239 | 37.7401 | 40.4619 | 52.4602 | 142.0 |
|
93 |
+
| 0.0707 | 11.93 | 4750 | 0.8794 | 55.9136 | 40.6124 | 43.8806 | 54.2039 | 142.0 |
|
94 |
|
95 |
|
96 |
### Framework versions
|