nandavikas16 commited on
Commit
321b64f
1 Parent(s): b13ea2b

Model save

Browse files
Files changed (2) hide show
  1. README.md +16 -16
  2. generation_config.json +1 -1
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.1309
21
- - Rouge1: 52.6236
22
- - Rouge2: 39.8632
23
- - Rougel: 43.4607
24
- - Rougelsum: 43.3561
25
 
26
  ## Model description
27
 
@@ -52,21 +52,21 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
- | 0.5104 | 1.0 | 34 | 0.2207 | 41.732 | 26.7717 | 31.2807 | 31.3611 |
56
- | 0.2181 | 2.0 | 68 | 0.2001 | 44.5268 | 30.0523 | 34.7912 | 35.0095 |
57
- | 0.1824 | 3.0 | 102 | 0.1995 | 45.4038 | 32.6808 | 36.3856 | 36.4004 |
58
- | 0.1851 | 4.0 | 136 | 0.1728 | 48.85 | 35.9202 | 39.2826 | 39.1813 |
59
- | 0.1692 | 5.0 | 170 | 0.1663 | 47.1374 | 34.5505 | 37.8192 | 37.8176 |
60
- | 0.164 | 6.0 | 204 | 0.1594 | 50.3895 | 37.8751 | 40.4181 | 40.3778 |
61
- | 0.1534 | 7.0 | 238 | 0.1526 | 50.7178 | 38.8207 | 41.5719 | 41.6111 |
62
- | 0.1421 | 8.0 | 272 | 0.1424 | 51.3382 | 38.6796 | 40.4545 | 40.3891 |
63
- | 0.1423 | 9.0 | 306 | 0.1354 | 53.8161 | 41.0736 | 45.1571 | 45.0427 |
64
- | 0.1336 | 10.0 | 340 | 0.1309 | 52.6236 | 39.8632 | 43.4607 | 43.3561 |
65
 
66
 
67
  ### Framework versions
68
 
69
- - Transformers 4.41.2
70
  - Pytorch 2.3.1+cu121
71
  - Datasets 2.20.0
72
  - Tokenizers 0.19.1
 
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.1476
21
+ - Rouge1: 54.2211
22
+ - Rouge2: 40.3114
23
+ - Rougel: 43.3623
24
+ - Rougelsum: 43.4852
25
 
26
  ## Model description
27
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
54
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
55
+ | 0.6594 | 1.0 | 36 | 0.2550 | 42.7603 | 25.8385 | 30.1054 | 30.2185 |
56
+ | 0.26 | 2.0 | 72 | 0.2210 | 45.6198 | 28.3586 | 32.8142 | 32.9316 |
57
+ | 0.225 | 3.0 | 108 | 0.2101 | 49.9583 | 35.7905 | 36.2053 | 36.2483 |
58
+ | 0.2142 | 4.0 | 144 | 0.1959 | 49.3039 | 34.5635 | 37.9744 | 38.0919 |
59
+ | 0.2 | 5.0 | 180 | 0.1864 | 51.5603 | 36.3023 | 38.3249 | 38.5464 |
60
+ | 0.1974 | 6.0 | 216 | 0.1780 | 53.5739 | 37.9308 | 40.2562 | 40.3355 |
61
+ | 0.1843 | 7.0 | 252 | 0.1678 | 55.1556 | 39.5639 | 42.9197 | 43.0779 |
62
+ | 0.1788 | 8.0 | 288 | 0.1579 | 55.8788 | 41.1155 | 45.2609 | 45.4263 |
63
+ | 0.1596 | 9.0 | 324 | 0.1497 | 56.0775 | 41.672 | 44.8889 | 45.1741 |
64
+ | 0.1545 | 10.0 | 360 | 0.1476 | 54.2211 | 40.3114 | 43.3623 | 43.4852 |
65
 
66
 
67
  ### Framework versions
68
 
69
+ - Transformers 4.42.4
70
  - Pytorch 2.3.1+cu121
71
  - Datasets 2.20.0
72
  - Tokenizers 0.19.1
generation_config.json CHANGED
@@ -11,6 +11,6 @@
11
  "no_repeat_ngram_size": 3,
12
  "num_beams": 4,
13
  "pad_token_id": 1,
14
- "transformers_version": "4.41.2",
15
  "use_cache": false
16
  }
 
11
  "no_repeat_ngram_size": 3,
12
  "num_beams": 4,
13
  "pad_token_id": 1,
14
+ "transformers_version": "4.42.4",
15
  "use_cache": false
16
  }