Commit
•
321b64f
1
Parent(s):
b13ea2b
Model save
Browse files- README.md +16 -16
- generation_config.json +1 -1
README.md
CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
- Loss: 0.
|
21 |
-
- Rouge1:
|
22 |
-
- Rouge2:
|
23 |
-
- Rougel: 43.
|
24 |
-
- Rougelsum: 43.
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -52,21 +52,21 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
|
55 |
-
| 0.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
|
66 |
|
67 |
### Framework versions
|
68 |
|
69 |
-
- Transformers 4.
|
70 |
- Pytorch 2.3.1+cu121
|
71 |
- Datasets 2.20.0
|
72 |
- Tokenizers 0.19.1
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 0.1476
|
21 |
+
- Rouge1: 54.2211
|
22 |
+
- Rouge2: 40.3114
|
23 |
+
- Rougel: 43.3623
|
24 |
+
- Rougelsum: 43.4852
|
25 |
|
26 |
## Model description
|
27 |
|
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
|
55 |
+
| 0.6594 | 1.0 | 36 | 0.2550 | 42.7603 | 25.8385 | 30.1054 | 30.2185 |
|
56 |
+
| 0.26 | 2.0 | 72 | 0.2210 | 45.6198 | 28.3586 | 32.8142 | 32.9316 |
|
57 |
+
| 0.225 | 3.0 | 108 | 0.2101 | 49.9583 | 35.7905 | 36.2053 | 36.2483 |
|
58 |
+
| 0.2142 | 4.0 | 144 | 0.1959 | 49.3039 | 34.5635 | 37.9744 | 38.0919 |
|
59 |
+
| 0.2 | 5.0 | 180 | 0.1864 | 51.5603 | 36.3023 | 38.3249 | 38.5464 |
|
60 |
+
| 0.1974 | 6.0 | 216 | 0.1780 | 53.5739 | 37.9308 | 40.2562 | 40.3355 |
|
61 |
+
| 0.1843 | 7.0 | 252 | 0.1678 | 55.1556 | 39.5639 | 42.9197 | 43.0779 |
|
62 |
+
| 0.1788 | 8.0 | 288 | 0.1579 | 55.8788 | 41.1155 | 45.2609 | 45.4263 |
|
63 |
+
| 0.1596 | 9.0 | 324 | 0.1497 | 56.0775 | 41.672 | 44.8889 | 45.1741 |
|
64 |
+
| 0.1545 | 10.0 | 360 | 0.1476 | 54.2211 | 40.3114 | 43.3623 | 43.4852 |
|
65 |
|
66 |
|
67 |
### Framework versions
|
68 |
|
69 |
+
- Transformers 4.42.4
|
70 |
- Pytorch 2.3.1+cu121
|
71 |
- Datasets 2.20.0
|
72 |
- Tokenizers 0.19.1
|
generation_config.json
CHANGED
@@ -11,6 +11,6 @@
|
|
11 |
"no_repeat_ngram_size": 3,
|
12 |
"num_beams": 4,
|
13 |
"pad_token_id": 1,
|
14 |
-
"transformers_version": "4.
|
15 |
"use_cache": false
|
16 |
}
|
|
|
11 |
"no_repeat_ngram_size": 3,
|
12 |
"num_beams": 4,
|
13 |
"pad_token_id": 1,
|
14 |
+
"transformers_version": "4.42.4",
|
15 |
"use_cache": false
|
16 |
}
|