achimoraites commited on
Commit
3daef6c
1 Parent(s): ad45030

End of training

Browse files
README.md CHANGED
@@ -21,7 +21,7 @@ model-index:
21
  metrics:
22
  - name: Rouge1
23
  type: rouge
24
- value: 33.7152
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,12 +31,12 @@ should probably proofread and complete it, then remove this comment. -->
31
 
32
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the xsum dataset.
33
  It achieves the following results on the evaluation set:
34
- - Loss: 2.0726
35
- - Rouge1: 33.7152
36
- - Rouge2: 12.5057
37
- - Rougel: 27.3989
38
- - Rougelsum: 27.3994
39
- - Gen Len: 18.7527
40
 
41
  ## Model description
42
 
@@ -61,18 +61,22 @@ The following hyperparameters were used during training:
61
  - seed: 42
62
  - optimizer: Adafactor
63
  - lr_scheduler_type: linear
64
- - num_epochs: 1
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
69
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
70
- | 1.3744 | 1.0 | 5102 | 2.0726 | 33.7152 | 12.5057 | 27.3989 | 27.3994 | 18.7527 |
 
 
 
 
71
 
72
 
73
  ### Framework versions
74
 
75
  - Transformers 4.26.1
76
  - Pytorch 1.13.1+cu116
77
- - Datasets 2.9.0
78
  - Tokenizers 0.13.2
 
21
  metrics:
22
  - name: Rouge1
23
  type: rouge
24
+ value: 32.3503
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
31
 
32
  This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the xsum dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 2.0798
35
+ - Rouge1: 32.3503
36
+ - Rouge2: 10.8909
37
+ - Rougel: 25.9346
38
+ - Rougelsum: 25.9216
39
+ - Gen Len: 18.8494
40
 
41
  ## Model description
42
 
 
61
  - seed: 42
62
  - optimizer: Adafactor
63
  - lr_scheduler_type: linear
64
+ - num_epochs: 5
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
69
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
70
+ | 2.335 | 1.0 | 1417 | 2.0823 | 31.3453 | 10.2077 | 25.0051 | 25.008 | 18.8259 |
71
+ | 1.8642 | 2.0 | 2834 | 2.0798 | 32.3503 | 10.8909 | 25.9346 | 25.9216 | 18.8494 |
72
+ | 1.5208 | 3.0 | 4251 | 2.1272 | 32.6743 | 11.3394 | 26.3776 | 26.3724 | 18.8435 |
73
+ | 1.2628 | 4.0 | 5668 | 2.2110 | 32.695 | 11.3273 | 26.3215 | 26.322 | 18.8306 |
74
+ | 1.0649 | 5.0 | 7085 | 2.3143 | 32.5287 | 11.3662 | 26.274 | 26.2741 | 18.8345 |
75
 
76
 
77
  ### Framework versions
78
 
79
  - Transformers 4.26.1
80
  - Pytorch 1.13.1+cu116
81
+ - Datasets 2.10.0
82
  - Tokenizers 0.13.2
logs/events.out.tfevents.1677232215.de363b89a23d.3558.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5e1a8ed4ab6c7c517bac33706b0c7dde8e4a757def1e40e39d7de73e4a2e20fc
3
- size 9602
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2c7fe28088477090f81402aa24c64bb860a1e6d6f76d35931832aba8e8b02fd3
3
+ size 9956
logs/events.out.tfevents.1677241045.de363b89a23d.3558.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e16a74fb960fe41b791fcb2b298f730041321536dd7aa2f89af7e6e996a5e488
3
+ size 565
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ea3428fbee20caeb15bdbf97d441c26f8425988d838f0e5fc0cc20cee9c6d666
3
  size 990408885
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:61f971cb758681d37595a053961983acbd5bddcde161e2e68629cdfa6c325de0
3
  size 990408885
tokenizer.json CHANGED
@@ -2,13 +2,13 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 154,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
- "Fixed": 154
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 160,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
  "strategy": {
11
+ "Fixed": 160
12
  },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,