End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 33.7152
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,12 +31,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the xsum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0726
-- Rouge1: 33.7152
-- Rouge2: 12.5057
-- Rougel: 27.3989
-- Rougelsum: 27.3994
-- Gen Len: 18.7527
 ## Model description
@@ -61,18 +61,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adafactor
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| 1.3744        | 1.0   | 5102 | 2.0726          | 33.7152 | 12.5057 | 27.3989 | 27.3994   | 18.7527 |
 ### Framework versions
 - Transformers 4.26.1
 - Pytorch 1.13.1+cu116
-- Datasets 2.9.0
 - Tokenizers 0.13.2

     metrics:
     - name: Rouge1
       type: rouge
+      value: 32.3503
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the xsum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0798
+- Rouge1: 32.3503
+- Rouge2: 10.8909
+- Rougel: 25.9346
+- Rougelsum: 25.9216
+- Gen Len: 18.8494
 ## Model description
 - seed: 42
 - optimizer: Adafactor
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 2.335         | 1.0   | 1417 | 2.0823          | 31.3453 | 10.2077 | 25.0051 | 25.008    | 18.8259 |
+| 1.8642        | 2.0   | 2834 | 2.0798          | 32.3503 | 10.8909 | 25.9346 | 25.9216   | 18.8494 |
+| 1.5208        | 3.0   | 4251 | 2.1272          | 32.6743 | 11.3394 | 26.3776 | 26.3724   | 18.8435 |
+| 1.2628        | 4.0   | 5668 | 2.2110          | 32.695  | 11.3273 | 26.3215 | 26.322    | 18.8306 |
+| 1.0649        | 5.0   | 7085 | 2.3143          | 32.5287 | 11.3662 | 26.274  | 26.2741   | 18.8345 |
 ### Framework versions
 - Transformers 4.26.1
 - Pytorch 1.13.1+cu116
+- Datasets 2.10.0
 - Tokenizers 0.13.2

logs/events.out.tfevents.1677232215.de363b89a23d.3558.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5e1a8ed4ab6c7c517bac33706b0c7dde8e4a757def1e40e39d7de73e4a2e20fc
-size 9602

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c7fe28088477090f81402aa24c64bb860a1e6d6f76d35931832aba8e8b02fd3
+size 9956

logs/events.out.tfevents.1677241045.de363b89a23d.3558.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e16a74fb960fe41b791fcb2b298f730041321536dd7aa2f89af7e6e996a5e488
+size 565

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea3428fbee20caeb15bdbf97d441c26f8425988d838f0e5fc0cc20cee9c6d666
 size 990408885

 version https://git-lfs.github.com/spec/v1
+oid sha256:61f971cb758681d37595a053961983acbd5bddcde161e2e68629cdfa6c325de0
 size 990408885

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 154,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 154
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 160,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 160
     },
     "direction": "Right",
     "pad_to_multiple_of": null,