eglkan1
/

flan-t5-small-lit-simplif

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3804
-- Rouge1: 15.4651
-- Rouge2: 7.9672
-- Rougel: 14.8556
-- Rougelsum: 15.0201
-- Gen Len: 19.0
 ## Model description
@@ -51,13 +51,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 92   | 1.5738          | 14.8098 | 7.3808 | 14.3159 | 14.2841   | 18.8913 |
-| No log        | 2.0   | 184  | 1.4534          | 15.0032 | 7.7761 | 14.4801 | 14.5351   | 19.0    |
-| No log        | 3.0   | 276  | 1.4216          | 15.4651 | 7.9672 | 14.8556 | 15.0201   | 19.0    |
-| No log        | 4.0   | 368  | 1.3949          | 15.4651 | 7.9672 | 14.8556 | 15.0201   | 19.0    |
-| No log        | 5.0   | 460  | 1.3804          | 15.4651 | 7.9672 | 14.8556 | 15.0201   | 19.0    |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8832
+- Rouge1: 57.5616
+- Rouge2: 43.0588
+- Rougel: 54.6246
+- Rougelsum: 54.8382
+- Gen Len: 18.4914
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 1.3943        | 1.0   | 698  | 1.0480          | 57.3763 | 43.1329 | 54.1155 | 54.4964   | 18.6571 |
+| 1.1857        | 2.0   | 1396 | 0.9521          | 57.315  | 43.2483 | 54.4032 | 54.7664   | 18.6771 |
+| 1.0406        | 3.0   | 2094 | 0.9075          | 57.6951 | 43.4451 | 54.8174 | 55.0469   | 18.5343 |
+| 0.9861        | 4.0   | 2792 | 0.8873          | 57.8533 | 43.409  | 54.7583 | 55.0156   | 18.5971 |
+| 0.9592        | 5.0   | 3490 | 0.8832          | 57.5616 | 43.0588 | 54.6246 | 54.8382   | 18.4914 |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e4b027d1afe792c4bc6f75ceedf9f060d59e4ff91a1394e525ce90667ba3a469
 size 307910149

 version https://git-lfs.github.com/spec/v1
+oid sha256:76d3350d856209ad0a53d2a66731e00d7915b8979858a35657bccfc5ed600149
 size 307910149

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 512
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 80,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 80
     },
     "direction": "Right",
     "pad_to_multiple_of": null,