End of training

Files changed (2) hide show

README.md CHANGED Viewed

@@ -16,17 +16,17 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.4040
-- Rouge-1 R: 0.3241
-- Rouge-1 P: 0.3123
-- Rouge-1 F: 0.3097
-- Rouge-2 R: 0.169
-- Rouge-2 P: 0.1616
-- Rouge-2 F: 0.1599
-- Rouge-l R: 0.3157
-- Rouge-l P: 0.3043
-- Rouge-l F: 0.3017
-- Gen Len: 15.8317
 ## Model description
@@ -51,14 +51,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge-1 R | Rouge-1 P | Rouge-1 F | Rouge-2 R | Rouge-2 P | Rouge-2 F | Rouge-l R | Rouge-l P | Rouge-l F | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:-------:|
-| 1.8489        | 1.0   | 10191 | 1.4722          | 0.3127    | 0.3174    | 0.3056    | 0.1591    | 0.1613    | 0.1543    | 0.3046    | 0.3091    | 0.2976    | 15.2001 |
-| 1.7273        | 2.0   | 20382 | 1.4040          | 0.3241    | 0.3123    | 0.3097    | 0.169     | 0.1616    | 0.1599    | 0.3157    | 0.3043    | 0.3017    | 15.8317 |
 ### Framework versions

 This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3685
+- Rouge-1 R: 0.3321
+- Rouge-1 P: 0.3218
+- Rouge-1 F: 0.3186
+- Rouge-2 R: 0.1761
+- Rouge-2 P: 0.1703
+- Rouge-2 F: 0.1677
+- Rouge-l R: 0.3234
+- Rouge-l P: 0.3133
+- Rouge-l F: 0.3102
+- Gen Len: 15.7133
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Rouge-1 R | Rouge-1 P | Rouge-1 F | Rouge-2 R | Rouge-2 P | Rouge-2 F | Rouge-l R | Rouge-l P | Rouge-l F | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:---------:|:-------:|
+| 1.8844        | 1.0   | 10191 | 1.4867          | 0.31      | 0.3133    | 0.3024    | 0.1576    | 0.1605    | 0.1531    | 0.3015    | 0.3048    | 0.2942    | 15.2667 |
+| 1.7381        | 2.0   | 20382 | 1.4401          | 0.3203    | 0.3104    | 0.3068    | 0.1675    | 0.162     | 0.1592    | 0.3121    | 0.3026    | 0.299     | 15.699  |
+| 1.6401        | 3.0   | 30573 | 1.3685          | 0.3321    | 0.3218    | 0.3186    | 0.1761    | 0.1703    | 0.1677    | 0.3234    | 0.3133    | 0.3102    | 15.7133 |
 ### Framework versions

runs/Jul20_03-11-00_4f90f64d3f48/events.out.tfevents.1721445062.4f90f64d3f48.24.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fbdad2085aa02f8f166b073c2c5716de34df4243eecb0c052ea8abf032da7387
-size 19801

 version https://git-lfs.github.com/spec/v1
+oid sha256:efedb4d9d3e4179b2a7e0147deb5dd862b3ff226ed223b5eea801c220f420313
+size 20975