shivraj221
/

mt5-small-finetuned-news-summary-kaggle

@@ -7,22 +7,22 @@ tags:
 metrics:
 - rouge
 model-index:
-- name: mt5-small-finetuned-news-summary-kaggle
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mt5-small-finetuned-news-summary-kaggle
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5691
-- Rouge1: 29.8633
-- Rouge2: 11.698
-- Rougel: 26.8739
-- Rougelsum: 26.8536
 ## Model description
@@ -41,26 +41,30 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5.6e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 8
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 8.1234        | 1.0   | 440  | 3.3123          | 18.1738 | 5.9811  | 16.7457 | 16.7126   |
-| 4.2107        | 2.0   | 880  | 2.8404          | 23.009  | 8.3824  | 20.9074 | 20.8962   |
-| 3.738         | 3.0   | 1320 | 2.7354          | 26.5696 | 10.1059 | 23.9321 | 24.0214   |
-| 3.4864        | 4.0   | 1760 | 2.6756          | 27.193  | 10.1971 | 24.4763 | 24.4933   |
-| 3.3642        | 5.0   | 2200 | 2.6224          | 28.7842 | 11.5323 | 26.317  | 26.3211   |
-| 3.269         | 6.0   | 2640 | 2.5883          | 29.6579 | 11.8043 | 26.8824 | 26.8692   |
-| 3.212         | 7.0   | 3080 | 2.5677          | 29.7513 | 11.6639 | 26.6042 | 26.64     |
-| 3.186         | 8.0   | 3520 | 2.5691          | 29.8633 | 11.698  | 26.8739 | 26.8536   |
 ### Framework versions

 metrics:
 - rouge
 model-index:
+- name: mt5-small-finetuned-news-summary-model-2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mt5-small-finetuned-news-summary-model-2
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5813
+- Rouge1: 29.4322
+- Rouge2: 11.4361
+- Rougel: 26.3875
+- Rougelsum: 26.297
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
+- train_batch_size: 10
+- eval_batch_size: 10
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 12
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
+|:-------------:|:-------:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 9.2632        | 0.9972  | 351  | 3.7059          | 17.3365 | 5.2307  | 15.438  | 15.3776   |
+| 4.6719        | 1.9943  | 702  | 3.0896          | 19.5787 | 6.8278  | 18.0637 | 18.0255   |
+| 4.1356        | 2.9915  | 1053 | 2.8713          | 22.5668 | 8.2899  | 20.551  | 20.5232   |
+| 3.7852        | 3.9886  | 1404 | 2.7729          | 25.7974 | 9.9158  | 23.2398 | 23.2198   |
+| 3.6194        | 4.9858  | 1755 | 2.7038          | 26.2572 | 10.0034 | 24.0326 | 23.9956   |
+| 3.4864        | 5.9830  | 2106 | 2.6714          | 26.8149 | 9.9056  | 24.2704 | 24.1399   |
+| 3.3965        | 6.9801  | 2457 | 2.6361          | 27.5399 | 10.3609 | 24.8286 | 24.7628   |
+| 3.3422        | 7.9773  | 2808 | 2.6194          | 28.0298 | 10.6938 | 25.1678 | 25.0924   |
+| 3.2879        | 8.9744  | 3159 | 2.5976          | 28.2324 | 10.6412 | 25.2803 | 25.1804   |
+| 3.2391        | 9.9716  | 3510 | 2.5894          | 29.0155 | 11.174  | 25.9995 | 25.8843   |
+| 3.2128        | 10.9688 | 3861 | 2.5854          | 29.3283 | 11.477  | 26.2235 | 26.1278   |
+| 3.2214        | 11.9659 | 4212 | 2.5813          | 29.4322 | 11.4361 | 26.3875 | 26.297    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0c6a51a85ef8a35bd08569b68519e656f820733d333c1219e54e2c5b76508e49
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:8ca8fd76fcc632f4fe26ab7e92addc98f6134cc016d4e316f9260cd1542dc811
 size 1200729512

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:29102fba08ecb0d89f128ba27346ffe9cd734e9599318962dbf69cb0c3db039a
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:b2680773dcdab95b5b3b01bd56a6a2d36df425c74a3a59d0ca0d7af8e59ea38a
 size 5304