Areeb123
/

mt5-small-finetuned_samsum_summarization_model

@@ -23,7 +23,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 38.4852
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,11 +33,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the samsum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0164
-- Rouge1: 38.4852
-- Rouge2: 16.4292
-- Rougel: 32.9585
-- Rougelsum: 36.0185
 ## Model description
@@ -62,17 +62,20 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
-| 4.9849        | 1.0   | 1050 | 2.2071          | 34.8128 | 14.0544 | 29.8982 | 32.2776   |
-| 2.7097        | 2.0   | 2100 | 2.1157          | 37.7348 | 15.9587 | 32.2724 | 35.2982   |
-| 2.5305        | 3.0   | 3150 | 2.0553          | 38.4581 | 16.4518 | 32.7643 | 35.936    |
-| 2.451         | 4.0   | 4200 | 2.0253          | 38.3972 | 16.3508 | 32.7684 | 35.9072   |
-| 2.4132        | 5.0   | 5250 | 2.0164          | 38.4852 | 16.4292 | 32.9585 | 36.0185   |
 ### Framework versions

     metrics:
     - name: Rouge1
       type: rouge
+      value: 39.9323
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the samsum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9328
+- Rouge1: 39.9323
+- Rouge2: 18.0293
+- Rougel: 34.3611
+- Rougelsum: 37.3087
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
+| 4.5012        | 1.0   | 1050 | 2.1992          | 34.6608 | 14.0886 | 29.8674 | 32.1737   |
+| 2.6852        | 2.0   | 2100 | 2.1014          | 38.1793 | 16.0747 | 32.5426 | 35.4332   |
+| 2.4933        | 3.0   | 3150 | 2.0319          | 38.4414 | 16.4993 | 32.6973 | 35.8539   |
+| 2.3933        | 4.0   | 4200 | 1.9910          | 39.2966 | 17.1718 | 33.5556 | 36.802    |
+| 2.3273        | 5.0   | 5250 | 1.9764          | 39.7619 | 17.7287 | 33.9838 | 37.1345   |
+| 2.2783        | 6.0   | 6300 | 1.9503          | 39.9351 | 17.8312 | 34.2641 | 37.2625   |
+| 2.2543        | 7.0   | 7350 | 1.9350          | 39.9551 | 17.918  | 34.3361 | 37.2039   |
+| 2.2383        | 8.0   | 8400 | 1.9328          | 39.9323 | 18.0293 | 34.3611 | 37.3087   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e8c494fbfa93714a15208eb94dff042e6ff578204f3d94d464836b45d632148
 size 1200729512

 version https://git-lfs.github.com/spec/v1
+oid sha256:d5af00fe5ef76f46d2f429eb1702f62cc317d6750030f40c36de18c3ebdf4a22
 size 1200729512

runs/Nov30_13-50-10_13547b54126b/events.out.tfevents.1701352225.13547b54126b.48239.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5ce5b66c7183609a2d7ad76256d84f62c7e39f28153242a99695d1756cd6b37d
-size 9003

 version https://git-lfs.github.com/spec/v1
+oid sha256:f306b9d6275a82801e283dbf2d6d8736277a1f63214a293fb5c5c21b87fa45ef
+size 9988