Areeb123 commited on
Commit
6b3a489
1 Parent(s): ec04cb7

Training complete

Browse files
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
- value: 38.4852
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,11 +33,11 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 2.0164
37
- - Rouge1: 38.4852
38
- - Rouge2: 16.4292
39
- - Rougel: 32.9585
40
- - Rougelsum: 36.0185
41
 
42
  ## Model description
43
 
@@ -62,17 +62,20 @@ The following hyperparameters were used during training:
62
  - seed: 42
63
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
  - lr_scheduler_type: linear
65
- - num_epochs: 5
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
71
- | 4.9849 | 1.0 | 1050 | 2.2071 | 34.8128 | 14.0544 | 29.8982 | 32.2776 |
72
- | 2.7097 | 2.0 | 2100 | 2.1157 | 37.7348 | 15.9587 | 32.2724 | 35.2982 |
73
- | 2.5305 | 3.0 | 3150 | 2.0553 | 38.4581 | 16.4518 | 32.7643 | 35.936 |
74
- | 2.451 | 4.0 | 4200 | 2.0253 | 38.3972 | 16.3508 | 32.7684 | 35.9072 |
75
- | 2.4132 | 5.0 | 5250 | 2.0164 | 38.4852 | 16.4292 | 32.9585 | 36.0185 |
 
 
 
76
 
77
 
78
  ### Framework versions
 
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
+ value: 39.9323
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 1.9328
37
+ - Rouge1: 39.9323
38
+ - Rouge2: 18.0293
39
+ - Rougel: 34.3611
40
+ - Rougelsum: 37.3087
41
 
42
  ## Model description
43
 
 
62
  - seed: 42
63
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
  - lr_scheduler_type: linear
65
+ - num_epochs: 8
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
71
+ | 4.5012 | 1.0 | 1050 | 2.1992 | 34.6608 | 14.0886 | 29.8674 | 32.1737 |
72
+ | 2.6852 | 2.0 | 2100 | 2.1014 | 38.1793 | 16.0747 | 32.5426 | 35.4332 |
73
+ | 2.4933 | 3.0 | 3150 | 2.0319 | 38.4414 | 16.4993 | 32.6973 | 35.8539 |
74
+ | 2.3933 | 4.0 | 4200 | 1.9910 | 39.2966 | 17.1718 | 33.5556 | 36.802 |
75
+ | 2.3273 | 5.0 | 5250 | 1.9764 | 39.7619 | 17.7287 | 33.9838 | 37.1345 |
76
+ | 2.2783 | 6.0 | 6300 | 1.9503 | 39.9351 | 17.8312 | 34.2641 | 37.2625 |
77
+ | 2.2543 | 7.0 | 7350 | 1.9350 | 39.9551 | 17.918 | 34.3361 | 37.2039 |
78
+ | 2.2383 | 8.0 | 8400 | 1.9328 | 39.9323 | 18.0293 | 34.3611 | 37.3087 |
79
 
80
 
81
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3e8c494fbfa93714a15208eb94dff042e6ff578204f3d94d464836b45d632148
3
  size 1200729512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5af00fe5ef76f46d2f429eb1702f62cc317d6750030f40c36de18c3ebdf4a22
3
  size 1200729512
runs/Nov30_13-50-10_13547b54126b/events.out.tfevents.1701352225.13547b54126b.48239.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5ce5b66c7183609a2d7ad76256d84f62c7e39f28153242a99695d1756cd6b37d
3
- size 9003
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f306b9d6275a82801e283dbf2d6d8736277a1f63214a293fb5c5c21b87fa45ef
3
+ size 9988