DaJulster commited on
Commit
683bdb4
1 Parent(s): 998998e

End of training

Browse files
README.md CHANGED
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.6565
21
- - Rouge1: 0.0957
22
- - Rouge2: 0.0189
23
- - Rougel: 0.0745
24
- - Rougelsum: 0.0745
25
  - Gen Len: 19.0
26
 
27
  ## Model description
@@ -47,15 +47,18 @@ The following hyperparameters were used during training:
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 2
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | 3.4822 | 1.0 | 658 | 2.9374 | 0.0738 | 0.0103 | 0.0627 | 0.0627 | 19.0 |
58
- | 2.8397 | 2.0 | 1316 | 2.6565 | 0.0957 | 0.0189 | 0.0745 | 0.0745 | 19.0 |
 
 
 
59
 
60
 
61
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.1482
21
+ - Rouge1: 0.1071
22
+ - Rouge2: 0.0265
23
+ - Rougel: 0.0821
24
+ - Rougelsum: 0.0823
25
  - Gen Len: 19.0
26
 
27
  ## Model description
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | 2.3526 | 1.0 | 658 | 2.7749 | 0.089 | 0.0141 | 0.0707 | 0.0707 | 19.0 |
58
+ | 2.0919 | 2.0 | 1316 | 2.5361 | 0.1003 | 0.0202 | 0.0777 | 0.0778 | 19.0 |
59
+ | 2.1006 | 3.0 | 1974 | 2.3129 | 0.1084 | 0.0209 | 0.0807 | 0.0808 | 19.0 |
60
+ | 1.3701 | 4.0 | 2632 | 2.2004 | 0.1025 | 0.0213 | 0.0779 | 0.078 | 19.0 |
61
+ | 1.0634 | 5.0 | 3290 | 2.1482 | 0.1071 | 0.0265 | 0.0821 | 0.0823 | 19.0 |
62
 
63
 
64
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b8d2516a5e32c66777813ec790b3ceff844e25cc612d8b7a4472ff071a830be6
3
  size 891644712
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fd3cb89f2a7f05f673cd2765687ca073a43ab172dbc6a5eb2ac244255763a5b0
3
  size 891644712
runs/Mar31_05-13-08_a5b5045d4ed5/events.out.tfevents.1711861989.a5b5045d4ed5.35.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fd90b43f425b729ade130f063c1d3ea317159d037d81604e290d8db34c673888
3
- size 8978
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15597fe0678eeb51d3392779cd0e0f8d79b3a250844b393cb741b7dea3c4bfed
3
+ size 9857