wgcv commited on
Commit
e3d594f
1 Parent(s): b6bf329

End of training

Browse files
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.0120
21
- - Rouge1: 0.7424
22
- - Rouge2: 0.6202
23
- - Rougel: 0.7365
24
- - Rougelsum: 0.7347
25
- - Gen Len: 7.0385
26
 
27
  ## Model description
28
 
@@ -47,20 +47,15 @@ The following hyperparameters were used during training:
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 8
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
- | No log | 1.0 | 132 | 1.2896 | 0.6823 | 0.5884 | 0.6755 | 0.6759 | 8.9744 |
57
- | No log | 2.0 | 264 | 1.1210 | 0.7277 | 0.6094 | 0.7187 | 0.7176 | 7.5342 |
58
- | No log | 3.0 | 396 | 1.0676 | 0.7358 | 0.6136 | 0.7287 | 0.7273 | 7.3077 |
59
- | 1.5149 | 4.0 | 528 | 1.0409 | 0.7462 | 0.6233 | 0.7386 | 0.7367 | 7.1368 |
60
- | 1.5149 | 5.0 | 660 | 1.0277 | 0.7464 | 0.6233 | 0.7387 | 0.7367 | 7.1068 |
61
- | 1.5149 | 6.0 | 792 | 1.0186 | 0.7463 | 0.6229 | 0.7383 | 0.7362 | 7.0769 |
62
- | 1.5149 | 7.0 | 924 | 1.0135 | 0.7423 | 0.6202 | 0.7364 | 0.7346 | 7.0427 |
63
- | 1.0709 | 8.0 | 1056 | 1.0120 | 0.7424 | 0.6202 | 0.7365 | 0.7347 | 7.0385 |
64
 
65
 
66
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.9966
21
+ - Rouge1: 0.7313
22
+ - Rouge2: 0.6085
23
+ - Rougel: 0.7292
24
+ - Rougelsum: 0.73
25
+ - Gen Len: 6.9487
26
 
27
  ## Model description
28
 
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 12
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
+ |:-------------:|:-------:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
+ | 1.692 | 3.7879 | 500 | 1.0552 | 0.7273 | 0.6085 | 0.7224 | 0.7248 | 7.1709 |
57
+ | 1.0841 | 7.5758 | 1000 | 1.0062 | 0.7346 | 0.6104 | 0.7318 | 0.7326 | 7.0342 |
58
+ | 1.0105 | 11.3636 | 1500 | 0.9966 | 0.7313 | 0.6085 | 0.7292 | 0.73 | 6.9487 |
 
 
 
 
 
59
 
60
 
61
  ### Framework versions
runs/Jul09_16-27-58_c5eadc05cc54/events.out.tfevents.1720542479.c5eadc05cc54.3138.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f452b20603c9aee65a67c2e39fb8213f7ec0263487f34500ecba7d8ecf03896a
3
- size 8046
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ebef8fc9d18ebfbb80f731723a4dec1bf964aeb8e33f2909026bf5661124632b
3
+ size 8400