mgundik commited on
Commit
c420252
·
verified ·
1 Parent(s): 61a18f9

End of training

Browse files
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.3114
21
- - Rouge1: 13.4323
22
- - Rouge2: 2.5384
23
- - Rougel: 10.3894
24
- - Rougelsum: 12.3641
25
- - Gen Len: 18.9838
26
 
27
  ## Model description
28
 
@@ -47,17 +47,19 @@ The following hyperparameters were used during training:
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 4
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
57
- | 3.5251 | 1.0 | 600 | 3.3437 | 13.1074 | 2.3866 | 10.2062 | 12.0447 | 18.9879 |
58
- | 3.4929 | 2.0 | 1200 | 3.3240 | 13.3594 | 2.4474 | 10.2984 | 12.3059 | 18.9717 |
59
- | 3.4717 | 3.0 | 1800 | 3.3144 | 13.3894 | 2.5074 | 10.3539 | 12.3039 | 18.9838 |
60
- | 3.453 | 4.0 | 2400 | 3.3114 | 13.4323 | 2.5384 | 10.3894 | 12.3641 | 18.9838 |
 
 
61
 
62
 
63
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.3115
21
+ - Rouge1: 20.5085
22
+ - Rouge2: 2.9908
23
+ - Rougel: 13.8508
24
+ - Rougelsum: 18.4822
25
+ - Gen Len: 228.1577
26
 
27
  ## Model description
28
 
 
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 6
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:--------:|
57
+ | 3.7585 | 1.0 | 600 | 3.3969 | 17.6639 | 2.5697 | 12.3838 | 15.9802 | 307.0236 |
58
+ | 3.5518 | 2.0 | 1200 | 3.3526 | 20.0469 | 2.9053 | 13.6055 | 18.0492 | 248.2581 |
59
+ | 3.5108 | 3.0 | 1800 | 3.3318 | 20.0243 | 2.8879 | 13.5558 | 17.9889 | 245.3416 |
60
+ | 3.4798 | 4.0 | 2400 | 3.3202 | 20.1501 | 2.9346 | 13.6819 | 18.1977 | 232.3801 |
61
+ | 3.4542 | 5.0 | 3000 | 3.3134 | 20.6061 | 3.0311 | 13.9844 | 18.5832 | 217.8302 |
62
+ | 3.453 | 6.0 | 3600 | 3.3115 | 20.5085 | 2.9908 | 13.8508 | 18.4822 | 228.1577 |
63
 
64
 
65
  ### Framework versions
generation_config.json CHANGED
@@ -1,6 +1,7 @@
1
  {
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
 
4
  "pad_token_id": 0,
5
  "transformers_version": "4.42.4"
6
  }
 
1
  {
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
+ "max_new_tokens": 1000,
5
  "pad_token_id": 0,
6
  "transformers_version": "4.42.4"
7
  }
runs/Jul28_19-08-25_15f254a06cfb/events.out.tfevents.1722193727.15f254a06cfb.302.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:081b789e7be69b73f2d1a5522926afdc1b9a714a7b1d05efa1cd720825cf1f18
3
- size 11754
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1472b5fb709a83541e715a92778f6dcda29b683b8acbeca646443a820b38ae0a
3
+ size 12633