bprateek commited on
Commit
14775ae
·
1 Parent(s): 203a723

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -11
README.md CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 4.0241
20
- - Rouge1: 0.1639
21
  - Rouge2: 0.0
22
- - Rougel: 0.1337
23
- - Rougelsum: 0.1357
24
- - Gen Len: 11.4
25
 
26
  ## Model description
27
 
@@ -46,22 +46,28 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 4
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
- | No log | 1.0 | 6 | 4.2311 | 0.1365 | 0.0 | 0.1103 | 0.1102 | 12.1 |
57
- | No log | 2.0 | 12 | 4.1437 | 0.1668 | 0.0 | 0.1321 | 0.1332 | 13.2 |
58
- | No log | 3.0 | 18 | 4.0572 | 0.143 | 0.0 | 0.1152 | 0.1152 | 11.8 |
59
- | No log | 4.0 | 24 | 4.0241 | 0.1639 | 0.0 | 0.1337 | 0.1357 | 11.4 |
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
63
 
64
- - Transformers 4.28.1
65
  - Pytorch 2.0.0+cu118
66
  - Datasets 2.12.0
67
  - Tokenizers 0.13.3
 
16
 
17
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 3.3303
20
+ - Rouge1: 0.1597
21
  - Rouge2: 0.0
22
+ - Rougel: 0.1349
23
+ - Rougelsum: 0.1334
24
+ - Gen Len: 18.7
25
 
26
  ## Model description
27
 
 
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 10
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
56
+ | No log | 1.0 | 6 | 3.5039 | 0.185 | 0.0105 | 0.1573 | 0.1576 | 15.7 |
57
+ | No log | 2.0 | 12 | 3.4680 | 0.1915 | 0.0105 | 0.1747 | 0.174 | 16.9 |
58
+ | No log | 3.0 | 18 | 3.4331 | 0.1579 | 0.0105 | 0.1308 | 0.1282 | 17.4 |
59
+ | No log | 4.0 | 24 | 3.4049 | 0.1579 | 0.0105 | 0.1308 | 0.1282 | 17.8 |
60
+ | No log | 5.0 | 30 | 3.3817 | 0.1716 | 0.0091 | 0.1476 | 0.1434 | 18.5 |
61
+ | No log | 6.0 | 36 | 3.3638 | 0.1323 | 0.0 | 0.1176 | 0.116 | 17.1 |
62
+ | No log | 7.0 | 42 | 3.3497 | 0.1597 | 0.0 | 0.1349 | 0.1334 | 18.7 |
63
+ | No log | 8.0 | 48 | 3.3394 | 0.1597 | 0.0 | 0.1349 | 0.1334 | 18.7 |
64
+ | No log | 9.0 | 54 | 3.3332 | 0.1597 | 0.0 | 0.1349 | 0.1334 | 18.7 |
65
+ | No log | 10.0 | 60 | 3.3303 | 0.1597 | 0.0 | 0.1349 | 0.1334 | 18.7 |
66
 
67
 
68
  ### Framework versions
69
 
70
+ - Transformers 4.28.0
71
  - Pytorch 2.0.0+cu118
72
  - Datasets 2.12.0
73
  - Tokenizers 0.13.3