nacielo commited on
Commit
e1b83a1
1 Parent(s): 2c8281c

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +20 -15
README.md CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.9734
20
- - Rouge1: 23.9184
21
- - Rouge2: 7.161
22
- - Rougel: 18.3653
23
- - Rougelsum: 18.3289
24
- - Gen Len: 45.86
25
 
26
  ## Model description
27
 
@@ -40,23 +40,28 @@ More information needed
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
- - learning_rate: 1e-05
44
  - train_batch_size: 4
45
  - eval_batch_size: 4
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 5
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
55
- | 4.3541 | 1.0 | 1361 | 3.7286 | 31.3443 | 8.5461 | 22.9152 | 22.8633 | 44.4 |
56
- | 3.6252 | 2.0 | 2722 | 3.2950 | 25.3846 | 6.7268 | 19.4638 | 19.3924 | 40.12 |
57
- | 3.3709 | 3.0 | 4083 | 3.1073 | 26.3761 | 7.1947 | 19.4663 | 19.4253 | 44.76 |
58
- | 3.2605 | 4.0 | 5444 | 3.0024 | 23.8355 | 7.1978 | 18.3755 | 18.3194 | 45.61 |
59
- | 3.2417 | 5.0 | 6805 | 2.9734 | 23.9184 | 7.161 | 18.3653 | 18.3289 | 45.86 |
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.9826
20
+ - Rouge1: 41.7849
21
+ - Rouge2: 16.827
22
+ - Rougel: 30.2876
23
+ - Rougelsum: 30.3452
24
+ - Gen Len: 33.09
25
 
26
  ## Model description
27
 
 
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
+ - learning_rate: 1e-06
44
  - train_batch_size: 4
45
  - eval_batch_size: 4
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
+ - num_epochs: 10
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
55
+ | 3.934 | 1.0 | 983 | 3.2674 | 39.1931 | 15.4549 | 29.0372 | 29.0798 | 33.06 |
56
+ | 3.4613 | 2.0 | 1966 | 3.1359 | 39.1184 | 16.7176 | 30.4367 | 30.485 | 30.61 |
57
+ | 3.3732 | 3.0 | 2949 | 3.0870 | 37.745 | 15.4165 | 29.4667 | 29.4787 | 28.73 |
58
+ | 3.3173 | 4.0 | 3932 | 3.0563 | 42.9662 | 18.0767 | 31.2519 | 31.3076 | 33.5 |
59
+ | 3.308 | 5.0 | 4915 | 3.0260 | 42.4731 | 17.6266 | 30.6927 | 30.7491 | 33.75 |
60
+ | 3.2768 | 6.0 | 5898 | 3.0119 | 41.9782 | 17.0076 | 30.2951 | 30.3719 | 33.35 |
61
+ | 3.2744 | 7.0 | 6881 | 2.9986 | 42.209 | 17.1482 | 30.7224 | 30.7751 | 33.41 |
62
+ | 3.2581 | 8.0 | 7864 | 2.9907 | 42.1508 | 17.2078 | 30.638 | 30.7119 | 33.24 |
63
+ | 3.236 | 9.0 | 8847 | 2.9853 | 41.8212 | 17.0602 | 30.3253 | 30.3585 | 33.08 |
64
+ | 3.2526 | 10.0 | 9830 | 2.9826 | 41.7849 | 16.827 | 30.2876 | 30.3452 | 33.09 |
65
 
66
 
67
  ### Framework versions