lvwerra HF staff commited on
Commit
8791cbe
1 Parent(s): 93a7017

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -6
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the samsum dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.4678
19
 
20
  ## Model description
21
 
@@ -41,17 +41,23 @@ The following hyperparameters were used during training:
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 500
44
- - num_epochs: 0.2
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
  | 1.6092 | 0.03 | 500 | 1.6488 |
51
- | 1.9246 | 0.07 | 1000 | 1.5353 |
52
- | 1.6674 | 0.1 | 1500 | 1.5057 |
53
- | 1.9026 | 0.14 | 2000 | 1.4848 |
54
- | 1.2869 | 0.17 | 2500 | 1.4678 |
 
 
 
 
 
 
55
 
56
 
57
  ### Framework versions
15
 
16
  This model is a fine-tuned version of [google/pegasus-cnn_dailymail](https://huggingface.co/google/pegasus-cnn_dailymail) on the samsum dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.4177
19
 
20
  ## Model description
21
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 500
44
+ - num_epochs: 0.4
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
  | 1.6092 | 0.03 | 500 | 1.6488 |
51
+ | 1.9715 | 0.07 | 1000 | 1.5444 |
52
+ | 1.8325 | 0.1 | 1500 | 1.5093 |
53
+ | 1.876 | 0.14 | 2000 | 1.4890 |
54
+ | 1.3081 | 0.17 | 2500 | 1.4737 |
55
+ | 1.7769 | 0.2 | 3000 | 1.4496 |
56
+ | 1.6276 | 0.24 | 3500 | 1.4430 |
57
+ | 1.6624 | 0.27 | 4000 | 1.4288 |
58
+ | 1.9202 | 0.31 | 4500 | 1.4235 |
59
+ | 1.4404 | 0.34 | 5000 | 1.4189 |
60
+ | 1.8016 | 0.37 | 5500 | 1.4177 |
61
 
62
 
63
  ### Framework versions