satjawat commited on
Commit
c0634e7
1 Parent(s): 318ace7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -15
README.md CHANGED
@@ -83,21 +83,22 @@ print("Generated Summary:", summary)
83
 
84
  # Training hyperparameters
85
  The following hyperparameters were used during training:
86
- <ul>
87
- <li>accumulation_steps:2</li>
88
- <li>num_epochs:20</li>
89
- <li>num_beams:6</li>
90
- <li>learning_rate:lr=5e-5</li>
91
- <li>optimizer:AdamW with betas=(0.9,0.999) and epsilon=1e-08</li>
92
- <li>activation_function:gelu</li>
93
- <li>add_bias_logits:True</li>
94
- <li>normalize_embedding:True</li>
95
- <li>add_final_layer_norm:False </li>
96
- <li>normalize_before:False</li>
97
- </ul>
98
-
99
-
100
-
 
101
 
102
 
103
 
 
83
 
84
  # Training hyperparameters
85
  The following hyperparameters were used during training:
86
+ - accumulation_steps: 2
87
+ - num_epochs: 20
88
+ - num_beams: 6
89
+ - learning_rate: lr=5e-5
90
+ - optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
91
+ - activation_function: gelu
92
+ - add_bias_logits: True
93
+ - normalize_embedding: True
94
+ - add_final_layer_norm: False
95
+ - normalize_before: False
96
+
97
+ # Score
98
+ Evaluate the model with the test dataset of <a href="https://github.com/nakhunchumpolsathien/ThaiSum">ThaiSum</a>, consisting of a total of 11,000 articles, with the following scores:
99
+ - Rouge1: 0.490279
100
+ - Rouge2: 0.289839
101
+ - Rougel: 0.489334
102
 
103
 
104