data-silence commited on
Commit
5bc36ce
1 Parent(s): dee9e68

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -197,4 +197,18 @@ The following hyperparameters were used during training:
197
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
198
  - lr_scheduler_type: linear
199
  - lr_scheduler_warmup_steps: 500
200
- - num_epochs: 4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
197
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
198
  - lr_scheduler_type: linear
199
  - lr_scheduler_warmup_steps: 500
200
+ - num_epochs: 4
201
+
202
+ ### Training results at last epoch:
203
+
204
+ | Training Loss | Epoch | Step | Validation Loss |
205
+ |:-------------:|:-----:|:-----:|:---------------:|
206
+ | 0.4487 | 4.0 | 20496 | 0.2799 |
207
+
208
+
209
+ ### Framework versions
210
+
211
+ - Transformers 4.42.4
212
+ - Pytorch 2.3.1+cu121
213
+ - Datasets 2.21.0
214
+ - Tokenizers 0.19.1