bilalhsp commited on
Commit
4ab40cc
1 Parent(s): 4e4d2e5

End of training

Browse files
Files changed (1) hide show
  1. README.md +1 -4
README.md CHANGED
@@ -38,9 +38,6 @@ The following hyperparameters were used during training:
38
  - eval_batch_size: 16
39
  - seed: 42
40
  - distributed_type: multi-GPU
41
- - num_devices: 4
42
- - total_train_batch_size: 64
43
- - total_eval_batch_size: 64
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - lr_scheduler_warmup_ratio: 0.1
@@ -51,7 +48,7 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Contrastive Loss | Diversity Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|:----------------:|:--------------:|
54
- | No log | 1.0 | 2 | 1793.5885 | 1763.9588 | 296.2975 |
55
 
56
 
57
  ### Framework versions
 
38
  - eval_batch_size: 16
39
  - seed: 42
40
  - distributed_type: multi-GPU
 
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_ratio: 0.1
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | Contrastive Loss | Diversity Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|:----------------:|:--------------:|
51
+ | No log | 1.0 | 8 | 1965.5140 | 1933.3790 | 321.3505 |
52
 
53
 
54
  ### Framework versions