Update README.md
Browse files
README.md
CHANGED
@@ -49,7 +49,7 @@ The following hyperparameters were used during training:
|
|
49 |
- optimizer: adamw_bnb_8bit
|
50 |
- lr_scheduler_type: linear
|
51 |
- lr_scheduler_warmup_steps: 15000
|
52 |
-
- training_steps:
|
53 |
- mixed_precision_training: True
|
54 |
|
55 |
## Acknowledgement
|
|
|
49 |
- optimizer: adamw_bnb_8bit
|
50 |
- lr_scheduler_type: linear
|
51 |
- lr_scheduler_warmup_steps: 15000
|
52 |
+
- training_steps: 35808 (terminated upon convergence. Initially set to 89520 steps)
|
53 |
- mixed_precision_training: True
|
54 |
|
55 |
## Acknowledgement
|