Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,7 @@ Hyperparameters:
|
|
19 |
- Cosine schedule with a warmup of 5400 steps
|
20 |
- Batch size of 4 (2 real x 2 accumulated)
|
21 |
- Maximum of 5 epochs, early stopping (visual observation), stopped after 3
|
|
|
22 |
|
23 |
Training prompt format:
|
24 |
|
|
|
19 |
- Cosine schedule with a warmup of 5400 steps
|
20 |
- Batch size of 4 (2 real x 2 accumulated)
|
21 |
- Maximum of 5 epochs, early stopping (visual observation), stopped after 3
|
22 |
+
- Gradient clipping norm value of 1.0
|
23 |
|
24 |
Training prompt format:
|
25 |
|