euclaise commited on
Commit
029d608
·
1 Parent(s): ac382e2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -19,6 +19,7 @@ Hyperparameters:
19
  - Cosine schedule with a warmup of 5400 steps
20
  - Batch size of 4 (2 real x 2 accumulated)
21
  - Maximum of 5 epochs, early stopping (visual observation), stopped after 3
 
22
 
23
  Training prompt format:
24
 
 
19
  - Cosine schedule with a warmup of 5400 steps
20
  - Batch size of 4 (2 real x 2 accumulated)
21
  - Maximum of 5 epochs, early stopping (visual observation), stopped after 3
22
+ - Gradient clipping norm value of 1.0
23
 
24
  Training prompt format:
25