Joelito kapllan commited on
Commit
0af19e4
1 Parent(s): dfd4e9c

The learning rate was not displayed as it should. (#3)

Browse files

- The learning rate was not displayed as it should. (52b9f0d034e7c0c2627e41a35fc6b5d31b6cb8ca)


Co-authored-by: kapllan <kapllan@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -107,7 +107,7 @@ For further details see [Niklaus et al. 2023](https://arxiv.org/abs/2306.02069?u
107
  - batche size: 512 samples
108
  - Number of steps: 1M/500K for the base/large model
109
  - Warm-up steps for the first 5\% of the total training steps
110
- - Learning rate: (linearly increasing up to) $1e\!-\!4$
111
  - Word masking: increased 20/30\% masking rate for base/large models respectively
112
 
113
  ## Evaluation
 
107
  - batche size: 512 samples
108
  - Number of steps: 1M/500K for the base/large model
109
  - Warm-up steps for the first 5\% of the total training steps
110
+ - Learning rate: (linearly increasing up to) 1e-4
111
  - Word masking: increased 20/30\% masking rate for base/large models respectively
112
 
113
  ## Evaluation