Update README.md
Browse files
README.md
CHANGED
@@ -8,7 +8,7 @@ per_device_train_batch_size=16
|
|
8 |
gradient_accumulation_steps=1
|
9 |
total train batch size (w. parallel, distributed & accumulation) = 16
|
10 |
learning_rate=3e-5
|
11 |
-
num_train_epochs=
|
12 |
```
|
13 |
|
14 |
## Evaluation results
|
|
|
8 |
gradient_accumulation_steps=1
|
9 |
total train batch size (w. parallel, distributed & accumulation) = 16
|
10 |
learning_rate=3e-5
|
11 |
+
num_train_epochs=3
|
12 |
```
|
13 |
|
14 |
## Evaluation results
|