andreaskoepf
commited on
Commit
•
78cc424
1
Parent(s):
38da841
Update README.md
Browse files
README.md
CHANGED
@@ -56,4 +56,6 @@ pythia-6.9b-pretrain:
|
|
56 |
per_device_eval_batch_size: 8
|
57 |
num_train_epochs: 1
|
58 |
save_total_limit: 2
|
59 |
-
```
|
|
|
|
56 |
per_device_eval_batch_size: 8
|
57 |
num_train_epochs: 1
|
58 |
save_total_limit: 2
|
59 |
+
```
|
60 |
+
|
61 |
+
command: `deepspeed trainer_sft.py --configs defaults pretrain pythia-6.9b-pretrain --cache_dir .cache/ --output_dir .saved_models/pythia-6.9b-pre --residual_dropout 0.0 --deepspeed`
|