danfu09 commited on
Commit
c076b6d
1 Parent(s): 870f6da

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -121,7 +121,7 @@ Please refer to [togethercomputer/RedPajama-Data-1T](https://huggingface.co/data
121
 
122
  - **Hardware:** 512 nodes of 6xV100 (IBM Power9), on the OLCF Summit cluster
123
  - **Optimizer:** Apex FusedAdam
124
- - **Parallelism:** Pipeline parallel 12, model parallel 2
125
  - **Gradient Accumulations**: 8 (global batch size 4M tokens)
126
  - **Num of Tokens:** 800B Tokens
127
  - **Learning rate:** 0.00012
 
121
 
122
  - **Hardware:** 512 nodes of 6xV100 (IBM Power9), on the OLCF Summit cluster
123
  - **Optimizer:** Apex FusedAdam
124
+ - **Parallelism:** Pipeline parallel 12, tensor parallel 2
125
  - **Gradient Accumulations**: 8 (global batch size 4M tokens)
126
  - **Num of Tokens:** 800B Tokens
127
  - **Learning rate:** 0.00012