lightonai
/

alfred-40b-0723

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

WeightsnWizardry commited on Jul 31, 2023

Commit

73bed7a

·

1 Parent(s): feb92bd

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -96,7 +96,7 @@ Alfred-40B-0723 was trained on a mixture of publicly available and in-house cura
 ### Training Procedure
-`Alfred-40B-0723` was trained on 128 A100 40GB GPUs, using a 3D parallelism strategy (TP=8, PP=4, DP=4) combined with ZeRO.
 #### Preprocessing

 ### Training Procedure
+`Alfred-40B-0723` was trained on 128 A100 40GB GPUs, using a 3D parallelism strategy (TP=8, PP=4, DP=4) combined with ZeRO. The value model is initialized from the reward model and does not have any shared parameters with the policy network.
 #### Preprocessing