GOAT-AI
/

GOAT-70B-Storytelling

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rokset3 commited on Nov 17, 2023

Commit

d1128de

•

1 Parent(s): c6ca8d2

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -24,6 +24,10 @@ The GOAT-70B-Storytelling model has been developed as an integral component with
  - **License:** llama2
  - **Context window length:** 4096 tokens
 ### Learn more
 - **Blog:** TBA

  - **License:** llama2
  - **Context window length:** 4096 tokens
+### Training details
+For training, we apply the standard recipe with learning rate 1e-5, batch size per GPU 6, optimizer AdamW without weight decay and we train the model via ZeRO-3 on 64xH100 GPU cluster
 ### Learn more
 - **Blog:** TBA