GeorgiaTechResearchInstitute
/

galpaca-30b

Text Generation

text-generation-inference

Model card Files Files and versions Community

blair-johnson commited on Mar 30, 2023

Commit

48d184f

•

1 Parent(s): 8e49fdb

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -41,7 +41,7 @@ TODO: add example inference usage.
 ## Training Resources
-GALPACA 30B was fine-tuned in about 6 hours using 16 A100 80GB GPUS at an effective batch-size of 1024 and with a maximum context window of 384 tokens. This model was trained using DeepSpeed Stage 3 optimizations.
 ## Performance and Limitations

 ## Training Resources
+GALPACA 30B was fine-tuned in about 6 hours using 16 A100 80GB GPUS using 16-bit mixed-precision at an effective batch-size of 1024 and with a maximum context window of 384 tokens. This model was trained using DeepSpeed Stage 3 optimizations.
 ## Performance and Limitations