erfanzar
/

PGT-1B-2EP

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

erfanzar commited on Apr 26, 2023

Commit

950c245

•

1 Parent(s): ad8d01a

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -22,6 +22,16 @@ this model is only 1B but you can call it somehow an SOTA
 this model can also run on 4 GB GPU RAM and know dialogs as well
 ## Usage Code
 ```python

 this model can also run on 4 GB GPU RAM and know dialogs as well
+### Train Parametes
+- learning-rate : 2e-4
+- sc : cosine lr
+- device : T4 GPU * 4
+- batch-size: AutoFind
+- train time 12 H
+- max sequence length: 1024
+- epochs : 2
 ## Usage Code
 ```python