armahlovis
/

Finetuned

Inference Endpoints

Model card Files Files and versions Community

armahlovis commited on Feb 16, 2023

Commit

68929ff

•

1 Parent(s): e611c8d

Update README.md

Files changed (1) hide show

README.md +10 -3

README.md CHANGED Viewed

@@ -15,15 +15,15 @@ This model is fine tunned on GPT2 to  generate text following  the writings of W
 ## Model Description
-The model is designed to be finned tunning with writting from Historical black black writers who are wrote on freedom and emancipation. The first version is  fine tunned
-on GPT2 to  generate text following  the writings of W. E. Burghardt Du Bois.
 - **Developed by:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
 - **License:** [More Information Needed]
 - **Finetuned from model [optional]:** [More Information Needed]
@@ -99,6 +99,13 @@ Use the code below to get started with the model.
 ### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 ### Speeds, Sizes, Times [optional]

 ## Model Description
+The model is designed to be finned tunning with writting from Historical black black writers who wrote on freedom and emancipation. This first version has GPT2
+fintunned with  the writings of W. E. Burghardt Du Bois.
 - **Developed by:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]
+- **Language(s) (NLP):** [More Information Needed] English
 - **License:** [More Information Needed]
 - **Finetuned from model [optional]:** [More Information Needed]
 ### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
+-   Num examples = 1005
+  Num Epochs = 3
+  Instantaneous batch size per device = 8
+  Total train batch size (w. parallel, distributed & accumulation) = 8
+  Gradient Accumulation steps = 1
+  Total optimization steps = 378
+  Number of trainable parameters = 124439808
 ### Speeds, Sizes, Times [optional]