cmarkea
/

bloomz-560m-sft-chat

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Sep 20, 2023

Commit

87b0708

·

1 Parent(s): 51577f4

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -53,7 +53,7 @@ Here is the table summarizing the architecture used for training, along with the
 |     Hyperparameter    |    Value   |
 |:---------------------:|:----------:|
 |       label smoothing | 0.05       |
-|              optimize | AdamW      |
 |                 betas | 0.9, 0.999 |
 |               AMSGrad | True       |
 |         learning rate | 5e-4       |
@@ -112,6 +112,6 @@ Citation
   AUTHOR = {Cyrile Delestre},
   URL = {https://huggingface.co/cmarkea/bloomz-560m-sft-chat},
   YEAR = {2023},
-  KEYWORDS = {NLP ; Transformers ; Bloomz},
 }
 ```

 |     Hyperparameter    |    Value   |
 |:---------------------:|:----------:|
 |       label smoothing | 0.05       |
+|             optimizer | AdamW      |
 |                 betas | 0.9, 0.999 |
 |               AMSGrad | True       |
 |         learning rate | 5e-4       |
   AUTHOR = {Cyrile Delestre},
   URL = {https://huggingface.co/cmarkea/bloomz-560m-sft-chat},
   YEAR = {2023},
+  KEYWORDS = {NLP ; Transformers ; LLM ; Bloomz},
 }
 ```