Mirelle
/

opt-125M-pt-br-finetuned

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mirelle commited on May 21, 2023

Commit

bfb82c3

•

1 Parent(s): aa0bb09

Update README.md

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -15,9 +15,16 @@ pipeline_tag: text-generation
 ---
 # OPT-125M finetuned Portuguese
-Fine-tuning the [OPT-125M](https://huggingface.co/facebook/opt-125m) model on a reduced corpus of MC4-Portuguese with approximately 300M tokens.
-In this training a sequence length of 512 tokens was used, batch of 32 for 2 epochs.
 With an A100 with 40GB of RAM, the training took around 3 hours

 ---
 # OPT-125M finetuned Portuguese
+Fine-tuning the [OPT-125M](https://huggingface.co/facebook/opt-125m) model on a reduced corpus of mc4-Portuguese with approximately 300M tokens.
+###### Hyper-parameters
+- learning_rate = 5e-5
+- batch_size = 32
+- warmup = 500
+- seq_length = 512
+- num_train_epochs = 2.0
 With an A100 with 40GB of RAM, the training took around 3 hours