yongzx
/

gpt2-finetuned-oscar-fr

Text Generation

feature-extraction

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

yongzx commited on Dec 9, 2021

Commit

74a8c6d

•

1 Parent(s): 046f613

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -28,4 +28,5 @@ We finetuned the `wte` and `wpe` layers of GPT-2 (while freezing the parameters
 - max_eval_samples: 5000
 ```
-Setup: 8 RTX-3090 GPUs, trained for seven days (total training steps: 110500, effective train batch size: 64, tokens per batch: 1024)

 - max_eval_samples: 5000
 ```
+Setup: 8 RTX-3090 GPUs, trained for seven days (total training steps: 110500, effective train batch size: 64, tokens per batch: 1024)
+Final checkpoint: checkpoint-111500