yhavinga
/

gpt2-large-dutch

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

yhavinga commited on Jan 11, 2022

Commit

df643e2

·

1 Parent(s): 67b1af5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -27,14 +27,14 @@ Tokenizer:
 Training details:
 * Training started on step 360K (bs 16) ppl 21 of earlier model trained with Adam optimizer.
-* Training at step 1100K of 2082K (53%) pp  15,1
 * Block size: 512
 * Optimizer: adafactor
 * Learning rate: 3.3e-5
 * Batch size: 32
 * Warmup steps: 5000
-Work in progress. Dec 2021-Jan2022
 * Many thanks to the [Google TPU Research Cloud](https://sites.research.google/trc/about/) for providing access to a TPU cluster!
 * Thanks to @gsarti for creating the [t5-flax-gcp

 Training details:
 * Training started on step 360K (bs 16) ppl 21 of earlier model trained with Adam optimizer.
+* Training at step 1100K (53%) of 2082K (bs 32) ppl 15,1
 * Block size: 512
 * Optimizer: adafactor
 * Learning rate: 3.3e-5
 * Batch size: 32
 * Warmup steps: 5000
+Jan 2022
 * Many thanks to the [Google TPU Research Cloud](https://sites.research.google/trc/about/) for providing access to a TPU cluster!
 * Thanks to @gsarti for creating the [t5-flax-gcp