raduion
/

bert-medium-luxembourgish

Inference Endpoints

Model card Files Files and versions Community

raduion commited on Jan 9, 2022

Commit

f8edff3

·

1 Parent(s): ec2adc0

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -3,4 +3,10 @@
 Created from a dataset with 1M Luxembourgish sentences from Wikipedia. Corpus has approx. 16M words.
 MLM objective was trained.
 The BERT model has parameters `L=8` and `H=512`.
-Vocabulary has 70K word pieces.

 Created from a dataset with 1M Luxembourgish sentences from Wikipedia. Corpus has approx. 16M words.
 MLM objective was trained.
 The BERT model has parameters `L=8` and `H=512`.
+Vocabulary has 70K word pieces.
+Final loss scores, after 3 epochs:
+Final train loss: 4.230
+  Final train perplexity: 68.726
+  Final validation loss: 4.074
+  Final validation perplexity: 58.765