orai-nlp
/

ElhBERTeu

Feature Extraction

Inference Endpoints

Model card Files Files and versions Community

Gorka Urbizu Garmendia commited on May 18, 2022

Commit

e2f7ea6

·

1 Parent(s): 5b3edfc

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ To train ElhBERTeu, we collected different corpora sources from several domains:
 |Others     | 7M       |
 |Total      | 575M     |
-ElhBERTeu is a base, uncased monolingual BERT model for Basque, with a vocab size of 50K. which sums up to 124M parameters in total.
 ElhBERTeu was trained following the design decisions for [BERTeus](https://huggingface.co/ixa-ehu/berteus-base-cased). The tokenizer and the hyper-parameter settings remained the same, with the only difference being that the full pre-training of the model (1M steps) was performed with a sequence length of 512 on a v3-8 TPU.

 |Others     | 7M       |
 |Total      | 575M     |
+ElhBERTeu is a base, uncased monolingual BERT model for Basque, with a vocab size of 50K, which has 124M parameters in total.
 ElhBERTeu was trained following the design decisions for [BERTeus](https://huggingface.co/ixa-ehu/berteus-base-cased). The tokenizer and the hyper-parameter settings remained the same, with the only difference being that the full pre-training of the model (1M steps) was performed with a sequence length of 512 on a v3-8 TPU.