Davlan
/

oyo-teams-base-discriminator

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Davlan commited on Aug 22, 2023

Commit

24ced61

•

1 Parent(s): 6cb0d71

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -1,3 +1,21 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
+language:
+- yo
 ---
+# oyo-teams-base-discriminator
+OYO-BERT (or Oyo-dialect of Yoruba BERT) was created by pre-training a [TEAMS model based on ELECTRA architecture](https://aclanthology.org/2021.findings-acl.219/) on Yoruba language texts for about 100K steps.
+It was trained using ELECTRA-base architecture with [Tensorflow Model Garden](https://github.com/tensorflow/models/tree/master/official/projects)
+### Pre-training corpus
+A mix of WURA, Wikipedia and MT560 Yoruba data
+### Acknowledgment
+We thank [@stefan-it](https://github.com/stefan-it) for providing the pre-processing and pre-training scripts. Finally, we would like to thank Google Cloud for providing us access to TPU v3-8 through the free cloud credits. Model trained using flax, before converted to pytorch.
+### BibTeX entry and citation info.