fgaim
/

tielectra-small

Inference Endpoints

Model card Files Files and versions Community

Fitsum Gaim commited on Oct 15, 2021

Commit

8eb3c17

•

1 Parent(s): 1f0a04c

Add model card

Files changed (1) hide show

README.md +22 -0

README.md ADDED Viewed

	@@ -0,0 +1,22 @@

+---
+language: ti
+widget:
+- text: "ዓቕሚ መንእሰይ ኤርትራ [MASK] ተራእዩ"
+---
+# Pre-trained ELECTRA small for Tigrinya Language
+We pre-train ELECTRA small on the [TLMD](https://zenodo.org/record/5139094) dataset, with over 40 million tokens.
+Contained are trained Flax and PyTorch models.
+## Hyperparameters
+The hyperparameters corresponding to model sizes mentioned above are as follows:
+| Model Size | L  | AH | HS  | FFN  | P    | Seq  |
+|------------|----|----|-----|------|------|------|
+| BASE       | 12 | 4  | 256 | 1024 | 14M  | 512  |
+(L = number of layers; AH = number of attention heads; HS = hidden size; FFN = feedforward network dimension; P = number of parameters; Seq = maximum sequence length.)