fgaim commited on
Commit
7a26103
1 Parent(s): 4a55b70

Add README

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # RoBERTa Pretrained for Tigrinya Language
2
+
3
+ We pretrain RoBERTa base on a relatively small dataset for Tigrinya (34M tokens).
4
+
5
+
6
+ ## Hyperparameters
7
+
8
+ The hyperparameters corresponding to model sizes mentioned above are as follows:
9
+ | Model Size | L | AH | HS | FFN | P |
10
+ |------------|----|----|-----|------|------|
11
+ | BASE | 12 | 12 | 768 | 3072 | 125M |
12
+
13
+ (AH = number of attention heads; HS = hidden size; FFN = feedforward network dimension; P = number of parameters.)
14
+
15
+