bozyurt
/

bio-electra-base-1m

Inference Endpoints

Model card Files Files and versions Community

bozyurt commited on Dec 25, 2023

Commit

fbf9c66

•

1 Parent(s): 8c865d4

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -1,5 +1,7 @@
 ---
 license: cc
 ---
 # Bio-ELECTRA Base 1m (cased)
@@ -34,7 +36,7 @@ to the properly tokenized and segmented sentences.
 ## Pretraining
  The model is pretrained on a single 8 core version 3 tensor processing unit (TPU) with 128 GB of RAM for 1,000,000 steps
- with a batch size of 256.  The training paprameters were the same as the original ELECTRA base model. The model has 110M parameters,
  12 transformers layers with hidden layer size of 768 and 12 attention heads.

 ---
 license: cc
+language:
+- en
 ---
 # Bio-ELECTRA Base 1m (cased)
 ## Pretraining
  The model is pretrained on a single 8 core version 3 tensor processing unit (TPU) with 128 GB of RAM for 1,000,000 steps
+ with a batch size of 256.  The training parameters were the same as the original ELECTRA base model. The model has 110M parameters,
  12 transformers layers with hidden layer size of 768 and 12 attention heads.