File size: 568 Bytes
db03705
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
 ---
inference: false
language: id
---

# IndoConvBERT Base Model

IndoConvBERT is a ConvBERT model pretrained on Indo4B.

## Pretraining details

We follow a different training procedure: instead of using a two-phase approach, that pre-trains the model for 90% with 128 sequence length and 10% with 512 sequence length, we pre-train the model with 512 sequence length for 1M steps on a v3-8 TPU.

The current version of the model is trained on Indo4B and small Twitter dump.

## Acknowledgement

Big thanks to TFRC (TensorFlow Research Cloud) for providing free TPU.