Indonesian T5 Base

T5 (Text-to-Text Transfer Transformer) model pretrained on Indonesian mC4 with extra filtering. This model is pre-trained only and needs to be fine-tuned to be used for specific tasks.

Pretraining Details

Trained for 1M steps following google/t5-v1_1-base.

Model Performance

TBD

Limitations and bias

This model also has the problem of biased (unethical, harmful, biased) output results due to the bias of the content of the training data, which is associated with the language model using a large-scale corpus. There is potential. Assuming that this problem may occur, please be careful to use it only for applications that do not cause damage.

Acknowledgement

Thanks to Tensorflow Research Cloud for providing TPU v3-8s.

Downloads last month
578
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Wikidepia/IndoT5-base

Finetunes
8 models

Dataset used to train Wikidepia/IndoT5-base