mesolitica
/

malaysian-debertav2-base

Model card Files Files and versions Community

Pretrain BASE 512 masking context length DebertaV2 on Malaysian text

Special thanks to https://github.com/aisyahrzk for pretraining DebertaV2 Base.

WanDB at https://wandb.ai/aisyahrazak/deberta-base?nw=nwuseraisyahrazak

Downloads last month: 6

Safetensors

Model size

114M params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mesolitica/malaysian-debertav2-base

Finetunes

1 model

Collection including mesolitica/malaysian-debertav2-base

Malaysian MaskLM

Trained on 17B tokens, 81GB of cleaned texts, able to understand standard Malay, local Malay, local Mandarin, Manglish, and local Tamil. • 7 items • Updated Dec 23, 2024