Edit model card


Model description

This is a Vietnamese RoBERTa base model pretrained on Vietnamese Oscar dataset.

How to use

You can use this model for masked language modeling as follows:

from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("NlpHUST/roberta-base-vn")
model = AutoModelForMaskedLM.from_pretrained("NlpHUST/roberta-base-vn")

You can fine-tune this model on downstream tasks.
Downloads last month
Hosted inference API
Mask token: <mask>
This model can be loaded on the Inference API on-demand.

Dataset used to train NlpHUST/roberta-base-vn