BERT for Vietnamese Law

Apply for Task 1: Legal Document Retrieval on ALQAC 2021 dataset

The model achieved 0.80 on the leaderboard(1st place score is 0.88).

We use vibert4news as based model and fine-tune on our own Vietnamese law dataset.

We use word sentencepiece, use basic bert tokenization and same config with bert base with lowercase = False.

