Edit model card

Vietnamese Legal Text BERT

Table of contents

  1. Introduction
  2. Using Vietnamese Legal Text BERT

Using Vietnamese Legal Text BERT hmthanh/VietnamLegalText-SBERT

Using Vietnamese Legal Text BERT transformers

Installation

  • Install transformers with pip:

pip install transformers

  • Install tokenizers with pip:

pip install tokenizers

Pre-trained models

Model #params Arch. Max length Pre-training data
hmthanh/VietnamLegalText-SBERT 135M base 256 20GB of texts

Example usage

import torch
from transformers import AutoModel, AutoTokenizer

phobert = AutoModel.from_pretrained("hmthanh/VietnamLegalText-SBERT")
tokenizer = AutoTokenizer.from_pretrained("hmthanh/VietnamLegalText-SBERT")

sentence = 'Vượt đèn đỏ bị phạt bao nhiêu tiền?'  

input_ids = torch.tensor([tokenizer.encode(sentence)])

with torch.no_grad():
    features = phobert(input_ids)  # Models outputs are now tuples
Downloads last month
386
Safetensors
Model size
135M params
Tensor type
I64
·
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.