Model Card for roberta-base-cuad

Model Details

Model Description

Uses

Direct Use

This model can be used for the task of Question Answering on Legal Documents.

Training Details

Read: CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review for detailed information on training procedure, dataset preprocessing and evaluation.

Training Data, Procedure, Preprocessing, etc.

See CUAD dataset card for more information.

Evaluation

Testing Data, Factors & Metrics

Testing Data

See CUAD dataset card for more information.

Software

Python, Transformers

Citation

BibTeX:

@article{hendrycks2021cuad,
     title={CUAD: An Expert-Annotated NLP Dataset for Legal Contract Review}, 
     author={Dan Hendrycks and Collin Burns and Anya Chen and Spencer Ball},
     journal={NeurIPS},
     year={2021}
}

How to Get Started with the Model

Use the code below to get started with the model.

Click to expand
from transformers import AutoTokenizer, AutoModelForQuestionAnswering
 
tokenizer = AutoTokenizer.from_pretrained("mgigena/cuad-roberta-base")
 
model = AutoModelForQuestionAnswering.from_pretrained("mgigena/cuad-roberta-base")
Downloads last month
8
Safetensors
Model size
124M params
Tensor type
I64
·
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train mgigena/roberta-base-cuad