Model Overview
This is a RoBERTa-Large QA Model trained from https://huggingface.co/roberta-large in two stages. First, it is trained on synthetic adversarial data generated using a BART-Large question generator on Wikipedia passages from SQuAD as well as Wikipedia passages external to SQuAD, and then it is trained on SQuAD and AdversarialQA (https://arxiv.org/abs/2002.00293) in a second stage of fine-tuning.
Data
Training data: SQuAD + AdversarialQA Evaluation data: SQuAD + AdversarialQA
Training Process
Approx. 1 training epoch on the synthetic data and 2 training epochs on the manually-curated data.
Additional Information
Please refer to https://arxiv.org/abs/2104.08678 for full details.
- Downloads last month
- 14
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Datasets used to train mbartolo/roberta-large-synqa-ext
Evaluation results
- Exact Match on adversarial_qavalidation set self-reported53.200
- F1 on adversarial_qavalidation set self-reported64.627