Edit model card

deberta-v3-large-snli_mnli_fever_anli_R1_R2_R3-nli

Datasets

This model was trained on the snli-v1.0, multi-nli-1.0, nli-fever and anli-1.0-r1/anli-1.0-r2/anli-1.0-r3 datasets with the training weights of 1,1,1,10,20,10 respectively.
The training codes are mostly referenced from: https://github.com/facebookresearch/anli

Hyperparameters

learning_rate: 1e-5
max_length: 156
batch_size: 16
warmup_ratio: 0.1
weight_decay: 0.0
num_epochs: 2

Dev results

snli-v1.0 multi-nli-1.0-m multi-nli-1.0-mm anli-1.0-r1 anli-1.0-r2 anli-1.0-r3
0.938 0.914 0.912 0.796 0.627 0.610

Test results

snli-v1.0 anli-1.0-r1 anli-1.0-r2 anli-1.0-r3
0.929 0.775 0.636 0.612
Downloads last month
55