Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,7 @@ datasets:
|
|
19 |
Based on microsoft/deberta-v3-large, this model was trained on the snli-v1.0, multi-nli-1.0, nli-fever and anli-1.0-r1/anli-1.0-r2/anli-1.0-r3 datasets, with the training weights of 1,1,1,10,20,10 respectively.
|
20 |
The training codes are mostly referenced from: https://github.com/facebookresearch/anli
|
21 |
|
|
|
22 |
#### Hyperparameters
|
23 |
learning_rate: 1e-5
|
24 |
max_length: 156
|
@@ -27,13 +28,14 @@ warmup_ratio: 0.1
|
|
27 |
weight_decay: 0.0
|
28 |
num_epochs: 2
|
29 |
|
|
|
30 |
#### Dev results
|
31 |
snli-v1.0 | multi-nli-1.0-m | multi-nli-1.0-mm | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3
|
32 |
----------|-----------------|------------------|-------------|-------------|------------
|
33 |
0.938 | 0.914 | 0.912 | 0.796 | 0.627 | 0.610
|
34 |
|
|
|
35 |
#### Test results
|
36 |
-
Results of the test sets are shown together with some other official pre-trained model checkpoints.
|
37 |
Model | snli-v1.0 | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3
|
38 |
------|-----------|-------------|-------------|------------
|
39 |
ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli | - | 0.736 | 0.493 | 0.455
|
|
|
19 |
Based on microsoft/deberta-v3-large, this model was trained on the snli-v1.0, multi-nli-1.0, nli-fever and anli-1.0-r1/anli-1.0-r2/anli-1.0-r3 datasets, with the training weights of 1,1,1,10,20,10 respectively.
|
20 |
The training codes are mostly referenced from: https://github.com/facebookresearch/anli
|
21 |
|
22 |
+
|
23 |
#### Hyperparameters
|
24 |
learning_rate: 1e-5
|
25 |
max_length: 156
|
|
|
28 |
weight_decay: 0.0
|
29 |
num_epochs: 2
|
30 |
|
31 |
+
|
32 |
#### Dev results
|
33 |
snli-v1.0 | multi-nli-1.0-m | multi-nli-1.0-mm | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3
|
34 |
----------|-----------------|------------------|-------------|-------------|------------
|
35 |
0.938 | 0.914 | 0.912 | 0.796 | 0.627 | 0.610
|
36 |
|
37 |
+
|
38 |
#### Test results
|
|
|
39 |
Model | snli-v1.0 | anli-1.0-r1 | anli-1.0-r2 | anli-1.0-r3
|
40 |
------|-----------|-------------|-------------|------------
|
41 |
ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli | - | 0.736 | 0.493 | 0.455
|