Update README.md

Browse files

Files changed (1) hide show

README.md +27 -10

README.md CHANGED Viewed

@@ -232,21 +232,21 @@ All models trained with max length 512 and batch size 8, using the CoNLL 2002 da
 ## PAWS-X
-All models trained with max length 512 and batch size 8. Even though this model has been run several times, it looks like some of the values reported may not be correct due to clerical errors (particularly the repeated 0.5765 values), so a new run is ongoing.
 <figure>
 | Model                                              | Accuracy |
 |----------------------------------------------------|----------|
 | bert-base-multilingual-cased                       | 0.5765   |
-| dccuchile/bert-base-spanish-wwm-cased              | 0.5765   |
 | BSC-TeMU/roberta-base-bne                          | 0.5765   |
-| bertin-project/bertin-roberta-base-spanish         | 0.6550   |
-| bertin-project/bertin-base-random                  | 0.8665   |
-| bertin-project/bertin-base-stepwise                | 0.8610   |
-| bertin-project/bertin-base-gaussian                | **0.8800**   |
-| bertin-project/bertin-base-random-exp-512seqlen    | 0.5765   |
-| bertin-project/bertin-base-gaussian-exp-512seqlen  |  **0.875**   |
 <caption>Table 5. Results for PAWS-X.</caption>
@@ -254,7 +254,6 @@ All models trained with max length 512 and batch size 8. Even though this model
 ## XNLI
-All models trained with max length 256 and batch size 32. (A set of runs with max length 512 is in progress.)
 <figure>
@@ -270,7 +269,25 @@ All models trained with max length 256 and batch size 32. (A set of runs with ma
 | bertin-project/bertin-base-gaussian-exp-512seqlen  | 0.7878   |
-<caption>Table 6. Results for XNLI.</caption>
 </figure>
 # Conclusions

 ## PAWS-X
+All models trained with max length 512 and batch size 8. These numbers are surprising both for the repeated instances of 0.5765 accuracy and for the large differences in performance. However, experiments have been repeated several times and the results are consistent.
 <figure>
 | Model                                              | Accuracy |
 |----------------------------------------------------|----------|
 | bert-base-multilingual-cased                       | 0.5765   |
+| dccuchile/bert-base-spanish-wwm-cased              | 0.8720   |
 | BSC-TeMU/roberta-base-bne                          | 0.5765   |
+| bertin-project/bertin-roberta-base-spanish         | 0.5765   |
+| bertin-project/bertin-base-random                  | 0.8800   |
+| bertin-project/bertin-base-stepwise                | 0.8825   |
+| bertin-project/bertin-base-gaussian                | 0.8875   |
+| bertin-project/bertin-base-random-exp-512seqlen    | 0.6735   |
+| bertin-project/bertin-base-gaussian-exp-512seqlen  |  **0.8965**   |
 <caption>Table 5. Results for PAWS-X.</caption>
 ## XNLI
 <figure>
 | bertin-project/bertin-base-gaussian-exp-512seqlen  | 0.7878   |
+<caption>Table 6. Results for XNLI with sequence length 256 and batch size 32.</caption>
+</figure>
+<figure>
+| Model                                              | Accuracy |
+|----------------------------------------------------|----------|
+| bert-base-multilingual-cased                       | WIP   |
+| dccuchile/bert-base-spanish-wwm-cased              | WIP   |
+| BSC-TeMU/roberta-base-bne                          | WIP   |
+| bertin-project/bertin-base-random                  | WIP   |
+| bertin-project/bertin-base-stepwise                | WIP   |
+| bertin-project/bertin-base-gaussian                | WIP   |
+| bertin-project/bertin-base-random-exp-512seqlen    | 0.7799   |
+| bertin-project/bertin-base-gaussian-exp-512seqlen  | 0.7843   |
+<caption>Table 7. Results for XNLI with sequence length 512 and batch size 16.</caption>
 </figure>
 # Conclusions