bertin-project
/

bertin-roberta-base-spanish

@@ -161,28 +161,73 @@ We are currently in the process of applying our language models to downstream ta
 **SQUAD-es**
 Using sequence length 128 we have achieved exact match 50.96 and F1 68.74.
-**POS**
 <figure>
-| Model                                              | Metric   |
-|----------------------------------------------------|----------|
-| bert-base-multilingual-cased                       | 0.9629   |
-| dccuchile/bert-base-spanish-wwm-cased              | 0.9642   |
-| BSC-TeMU/roberta-base-bne                          | 0.9659   |
-| flax-community/bertin-roberta-large-spanish        | 0.9646   |
-| bertin-project/bertin-roberta-base-spanish         | 0.9638   |
-| bertin-project/bertin-base-random                  | 0.9656   |
-| bertin-project/bertin-base-stepwise                | 0.9656   |
-| bertin-project/bertin-base-gaussian                | **0.9662**   |
-| bertin-project/bertin-base-random-exp-512seqlen    | 0.9660   |
-| bertin-project/bertin-base-gaussian-exp-512seqlen  | **0.9662**   |
 <caption>Table 2. Results for POS.</caption>
 </figure>
-**Improve table 2 with details like number of epochs etc**
 # Conclusions

 **SQUAD-es**
 Using sequence length 128 we have achieved exact match 50.96 and F1 68.74.
+**POS**
+All models trained with max length 512 and batch size 8, using the CoNLL 2002 dataset.
 <figure>
+| Model                                              |    F1    | Accuracy |
+|----------------------------------------------------|----------|----------|
+| bert-base-multilingual-cased                       | 0.9629   | 0.9687   |
+| dccuchile/bert-base-spanish-wwm-cased              | 0.9642   | 0.9700   |
+| BSC-TeMU/roberta-base-bne                          | 0.9659   | 0.9707   |
+| flax-community/bertin-roberta-large-spanish        | 0.9646   | 0.9697   |
+| bertin-project/bertin-roberta-base-spanish         | 0.9638   | 0.9690   |
+| bertin-project/bertin-base-random                  | 0.9656   | 0.9704   |
+| bertin-project/bertin-base-stepwise                | 0.9656   | 0.9707   |
+| bertin-project/bertin-base-gaussian                | **0.9662**   | 0.9709   |
+| bertin-project/bertin-base-random-exp-512seqlen    | 0.9660   | 0.9707   |
+| bertin-project/bertin-base-gaussian-exp-512seqlen  | **0.9662**   | **0.9714**   |
 <caption>Table 2. Results for POS.</caption>
 </figure>
+**NER**
+All models trained with max length 512 and batch size 8, using the CoNLL 2002 dataset.
+<figure>
+| Model                                              |    F1    | Accuracy |
+|----------------------------------------------------|----------|----------|
+| bert-base-multilingual-cased                       | 0.8539   | 0.9779   |
+| dccuchile/bert-base-spanish-wwm-cased              | 0.8579   | 0.9783   |
+| BSC-TeMU/roberta-base-bne                          | 0.8700   | 0.9807   |
+| flax-community/bertin-roberta-large-spanish        | 0.8735   | 0.9806   |
+| bertin-project/bertin-roberta-base-spanish         | 0.8725   | 0.9812   |
+| bertin-project/bertin-base-random                  | 0.8704   | 0.9807   |
+| bertin-project/bertin-base-stepwise                | 0.8705   | 0.9809   |
+| bertin-project/bertin-base-gaussian                | **0.8792**   | **0.9816**   |
+| bertin-project/bertin-base-random-exp-512seqlen    | 0.8616   | 0.9803   |
+| bertin-project/bertin-base-gaussian-exp-512seqlen  | **0.8764**   |  **0.9819**   |
+<caption>Table 3. Results for NER.</caption>
+</figure>
+**PAWS-X**
+All models trained with max length 512 and batch size 8. The accuracy values in this case are a bit surprising (given some models are below 0.60 while others are close to 0.90), so these were run 3 times, with very similar results (these are the metrics for the last run).
+<figure>
+| Model                                              | Accuracy |
+|----------------------------------------------------|----------|
+| bert-base-multilingual-cased                       | 0.5765   |
+| dccuchile/bert-base-spanish-wwm-cased              | 0.5765   |
+| BSC-TeMU/roberta-base-bne                          | 0.5765   |
+| flax-community/bertin-roberta-large-spanish        | 0.5765   |
+| bertin-project/bertin-roberta-base-spanish         | 0.6550   |
+| bertin-project/bertin-base-random                  | 0.8665   |
+| bertin-project/bertin-base-stepwise                | 0.8610   |
+| bertin-project/bertin-base-gaussian                | **0.8800**   |
+| bertin-project/bertin-base-random-exp-512seqlen    | 0.5765   |
+| bertin-project/bertin-base-gaussian-exp-512seqlen  |  **0.875**   |
+<caption>Table 4. Results for PAWS-X.</caption>
+</figure>
 # Conclusions