semindan
/

xnli_m_bert_only_en

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.810843373493976
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on the xnli dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5306
-- Accuracy: 0.8108
 ## Model description
@@ -53,90 +53,24 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.8884        | 0.04  | 1000  | 0.7743          | 0.6703   |
-| 0.782         | 0.08  | 2000  | 0.7029          | 0.7060   |
-| 0.7479        | 0.12  | 3000  | 0.7366          | 0.6880   |
-| 0.7348        | 0.16  | 4000  | 0.6722          | 0.7285   |
-| 0.721         | 0.2   | 5000  | 0.6802          | 0.7237   |
-| 0.7097        | 0.24  | 6000  | 0.6801          | 0.7217   |
-| 0.6978        | 0.29  | 7000  | 0.6051          | 0.7643   |
-| 0.6924        | 0.33  | 8000  | 0.6793          | 0.7357   |
-| 0.6807        | 0.37  | 9000  | 0.6604          | 0.7502   |
-| 0.6636        | 0.41  | 10000 | 0.6309          | 0.7430   |
-| 0.6616        | 0.45  | 11000 | 0.6039          | 0.7490   |
-| 0.6561        | 0.49  | 12000 | 0.6051          | 0.7610   |
-| 0.6545        | 0.53  | 13000 | 0.6354          | 0.7454   |
-| 0.644         | 0.57  | 14000 | 0.6064          | 0.7466   |
-| 0.6446        | 0.61  | 15000 | 0.6052          | 0.7554   |
-| 0.6414        | 0.65  | 16000 | 0.6365          | 0.7422   |
-| 0.6311        | 0.69  | 17000 | 0.6118          | 0.7546   |
-| 0.6187        | 0.73  | 18000 | 0.5973          | 0.7538   |
-| 0.619         | 0.77  | 19000 | 0.5863          | 0.7570   |
-| 0.6108        | 0.81  | 20000 | 0.6212          | 0.7490   |
-| 0.6136        | 0.86  | 21000 | 0.5810          | 0.7695   |
-| 0.6018        | 0.9   | 22000 | 0.5799          | 0.7731   |
-| 0.6198        | 0.94  | 23000 | 0.5548          | 0.7723   |
-| 0.6047        | 0.98  | 24000 | 0.5964          | 0.7622   |
-| 0.5636        | 1.02  | 25000 | 0.5805          | 0.7851   |
-| 0.5267        | 1.06  | 26000 | 0.5540          | 0.7795   |
-| 0.5067        | 1.1   | 27000 | 0.5388          | 0.7855   |
-| 0.5304        | 1.14  | 28000 | 0.5482          | 0.7799   |
-| 0.5332        | 1.18  | 29000 | 0.5290          | 0.7859   |
-| 0.5154        | 1.22  | 30000 | 0.5475          | 0.7799   |
-| 0.524         | 1.26  | 31000 | 0.5305          | 0.7900   |
-| 0.5236        | 1.3   | 32000 | 0.5691          | 0.7871   |
-| 0.5154        | 1.34  | 33000 | 0.5642          | 0.7739   |
-| 0.5248        | 1.39  | 34000 | 0.5590          | 0.7643   |
-| 0.5077        | 1.43  | 35000 | 0.6064          | 0.7715   |
-| 0.5147        | 1.47  | 36000 | 0.5343          | 0.7948   |
-| 0.5041        | 1.51  | 37000 | 0.5375          | 0.7867   |
-| 0.5054        | 1.55  | 38000 | 0.5660          | 0.7727   |
-| 0.5053        | 1.59  | 39000 | 0.5479          | 0.7859   |
-| 0.5009        | 1.63  | 40000 | 0.5080          | 0.7960   |
-| 0.5081        | 1.67  | 41000 | 0.5139          | 0.7920   |
-| 0.5013        | 1.71  | 42000 | 0.5385          | 0.7904   |
-| 0.4972        | 1.75  | 43000 | 0.5257          | 0.7928   |
-| 0.4987        | 1.79  | 44000 | 0.5056          | 0.8020   |
-| 0.4863        | 1.83  | 45000 | 0.5030          | 0.8004   |
-| 0.5           | 1.87  | 46000 | 0.5157          | 0.7980   |
-| 0.4926        | 1.91  | 47000 | 0.5505          | 0.7924   |
-| 0.4893        | 1.96  | 48000 | 0.5286          | 0.8004   |
-| 0.4755        | 2.0   | 49000 | 0.5216          | 0.8036   |
-| 0.3855        | 2.04  | 50000 | 0.6087          | 0.7884   |
-| 0.3731        | 2.08  | 51000 | 0.5485          | 0.8064   |
-| 0.3698        | 2.12  | 52000 | 0.5398          | 0.8080   |
-| 0.3702        | 2.16  | 53000 | 0.5454          | 0.8      |
-| 0.3688        | 2.2   | 54000 | 0.5512          | 0.8068   |
-| 0.3683        | 2.24  | 55000 | 0.5423          | 0.8060   |
-| 0.3704        | 2.28  | 56000 | 0.5383          | 0.8084   |
-| 0.3758        | 2.32  | 57000 | 0.5398          | 0.8161   |
-| 0.3781        | 2.36  | 58000 | 0.5481          | 0.8088   |
-| 0.3697        | 2.4   | 59000 | 0.5465          | 0.8056   |
-| 0.3706        | 2.44  | 60000 | 0.5488          | 0.7988   |
-| 0.3704        | 2.49  | 61000 | 0.5395          | 0.8052   |
-| 0.3648        | 2.53  | 62000 | 0.5463          | 0.8068   |
-| 0.36          | 2.57  | 63000 | 0.5400          | 0.8052   |
-| 0.3661        | 2.61  | 64000 | 0.5542          | 0.8068   |
-| 0.3555        | 2.65  | 65000 | 0.5424          | 0.8044   |
-| 0.3551        | 2.69  | 66000 | 0.5269          | 0.8124   |
-| 0.3608        | 2.73  | 67000 | 0.5382          | 0.8129   |
-| 0.35          | 2.77  | 68000 | 0.5461          | 0.8108   |
-| 0.3457        | 2.81  | 69000 | 0.5477          | 0.8084   |
-| 0.3516        | 2.85  | 70000 | 0.5345          | 0.8104   |
-| 0.3499        | 2.89  | 71000 | 0.5344          | 0.8129   |
-| 0.3513        | 2.93  | 72000 | 0.5279          | 0.8120   |
-| 0.3442        | 2.97  | 73000 | 0.5306          | 0.8108   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.8076305220883534
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) on the xnli dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0082
+- Accuracy: 0.8076
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 128
+- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 7
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.3328        | 1.0   | 3068  | 0.5433          | 0.8036   |
+| 0.259         | 2.0   | 6136  | 0.5708          | 0.8008   |
+| 0.2023        | 3.0   | 9204  | 0.6475          | 0.8048   |
+| 0.1362        | 4.0   | 12272 | 0.7661          | 0.7972   |
+| 0.0945        | 5.0   | 15340 | 0.8333          | 0.8008   |
+| 0.0665        | 6.0   | 18408 | 0.9312          | 0.8092   |
+| 0.0463        | 7.0   | 21476 | 1.0082          | 0.8076   |
 ### Framework versions