semindan
/

xnli_xlm_r_only_ur

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.41847389558232934
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the xnli dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0581
-- Accuracy: 0.4185
 ## Model description
@@ -52,29 +52,29 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 100
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 1.0996        | 1.0   | 3068  | 1.0988          | 0.3333   |
-| 1.1           | 2.0   | 6136  | 1.0986          | 0.3333   |
-| 1.0999        | 3.0   | 9204  | 1.0986          | 0.3333   |
-| 1.0998        | 4.0   | 12272 | 1.0988          | 0.3333   |
-| 1.0998        | 5.0   | 15340 | 1.0986          | 0.3333   |
-| 1.0994        | 6.0   | 18408 | 1.0987          | 0.3333   |
-| 1.0994        | 7.0   | 21476 | 1.0987          | 0.3333   |
-| 1.0993        | 8.0   | 24544 | 1.0987          | 0.3333   |
-| 1.0932        | 9.0   | 27612 | 1.0692          | 0.4177   |
-| 1.075         | 10.0  | 30680 | 1.0581          | 0.4185   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.6526104417670683
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on the xnli dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8165
+- Accuracy: 0.6526
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.5e-05
+- train_batch_size: 192
+- eval_batch_size: 192
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 1.0253        | 1.0   | 2046  | 0.8330          | 0.6382   |
+| 0.9659        | 2.0   | 4092  | 0.8105          | 0.6530   |
+| 0.9445        | 3.0   | 6138  | 0.7978          | 0.6558   |
+| 0.9254        | 4.0   | 8184  | 0.7791          | 0.6594   |
+| 0.9075        | 5.0   | 10230 | 0.7792          | 0.6614   |
+| 0.8892        | 6.0   | 12276 | 0.7812          | 0.6554   |
+| 0.8728        | 7.0   | 14322 | 0.7762          | 0.6538   |
+| 0.8565        | 8.0   | 16368 | 0.8019          | 0.6494   |
+| 0.8427        | 9.0   | 18414 | 0.8067          | 0.6558   |
+| 0.8332        | 10.0  | 20460 | 0.8165          | 0.6526   |
 ### Framework versions