liuyanchen1015
/

roberta-base-mnli_ChcE

@@ -3,19 +3,19 @@ license: mit
 tags:
 - generated_from_trainer
 model-index:
-- name: mnli_ChcE
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# mnli_ChcE
 This model is a fine-tuned version of [WillHeld/roberta-base-mnli](https://huggingface.co/WillHeld/roberta-base-mnli) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5320
-- Acc: 0.8721
 ## Model description
@@ -40,29 +40,41 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Acc    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|
-| 0.3096        | 0.17  | 2000  | 0.4133          | 0.8553 |
-| 0.3028        | 0.33  | 4000  | 0.3985          | 0.8665 |
-| 0.3005        | 0.5   | 6000  | 0.3883          | 0.8671 |
-| 0.2973        | 0.67  | 8000  | 0.3959          | 0.8648 |
-| 0.2921        | 0.83  | 10000 | 0.4101          | 0.8662 |
-| 0.2934        | 1.0   | 12000 | 0.4040          | 0.8675 |
-| 0.2034        | 1.17  | 14000 | 0.4498          | 0.8651 |
-| 0.2021        | 1.33  | 16000 | 0.4749          | 0.8646 |
-| 0.2027        | 1.5   | 18000 | 0.4369          | 0.8685 |
-| 0.2001        | 1.67  | 20000 | 0.4337          | 0.8699 |
-| 0.2014        | 1.83  | 22000 | 0.4438          | 0.8672 |
-| 0.1945        | 2.0   | 24000 | 0.4623          | 0.8679 |
-| 0.1371        | 2.17  | 26000 | 0.5456          | 0.8705 |
-| 0.1404        | 2.33  | 28000 | 0.5174          | 0.8687 |
-| 0.1384        | 2.5   | 30000 | 0.5245          | 0.8708 |
-| 0.1347        | 2.67  | 32000 | 0.5402          | 0.8711 |
-| 0.137         | 2.84  | 34000 | 0.5320          | 0.8721 |
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: roberta-base-mnli_ChcE
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# roberta-base-mnli_ChcE
 This model is a fine-tuned version of [WillHeld/roberta-base-mnli](https://huggingface.co/WillHeld/roberta-base-mnli) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7094
+- Acc: 0.8698
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Acc    |
 |:-------------:|:-----:|:-----:|:---------------:|:------:|
+| 0.3102        | 0.17  | 2000  | 0.4135          | 0.8545 |
+| 0.3046        | 0.33  | 4000  | 0.4024          | 0.8645 |
+| 0.3038        | 0.5   | 6000  | 0.3936          | 0.8668 |
+| 0.3012        | 0.67  | 8000  | 0.4007          | 0.8625 |
+| 0.2979        | 0.83  | 10000 | 0.4235          | 0.8620 |
+| 0.2997        | 1.0   | 12000 | 0.4031          | 0.8644 |
+| 0.2099        | 1.17  | 14000 | 0.4393          | 0.8633 |
+| 0.2114        | 1.33  | 16000 | 0.4662          | 0.8628 |
+| 0.2147        | 1.5   | 18000 | 0.4331          | 0.8648 |
+| 0.2122        | 1.67  | 20000 | 0.4166          | 0.8702 |
+| 0.2156        | 1.83  | 22000 | 0.4463          | 0.8633 |
+| 0.2117        | 2.0   | 24000 | 0.4637          | 0.8680 |
+| 0.1469        | 2.17  | 26000 | 0.5211          | 0.8681 |
+| 0.1526        | 2.33  | 28000 | 0.5206          | 0.8620 |
+| 0.1494        | 2.5   | 30000 | 0.5168          | 0.8664 |
+| 0.1519        | 2.67  | 32000 | 0.4830          | 0.8700 |
+| 0.152         | 2.84  | 34000 | 0.5465          | 0.8636 |
+| 0.1498        | 3.0   | 36000 | 0.5550          | 0.8680 |
+| 0.1131        | 3.17  | 38000 | 0.6764          | 0.8602 |
+| 0.1135        | 3.34  | 40000 | 0.6200          | 0.8657 |
+| 0.1175        | 3.5   | 42000 | 0.5889          | 0.8671 |
+| 0.1156        | 3.67  | 44000 | 0.6300          | 0.8663 |
+| 0.1104        | 3.84  | 46000 | 0.6045          | 0.8690 |
+| 0.1111        | 4.0   | 48000 | 0.6413          | 0.8694 |
+| 0.086         | 4.17  | 50000 | 0.7271          | 0.8658 |
+| 0.0895        | 4.34  | 52000 | 0.7274          | 0.8683 |
+| 0.0867        | 4.5   | 54000 | 0.7226          | 0.8658 |
+| 0.0886        | 4.67  | 56000 | 0.7182          | 0.8691 |
+| 0.0849        | 4.84  | 58000 | 0.7094          | 0.8698 |
 ### Framework versions