anirudh21
/

albert-large-v2-finetuned-rte

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

anirudh21 commited on Jan 27, 2022

Commit

2caa23a

·

1 Parent(s): ac7e91a

update model card README.md

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.5342960288808665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,8 +29,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [albert-large-v2](https://huggingface.co/albert-large-v2) on the glue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6914
-- Accuracy: 0.5343
 ## Model description
@@ -50,8 +50,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -61,11 +61,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 63   | 0.7635          | 0.4729   |
-| No log        | 2.0   | 126  | 0.7056          | 0.4549   |
-| No log        | 3.0   | 189  | 0.7028          | 0.4910   |
-| No log        | 4.0   | 252  | 0.6914          | 0.5343   |
-| No log        | 5.0   | 315  | 0.6961          | 0.5126   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.631768953068592
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [albert-large-v2](https://huggingface.co/albert-large-v2) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6843
+- Accuracy: 0.6318
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 13   | 0.6771          | 0.5740   |
+| No log        | 2.0   | 26   | 0.6675          | 0.6029   |
+| No log        | 3.0   | 39   | 0.7132          | 0.5632   |
+| No log        | 4.0   | 52   | 0.6843          | 0.6318   |
+| No log        | 5.0   | 65   | 0.7010          | 0.6282   |
 ### Framework versions