gokuls
/

hBERTv1_new_pretrain_w_init__mrpc

@@ -1,6 +1,4 @@
 ---
-language:
-- en
 tags:
 - generated_from_trainer
 datasets:
@@ -15,7 +13,7 @@ model-index:
       name: Text Classification
       type: text-classification
     dataset:
-      name: GLUE MRPC
       type: glue
       config: mrpc
       split: validation
@@ -23,10 +21,10 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.6838235294117647
     - name: F1
       type: f1
-      value: 0.8122270742358079
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,12 +32,12 @@ should probably proofread and complete it, then remove this comment. -->
 # hBERTv1_new_pretrain_w_init__mrpc
-This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new_wt_init](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new_wt_init) on the GLUE MRPC dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6237
-- Accuracy: 0.6838
-- F1: 0.8122
-- Combined Score: 0.7480
 ## Model description
@@ -58,7 +56,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
@@ -71,15 +69,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:--------------:|
-| 3.2785        | 1.0   | 29   | 0.6238          | 0.6838   | 0.8122 | 0.7480         |
-| 0.7343        | 2.0   | 58   | 0.6786          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6377        | 3.0   | 87   | 0.6245          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6353        | 4.0   | 116  | 0.6237          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6344        | 5.0   | 145  | 0.6244          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6314        | 6.0   | 174  | 0.6324          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6431        | 7.0   | 203  | 0.6402          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6347        | 8.0   | 232  | 0.6336          | 0.6838   | 0.8122 | 0.7480         |
-| 0.6343        | 9.0   | 261  | 0.6258          | 0.6838   | 0.8122 | 0.7480         |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 datasets:
       name: Text Classification
       type: text-classification
     dataset:
+      name: glue
       type: glue
       config: mrpc
       split: validation
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.5735294117647058
     - name: F1
       type: f1
+      value: 0.65748031496063
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # hBERTv1_new_pretrain_w_init__mrpc
+This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new_wt_init](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new_wt_init) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3076
+- Accuracy: 0.5735
+- F1: 0.6575
+- Combined Score: 0.6155
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:--------------:|
+| 0.7111        | 1.0   | 29   | 0.6564          | 0.6838   | 0.8122 | 0.7480         |
+| 0.6641        | 2.0   | 58   | 0.6160          | 0.6838   | 0.8122 | 0.7480         |
+| 0.6156        | 3.0   | 87   | 0.6354          | 0.6838   | 0.8122 | 0.7480         |
+| 0.5817        | 4.0   | 116  | 0.6082          | 0.6863   | 0.7895 | 0.7379         |
+| 0.5091        | 5.0   | 145  | 0.7812          | 0.5074   | 0.5157 | 0.5115         |
+| 0.3973        | 6.0   | 174  | 0.7949          | 0.6544   | 0.7565 | 0.7054         |
+| 0.2966        | 7.0   | 203  | 1.0388          | 0.6078   | 0.6887 | 0.6483         |
+| 0.2024        | 8.0   | 232  | 1.0065          | 0.6201   | 0.7124 | 0.6663         |
+| 0.1621        | 9.0   | 261  | 1.3076          | 0.5735   | 0.6575 | 0.6155         |
 ### Framework versions