gokuls
/

hBERTv1_new_pretrain_sst2

@@ -1,6 +1,4 @@
 ---
-language:
-- en
 tags:
 - generated_from_trainer
 datasets:
@@ -14,7 +12,7 @@ model-index:
       name: Text Classification
       type: text-classification
     dataset:
-      name: GLUE SST2
       type: glue
       config: sst2
       split: validation
@@ -22,7 +20,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.5091743119266054
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +28,10 @@ should probably proofread and complete it, then remove this comment. -->
 # hBERTv1_new_pretrain_sst2
-This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new) on the GLUE SST2 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 15.8277
-- Accuracy: 0.5092
 ## Model description
@@ -52,7 +50,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
@@ -65,23 +63,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 23.7143       | 1.0   | 527  | 22.5487         | 0.5092   |
-| 20.5968       | 2.0   | 1054 | 21.7701         | 0.5092   |
-| 19.7199       | 3.0   | 1581 | 18.5677         | 0.5092   |
-| 19.5252       | 4.0   | 2108 | 21.5728         | 0.5092   |
-| 19.812        | 5.0   | 2635 | 15.8377         | 0.5092   |
-| 18.4467       | 6.0   | 3162 | 15.8291         | 0.5092   |
-| 18.04         | 7.0   | 3689 | 15.8572         | 0.5092   |
-| 18.0932       | 8.0   | 4216 | 15.8288         | 0.5092   |
-| 18.1005       | 9.0   | 4743 | 15.8288         | 0.5092   |
-| 18.0769       | 10.0  | 5270 | 15.8291         | 0.5092   |
-| 17.912        | 11.0  | 5797 | 15.8291         | 0.5092   |
-| 17.887        | 12.0  | 6324 | 15.8277         | 0.5092   |
-| 18.1205       | 13.0  | 6851 | 15.8288         | 0.5092   |
-| 18.0703       | 14.0  | 7378 | 15.8291         | 0.5092   |
-| 18.044        | 15.0  | 7905 | 15.8294         | 0.5092   |
-| 18.0354       | 16.0  | 8432 | 15.8294         | 0.5092   |
-| 17.8629       | 17.0  | 8959 | 15.8297         | 0.5092   |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 datasets:
       name: Text Classification
       type: text-classification
     dataset:
+      name: glue
       type: glue
       config: sst2
       split: validation
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.7878440366972477
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # hBERTv1_new_pretrain_sst2
+This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6715
+- Accuracy: 0.7878
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.4258        | 1.0   | 527  | 0.4994          | 0.8062   |
+| 0.2652        | 2.0   | 1054 | 0.5633          | 0.8005   |
+| 0.2214        | 3.0   | 1581 | 0.4752          | 0.7878   |
+| 0.2014        | 4.0   | 2108 | 0.5329          | 0.7890   |
+| 0.1813        | 5.0   | 2635 | 0.5410          | 0.7924   |
+| 0.1679        | 6.0   | 3162 | 0.5857          | 0.8085   |
+| 0.1526        | 7.0   | 3689 | 0.7654          | 0.8039   |
+| 0.1405        | 8.0   | 4216 | 0.6715          | 0.7878   |
 ### Framework versions