gokuls
/

hBERTv1_new_pretrain_w_init__stsb

@@ -1,6 +1,4 @@
 ---
-language:
-- en
 tags:
 - generated_from_trainer
 datasets:
@@ -14,7 +12,7 @@ model-index:
       name: Text Classification
       type: text-classification
     dataset:
-      name: GLUE STSB
       type: glue
       config: stsb
       split: validation
@@ -22,7 +20,7 @@ model-index:
     metrics:
     - name: Spearmanr
       type: spearmanr
-      value: 0.06376297357455894
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,12 +28,12 @@ should probably proofread and complete it, then remove this comment. -->
 # hBERTv1_new_pretrain_w_init__stsb
-This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new_wt_init](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new_wt_init) on the GLUE STSB dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.2578
-- Pearson: 0.0646
-- Spearmanr: 0.0638
-- Combined Score: 0.0642
 ## Model description
@@ -54,7 +52,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
@@ -67,14 +65,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
-| 61.8804       | 1.0   | 45   | 2.3512          | -0.0440 | 0.0212    | -0.0114        |
-| 2.6713        | 2.0   | 90   | 3.0563          | 0.0479  | 0.0125    | 0.0302         |
-| 2.3803        | 3.0   | 135  | 2.2578          | 0.0646  | 0.0638    | 0.0642         |
-| 2.2516        | 4.0   | 180  | 2.3252          | 0.0646  | 0.0638    | 0.0642         |
-| 2.243         | 5.0   | 225  | 2.2928          | 0.0646  | 0.0638    | 0.0642         |
-| 2.1764        | 6.0   | 270  | 2.2826          | 0.0632  | 0.0058    | 0.0345         |
-| 2.1933        | 7.0   | 315  | 2.3738          | 0.0495  | 0.0497    | 0.0496         |
-| 2.2067        | 8.0   | 360  | 2.2834          | 0.0372  | -0.0148   | 0.0112         |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
 datasets:
       name: Text Classification
       type: text-classification
     dataset:
+      name: glue
       type: glue
       config: stsb
       split: validation
     metrics:
     - name: Spearmanr
       type: spearmanr
+      value: 0.4406348486823185
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # hBERTv1_new_pretrain_w_init__stsb
+This model is a fine-tuned version of [gokuls/bert_12_layer_model_v1_complete_training_new_wt_init](https://huggingface.co/gokuls/bert_12_layer_model_v1_complete_training_new_wt_init) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.8004
+- Pearson: 0.4571
+- Spearmanr: 0.4406
+- Combined Score: 0.4489
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 4e-05
 - train_batch_size: 128
 - eval_batch_size: 128
 - seed: 10
 | Training Loss | Epoch | Step | Validation Loss | Pearson | Spearmanr | Combined Score |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:---------:|:--------------:|
+| 2.5056        | 1.0   | 45   | 2.2584          | 0.0949  | 0.0892    | 0.0920         |
+| 2.1254        | 2.0   | 90   | 2.6871          | 0.1250  | 0.1231    | 0.1241         |
+| 1.9839        | 3.0   | 135  | 2.2709          | 0.1790  | 0.1840    | 0.1815         |
+| 1.6299        | 4.0   | 180  | 2.5115          | 0.2691  | 0.2797    | 0.2744         |
+| 1.3155        | 5.0   | 225  | 2.4555          | 0.3453  | 0.3437    | 0.3445         |
+| 0.9686        | 6.0   | 270  | 2.8004          | 0.4571  | 0.4406    | 0.4489         |
 ### Framework versions