rurupang
/

roberta-base-finetuned-sts

Text Classification Transformers PyTorch

roberta generated_from_trainer Eval Results Inference Endpoints

Model card Files Files and versions Community

rurupang commited on Mar 24, 2022

Commit

ca10106

•

1 Parent(s): 79bc713

update model card README.md

Browse files

Files changed (1) hide show

README.md +36 -22

README.md CHANGED Viewed

@@ -1,11 +1,24 @@
 ---
 tags:
 - generated_from_trainer
 metrics:
 - pearsonr
 model-index:
 - name: roberta-base-finetuned-sts
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -13,10 +26,10 @@ should probably proofread and complete it, then remove this comment. -->
 # roberta-base-finetuned-sts
-This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2066
-- Pearsonr: 0.9571
 ## Model description
@@ -35,12 +48,13 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 15
 - mixed_precision_training: Native AMP
@@ -48,21 +62,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Pearsonr |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 82   | 0.2086          | 0.9500   |
-| No log        | 2.0   | 164  | 0.2020          | 0.9548   |
-| No log        | 3.0   | 246  | 0.2583          | 0.9521   |
-| No log        | 4.0   | 328  | 0.1784          | 0.9554   |
-| No log        | 5.0   | 410  | 0.1956          | 0.9530   |
-| No log        | 6.0   | 492  | 0.2147          | 0.9565   |
-| 0.1027        | 7.0   | 574  | 0.1954          | 0.9556   |
-| 0.1027        | 8.0   | 656  | 0.2360          | 0.9546   |
-| 0.1027        | 9.0   | 738  | 0.2066          | 0.9571   |
-| 0.1027        | 10.0  | 820  | 0.1884          | 0.9566   |
-| 0.1027        | 11.0  | 902  | 0.2237          | 0.9551   |
-| 0.1027        | 12.0  | 984  | 0.2175          | 0.9562   |
-| 0.0393        | 13.0  | 1066 | 0.2132          | 0.9553   |
-| 0.0393        | 14.0  | 1148 | 0.2032          | 0.9559   |
-| 0.0393        | 15.0  | 1230 | 0.2035          | 0.9552   |
 ### Framework versions

 ---
 tags:
 - generated_from_trainer
+datasets:
+- klue
 metrics:
 - pearsonr
 model-index:
 - name: roberta-base-finetuned-sts
+  results:
+  - task:
+      name: Text Classification
+      type: text-classification
+    dataset:
+      name: klue
+      type: klue
+      args: sts
+    metrics:
+    - name: Pearsonr
+      type: pearsonr
+      value: 0.956039443806831
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # roberta-base-finetuned-sts
+This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on the klue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1999
+- Pearsonr: 0.9560
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 32
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 200
 - num_epochs: 15
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | Pearsonr |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 329  | 0.2462          | 0.9478   |
+| 1.2505        | 2.0   | 658  | 0.1671          | 0.9530   |
+| 1.2505        | 3.0   | 987  | 0.1890          | 0.9525   |
+| 0.133         | 4.0   | 1316 | 0.2360          | 0.9548   |
+| 0.0886        | 5.0   | 1645 | 0.2265          | 0.9528   |
+| 0.0886        | 6.0   | 1974 | 0.2097          | 0.9518   |
+| 0.0687        | 7.0   | 2303 | 0.2281          | 0.9523   |
+| 0.0539        | 8.0   | 2632 | 0.2212          | 0.9542   |
+| 0.0539        | 9.0   | 2961 | 0.1843          | 0.9532   |
+| 0.045         | 10.0  | 3290 | 0.1999          | 0.9560   |
+| 0.0378        | 11.0  | 3619 | 0.2357          | 0.9533   |
+| 0.0378        | 12.0  | 3948 | 0.2134          | 0.9541   |
+| 0.033         | 13.0  | 4277 | 0.2273          | 0.9540   |
+| 0.03          | 14.0  | 4606 | 0.2148          | 0.9533   |
+| 0.03          | 15.0  | 4935 | 0.2207          | 0.9534   |
 ### Framework versions