daisyshim
/

kogpt2-base-v2-finetuned-klue-ner

@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: 0.41163625019497735
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,8 +31,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0877
-- F1: 0.4116
 ## Model description
@@ -57,42 +57,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 30
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | F1     |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|
-| 0.6183        | 1.0   | 876   | 0.5527          | 0.1741 |
-| 0.4278        | 2.0   | 1752  | 0.5237          | 0.2522 |
-| 0.3488        | 3.0   | 2628  | 0.4521          | 0.3091 |
-| 0.2997        | 4.0   | 3504  | 0.4651          | 0.3192 |
-| 0.2662        | 5.0   | 4380  | 0.4429          | 0.3323 |
-| 0.2393        | 6.0   | 5256  | 0.4527          | 0.3588 |
-| 0.2161        | 7.0   | 6132  | 0.4748          | 0.3580 |
-| 0.1974        | 8.0   | 7008  | 0.4957          | 0.3369 |
-| 0.1807        | 9.0   | 7884  | 0.4911          | 0.3673 |
-| 0.1684        | 10.0  | 8760  | 0.5136          | 0.3597 |
-| 0.1531        | 11.0  | 9636  | 0.4972          | 0.3681 |
-| 0.1405        | 12.0  | 10512 | 0.5515          | 0.3766 |
-| 0.1274        | 13.0  | 11388 | 0.6285          | 0.3617 |
-| 0.1177        | 14.0  | 12264 | 0.5866          | 0.3857 |
-| 0.1078        | 15.0  | 13140 | 0.5878          | 0.3810 |
-| 0.0979        | 16.0  | 14016 | 0.6423          | 0.3764 |
-| 0.0897        | 17.0  | 14892 | 0.6334          | 0.3850 |
-| 0.0805        | 18.0  | 15768 | 0.7045          | 0.3904 |
-| 0.0731        | 19.0  | 16644 | 0.7177          | 0.3941 |
-| 0.0653        | 20.0  | 17520 | 0.7584          | 0.3826 |
-| 0.0594        | 21.0  | 18396 | 0.8027          | 0.3805 |
-| 0.0556        | 22.0  | 19272 | 0.8706          | 0.3736 |
-| 0.0505        | 23.0  | 20148 | 0.8978          | 0.3979 |
-| 0.0472        | 24.0  | 21024 | 0.9166          | 0.3968 |
-| 0.0432        | 25.0  | 21900 | 0.9277          | 0.3978 |
-| 0.0392        | 26.0  | 22776 | 0.9799          | 0.3973 |
-| 0.0372        | 27.0  | 23652 | 1.0087          | 0.4115 |
-| 0.0344        | 28.0  | 24528 | 1.0642          | 0.3996 |
-| 0.032         | 29.0  | 25404 | 1.0861          | 0.4037 |
-| 0.0307        | 30.0  | 26280 | 1.0877          | 0.4116 |
 ### Framework versions

     metrics:
     - name: F1
       type: f1
+      value: 0.37298165525403665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4076
+- F1: 0.3730
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.6084        | 1.0   | 876  | 0.5353          | 0.2118 |
+| 0.3911        | 2.0   | 1752 | 0.4691          | 0.3041 |
+| 0.2855        | 3.0   | 2628 | 0.4076          | 0.3730 |
 ### Framework versions