wonlk
/

kogpt2-base-v2-finetuned-klue-ner

Token Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

wonlk commited on May 6, 2023

Commit

f992cc0

•

1 Parent(s): dc2cc27

update model card README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: 0.37298165525403665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,8 +31,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4076
-- F1: 0.3730
 ## Model description
@@ -51,21 +51,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 24
 - eval_batch_size: 24
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.6084        | 1.0   | 876  | 0.5353          | 0.2118 |
-| 0.3911        | 2.0   | 1752 | 0.4691          | 0.3041 |
-| 0.2855        | 3.0   | 2628 | 0.4076          | 0.3730 |
 ### Framework versions

     metrics:
     - name: F1
       type: f1
+      value: 0.3963993723676604
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4119
+- F1: 0.3964
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
 - train_batch_size: 24
 - eval_batch_size: 24
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.6174        | 1.0   | 876  | 0.5155          | 0.2295 |
+| 0.4045        | 2.0   | 1752 | 0.4996          | 0.2834 |
+| 0.3157        | 3.0   | 2628 | 0.4186          | 0.3107 |
+| 0.2569        | 4.0   | 3504 | 0.4066          | 0.3805 |
+| 0.2148        | 5.0   | 4380 | 0.4119          | 0.3964 |
 ### Framework versions