MingMingBang98
/

kogpt2-base-v2

Token Classification

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

MingMingBang98 commited on May 7, 2023

Commit

5612b19

•

1 Parent(s): a3f9228

update model card README.md

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ model-index:
     metrics:
     - name: F1
       type: f1
-      value: 0.37298165525403665
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,8 +31,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4076
-- F1: 0.3730
 ## Model description
@@ -52,8 +52,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 24
-- eval_batch_size: 24
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -63,9 +63,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.6084        | 1.0   | 876  | 0.5353          | 0.2118 |
-| 0.3911        | 2.0   | 1752 | 0.4691          | 0.3041 |
-| 0.2855        | 3.0   | 2628 | 0.4076          | 0.3730 |
 ### Framework versions

     metrics:
     - name: F1
       type: f1
+      value: 0.37186865267433983
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on the klue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4123
+- F1: 0.3719
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.6113        | 1.0   | 1313 | 0.5320          | 0.2107 |
+| 0.3942        | 2.0   | 2626 | 0.4888          | 0.2891 |
+| 0.2845        | 3.0   | 3939 | 0.4123          | 0.3719 |
 ### Framework versions