Doowon96
/

roberta-base-finetuned-ynat_bench

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

Doowon96 commited on Jan 23

Commit

ab63ecf

•

1 Parent(s): bb8122e

End of training

Files changed (3) hide show

README.md +14 -9
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3744
-- F1: 0.8712
 ## Model description
@@ -37,22 +37,27 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 256
-- eval_batch_size: 256
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| No log        | 1.0   | 179  | 0.4018          | 0.8662 |
-| No log        | 2.0   | 358  | 0.3744          | 0.8712 |
-| 0.4101        | 3.0   | 537  | 0.3726          | 0.8694 |
-| 0.4101        | 4.0   | 716  | 0.3811          | 0.8686 |
-| 0.4101        | 5.0   | 895  | 0.3760          | 0.8707 |
 ### Framework versions

 This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3938
+- F1: 0.8672
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 512
+- eval_batch_size: 512
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 200
 - num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.3162        | 0.56  | 50   | 0.4069          | 0.8610 |
+| 0.276         | 1.11  | 100  | 0.4176          | 0.8587 |
+| 0.2706        | 1.67  | 150  | 0.4036          | 0.8631 |
+| 0.2941        | 2.22  | 200  | 0.4232          | 0.8590 |
+| 0.2778        | 2.78  | 250  | 0.3994          | 0.8623 |
+| 0.2575        | 3.33  | 300  | 0.3979          | 0.8628 |
+| 0.2389        | 3.89  | 350  | 0.4008          | 0.8652 |
+| 0.2258        | 4.44  | 400  | 0.3950          | 0.8653 |
+| 0.2097        | 5.0   | 450  | 0.3938          | 0.8672 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0601af144e1dde25f7ae83ea3a203dfe17f774fe95fa59ba9de3acada2c4870f
 size 442518124

 version https://git-lfs.github.com/spec/v1
+oid sha256:b3fadcecac10404d64f1be2617f4248eee4d5710ae0a05bb5d35eba20fb0826f
 size 442518124

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:15756f198a4e4540ba4a4699d9c3d5c5c38bcf894d59479515a1ee4e6223285a
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:c3d7e2f768130a7caf7a2b60b45fb790577a85b1c6c4e57e9a538a2984f58e6c
 size 4728