Doowon96 commited on
Commit
ab63ecf
1 Parent(s): bb8122e

End of training

Browse files
Files changed (3) hide show
  1. README.md +14 -9
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.3744
20
- - F1: 0.8712
21
 
22
  ## Model description
23
 
@@ -37,22 +37,27 @@ More information needed
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
- - train_batch_size: 256
41
- - eval_batch_size: 256
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
 
45
  - num_epochs: 5
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss | F1 |
50
  |:-------------:|:-----:|:----:|:---------------:|:------:|
51
- | No log | 1.0 | 179 | 0.4018 | 0.8662 |
52
- | No log | 2.0 | 358 | 0.3744 | 0.8712 |
53
- | 0.4101 | 3.0 | 537 | 0.3726 | 0.8694 |
54
- | 0.4101 | 4.0 | 716 | 0.3811 | 0.8686 |
55
- | 0.4101 | 5.0 | 895 | 0.3760 | 0.8707 |
 
 
 
 
56
 
57
 
58
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [klue/roberta-base](https://huggingface.co/klue/roberta-base) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.3938
20
+ - F1: 0.8672
21
 
22
  ## Model description
23
 
 
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 2e-05
40
+ - train_batch_size: 512
41
+ - eval_batch_size: 512
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - lr_scheduler_warmup_steps: 200
46
  - num_epochs: 5
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | F1 |
51
  |:-------------:|:-----:|:----:|:---------------:|:------:|
52
+ | 0.3162 | 0.56 | 50 | 0.4069 | 0.8610 |
53
+ | 0.276 | 1.11 | 100 | 0.4176 | 0.8587 |
54
+ | 0.2706 | 1.67 | 150 | 0.4036 | 0.8631 |
55
+ | 0.2941 | 2.22 | 200 | 0.4232 | 0.8590 |
56
+ | 0.2778 | 2.78 | 250 | 0.3994 | 0.8623 |
57
+ | 0.2575 | 3.33 | 300 | 0.3979 | 0.8628 |
58
+ | 0.2389 | 3.89 | 350 | 0.4008 | 0.8652 |
59
+ | 0.2258 | 4.44 | 400 | 0.3950 | 0.8653 |
60
+ | 0.2097 | 5.0 | 450 | 0.3938 | 0.8672 |
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0601af144e1dde25f7ae83ea3a203dfe17f774fe95fa59ba9de3acada2c4870f
3
  size 442518124
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b3fadcecac10404d64f1be2617f4248eee4d5710ae0a05bb5d35eba20fb0826f
3
  size 442518124
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:15756f198a4e4540ba4a4699d9c3d5c5c38bcf894d59479515a1ee4e6223285a
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c3d7e2f768130a7caf7a2b60b45fb790577a85b1c6c4e57e9a538a2984f58e6c
3
  size 4728