madatnlp commited on
Commit
a27ad53
1 Parent(s): 43639dc

End of training

Browse files
Files changed (2) hide show
  1. README.md +19 -4
  2. tf_model.h5 +1 -1
README.md CHANGED
@@ -14,9 +14,9 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 2.7142
18
- - Validation Loss: 1.8683
19
- - Epoch: 0
20
 
21
  ## Model description
22
 
@@ -35,7 +35,7 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - optimizer: {'name': 'Adam', 'learning_rate': 5e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
39
  - training_precision: float32
40
 
41
  ### Training results
@@ -43,6 +43,21 @@ The following hyperparameters were used during training:
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 2.7142 | 1.8683 | 0 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
 
48
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.3184
18
+ - Validation Loss: 1.4046
19
+ - Epoch: 15
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'Adam', 'learning_rate': 2.2999999e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
39
  - training_precision: float32
40
 
41
  ### Training results
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
  | 2.7142 | 1.8683 | 0 |
46
+ | 1.6077 | 1.4417 | 1 |
47
+ | 1.2458 | 1.3161 | 2 |
48
+ | 1.0396 | 1.2704 | 3 |
49
+ | 0.8848 | 1.2818 | 4 |
50
+ | 0.7634 | 1.2579 | 5 |
51
+ | 0.6699 | 1.2724 | 6 |
52
+ | 0.5948 | 1.2718 | 7 |
53
+ | 0.5306 | 1.3300 | 8 |
54
+ | 0.4832 | 1.3377 | 9 |
55
+ | 0.4401 | 1.3038 | 10 |
56
+ | 0.4053 | 1.3622 | 11 |
57
+ | 0.3782 | 1.3577 | 12 |
58
+ | 0.3550 | 1.3696 | 13 |
59
+ | 0.3347 | 1.3682 | 14 |
60
+ | 0.3184 | 1.4046 | 15 |
61
 
62
 
63
  ### Framework versions
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f11a6a5bdf24a589cec9ca2b2c1ec0943316ac4ddcf2585d3f34d06cd5670cd1
3
  size 658153136
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0e3685ca3da085c7a5adf77d1a4e8bea7f2ddaec2630aa54ebf78cec23844d2
3
  size 658153136