Maximofn commited on
Commit
d107b3b
1 Parent(s): b82aa3d

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 3.2082
19
 
20
  ## Model description
21
 
@@ -35,7 +35,7 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
- - train_batch_size: 32
39
  - eval_batch_size: 32
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:-----:|:---------------:|
52
- | 3.381 | 1.0 | 6516 | 3.2650 |
53
- | 3.2617 | 2.0 | 13032 | 3.2063 |
54
- | 3.2142 | 3.0 | 19548 | 3.1986 |
55
 
56
 
57
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.2013
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 2e-05
38
+ - train_batch_size: 28
39
  - eval_batch_size: 32
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:-----:|:---------------:|
52
+ | 3.3866 | 1.0 | 7447 | 3.2590 |
53
+ | 3.2599 | 2.0 | 14894 | 3.1997 |
54
+ | 3.2126 | 3.0 | 22341 | 3.1920 |
55
 
56
 
57
  ### Framework versions
runs/Jul13_10-22-19_8de3af1b431d/events.out.tfevents.1720875425.8de3af1b431d.6946.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da3103d50b57c7d4f1fd2a7c96cfbc729e27b09961d79af745275eb492659b77
3
+ size 364