Redhotchilipoppy commited on
Commit
278cf5d
1 Parent(s): 0cdb665

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.9267
19
 
20
  ## Model description
21
 
@@ -46,21 +46,21 @@ The following hyperparameters were used during training:
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:-----:|:---------------:|
49
- | 1.4649 | 1.0 | 1090 | 1.2665 |
50
- | 1.2725 | 2.0 | 2180 | 1.1424 |
51
- | 1.1813 | 3.0 | 3270 | 1.0741 |
52
- | 1.1291 | 4.0 | 4360 | 1.0381 |
53
- | 1.0842 | 5.0 | 5450 | 1.0118 |
54
- | 1.0461 | 6.0 | 6540 | 0.9905 |
55
- | 1.0236 | 7.0 | 7630 | 0.9721 |
56
- | 1.0033 | 8.0 | 8720 | 0.9618 |
57
- | 0.9879 | 9.0 | 9810 | 0.9499 |
58
- | 0.9731 | 10.0 | 10900 | 0.9465 |
59
- | 0.9568 | 11.0 | 11990 | 0.9392 |
60
- | 0.9509 | 12.0 | 13080 | 0.9326 |
61
- | 0.9429 | 13.0 | 14170 | 0.9300 |
62
- | 0.9371 | 14.0 | 15260 | 0.9290 |
63
- | 0.9319 | 15.0 | 16350 | 0.9267 |
64
 
65
 
66
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.8843
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:-----:|:---------------:|
49
+ | 0.8369 | 1.0 | 1090 | 0.8848 |
50
+ | 0.8257 | 2.0 | 2180 | 0.8879 |
51
+ | 0.8085 | 3.0 | 3270 | 0.8750 |
52
+ | 0.7996 | 4.0 | 4360 | 0.8720 |
53
+ | 0.7861 | 5.0 | 5450 | 0.8789 |
54
+ | 0.7781 | 6.0 | 6540 | 0.8777 |
55
+ | 0.768 | 7.0 | 7630 | 0.8798 |
56
+ | 0.7585 | 8.0 | 8720 | 0.8789 |
57
+ | 0.7475 | 9.0 | 9810 | 0.8774 |
58
+ | 0.739 | 10.0 | 10900 | 0.8828 |
59
+ | 0.7285 | 11.0 | 11990 | 0.8817 |
60
+ | 0.7269 | 12.0 | 13080 | 0.8813 |
61
+ | 0.7202 | 13.0 | 14170 | 0.8801 |
62
+ | 0.7156 | 14.0 | 15260 | 0.8839 |
63
+ | 0.7106 | 15.0 | 16350 | 0.8843 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68e05d90f0ef50f949eb84ec05739f87e19d6100cf1782e052a251263de4a34e
3
  size 327657928
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36e64da8f53ad199df931a08abecad17353fcc5ab75e05f179b23e44630eee34
3
  size 327657928
runs/Jan22_07-36-55_a7ddfede0210/events.out.tfevents.1705913091.a7ddfede0210.26.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a601c54e6996939399d8e823bd77c29e36d8414c76748927edfdaba833216887
3
- size 359
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:59e77ce0569844876d4d27c29895c43471364fd67c875d8e66d6d2dd1dcd9f25
3
+ size 14715
runs/Jan22_07-36-55_a7ddfede0210/events.out.tfevents.1705917420.a7ddfede0210.26.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da06183ef85effe9b99201719198d678e08d1e60a172f1969674fb7c33a521e2
3
+ size 14173
runs/Jan22_07-36-55_a7ddfede0210/events.out.tfevents.1705919386.a7ddfede0210.26.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:146d0d2433aa3a258c152e4e40fb45f7540497e413660f300ca6083a2e777e54
3
+ size 359