Mithil commited on
Commit
058bce2
1 Parent(s): a87558e

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.6477
19
 
20
  ## Model description
21
 
@@ -48,26 +48,26 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | No log | 0.98 | 6 | 10.0906 |
52
- | 9.1486 | 1.96 | 12 | 6.7033 |
53
- | 9.1486 | 2.94 | 18 | 5.8931 |
54
- | 6.8583 | 3.92 | 24 | 5.6861 |
55
- | 6.6191 | 4.9 | 30 | 5.6068 |
56
- | 6.6191 | 5.88 | 36 | 5.5426 |
57
- | 6.3742 | 6.86 | 42 | 5.4848 |
58
- | 6.3742 | 8.0 | 49 | 5.3937 |
59
- | 6.3911 | 8.98 | 55 | 5.3093 |
60
- | 6.3445 | 9.96 | 61 | 5.2252 |
61
- | 6.3445 | 10.94 | 67 | 5.1060 |
62
- | 6.1275 | 11.92 | 73 | 4.9838 |
63
- | 6.1275 | 12.9 | 79 | 4.8939 |
64
- | 6.108 | 13.88 | 85 | 4.8286 |
65
- | 5.9782 | 14.86 | 91 | 4.7518 |
66
- | 5.9782 | 16.0 | 98 | 4.7024 |
67
- | 5.9191 | 16.98 | 104 | 4.6739 |
68
- | 5.869 | 17.96 | 110 | 4.6599 |
69
- | 5.869 | 18.94 | 116 | 4.6488 |
70
- | 5.7407 | 19.59 | 120 | 4.6477 |
71
 
72
 
73
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.3524
19
 
20
  ## Model description
21
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | No log | 0.98 | 6 | 8.2912 |
52
+ | 9.0416 | 1.96 | 12 | 6.3294 |
53
+ | 9.0416 | 2.94 | 18 | 5.7787 |
54
+ | 6.7966 | 3.92 | 24 | 5.4608 |
55
+ | 6.5563 | 4.9 | 30 | 5.3249 |
56
+ | 6.5563 | 5.88 | 36 | 5.1962 |
57
+ | 6.2756 | 6.86 | 42 | 5.1209 |
58
+ | 6.2756 | 8.0 | 49 | 4.9701 |
59
+ | 6.3126 | 8.98 | 55 | 4.8793 |
60
+ | 6.237 | 9.96 | 61 | 4.7837 |
61
+ | 6.237 | 10.94 | 67 | 4.7102 |
62
+ | 5.9722 | 11.92 | 73 | 4.5721 |
63
+ | 5.9722 | 12.9 | 79 | 4.5170 |
64
+ | 5.9883 | 13.88 | 85 | 4.4562 |
65
+ | 5.8828 | 14.86 | 91 | 4.4168 |
66
+ | 5.8828 | 16.0 | 98 | 4.3880 |
67
+ | 5.8493 | 16.98 | 104 | 4.3684 |
68
+ | 5.8112 | 17.96 | 110 | 4.3570 |
69
+ | 5.8112 | 18.94 | 116 | 4.3528 |
70
+ | 5.6628 | 19.59 | 120 | 4.3524 |
71
 
72
 
73
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bfa4447b378eec9c913be4daa5273c384c53af848a654bdcee1908a69e8dd35f
3
  size 497777280
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b672c8c562af88d3224a2bbb5365350a8a62de023117f2d96c094b72b9299da4
3
  size 497777280
runs/May21_09-50-09_e261ca047b2e/events.out.tfevents.1716285014.e261ca047b2e.34.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13cc9bd2ffaeb35d4c5ca59bfba132acfeeda1b42d4026c130e53ca38f9d1208
3
+ size 17754
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:54bb2bda263ecf9eff76d0435dfea29cce1f553560be1f5a38f4365f73b97087
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d814d2398436b0823107338673b666c0f55a1ce0d76410bc11af67edf461fe6
3
  size 4920