LordAbsurd2137 commited on
Commit
3b58d12
1 Parent(s): c500ced

End of training

Browse files
README.md CHANGED
@@ -1,4 +1,5 @@
1
  ---
 
2
  tags:
3
  - generated_from_trainer
4
  model-index:
@@ -11,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
11
 
12
  # calculator_model_test2
13
 
14
- This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 0.1318
17
 
18
  ## Model description
19
 
@@ -38,52 +39,62 @@ The following hyperparameters were used during training:
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - num_epochs: 40
42
 
43
  ### Training results
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
- | 2.9409 | 1.0 | 6 | 2.2604 |
48
- | 2.0221 | 2.0 | 12 | 1.7164 |
49
- | 1.5231 | 3.0 | 18 | 1.2971 |
50
- | 1.1926 | 4.0 | 24 | 1.0828 |
51
- | 1.0328 | 5.0 | 30 | 0.9510 |
52
- | 0.9177 | 6.0 | 36 | 0.8256 |
53
- | 0.8079 | 7.0 | 42 | 0.7383 |
54
- | 0.7261 | 8.0 | 48 | 0.6878 |
55
- | 0.6844 | 9.0 | 54 | 0.6245 |
56
- | 0.6421 | 10.0 | 60 | 0.5833 |
57
- | 0.6088 | 11.0 | 66 | 0.5801 |
58
- | 0.5638 | 12.0 | 72 | 0.5270 |
59
- | 0.5326 | 13.0 | 78 | 0.4975 |
60
- | 0.5134 | 14.0 | 84 | 0.5070 |
61
- | 0.5135 | 15.0 | 90 | 0.4415 |
62
- | 0.468 | 16.0 | 96 | 0.4325 |
63
- | 0.4442 | 17.0 | 102 | 0.4200 |
64
- | 0.4214 | 18.0 | 108 | 0.4241 |
65
- | 0.4115 | 19.0 | 114 | 0.3691 |
66
- | 0.3885 | 20.0 | 120 | 0.3460 |
67
- | 0.3641 | 21.0 | 126 | 0.3261 |
68
- | 0.3445 | 22.0 | 132 | 0.2990 |
69
- | 0.3198 | 23.0 | 138 | 0.2776 |
70
- | 0.3043 | 24.0 | 144 | 0.2610 |
71
- | 0.2885 | 25.0 | 150 | 0.2424 |
72
- | 0.2709 | 26.0 | 156 | 0.2312 |
73
- | 0.2537 | 27.0 | 162 | 0.2321 |
74
- | 0.2475 | 28.0 | 168 | 0.2040 |
75
- | 0.231 | 29.0 | 174 | 0.1949 |
76
- | 0.2228 | 30.0 | 180 | 0.1797 |
77
- | 0.2015 | 31.0 | 186 | 0.1713 |
78
- | 0.202 | 32.0 | 192 | 0.1616 |
79
- | 0.1793 | 33.0 | 198 | 0.1583 |
80
- | 0.1849 | 34.0 | 204 | 0.1512 |
81
- | 0.1726 | 35.0 | 210 | 0.1464 |
82
- | 0.1703 | 36.0 | 216 | 0.1451 |
83
- | 0.1611 | 37.0 | 222 | 0.1394 |
84
- | 0.166 | 38.0 | 228 | 0.1353 |
85
- | 0.1595 | 39.0 | 234 | 0.1326 |
86
- | 0.1526 | 40.0 | 240 | 0.1318 |
 
 
 
 
 
 
 
 
 
 
87
 
88
 
89
  ### Framework versions
 
1
  ---
2
+ base_model: LordAbsurd2137/calculator_model_test2
3
  tags:
4
  - generated_from_trainer
5
  model-index:
 
12
 
13
  # calculator_model_test2
14
 
15
+ This model is a fine-tuned version of [LordAbsurd2137/calculator_model_test2](https://huggingface.co/LordAbsurd2137/calculator_model_test2) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.0222
18
 
19
  ## Model description
20
 
 
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
+ - num_epochs: 50
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
+ | 1.0953 | 1.0 | 6 | 0.6430 |
49
+ | 0.5025 | 2.0 | 12 | 0.4289 |
50
+ | 0.3633 | 3.0 | 18 | 0.3352 |
51
+ | 0.3426 | 4.0 | 24 | 0.2911 |
52
+ | 0.284 | 5.0 | 30 | 0.2464 |
53
+ | 0.2408 | 6.0 | 36 | 0.1998 |
54
+ | 0.2168 | 7.0 | 42 | 0.1874 |
55
+ | 0.1815 | 8.0 | 48 | 0.1592 |
56
+ | 0.1684 | 9.0 | 54 | 0.1547 |
57
+ | 0.1604 | 10.0 | 60 | 0.1707 |
58
+ | 0.1603 | 11.0 | 66 | 0.1516 |
59
+ | 0.1498 | 12.0 | 72 | 0.1220 |
60
+ | 0.1307 | 13.0 | 78 | 0.1079 |
61
+ | 0.1154 | 14.0 | 84 | 0.1270 |
62
+ | 0.1157 | 15.0 | 90 | 0.0997 |
63
+ | 0.1038 | 16.0 | 96 | 0.0919 |
64
+ | 0.1005 | 17.0 | 102 | 0.1005 |
65
+ | 0.097 | 18.0 | 108 | 0.1103 |
66
+ | 0.1019 | 19.0 | 114 | 0.1285 |
67
+ | 0.1067 | 20.0 | 120 | 0.1202 |
68
+ | 0.0995 | 21.0 | 126 | 0.0810 |
69
+ | 0.0799 | 22.0 | 132 | 0.0774 |
70
+ | 0.0723 | 23.0 | 138 | 0.0678 |
71
+ | 0.0656 | 24.0 | 144 | 0.0655 |
72
+ | 0.0666 | 25.0 | 150 | 0.0637 |
73
+ | 0.0606 | 26.0 | 156 | 0.0541 |
74
+ | 0.0562 | 27.0 | 162 | 0.0497 |
75
+ | 0.0524 | 28.0 | 168 | 0.0519 |
76
+ | 0.0556 | 29.0 | 174 | 0.0616 |
77
+ | 0.0584 | 30.0 | 180 | 0.0448 |
78
+ | 0.0496 | 31.0 | 186 | 0.0451 |
79
+ | 0.0484 | 32.0 | 192 | 0.0413 |
80
+ | 0.0454 | 33.0 | 198 | 0.0444 |
81
+ | 0.0411 | 34.0 | 204 | 0.0423 |
82
+ | 0.0395 | 35.0 | 210 | 0.0376 |
83
+ | 0.0383 | 36.0 | 216 | 0.0358 |
84
+ | 0.0373 | 37.0 | 222 | 0.0326 |
85
+ | 0.035 | 38.0 | 228 | 0.0286 |
86
+ | 0.0323 | 39.0 | 234 | 0.0307 |
87
+ | 0.0306 | 40.0 | 240 | 0.0279 |
88
+ | 0.029 | 41.0 | 246 | 0.0265 |
89
+ | 0.0296 | 42.0 | 252 | 0.0259 |
90
+ | 0.0263 | 43.0 | 258 | 0.0259 |
91
+ | 0.0244 | 44.0 | 264 | 0.0232 |
92
+ | 0.0264 | 45.0 | 270 | 0.0234 |
93
+ | 0.0263 | 46.0 | 276 | 0.0226 |
94
+ | 0.0223 | 47.0 | 282 | 0.0227 |
95
+ | 0.0225 | 48.0 | 288 | 0.0224 |
96
+ | 0.0218 | 49.0 | 294 | 0.0222 |
97
+ | 0.0216 | 50.0 | 300 | 0.0222 |
98
 
99
 
100
  ### Framework versions
config.json CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "architectures": [
3
  "EncoderDecoderModel"
4
  ],
 
1
  {
2
+ "_name_or_path": "LordAbsurd2137/calculator_model_test2",
3
  "architectures": [
4
  "EncoderDecoderModel"
5
  ],
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:101d61414e7dafdc1b411412bbf65a09c0d4c1e2ff60ecfad0e487dc6a6f0e0b
3
  size 31207604
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a7b90e224c02b0c007748233e03d38e1e41f23ddc6fea0be25961145c3ad10b
3
  size 31207604
runs/Mar04_14-37-56_5079440c9743/events.out.tfevents.1709563077.5079440c9743.368.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d84ab2f7ffc63043acb525f633080beaf60d8182c78abf3cae65897d75a543e0
3
+ size 32960
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:777b82b562884d850ec3b9f6912b7ef277eedb2ea334502598862d2b6e00f6aa
3
  size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5cf2a1d2265e44bc1e1badf337d46f5cc01385657ef4996c00756191e1659d8b
3
  size 5112