BartekSadlej
/

calculator_model_test

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8846
 ## Model description
@@ -44,46 +44,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.7375        | 1.0   | 14   | 2.8446          |
-| 2.5309        | 2.0   | 28   | 2.3889          |
-| 2.3406        | 3.0   | 42   | 2.3073          |
-| 2.2691        | 4.0   | 56   | 2.2098          |
-| 2.1412        | 5.0   | 70   | 2.0464          |
-| 1.9372        | 6.0   | 84   | 1.7744          |
-| 1.6761        | 7.0   | 98   | 1.5399          |
-| 1.4725        | 8.0   | 112  | 1.3886          |
-| 1.368         | 9.0   | 126  | 1.3246          |
-| 1.33          | 10.0  | 140  | 1.3355          |
-| 1.3119        | 11.0  | 154  | 1.2886          |
-| 1.2836        | 12.0  | 168  | 1.2712          |
-| 1.2668        | 13.0  | 182  | 1.2703          |
-| 1.2526        | 14.0  | 196  | 1.2477          |
-| 1.2292        | 15.0  | 210  | 1.2339          |
-| 1.203         | 16.0  | 224  | 1.1997          |
-| 1.1686        | 17.0  | 238  | 1.1764          |
-| 1.1308        | 18.0  | 252  | 1.1424          |
-| 1.0866        | 19.0  | 266  | 1.1034          |
-| 1.0355        | 20.0  | 280  | 1.0546          |
-| 1.0031        | 21.0  | 294  | 1.0241          |
-| 0.9608        | 22.0  | 308  | 0.9925          |
-| 0.924         | 23.0  | 322  | 0.9673          |
-| 0.9022        | 24.0  | 336  | 0.9555          |
-| 0.8733        | 25.0  | 350  | 0.9381          |
-| 0.8549        | 26.0  | 364  | 0.9394          |
-| 0.8363        | 27.0  | 378  | 0.9274          |
-| 0.8129        | 28.0  | 392  | 0.9211          |
-| 0.7894        | 29.0  | 406  | 0.9149          |
-| 0.7705        | 30.0  | 420  | 0.9042          |
-| 0.7509        | 31.0  | 434  | 0.8962          |
-| 0.7363        | 32.0  | 448  | 0.9003          |
-| 0.7261        | 33.0  | 462  | 0.8935          |
-| 0.7135        | 34.0  | 476  | 0.8923          |
-| 0.6988        | 35.0  | 490  | 0.8961          |
-| 0.6883        | 36.0  | 504  | 0.8883          |
-| 0.6768        | 37.0  | 518  | 0.8905          |
-| 0.6686        | 38.0  | 532  | 0.8885          |
-| 0.6625        | 39.0  | 546  | 0.8865          |
-| 0.6566        | 40.0  | 560  | 0.8846          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3798
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.8946        | 1.0   | 41   | 2.3454          |
+| 2.1015        | 2.0   | 82   | 1.9329          |
+| 1.8297        | 3.0   | 123  | 1.6538          |
+| 1.4394        | 4.0   | 164  | 1.2883          |
+| 1.2328        | 5.0   | 205  | 1.1711          |
+| 1.1398        | 6.0   | 246  | 1.0309          |
+| 1.0575        | 7.0   | 287  | 0.9585          |
+| 0.9607        | 8.0   | 328  | 0.9029          |
+| 0.8955        | 9.0   | 369  | 0.8200          |
+| 0.8318        | 10.0  | 410  | 0.7741          |
+| 0.7961        | 11.0  | 451  | 0.7525          |
+| 0.7713        | 12.0  | 492  | 0.7437          |
+| 0.7477        | 13.0  | 533  | 0.6924          |
+| 0.7197        | 14.0  | 574  | 0.6796          |
+| 0.6971        | 15.0  | 615  | 0.6514          |
+| 0.6734        | 16.0  | 656  | 0.6209          |
+| 0.6593        | 17.0  | 697  | 0.6080          |
+| 0.6396        | 18.0  | 738  | 0.5799          |
+| 0.6208        | 19.0  | 779  | 0.5706          |
+| 0.6004        | 20.0  | 820  | 0.5619          |
+| 0.5805        | 21.0  | 861  | 0.5368          |
+| 0.5765        | 22.0  | 902  | 0.5237          |
+| 0.5591        | 23.0  | 943  | 0.5110          |
+| 0.5462        | 24.0  | 984  | 0.5035          |
+| 0.5345        | 25.0  | 1025 | 0.4991          |
+| 0.5208        | 26.0  | 1066 | 0.4734          |
+| 0.5064        | 27.0  | 1107 | 0.4680          |
+| 0.4989        | 28.0  | 1148 | 0.4560          |
+| 0.4892        | 29.0  | 1189 | 0.4560          |
+| 0.4821        | 30.0  | 1230 | 0.4438          |
+| 0.4726        | 31.0  | 1271 | 0.4383          |
+| 0.4659        | 32.0  | 1312 | 0.4314          |
+| 0.453         | 33.0  | 1353 | 0.4122          |
+| 0.4466        | 34.0  | 1394 | 0.4115          |
+| 0.4393        | 35.0  | 1435 | 0.3996          |
+| 0.4315        | 36.0  | 1476 | 0.4007          |
+| 0.4266        | 37.0  | 1517 | 0.3949          |
+| 0.4219        | 38.0  | 1558 | 0.3878          |
+| 0.416         | 39.0  | 1599 | 0.3816          |
+| 0.4133        | 40.0  | 1640 | 0.3798          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1b2d492815e4c9d2a12d3ae79a8d6055683a98c9381a94ecf0c06589e143796a
 size 31521568

 version https://git-lfs.github.com/spec/v1
+oid sha256:1e4ad507054e5cc2e2a29d9ab44620bc49d1651769f82a07ef40c9cbb134a55b
 size 31521568

runs/Mar04_10-13-42_f9b5e148b874/events.out.tfevents.1709547223.f9b5e148b874.6804.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a53132516ab004c73bb8b2cb45acaaac999e7b48b354a3af839872d626390e49
-size 25879

 version https://git-lfs.github.com/spec/v1
+oid sha256:5e0b1a3ed477f0c6f00457f93e9a316ee573528540c335da9e21d6638210d565
+size 28161