BartekSadlej commited on
Commit
6acaadb
1 Parent(s): 27d6dc5

End of training

Browse files
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 0.8846
17
 
18
  ## Model description
19
 
@@ -44,46 +44,46 @@ The following hyperparameters were used during training:
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
- | 3.7375 | 1.0 | 14 | 2.8446 |
48
- | 2.5309 | 2.0 | 28 | 2.3889 |
49
- | 2.3406 | 3.0 | 42 | 2.3073 |
50
- | 2.2691 | 4.0 | 56 | 2.2098 |
51
- | 2.1412 | 5.0 | 70 | 2.0464 |
52
- | 1.9372 | 6.0 | 84 | 1.7744 |
53
- | 1.6761 | 7.0 | 98 | 1.5399 |
54
- | 1.4725 | 8.0 | 112 | 1.3886 |
55
- | 1.368 | 9.0 | 126 | 1.3246 |
56
- | 1.33 | 10.0 | 140 | 1.3355 |
57
- | 1.3119 | 11.0 | 154 | 1.2886 |
58
- | 1.2836 | 12.0 | 168 | 1.2712 |
59
- | 1.2668 | 13.0 | 182 | 1.2703 |
60
- | 1.2526 | 14.0 | 196 | 1.2477 |
61
- | 1.2292 | 15.0 | 210 | 1.2339 |
62
- | 1.203 | 16.0 | 224 | 1.1997 |
63
- | 1.1686 | 17.0 | 238 | 1.1764 |
64
- | 1.1308 | 18.0 | 252 | 1.1424 |
65
- | 1.0866 | 19.0 | 266 | 1.1034 |
66
- | 1.0355 | 20.0 | 280 | 1.0546 |
67
- | 1.0031 | 21.0 | 294 | 1.0241 |
68
- | 0.9608 | 22.0 | 308 | 0.9925 |
69
- | 0.924 | 23.0 | 322 | 0.9673 |
70
- | 0.9022 | 24.0 | 336 | 0.9555 |
71
- | 0.8733 | 25.0 | 350 | 0.9381 |
72
- | 0.8549 | 26.0 | 364 | 0.9394 |
73
- | 0.8363 | 27.0 | 378 | 0.9274 |
74
- | 0.8129 | 28.0 | 392 | 0.9211 |
75
- | 0.7894 | 29.0 | 406 | 0.9149 |
76
- | 0.7705 | 30.0 | 420 | 0.9042 |
77
- | 0.7509 | 31.0 | 434 | 0.8962 |
78
- | 0.7363 | 32.0 | 448 | 0.9003 |
79
- | 0.7261 | 33.0 | 462 | 0.8935 |
80
- | 0.7135 | 34.0 | 476 | 0.8923 |
81
- | 0.6988 | 35.0 | 490 | 0.8961 |
82
- | 0.6883 | 36.0 | 504 | 0.8883 |
83
- | 0.6768 | 37.0 | 518 | 0.8905 |
84
- | 0.6686 | 38.0 | 532 | 0.8885 |
85
- | 0.6625 | 39.0 | 546 | 0.8865 |
86
- | 0.6566 | 40.0 | 560 | 0.8846 |
87
 
88
 
89
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 0.3798
17
 
18
  ## Model description
19
 
 
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
+ | 2.8946 | 1.0 | 41 | 2.3454 |
48
+ | 2.1015 | 2.0 | 82 | 1.9329 |
49
+ | 1.8297 | 3.0 | 123 | 1.6538 |
50
+ | 1.4394 | 4.0 | 164 | 1.2883 |
51
+ | 1.2328 | 5.0 | 205 | 1.1711 |
52
+ | 1.1398 | 6.0 | 246 | 1.0309 |
53
+ | 1.0575 | 7.0 | 287 | 0.9585 |
54
+ | 0.9607 | 8.0 | 328 | 0.9029 |
55
+ | 0.8955 | 9.0 | 369 | 0.8200 |
56
+ | 0.8318 | 10.0 | 410 | 0.7741 |
57
+ | 0.7961 | 11.0 | 451 | 0.7525 |
58
+ | 0.7713 | 12.0 | 492 | 0.7437 |
59
+ | 0.7477 | 13.0 | 533 | 0.6924 |
60
+ | 0.7197 | 14.0 | 574 | 0.6796 |
61
+ | 0.6971 | 15.0 | 615 | 0.6514 |
62
+ | 0.6734 | 16.0 | 656 | 0.6209 |
63
+ | 0.6593 | 17.0 | 697 | 0.6080 |
64
+ | 0.6396 | 18.0 | 738 | 0.5799 |
65
+ | 0.6208 | 19.0 | 779 | 0.5706 |
66
+ | 0.6004 | 20.0 | 820 | 0.5619 |
67
+ | 0.5805 | 21.0 | 861 | 0.5368 |
68
+ | 0.5765 | 22.0 | 902 | 0.5237 |
69
+ | 0.5591 | 23.0 | 943 | 0.5110 |
70
+ | 0.5462 | 24.0 | 984 | 0.5035 |
71
+ | 0.5345 | 25.0 | 1025 | 0.4991 |
72
+ | 0.5208 | 26.0 | 1066 | 0.4734 |
73
+ | 0.5064 | 27.0 | 1107 | 0.4680 |
74
+ | 0.4989 | 28.0 | 1148 | 0.4560 |
75
+ | 0.4892 | 29.0 | 1189 | 0.4560 |
76
+ | 0.4821 | 30.0 | 1230 | 0.4438 |
77
+ | 0.4726 | 31.0 | 1271 | 0.4383 |
78
+ | 0.4659 | 32.0 | 1312 | 0.4314 |
79
+ | 0.453 | 33.0 | 1353 | 0.4122 |
80
+ | 0.4466 | 34.0 | 1394 | 0.4115 |
81
+ | 0.4393 | 35.0 | 1435 | 0.3996 |
82
+ | 0.4315 | 36.0 | 1476 | 0.4007 |
83
+ | 0.4266 | 37.0 | 1517 | 0.3949 |
84
+ | 0.4219 | 38.0 | 1558 | 0.3878 |
85
+ | 0.416 | 39.0 | 1599 | 0.3816 |
86
+ | 0.4133 | 40.0 | 1640 | 0.3798 |
87
 
88
 
89
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1b2d492815e4c9d2a12d3ae79a8d6055683a98c9381a94ecf0c06589e143796a
3
  size 31521568
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e4ad507054e5cc2e2a29d9ab44620bc49d1651769f82a07ef40c9cbb134a55b
3
  size 31521568
runs/Mar04_10-13-42_f9b5e148b874/events.out.tfevents.1709547223.f9b5e148b874.6804.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a53132516ab004c73bb8b2cb45acaaac999e7b48b354a3af839872d626390e49
3
- size 25879
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e0b1a3ed477f0c6f00457f93e9a316ee573528540c335da9e21d6638210d565
3
+ size 28161