SimoneJLaudani commited on
Commit
34af40f
1 Parent(s): ed86c5b

End of training

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.0935
24
- - Precision: 0.8398
25
- - Recall: 0.8319
26
- - F1: 0.8316
27
- - Accuracy: 0.8319
28
 
29
  ## Model description
30
 
@@ -49,41 +49,49 @@ The following hyperparameters were used during training:
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
- - num_epochs: 4
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
- | 0.0672 | 0.14 | 30 | 0.9287 | 0.8186 | 0.8123 | 0.8094 | 0.8123 |
59
- | 0.0262 | 0.27 | 60 | 1.0312 | 0.8147 | 0.8067 | 0.8077 | 0.8067 |
60
- | 0.1314 | 0.41 | 90 | 0.9548 | 0.8156 | 0.8123 | 0.8122 | 0.8123 |
61
- | 0.0639 | 0.54 | 120 | 0.9111 | 0.8298 | 0.8207 | 0.8195 | 0.8207 |
62
- | 0.0598 | 0.68 | 150 | 1.1390 | 0.8145 | 0.8011 | 0.8022 | 0.8011 |
63
- | 0.1492 | 0.81 | 180 | 0.9996 | 0.8365 | 0.8319 | 0.8315 | 0.8319 |
64
- | 0.1065 | 0.95 | 210 | 1.0810 | 0.8316 | 0.8179 | 0.8167 | 0.8179 |
65
- | 0.0976 | 1.08 | 240 | 1.0728 | 0.8315 | 0.8263 | 0.8257 | 0.8263 |
66
- | 0.0618 | 1.22 | 270 | 1.0524 | 0.8414 | 0.8319 | 0.8319 | 0.8319 |
67
- | 0.0901 | 1.35 | 300 | 1.0568 | 0.8251 | 0.8151 | 0.8163 | 0.8151 |
68
- | 0.0471 | 1.49 | 330 | 1.0259 | 0.8292 | 0.8179 | 0.8184 | 0.8179 |
69
- | 0.028 | 1.62 | 360 | 1.0824 | 0.8284 | 0.8235 | 0.8223 | 0.8235 |
70
- | 0.0334 | 1.76 | 390 | 1.1277 | 0.8264 | 0.8179 | 0.8178 | 0.8179 |
71
- | 0.0136 | 1.89 | 420 | 1.0544 | 0.8379 | 0.8291 | 0.8270 | 0.8291 |
72
- | 0.0134 | 2.03 | 450 | 1.0709 | 0.8408 | 0.8347 | 0.8327 | 0.8347 |
73
- | 0.0323 | 2.16 | 480 | 1.1359 | 0.8399 | 0.8263 | 0.8261 | 0.8263 |
74
- | 0.0182 | 2.3 | 510 | 1.1279 | 0.8344 | 0.8235 | 0.8243 | 0.8235 |
75
- | 0.0211 | 2.43 | 540 | 1.0813 | 0.8443 | 0.8375 | 0.8380 | 0.8375 |
76
- | 0.0009 | 2.57 | 570 | 1.1202 | 0.8399 | 0.8319 | 0.8318 | 0.8319 |
77
- | 0.0227 | 2.7 | 600 | 1.1772 | 0.8255 | 0.8151 | 0.8153 | 0.8151 |
78
- | 0.0007 | 2.84 | 630 | 1.1644 | 0.8299 | 0.8179 | 0.8182 | 0.8179 |
79
- | 0.0017 | 2.97 | 660 | 1.1172 | 0.8406 | 0.8319 | 0.8311 | 0.8319 |
80
- | 0.0005 | 3.11 | 690 | 1.1139 | 0.8403 | 0.8319 | 0.8311 | 0.8319 |
81
- | 0.0094 | 3.24 | 720 | 1.0901 | 0.8363 | 0.8291 | 0.8287 | 0.8291 |
82
- | 0.0011 | 3.38 | 750 | 1.0993 | 0.8436 | 0.8347 | 0.8349 | 0.8347 |
83
- | 0.0005 | 3.51 | 780 | 1.0997 | 0.8436 | 0.8347 | 0.8349 | 0.8347 |
84
- | 0.0005 | 3.65 | 810 | 1.1004 | 0.8372 | 0.8291 | 0.8288 | 0.8291 |
85
- | 0.0005 | 3.78 | 840 | 1.1025 | 0.8372 | 0.8291 | 0.8288 | 0.8291 |
86
- | 0.0025 | 3.92 | 870 | 1.0964 | 0.8372 | 0.8291 | 0.8288 | 0.8291 |
 
 
 
 
 
 
 
 
87
 
88
 
89
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 1.0027
24
+ - Precision: 0.8141
25
+ - Recall: 0.8067
26
+ - F1: 0.8073
27
+ - Accuracy: 0.8067
28
 
29
  ## Model description
30
 
 
49
  - seed: 42
50
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
  - lr_scheduler_type: linear
52
+ - num_epochs: 5
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
58
+ | 1.9463 | 0.14 | 30 | 1.8631 | 0.1245 | 0.1625 | 0.0819 | 0.1625 |
59
+ | 1.7589 | 0.27 | 60 | 1.4567 | 0.4725 | 0.5098 | 0.4483 | 0.5098 |
60
+ | 1.389 | 0.41 | 90 | 1.2228 | 0.6230 | 0.5714 | 0.5547 | 0.5714 |
61
+ | 1.2009 | 0.54 | 120 | 1.0306 | 0.7264 | 0.6835 | 0.6666 | 0.6835 |
62
+ | 1.0999 | 0.68 | 150 | 0.8052 | 0.7808 | 0.7647 | 0.7625 | 0.7647 |
63
+ | 0.8848 | 0.81 | 180 | 0.7826 | 0.7499 | 0.7283 | 0.7191 | 0.7283 |
64
+ | 0.685 | 0.95 | 210 | 0.7337 | 0.7765 | 0.7591 | 0.7587 | 0.7591 |
65
+ | 0.5562 | 1.08 | 240 | 0.6653 | 0.7897 | 0.7871 | 0.7863 | 0.7871 |
66
+ | 0.4662 | 1.22 | 270 | 0.7158 | 0.7895 | 0.7535 | 0.7539 | 0.7535 |
67
+ | 0.3985 | 1.35 | 300 | 0.6552 | 0.8160 | 0.8011 | 0.8024 | 0.8011 |
68
+ | 0.317 | 1.49 | 330 | 0.7378 | 0.7902 | 0.7843 | 0.7836 | 0.7843 |
69
+ | 0.4177 | 1.62 | 360 | 0.6983 | 0.8085 | 0.8039 | 0.8028 | 0.8039 |
70
+ | 0.383 | 1.76 | 390 | 0.7612 | 0.7979 | 0.7759 | 0.7640 | 0.7759 |
71
+ | 0.2906 | 1.89 | 420 | 0.7369 | 0.7914 | 0.7759 | 0.7761 | 0.7759 |
72
+ | 0.3305 | 2.03 | 450 | 0.7302 | 0.7904 | 0.7787 | 0.7791 | 0.7787 |
73
+ | 0.1398 | 2.16 | 480 | 0.7798 | 0.8169 | 0.8095 | 0.8084 | 0.8095 |
74
+ | 0.0988 | 2.3 | 510 | 0.9284 | 0.7902 | 0.7815 | 0.7799 | 0.7815 |
75
+ | 0.1449 | 2.43 | 540 | 0.8863 | 0.8196 | 0.8123 | 0.8133 | 0.8123 |
76
+ | 0.2552 | 2.57 | 570 | 0.8396 | 0.8227 | 0.8179 | 0.8177 | 0.8179 |
77
+ | 0.1616 | 2.7 | 600 | 0.8182 | 0.8172 | 0.8123 | 0.8128 | 0.8123 |
78
+ | 0.2163 | 2.84 | 630 | 0.8075 | 0.8031 | 0.7983 | 0.7994 | 0.7983 |
79
+ | 0.2134 | 2.97 | 660 | 0.9430 | 0.8190 | 0.8067 | 0.8080 | 0.8067 |
80
+ | 0.1255 | 3.11 | 690 | 0.8907 | 0.8166 | 0.8123 | 0.8116 | 0.8123 |
81
+ | 0.0969 | 3.24 | 720 | 0.8805 | 0.8009 | 0.7983 | 0.7977 | 0.7983 |
82
+ | 0.0649 | 3.38 | 750 | 0.9065 | 0.7957 | 0.7843 | 0.7846 | 0.7843 |
83
+ | 0.0328 | 3.51 | 780 | 0.9083 | 0.8141 | 0.8095 | 0.8093 | 0.8095 |
84
+ | 0.0274 | 3.65 | 810 | 0.8894 | 0.8096 | 0.8011 | 0.8011 | 0.8011 |
85
+ | 0.0906 | 3.78 | 840 | 0.9425 | 0.8166 | 0.8095 | 0.8101 | 0.8095 |
86
+ | 0.0906 | 3.92 | 870 | 0.9333 | 0.8066 | 0.8011 | 0.8011 | 0.8011 |
87
+ | 0.0641 | 4.05 | 900 | 0.9052 | 0.8108 | 0.8067 | 0.8063 | 0.8067 |
88
+ | 0.0246 | 4.19 | 930 | 0.9993 | 0.8017 | 0.7955 | 0.7946 | 0.7955 |
89
+ | 0.0551 | 4.32 | 960 | 0.9899 | 0.8174 | 0.8123 | 0.8122 | 0.8123 |
90
+ | 0.0084 | 4.46 | 990 | 0.9954 | 0.8127 | 0.8067 | 0.8066 | 0.8067 |
91
+ | 0.0049 | 4.59 | 1020 | 0.9912 | 0.8145 | 0.8095 | 0.8093 | 0.8095 |
92
+ | 0.0217 | 4.73 | 1050 | 0.9957 | 0.8128 | 0.8067 | 0.8067 | 0.8067 |
93
+ | 0.0144 | 4.86 | 1080 | 1.0042 | 0.8164 | 0.8095 | 0.8100 | 0.8095 |
94
+ | 0.0276 | 5.0 | 1110 | 1.0027 | 0.8141 | 0.8067 | 0.8073 | 0.8067 |
95
 
96
 
97
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0b3a32bb504e4e98c4ea232588153dbc20807fcc7d10e3a7882211cb16b53066
3
  size 263160068
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:098e20c330fc9c6fc9e29ac4e68724325b0ddab6f2c605c40292d0121e0fc192
3
  size 263160068
runs/Apr03_15-46-05_d508c0ef2f9f/events.out.tfevents.1712159217.d508c0ef2f9f.9555.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f8319b72c60f3e67ea9c3ff172134dd9c50d26bc6335d590ee8563520661dbc
3
+ size 30349
runs/Apr03_15-46-05_d508c0ef2f9f/events.out.tfevents.1712174436.d508c0ef2f9f.9555.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:00203007aa53631da6be485ef5ce45ee2ffe31451a7ad81f63e1a180283259f2
3
+ size 560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:84990d5f97ee8ef3737cb22b4b2951f699ba3948a106d8c797a84b8d582a58f0
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c564b281f4d86a09a862aa9e6561baff85bf27abedcee7201f2b7210f26de2eb
3
  size 4920