sakren commited on
Commit
ddad4a4
1 Parent(s): a7e1bc2

sakren/debert-imeocap

Browse files
README.md CHANGED
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.3914
24
- - F1: 0.6372
25
- - Precision: 0.6448
26
- - Recall: 0.6365
27
- - Accuracy: 0.6365
28
 
29
  ## Model description
30
 
@@ -55,21 +55,21 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:------:|:---------:|:------:|:--------:|
58
- | 1.5405 | 1.0 | 74 | 1.4488 | 0.3206 | 0.2386 | 0.4885 | 0.4885 |
59
- | 1.3156 | 2.0 | 148 | 1.1964 | 0.5541 | 0.5627 | 0.575 | 0.575 |
60
- | 1.0728 | 3.0 | 222 | 1.1077 | 0.6001 | 0.6189 | 0.5981 | 0.5981 |
61
- | 0.9239 | 4.0 | 296 | 1.0742 | 0.6324 | 0.6361 | 0.6365 | 0.6365 |
62
- | 0.7802 | 5.0 | 370 | 1.0834 | 0.6073 | 0.6333 | 0.6058 | 0.6058 |
63
- | 0.661 | 6.0 | 444 | 1.1733 | 0.5984 | 0.6166 | 0.5962 | 0.5962 |
64
- | 0.602 | 7.0 | 518 | 1.1786 | 0.5911 | 0.6193 | 0.5885 | 0.5885 |
65
- | 0.5391 | 8.0 | 592 | 1.2171 | 0.6156 | 0.6251 | 0.6154 | 0.6154 |
66
- | 0.4815 | 9.0 | 666 | 1.2566 | 0.6259 | 0.6399 | 0.625 | 0.625 |
67
- | 0.4548 | 10.0 | 740 | 1.2927 | 0.6233 | 0.6417 | 0.6212 | 0.6212 |
68
- | 0.4538 | 11.0 | 814 | 1.2969 | 0.6385 | 0.6461 | 0.6385 | 0.6385 |
69
- | 0.4119 | 12.0 | 888 | 1.3455 | 0.6376 | 0.6464 | 0.6365 | 0.6365 |
70
- | 0.3968 | 13.0 | 962 | 1.3709 | 0.6304 | 0.6413 | 0.6288 | 0.6288 |
71
- | 0.352 | 14.0 | 1036 | 1.3823 | 0.6246 | 0.6360 | 0.6231 | 0.6231 |
72
- | 0.3551 | 15.0 | 1110 | 1.3914 | 0.6372 | 0.6448 | 0.6365 | 0.6365 |
73
 
74
 
75
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 1.8660
24
+ - F1: 0.6185
25
+ - Precision: 0.6337
26
+ - Recall: 0.6154
27
+ - Accuracy: 0.6154
28
 
29
  ## Model description
30
 
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
57
  |:-------------:|:-----:|:----:|:---------------:|:------:|:---------:|:------:|:--------:|
58
+ | 0.4637 | 1.0 | 74 | 1.3864 | 0.6129 | 0.6262 | 0.6115 | 0.6115 |
59
+ | 0.3815 | 2.0 | 148 | 1.3801 | 0.6193 | 0.6348 | 0.6173 | 0.6173 |
60
+ | 0.3363 | 3.0 | 222 | 1.6944 | 0.6077 | 0.6297 | 0.6077 | 0.6077 |
61
+ | 0.31 | 4.0 | 296 | 1.6945 | 0.5995 | 0.6285 | 0.5942 | 0.5942 |
62
+ | 0.2885 | 5.0 | 370 | 1.5945 | 0.6218 | 0.6306 | 0.6192 | 0.6192 |
63
+ | 0.2594 | 6.0 | 444 | 1.7662 | 0.6279 | 0.6396 | 0.625 | 0.625 |
64
+ | 0.2319 | 7.0 | 518 | 1.7093 | 0.6210 | 0.6321 | 0.6173 | 0.6173 |
65
+ | 0.2306 | 8.0 | 592 | 1.8068 | 0.6279 | 0.6341 | 0.6288 | 0.6288 |
66
+ | 0.2167 | 9.0 | 666 | 1.7306 | 0.6376 | 0.6444 | 0.6346 | 0.6346 |
67
+ | 0.2158 | 10.0 | 740 | 1.8745 | 0.6262 | 0.6318 | 0.6269 | 0.6269 |
68
+ | 0.222 | 11.0 | 814 | 1.8323 | 0.6200 | 0.6348 | 0.6173 | 0.6173 |
69
+ | 0.2152 | 12.0 | 888 | 1.8576 | 0.6246 | 0.6363 | 0.6212 | 0.6212 |
70
+ | 0.226 | 13.0 | 962 | 1.8880 | 0.6343 | 0.6411 | 0.6308 | 0.6308 |
71
+ | 0.2097 | 14.0 | 1036 | 1.8884 | 0.6152 | 0.6326 | 0.6115 | 0.6115 |
72
+ | 0.2192 | 15.0 | 1110 | 1.8660 | 0.6185 | 0.6337 | 0.6154 | 0.6154 |
73
 
74
 
75
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ac45bf244570330a2f8f86b6c710b9b12511dc260d8ca87c461a8e14688ccaf
3
  size 737731584
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57e205298d17fc0458087d58499f8633058cf00d94b7e6c8d118ccbc3336d373
3
  size 737731584
runs/May15_18-08-01_95a092faa389/events.out.tfevents.1715796514.95a092faa389.35.9 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:435449e7015d71bb471b320d0a2f20138ced45cdd7e9afa3116bf3c90f167e08
3
- size 13973
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d5bfc7cfd1e8f6f5ab27a5294d1826af4f68513fa7e0d51b509665552eb994c
3
+ size 15693
runs/May15_18-08-01_95a092faa389/events.out.tfevents.1715797115.95a092faa389.35.10 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dbf853198fc0647eaf28f420cd436d9b11952a4e54ca02fd34fcbc7b56f83006
3
+ size 560