dtorber commited on
Commit
5723a0b
1 Parent(s): ea918de

Model save

Browse files
README.md CHANGED
@@ -14,10 +14,10 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [NLP-LTU/bertweet-large-sexism-detector](https://huggingface.co/NLP-LTU/bertweet-large-sexism-detector) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 1.6489
18
- - Icm: 0.0711
19
- - Icmnorm: 0.5361
20
- - Fmeasure: 0.6886
21
 
22
  ## Model description
23
 
@@ -36,23 +36,32 @@ More information needed
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
- - learning_rate: 2e-05
40
  - train_batch_size: 8
41
  - eval_batch_size: 8
42
  - seed: 42
43
  - distributed_type: multi-GPU
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 3
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Icm | Icmnorm | Fmeasure |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|
53
- | 0.6499 | 1.0 | 771 | 0.5820 | 0.0093 | 0.5047 | 0.6732 |
54
- | 0.4108 | 2.0 | 1542 | 0.8449 | 0.0401 | 0.5204 | 0.6798 |
55
- | 0.2617 | 3.0 | 2313 | 1.6489 | 0.0711 | 0.5361 | 0.6886 |
 
 
 
 
 
 
 
 
 
56
 
57
 
58
  ### Framework versions
 
14
 
15
  This model is a fine-tuned version of [NLP-LTU/bertweet-large-sexism-detector](https://huggingface.co/NLP-LTU/bertweet-large-sexism-detector) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 0.7233
18
+ - Icm: 0.1739
19
+ - Icmnorm: 0.5882
20
+ - Fmeasure: 0.7274
21
 
22
  ## Model description
23
 
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - learning_rate: 1e-06
40
  - train_batch_size: 8
41
  - eval_batch_size: 8
42
  - seed: 42
43
  - distributed_type: multi-GPU
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
+ - num_epochs: 12
47
  - mixed_precision_training: Native AMP
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Icm | Icmnorm | Fmeasure |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|
53
+ | No log | 1.0 | 407 | 0.5823 | 0.0856 | 0.5434 | 0.6977 |
54
+ | 0.7182 | 2.0 | 814 | 0.5689 | 0.0427 | 0.5217 | 0.6843 |
55
+ | 0.5513 | 3.0 | 1221 | 0.5684 | 0.0904 | 0.5459 | 0.6999 |
56
+ | 0.4941 | 4.0 | 1628 | 0.5851 | 0.1166 | 0.5591 | 0.7073 |
57
+ | 0.4613 | 5.0 | 2035 | 0.6129 | 0.1405 | 0.5713 | 0.7164 |
58
+ | 0.4613 | 6.0 | 2442 | 0.6221 | 0.1381 | 0.5701 | 0.7152 |
59
+ | 0.4221 | 7.0 | 2849 | 0.6397 | 0.1548 | 0.5785 | 0.7207 |
60
+ | 0.3863 | 8.0 | 3256 | 0.6702 | 0.1691 | 0.5858 | 0.7249 |
61
+ | 0.3673 | 9.0 | 3663 | 0.6797 | 0.1572 | 0.5797 | 0.7219 |
62
+ | 0.35 | 10.0 | 4070 | 0.7050 | 0.1572 | 0.5797 | 0.7219 |
63
+ | 0.35 | 11.0 | 4477 | 0.7273 | 0.1715 | 0.5870 | 0.7262 |
64
+ | 0.3388 | 12.0 | 4884 | 0.7233 | 0.1739 | 0.5882 | 0.7274 |
65
 
66
 
67
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d4d5681026b7c543afd6beaa5d130e2ca742bddd1d497da142b18576c02e0030
3
  size 1421495416
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b11ccaf8adb829ca545369400bc0f5e730086173fbedbfaa56036a435a6f8d69
3
  size 1421495416
runs/Apr25_15-50-44_tardis/events.out.tfevents.1714053047.tardis.149981.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a85d5c918f5fe806dd9f07f3a750cf43bc0182a3f237f3177c1796754e3fef17
3
- size 11335
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8efb0cb88303f23c3ea1f1bec2597e1be45f621de0ddd99a156adff5bf3e6473
3
+ size 12321
runs/Apr25_15-50-44_tardis/events.out.tfevents.1714054784.tardis.149981.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:04609063a82b58bd8af36fc2c51fd66eb436964c390cfe5d7d928e57a909497d
3
+ size 509