joseph10 commited on
Commit
8ca8b04
1 Parent(s): c877d7a

End of training

Browse files
Files changed (2) hide show
  1. README.md +34 -14
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,24 +1,39 @@
1
  ---
2
  license: apache-2.0
3
- base_model: google/bert_uncased_L-2_H-128_A-2
4
  tags:
5
  - generated_from_trainer
 
 
6
  metrics:
7
  - accuracy
8
  model-index:
9
- - name: berttiny-hateXplain-parentpretrained
10
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # berttiny-hateXplain-parentpretrained
17
 
18
- This model is a fine-tuned version of [google/bert_uncased_L-2_H-128_A-2](https://huggingface.co/google/bert_uncased_L-2_H-128_A-2) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.2286
21
- - Accuracy: 0.7601
22
 
23
  ## Model description
24
 
@@ -37,21 +52,26 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 0.0001286744242350192
41
- - train_batch_size: 128
42
- - eval_batch_size: 128
43
  - seed: 33
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
- - num_epochs: 7
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
52
- | 0.236 | 1.0 | 121 | 0.2296 | 0.7581 |
53
- | 0.2239 | 2.0 | 242 | 0.2274 | 0.7591 |
54
- | 0.2174 | 3.0 | 363 | 0.2286 | 0.7601 |
 
 
 
 
 
55
 
56
 
57
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: agvidit1/DistilledBert_HateSpeech_pretrain
4
  tags:
5
  - generated_from_trainer
6
+ datasets:
7
+ - hate_speech18
8
  metrics:
9
  - accuracy
10
  model-index:
11
+ - name: distilbert-hate_speech18
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: hate_speech18
18
+ type: hate_speech18
19
+ config: default
20
+ split: train
21
+ args: default
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.8587751371115173
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
  should probably proofread and complete it, then remove this comment. -->
30
 
31
+ # distilbert-hate_speech18
32
 
33
+ This model is a fine-tuned version of [agvidit1/DistilledBert_HateSpeech_pretrain](https://huggingface.co/agvidit1/DistilledBert_HateSpeech_pretrain) on the hate_speech18 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.4684
36
+ - Accuracy: 0.8588
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 5.050180626898551e-06
56
+ - train_batch_size: 32
57
+ - eval_batch_size: 16
58
  - seed: 33
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
+ - num_epochs: 8
62
 
63
  ### Training results
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
+ | 0.4404 | 1.0 | 240 | 0.4670 | 0.8533 |
68
+ | 0.4356 | 2.0 | 480 | 0.4642 | 0.8675 |
69
+ | 0.4303 | 3.0 | 720 | 0.4649 | 0.8748 |
70
+ | 0.4282 | 4.0 | 960 | 0.4694 | 0.8592 |
71
+ | 0.4273 | 5.0 | 1200 | 0.4638 | 0.8729 |
72
+ | 0.4256 | 6.0 | 1440 | 0.4651 | 0.8679 |
73
+ | 0.425 | 7.0 | 1680 | 0.4682 | 0.8560 |
74
+ | 0.4227 | 8.0 | 1920 | 0.4684 | 0.8588 |
75
 
76
 
77
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a2ff1680e642eb08587fcaffbf551ece771841388927ece8dde6fe736be1e30e
3
  size 437964800
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a8b1d69966685411324f5615071195388adbe0defffff357b2f2a960939a6e5
3
  size 437964800