joseph10 commited on
Commit
67468c5
1 Parent(s): 9cf405e

End of training

Browse files
Files changed (2) hide show
  1. README.md +34 -14
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,29 +1,39 @@
1
  ---
2
  license: apache-2.0
3
- base_model: google/bert_uncased_L-2_H-128_A-2
4
  tags:
5
  - generated_from_trainer
6
  datasets:
7
  - hate_speech18
 
 
8
  model-index:
9
- - name: bert-tiny-distilled
10
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- # bert-tiny-distilled
17
 
18
- This model is a fine-tuned version of [google/bert_uncased_L-2_H-128_A-2](https://huggingface.co/google/bert_uncased_L-2_H-128_A-2) on the hate_speech18 dataset.
19
  It achieves the following results on the evaluation set:
20
- - eval_loss: 0.4745
21
- - eval_accuracy: 0.8556
22
- - eval_runtime: 20.4101
23
- - eval_samples_per_second: 107.202
24
- - eval_steps_per_second: 0.882
25
- - epoch: 1.0
26
- - step: 479
27
 
28
  ## Model description
29
 
@@ -42,14 +52,24 @@ More information needed
42
  ### Training hyperparameters
43
 
44
  The following hyperparameters were used during training:
45
- - learning_rate: 7.474176155523857e-05
46
  - train_batch_size: 16
47
  - eval_batch_size: 128
48
- - seed: 23
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
  - num_epochs: 4
52
 
 
 
 
 
 
 
 
 
 
 
53
  ### Framework versions
54
 
55
  - Transformers 4.36.0.dev0
 
1
  ---
2
  license: apache-2.0
3
+ base_model: agvidit1/GoogleTinyBert_HateSpeech_pretrain
4
  tags:
5
  - generated_from_trainer
6
  datasets:
7
  - hate_speech18
8
+ metrics:
9
+ - accuracy
10
  model-index:
11
+ - name: berttiny-hate_speech18-bothpretrained
12
+ results:
13
+ - task:
14
+ name: Text Classification
15
+ type: text-classification
16
+ dataset:
17
+ name: hate_speech18
18
+ type: hate_speech18
19
+ config: default
20
+ split: train
21
+ args: default
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.879798903107861
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
  should probably proofread and complete it, then remove this comment. -->
30
 
31
+ # berttiny-hate_speech18-bothpretrained
32
 
33
+ This model is a fine-tuned version of [agvidit1/GoogleTinyBert_HateSpeech_pretrain](https://huggingface.co/agvidit1/GoogleTinyBert_HateSpeech_pretrain) on the hate_speech18 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.4682
36
+ - Accuracy: 0.8798
 
 
 
 
 
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 7.12739424754752e-05
56
  - train_batch_size: 16
57
  - eval_batch_size: 128
58
+ - seed: 34
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - num_epochs: 4
62
 
63
+ ### Training results
64
+
65
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
66
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
67
+ | 0.4804 | 1.0 | 479 | 0.4778 | 0.8624 |
68
+ | 0.4592 | 2.0 | 958 | 0.4688 | 0.8761 |
69
+ | 0.4471 | 3.0 | 1437 | 0.4681 | 0.8780 |
70
+ | 0.4405 | 4.0 | 1916 | 0.4682 | 0.8798 |
71
+
72
+
73
  ### Framework versions
74
 
75
  - Transformers 4.36.0.dev0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7f38383209172b269b8c6ad7965b4d91808345b775de2bb900e2ae03e046066f
3
  size 17550344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5cc6cf3fafefe79240f2a01b1f7f238e985c6d29b175003aa8e05df47f9645a
3
  size 17550344