vsombhane commited on
Commit
1845cfe
1 Parent(s): 90f133f

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +109 -0
README.md ADDED
@@ -0,0 +1,109 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: distilbert-base-uncased
4
+ tags:
5
+ - generated_from_trainer
6
+ datasets:
7
+ - wnut_17
8
+ metrics:
9
+ - precision
10
+ - recall
11
+ - f1
12
+ - accuracy
13
+ model-index:
14
+ - name: my_awesome_wnut_model
15
+ results:
16
+ - task:
17
+ name: Token Classification
18
+ type: token-classification
19
+ dataset:
20
+ name: wnut_17
21
+ type: wnut_17
22
+ config: wnut_17
23
+ split: test
24
+ args: wnut_17
25
+ metrics:
26
+ - name: Precision
27
+ type: precision
28
+ value: 0.5524652338811631
29
+ - name: Recall
30
+ type: recall
31
+ value: 0.40500463392029656
32
+ - name: F1
33
+ type: f1
34
+ value: 0.467379679144385
35
+ - name: Accuracy
36
+ type: accuracy
37
+ value: 0.9464751400111154
38
+ ---
39
+
40
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
41
+ should probably proofread and complete it, then remove this comment. -->
42
+
43
+ # my_awesome_wnut_model
44
+
45
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the wnut_17 dataset.
46
+ It achieves the following results on the evaluation set:
47
+ - Loss: 0.4068
48
+ - Precision: 0.5525
49
+ - Recall: 0.4050
50
+ - F1: 0.4674
51
+ - Accuracy: 0.9465
52
+
53
+ ## Model description
54
+
55
+ More information needed
56
+
57
+ ## Intended uses & limitations
58
+
59
+ More information needed
60
+
61
+ ## Training and evaluation data
62
+
63
+ More information needed
64
+
65
+ ## Training procedure
66
+
67
+ ### Training hyperparameters
68
+
69
+ The following hyperparameters were used during training:
70
+ - learning_rate: 2e-05
71
+ - train_batch_size: 16
72
+ - eval_batch_size: 16
73
+ - seed: 42
74
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
75
+ - lr_scheduler_type: linear
76
+ - num_epochs: 20
77
+
78
+ ### Training results
79
+
80
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
81
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
82
+ | No log | 1.0 | 213 | 0.2742 | 0.5216 | 0.3355 | 0.4083 | 0.9421 |
83
+ | No log | 2.0 | 426 | 0.2810 | 0.6107 | 0.3503 | 0.4452 | 0.9455 |
84
+ | 0.0744 | 3.0 | 639 | 0.3305 | 0.6560 | 0.3411 | 0.4488 | 0.9456 |
85
+ | 0.0744 | 4.0 | 852 | 0.3382 | 0.5480 | 0.3596 | 0.4342 | 0.9443 |
86
+ | 0.0295 | 5.0 | 1065 | 0.3461 | 0.5635 | 0.3865 | 0.4585 | 0.9454 |
87
+ | 0.0295 | 6.0 | 1278 | 0.3823 | 0.5744 | 0.3828 | 0.4594 | 0.9454 |
88
+ | 0.0295 | 7.0 | 1491 | 0.3404 | 0.5080 | 0.4096 | 0.4536 | 0.9445 |
89
+ | 0.0128 | 8.0 | 1704 | 0.3926 | 0.5302 | 0.3744 | 0.4389 | 0.9441 |
90
+ | 0.0128 | 9.0 | 1917 | 0.3505 | 0.5033 | 0.4226 | 0.4594 | 0.9449 |
91
+ | 0.0071 | 10.0 | 2130 | 0.3825 | 0.5685 | 0.3846 | 0.4588 | 0.9456 |
92
+ | 0.0071 | 11.0 | 2343 | 0.3806 | 0.5155 | 0.4171 | 0.4611 | 0.9451 |
93
+ | 0.0044 | 12.0 | 2556 | 0.4035 | 0.5422 | 0.3985 | 0.4594 | 0.9454 |
94
+ | 0.0044 | 13.0 | 2769 | 0.4106 | 0.5940 | 0.3865 | 0.4683 | 0.9465 |
95
+ | 0.0044 | 14.0 | 2982 | 0.4069 | 0.5485 | 0.4032 | 0.4647 | 0.9457 |
96
+ | 0.0032 | 15.0 | 3195 | 0.4280 | 0.6029 | 0.3800 | 0.4662 | 0.9466 |
97
+ | 0.0032 | 16.0 | 3408 | 0.4049 | 0.5798 | 0.4208 | 0.4876 | 0.9472 |
98
+ | 0.0026 | 17.0 | 3621 | 0.4129 | 0.5758 | 0.4013 | 0.4730 | 0.9470 |
99
+ | 0.0026 | 18.0 | 3834 | 0.4131 | 0.5731 | 0.4069 | 0.4759 | 0.9469 |
100
+ | 0.0021 | 19.0 | 4047 | 0.4074 | 0.5557 | 0.4022 | 0.4667 | 0.9465 |
101
+ | 0.0021 | 20.0 | 4260 | 0.4068 | 0.5525 | 0.4050 | 0.4674 | 0.9465 |
102
+
103
+
104
+ ### Framework versions
105
+
106
+ - Transformers 4.31.0
107
+ - Pytorch 2.0.1+cu118
108
+ - Datasets 2.14.4
109
+ - Tokenizers 0.13.3