Isotonic commited on
Commit
daaddbc
1 Parent(s): c748939

Model save

Browse files
README.md ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/deberta-v3-base
4
+ tags:
5
+ - generated_from_trainer
6
+ model-index:
7
+ - name: deberta-v3-base_finetuned_bluegennx_run2
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # deberta-v3-base_finetuned_bluegennx_run2
15
+
16
+ This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.0737
19
+ - Overall Precision: 0.7273
20
+ - Overall Recall: 0.7428
21
+ - Overall F1: 0.7350
22
+ - Overall Accuracy: 0.9752
23
+ - Aadhar F1: 0.8128
24
+ - Age F1: 0.4700
25
+ - City F1: 0.7686
26
+ - Country F1: 0.7226
27
+ - Creditcardcvv F1: 0.7531
28
+ - Creditcardnumber F1: 0.8109
29
+ - Date F1: 0.7126
30
+ - Dateofbirth F1: 0.7262
31
+ - Email F1: 0.6935
32
+ - Expiry F1: 0.6621
33
+ - Organization F1: 0.7623
34
+ - Pan F1: 0.7772
35
+ - Person F1: 0.7568
36
+ - Phonenumber F1: 0.8194
37
+ - Secondary F1: 0.6278
38
+ - State F1: 0.7735
39
+ - Time F1: 0.7856
40
+ - Url F1: 0.5824
41
+
42
+ ## Model description
43
+
44
+ More information needed
45
+
46
+ ## Intended uses & limitations
47
+
48
+ More information needed
49
+
50
+ ## Training and evaluation data
51
+
52
+ More information needed
53
+
54
+ ## Training procedure
55
+
56
+ ### Training hyperparameters
57
+
58
+ The following hyperparameters were used during training:
59
+ - learning_rate: 5e-05
60
+ - train_batch_size: 8
61
+ - eval_batch_size: 8
62
+ - seed: 42
63
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
64
+ - lr_scheduler_type: cosine_with_restarts
65
+ - lr_scheduler_warmup_ratio: 0.2
66
+ - num_epochs: 5
67
+
68
+ ### Training results
69
+
70
+ | Training Loss | Epoch | Step | Validation Loss | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy | Aadhar F1 | Age F1 | City F1 | Country F1 | Creditcardcvv F1 | Creditcardnumber F1 | Date F1 | Dateofbirth F1 | Email F1 | Expiry F1 | Organization F1 | Pan F1 | Person F1 | Phonenumber F1 | Secondary F1 | State F1 | Time F1 | Url F1 |
71
+ |:-------------:|:-----:|:-----:|:---------------:|:-----------------:|:--------------:|:----------:|:----------------:|:---------:|:------:|:-------:|:----------:|:----------------:|:-------------------:|:-------:|:--------------:|:--------:|:---------:|:---------------:|:------:|:---------:|:--------------:|:------------:|:--------:|:-------:|:------:|
72
+ | 0.1576 | 1.0 | 3893 | 0.1289 | 0.5166 | 0.5445 | 0.5302 | 0.9559 | 0.6073 | 0.1745 | 0.5790 | 0.5463 | 0.5707 | 0.6816 | 0.4834 | 0.4489 | 0.4808 | 0.5009 | 0.6085 | 0.5667 | 0.5383 | 0.5811 | 0.4273 | 0.6592 | 0.5824 | 0.2314 |
73
+ | 0.1075 | 2.0 | 7786 | 0.1151 | 0.5991 | 0.6001 | 0.5996 | 0.9610 | 0.7012 | 0.2439 | 0.6649 | 0.5689 | 0.6735 | 0.6950 | 0.5229 | 0.6065 | 0.5176 | 0.4904 | 0.6910 | 0.7248 | 0.5493 | 0.6810 | 0.5406 | 0.6382 | 0.6816 | 0.3492 |
74
+ | 0.0804 | 3.0 | 11679 | 0.0841 | 0.6783 | 0.7045 | 0.6911 | 0.9709 | 0.7826 | 0.3554 | 0.7372 | 0.6909 | 0.7276 | 0.7621 | 0.6459 | 0.7272 | 0.6303 | 0.6235 | 0.7329 | 0.7324 | 0.6816 | 0.7855 | 0.5912 | 0.7620 | 0.7529 | 0.4652 |
75
+ | 0.0532 | 4.0 | 15572 | 0.0737 | 0.7273 | 0.7428 | 0.7350 | 0.9752 | 0.8128 | 0.4700 | 0.7686 | 0.7226 | 0.7531 | 0.8109 | 0.7126 | 0.7262 | 0.6935 | 0.6621 | 0.7623 | 0.7772 | 0.7568 | 0.8194 | 0.6278 | 0.7735 | 0.7856 | 0.5824 |
76
+ | 0.0381 | 5.0 | 19465 | 0.0753 | 0.7372 | 0.7589 | 0.7479 | 0.9768 | 0.8278 | 0.4925 | 0.7705 | 0.7185 | 0.7832 | 0.8258 | 0.7231 | 0.7605 | 0.7027 | 0.6676 | 0.7700 | 0.8011 | 0.7591 | 0.8305 | 0.6558 | 0.7828 | 0.7978 | 0.6144 |
77
+
78
+
79
+ ### Framework versions
80
+
81
+ - Transformers 4.38.2
82
+ - Pytorch 2.1.0+cu121
83
+ - Datasets 2.18.0
84
+ - Tokenizers 0.15.2
all_results.json ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 5.0,
3
+ "eval_AADHAR_f1": 0.8128376231331428,
4
+ "eval_AGE_f1": 0.4700162074554295,
5
+ "eval_CITY_f1": 0.7685916078105525,
6
+ "eval_COUNTRY_f1": 0.7226140103432064,
7
+ "eval_CREDITCARDCVV_f1": 0.75306823760432,
8
+ "eval_CREDITCARDNUMBER_f1": 0.8108559498956159,
9
+ "eval_DATEOFBIRTH_f1": 0.7261687917425622,
10
+ "eval_DATE_f1": 0.7126321087065928,
11
+ "eval_EMAIL_f1": 0.6934642528382939,
12
+ "eval_EXPIRY_f1": 0.6621376811594203,
13
+ "eval_ORGANIZATION_f1": 0.7623318385650225,
14
+ "eval_PAN_f1": 0.7771563143950544,
15
+ "eval_PERSON_f1": 0.7567690557451651,
16
+ "eval_PHONENUMBER_f1": 0.8193832599118942,
17
+ "eval_SECONDARY_f1": 0.6278339727938611,
18
+ "eval_STATE_f1": 0.7735243269943195,
19
+ "eval_TIME_f1": 0.7856,
20
+ "eval_URL_f1": 0.5823902842947833,
21
+ "eval_loss": 0.07368036359548569,
22
+ "eval_overall_accuracy": 0.9751581288663377,
23
+ "eval_overall_f1": 0.7349767992514655,
24
+ "eval_overall_precision": 0.7273242630385488,
25
+ "eval_overall_recall": 0.7427920799722104,
26
+ "eval_runtime": 85.8812,
27
+ "eval_samples": 7785,
28
+ "eval_samples_per_second": 90.648,
29
+ "eval_steps_per_second": 11.341,
30
+ "total_flos": 2.3463823353089932e+16,
31
+ "train_loss": 0.13554635967780837,
32
+ "train_runtime": 4621.5139,
33
+ "train_samples": 31139,
34
+ "train_samples_per_second": 33.689,
35
+ "train_steps_per_second": 4.212
36
+ }
eval_results.json ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 5.0,
3
+ "eval_AADHAR_f1": 0.8128376231331428,
4
+ "eval_AGE_f1": 0.4700162074554295,
5
+ "eval_CITY_f1": 0.7685916078105525,
6
+ "eval_COUNTRY_f1": 0.7226140103432064,
7
+ "eval_CREDITCARDCVV_f1": 0.75306823760432,
8
+ "eval_CREDITCARDNUMBER_f1": 0.8108559498956159,
9
+ "eval_DATEOFBIRTH_f1": 0.7261687917425622,
10
+ "eval_DATE_f1": 0.7126321087065928,
11
+ "eval_EMAIL_f1": 0.6934642528382939,
12
+ "eval_EXPIRY_f1": 0.6621376811594203,
13
+ "eval_ORGANIZATION_f1": 0.7623318385650225,
14
+ "eval_PAN_f1": 0.7771563143950544,
15
+ "eval_PERSON_f1": 0.7567690557451651,
16
+ "eval_PHONENUMBER_f1": 0.8193832599118942,
17
+ "eval_SECONDARY_f1": 0.6278339727938611,
18
+ "eval_STATE_f1": 0.7735243269943195,
19
+ "eval_TIME_f1": 0.7856,
20
+ "eval_URL_f1": 0.5823902842947833,
21
+ "eval_loss": 0.07368036359548569,
22
+ "eval_overall_accuracy": 0.9751581288663377,
23
+ "eval_overall_f1": 0.7349767992514655,
24
+ "eval_overall_precision": 0.7273242630385488,
25
+ "eval_overall_recall": 0.7427920799722104,
26
+ "eval_runtime": 85.8812,
27
+ "eval_samples": 7785,
28
+ "eval_samples_per_second": 90.648,
29
+ "eval_steps_per_second": 11.341
30
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9325917e4fe4e9569a12e1e777211d68a8e0adce8460b91582f8bdd4357d409d
3
  size 735464404
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91af9d520039b009f3684f3a46629166788e37b7f20ab5538f9f566ad665ad01
3
  size 735464404
train_results.json ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "epoch": 5.0,
3
+ "total_flos": 2.3463823353089932e+16,
4
+ "train_loss": 0.13554635967780837,
5
+ "train_runtime": 4621.5139,
6
+ "train_samples": 31139,
7
+ "train_samples_per_second": 33.689,
8
+ "train_steps_per_second": 4.212
9
+ }
trainer_state.json ADDED
@@ -0,0 +1,476 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "best_metric": 0.07368036359548569,
3
+ "best_model_checkpoint": "deberta-v3-base_finetuned_bluegennx_run2/checkpoint-15572",
4
+ "epoch": 5.0,
5
+ "eval_steps": 500,
6
+ "global_step": 19465,
7
+ "is_hyper_param_search": false,
8
+ "is_local_process_zero": true,
9
+ "is_world_process_zero": true,
10
+ "log_history": [
11
+ {
12
+ "epoch": 0.13,
13
+ "grad_norm": 4.968059539794922,
14
+ "learning_rate": 6.421782686873877e-06,
15
+ "loss": 1.5207,
16
+ "step": 500
17
+ },
18
+ {
19
+ "epoch": 0.26,
20
+ "grad_norm": 1.833338975906372,
21
+ "learning_rate": 1.2843565373747755e-05,
22
+ "loss": 0.3548,
23
+ "step": 1000
24
+ },
25
+ {
26
+ "epoch": 0.39,
27
+ "grad_norm": 3.3004016876220703,
28
+ "learning_rate": 1.926534806062163e-05,
29
+ "loss": 0.2465,
30
+ "step": 1500
31
+ },
32
+ {
33
+ "epoch": 0.51,
34
+ "grad_norm": 1.2553385496139526,
35
+ "learning_rate": 2.568713074749551e-05,
36
+ "loss": 0.2083,
37
+ "step": 2000
38
+ },
39
+ {
40
+ "epoch": 0.64,
41
+ "grad_norm": 1.1466569900512695,
42
+ "learning_rate": 3.210891343436938e-05,
43
+ "loss": 0.1793,
44
+ "step": 2500
45
+ },
46
+ {
47
+ "epoch": 0.77,
48
+ "grad_norm": 1.0831718444824219,
49
+ "learning_rate": 3.853069612124326e-05,
50
+ "loss": 0.1638,
51
+ "step": 3000
52
+ },
53
+ {
54
+ "epoch": 0.9,
55
+ "grad_norm": 0.8673099279403687,
56
+ "learning_rate": 4.495247880811714e-05,
57
+ "loss": 0.1576,
58
+ "step": 3500
59
+ },
60
+ {
61
+ "epoch": 1.0,
62
+ "eval_AADHAR_f1": 0.6073298429319373,
63
+ "eval_AGE_f1": 0.17452006980802792,
64
+ "eval_CITY_f1": 0.5790040957102823,
65
+ "eval_COUNTRY_f1": 0.5463393054633932,
66
+ "eval_CREDITCARDCVV_f1": 0.570738773539353,
67
+ "eval_CREDITCARDNUMBER_f1": 0.6815920398009951,
68
+ "eval_DATEOFBIRTH_f1": 0.44886975242195915,
69
+ "eval_DATE_f1": 0.48340248962655596,
70
+ "eval_EMAIL_f1": 0.4808362369337979,
71
+ "eval_EXPIRY_f1": 0.5009433962264151,
72
+ "eval_ORGANIZATION_f1": 0.6085458304617505,
73
+ "eval_PAN_f1": 0.5666666666666667,
74
+ "eval_PERSON_f1": 0.5382635150947812,
75
+ "eval_PHONENUMBER_f1": 0.5810865191146882,
76
+ "eval_SECONDARY_f1": 0.4273062730627306,
77
+ "eval_STATE_f1": 0.6591928251121076,
78
+ "eval_TIME_f1": 0.5823575331772053,
79
+ "eval_URL_f1": 0.2313599207332177,
80
+ "eval_loss": 0.1289273202419281,
81
+ "eval_overall_accuracy": 0.9558582980005097,
82
+ "eval_overall_f1": 0.530216476247745,
83
+ "eval_overall_precision": 0.5166440839345223,
84
+ "eval_overall_recall": 0.5445212088463468,
85
+ "eval_runtime": 86.6617,
86
+ "eval_samples_per_second": 89.832,
87
+ "eval_steps_per_second": 11.239,
88
+ "step": 3893
89
+ },
90
+ {
91
+ "epoch": 1.03,
92
+ "grad_norm": 1.2435287237167358,
93
+ "learning_rate": 4.9994175325525815e-05,
94
+ "loss": 0.1456,
95
+ "step": 4000
96
+ },
97
+ {
98
+ "epoch": 1.16,
99
+ "grad_norm": 2.9733986854553223,
100
+ "learning_rate": 4.981277857643215e-05,
101
+ "loss": 0.1351,
102
+ "step": 4500
103
+ },
104
+ {
105
+ "epoch": 1.28,
106
+ "grad_norm": 0.6477993130683899,
107
+ "learning_rate": 4.9379116203601614e-05,
108
+ "loss": 0.1314,
109
+ "step": 5000
110
+ },
111
+ {
112
+ "epoch": 1.41,
113
+ "grad_norm": 0.5392288565635681,
114
+ "learning_rate": 4.869759714933127e-05,
115
+ "loss": 0.1209,
116
+ "step": 5500
117
+ },
118
+ {
119
+ "epoch": 1.54,
120
+ "grad_norm": 1.1792165040969849,
121
+ "learning_rate": 4.77751502559023e-05,
122
+ "loss": 0.1198,
123
+ "step": 6000
124
+ },
125
+ {
126
+ "epoch": 1.67,
127
+ "grad_norm": 0.7067626714706421,
128
+ "learning_rate": 4.662115382168699e-05,
129
+ "loss": 0.1123,
130
+ "step": 6500
131
+ },
132
+ {
133
+ "epoch": 1.8,
134
+ "grad_norm": 2.738388776779175,
135
+ "learning_rate": 4.524734025421763e-05,
136
+ "loss": 0.108,
137
+ "step": 7000
138
+ },
139
+ {
140
+ "epoch": 1.93,
141
+ "grad_norm": 0.5796131491661072,
142
+ "learning_rate": 4.366767678958705e-05,
143
+ "loss": 0.1075,
144
+ "step": 7500
145
+ },
146
+ {
147
+ "epoch": 2.0,
148
+ "eval_AADHAR_f1": 0.7011530071673419,
149
+ "eval_AGE_f1": 0.24390243902439024,
150
+ "eval_CITY_f1": 0.6648512777545037,
151
+ "eval_COUNTRY_f1": 0.5688849321166258,
152
+ "eval_CREDITCARDCVV_f1": 0.673469387755102,
153
+ "eval_CREDITCARDNUMBER_f1": 0.6950354609929078,
154
+ "eval_DATEOFBIRTH_f1": 0.6064516129032257,
155
+ "eval_DATE_f1": 0.522875816993464,
156
+ "eval_EMAIL_f1": 0.5175817396668723,
157
+ "eval_EXPIRY_f1": 0.4903722721437741,
158
+ "eval_ORGANIZATION_f1": 0.6910175177040625,
159
+ "eval_PAN_f1": 0.7247842903897649,
160
+ "eval_PERSON_f1": 0.5492757739278614,
161
+ "eval_PHONENUMBER_f1": 0.6809864757358791,
162
+ "eval_SECONDARY_f1": 0.5405599425699928,
163
+ "eval_STATE_f1": 0.6381724392041267,
164
+ "eval_TIME_f1": 0.6816214088941362,
165
+ "eval_URL_f1": 0.3492149431510558,
166
+ "eval_loss": 0.115125373005867,
167
+ "eval_overall_accuracy": 0.9609531753203123,
168
+ "eval_overall_f1": 0.5995951026703943,
169
+ "eval_overall_precision": 0.5990522422561257,
170
+ "eval_overall_recall": 0.6001389478559573,
171
+ "eval_runtime": 85.274,
172
+ "eval_samples_per_second": 91.294,
173
+ "eval_steps_per_second": 11.422,
174
+ "step": 7786
175
+ },
176
+ {
177
+ "epoch": 2.05,
178
+ "grad_norm": 2.7306056022644043,
179
+ "learning_rate": 4.189822349087813e-05,
180
+ "loss": 0.0973,
181
+ "step": 8000
182
+ },
183
+ {
184
+ "epoch": 2.18,
185
+ "grad_norm": 0.8707060813903809,
186
+ "learning_rate": 3.99569699693187e-05,
187
+ "loss": 0.0872,
188
+ "step": 8500
189
+ },
190
+ {
191
+ "epoch": 2.31,
192
+ "grad_norm": 1.4237110614776611,
193
+ "learning_rate": 3.786365248817888e-05,
194
+ "loss": 0.0892,
195
+ "step": 9000
196
+ },
197
+ {
198
+ "epoch": 2.44,
199
+ "grad_norm": 0.9175803065299988,
200
+ "learning_rate": 3.563955330887217e-05,
201
+ "loss": 0.0809,
202
+ "step": 9500
203
+ },
204
+ {
205
+ "epoch": 2.57,
206
+ "grad_norm": 2.0962719917297363,
207
+ "learning_rate": 3.330728431926079e-05,
208
+ "loss": 0.0816,
209
+ "step": 10000
210
+ },
211
+ {
212
+ "epoch": 2.7,
213
+ "grad_norm": 0.6415413618087769,
214
+ "learning_rate": 3.089055714396487e-05,
215
+ "loss": 0.0818,
216
+ "step": 10500
217
+ },
218
+ {
219
+ "epoch": 2.83,
220
+ "grad_norm": 1.0105106830596924,
221
+ "learning_rate": 2.8413942073909294e-05,
222
+ "loss": 0.0833,
223
+ "step": 11000
224
+ },
225
+ {
226
+ "epoch": 2.95,
227
+ "grad_norm": 0.6718979477882385,
228
+ "learning_rate": 2.5902618266014323e-05,
229
+ "loss": 0.0804,
230
+ "step": 11500
231
+ },
232
+ {
233
+ "epoch": 3.0,
234
+ "eval_AADHAR_f1": 0.7825545171339564,
235
+ "eval_AGE_f1": 0.3553875236294896,
236
+ "eval_CITY_f1": 0.7371663244353183,
237
+ "eval_COUNTRY_f1": 0.6909004514136373,
238
+ "eval_CREDITCARDCVV_f1": 0.7276190476190476,
239
+ "eval_CREDITCARDNUMBER_f1": 0.7621000820344543,
240
+ "eval_DATEOFBIRTH_f1": 0.7271589486858574,
241
+ "eval_DATE_f1": 0.645884072089625,
242
+ "eval_EMAIL_f1": 0.6303360581289738,
243
+ "eval_EXPIRY_f1": 0.6235446313065977,
244
+ "eval_ORGANIZATION_f1": 0.7328818660647103,
245
+ "eval_PAN_f1": 0.7323537885335638,
246
+ "eval_PERSON_f1": 0.6816084377059987,
247
+ "eval_PHONENUMBER_f1": 0.7855153203342619,
248
+ "eval_SECONDARY_f1": 0.5911859548548907,
249
+ "eval_STATE_f1": 0.7619970916141541,
250
+ "eval_TIME_f1": 0.7529128163921254,
251
+ "eval_URL_f1": 0.4651810584958217,
252
+ "eval_loss": 0.0840676948428154,
253
+ "eval_overall_accuracy": 0.9709089224068025,
254
+ "eval_overall_f1": 0.6911280245370895,
255
+ "eval_overall_precision": 0.6782860752907949,
256
+ "eval_overall_recall": 0.7044656297039639,
257
+ "eval_runtime": 86.0556,
258
+ "eval_samples_per_second": 90.465,
259
+ "eval_steps_per_second": 11.318,
260
+ "step": 11679
261
+ },
262
+ {
263
+ "epoch": 3.08,
264
+ "grad_norm": 0.6749424338340759,
265
+ "learning_rate": 2.3382117752690293e-05,
266
+ "loss": 0.0678,
267
+ "step": 12000
268
+ },
269
+ {
270
+ "epoch": 3.21,
271
+ "grad_norm": 0.7351276278495789,
272
+ "learning_rate": 2.0878065863731047e-05,
273
+ "loss": 0.0629,
274
+ "step": 12500
275
+ },
276
+ {
277
+ "epoch": 3.34,
278
+ "grad_norm": 2.0415396690368652,
279
+ "learning_rate": 1.841592069967507e-05,
280
+ "loss": 0.0589,
281
+ "step": 13000
282
+ },
283
+ {
284
+ "epoch": 3.47,
285
+ "grad_norm": 2.5976686477661133,
286
+ "learning_rate": 1.602071430534669e-05,
287
+ "loss": 0.055,
288
+ "step": 13500
289
+ },
290
+ {
291
+ "epoch": 3.6,
292
+ "grad_norm": 0.5724570155143738,
293
+ "learning_rate": 1.3716798175004491e-05,
294
+ "loss": 0.0567,
295
+ "step": 14000
296
+ },
297
+ {
298
+ "epoch": 3.72,
299
+ "grad_norm": 0.33804842829704285,
300
+ "learning_rate": 1.1527595676485656e-05,
301
+ "loss": 0.0551,
302
+ "step": 14500
303
+ },
304
+ {
305
+ "epoch": 3.85,
306
+ "grad_norm": 0.36113670468330383,
307
+ "learning_rate": 9.47536391139105e-06,
308
+ "loss": 0.0534,
309
+ "step": 15000
310
+ },
311
+ {
312
+ "epoch": 3.98,
313
+ "grad_norm": 0.837962806224823,
314
+ "learning_rate": 7.580967432421968e-06,
315
+ "loss": 0.0532,
316
+ "step": 15500
317
+ },
318
+ {
319
+ "epoch": 4.0,
320
+ "eval_AADHAR_f1": 0.8128376231331428,
321
+ "eval_AGE_f1": 0.4700162074554295,
322
+ "eval_CITY_f1": 0.7685916078105525,
323
+ "eval_COUNTRY_f1": 0.7226140103432064,
324
+ "eval_CREDITCARDCVV_f1": 0.75306823760432,
325
+ "eval_CREDITCARDNUMBER_f1": 0.8108559498956159,
326
+ "eval_DATEOFBIRTH_f1": 0.7261687917425622,
327
+ "eval_DATE_f1": 0.7126321087065928,
328
+ "eval_EMAIL_f1": 0.6934642528382939,
329
+ "eval_EXPIRY_f1": 0.6621376811594203,
330
+ "eval_ORGANIZATION_f1": 0.7623318385650225,
331
+ "eval_PAN_f1": 0.7771563143950544,
332
+ "eval_PERSON_f1": 0.7567690557451651,
333
+ "eval_PHONENUMBER_f1": 0.8193832599118942,
334
+ "eval_SECONDARY_f1": 0.6278339727938611,
335
+ "eval_STATE_f1": 0.7735243269943195,
336
+ "eval_TIME_f1": 0.7856,
337
+ "eval_URL_f1": 0.5823902842947833,
338
+ "eval_loss": 0.07368036359548569,
339
+ "eval_overall_accuracy": 0.9751581288663377,
340
+ "eval_overall_f1": 0.7349767992514655,
341
+ "eval_overall_precision": 0.7273242630385488,
342
+ "eval_overall_recall": 0.7427920799722104,
343
+ "eval_runtime": 85.6115,
344
+ "eval_samples_per_second": 90.934,
345
+ "eval_steps_per_second": 11.377,
346
+ "step": 15572
347
+ },
348
+ {
349
+ "epoch": 4.11,
350
+ "grad_norm": 0.788645327091217,
351
+ "learning_rate": 5.863666118430778e-06,
352
+ "loss": 0.0442,
353
+ "step": 16000
354
+ },
355
+ {
356
+ "epoch": 4.24,
357
+ "grad_norm": 0.3516280949115753,
358
+ "learning_rate": 4.340919363809101e-06,
359
+ "loss": 0.0406,
360
+ "step": 16500
361
+ },
362
+ {
363
+ "epoch": 4.37,
364
+ "grad_norm": 0.3513507544994354,
365
+ "learning_rate": 3.028208572973809e-06,
366
+ "loss": 0.0402,
367
+ "step": 17000
368
+ },
369
+ {
370
+ "epoch": 4.5,
371
+ "grad_norm": 1.3249739408493042,
372
+ "learning_rate": 1.938879764607007e-06,
373
+ "loss": 0.0388,
374
+ "step": 17500
375
+ },
376
+ {
377
+ "epoch": 4.62,
378
+ "grad_norm": 0.6633570194244385,
379
+ "learning_rate": 1.0840078858554203e-06,
380
+ "loss": 0.0414,
381
+ "step": 18000
382
+ },
383
+ {
384
+ "epoch": 4.75,
385
+ "grad_norm": 0.98424232006073,
386
+ "learning_rate": 4.7228421597436956e-07,
387
+ "loss": 0.0414,
388
+ "step": 18500
389
+ },
390
+ {
391
+ "epoch": 4.88,
392
+ "grad_norm": 0.834293007850647,
393
+ "learning_rate": 1.0992800415684512e-07,
394
+ "loss": 0.0381,
395
+ "step": 19000
396
+ },
397
+ {
398
+ "epoch": 5.0,
399
+ "eval_AADHAR_f1": 0.8277830637488106,
400
+ "eval_AGE_f1": 0.4925124792013311,
401
+ "eval_CITY_f1": 0.7704582651391162,
402
+ "eval_COUNTRY_f1": 0.7185407527129993,
403
+ "eval_CREDITCARDCVV_f1": 0.7832031250000001,
404
+ "eval_CREDITCARDNUMBER_f1": 0.8258333333333333,
405
+ "eval_DATEOFBIRTH_f1": 0.7604938271604939,
406
+ "eval_DATE_f1": 0.7231004589495156,
407
+ "eval_EMAIL_f1": 0.7027190332326284,
408
+ "eval_EXPIRY_f1": 0.6675651392632524,
409
+ "eval_ORGANIZATION_f1": 0.7700453857791226,
410
+ "eval_PAN_f1": 0.8010501750291715,
411
+ "eval_PERSON_f1": 0.7590744101633393,
412
+ "eval_PHONENUMBER_f1": 0.8305220883534136,
413
+ "eval_SECONDARY_f1": 0.655794587092297,
414
+ "eval_STATE_f1": 0.7827788649706457,
415
+ "eval_TIME_f1": 0.797752808988764,
416
+ "eval_URL_f1": 0.6144478272903404,
417
+ "eval_loss": 0.07529246807098389,
418
+ "eval_overall_accuracy": 0.976808924723709,
419
+ "eval_overall_f1": 0.7478842972063214,
420
+ "eval_overall_precision": 0.7371597810602084,
421
+ "eval_overall_recall": 0.7589254699139295,
422
+ "eval_runtime": 85.7339,
423
+ "eval_samples_per_second": 90.804,
424
+ "eval_steps_per_second": 11.361,
425
+ "step": 19465
426
+ },
427
+ {
428
+ "epoch": 5.0,
429
+ "step": 19465,
430
+ "total_flos": 2.3463823353089932e+16,
431
+ "train_loss": 0.13554635967780837,
432
+ "train_runtime": 4621.5139,
433
+ "train_samples_per_second": 33.689,
434
+ "train_steps_per_second": 4.212
435
+ },
436
+ {
437
+ "epoch": 5.0,
438
+ "eval_AADHAR_f1": 0.8128376231331428,
439
+ "eval_AGE_f1": 0.4700162074554295,
440
+ "eval_CITY_f1": 0.7685916078105525,
441
+ "eval_COUNTRY_f1": 0.7226140103432064,
442
+ "eval_CREDITCARDCVV_f1": 0.75306823760432,
443
+ "eval_CREDITCARDNUMBER_f1": 0.8108559498956159,
444
+ "eval_DATEOFBIRTH_f1": 0.7261687917425622,
445
+ "eval_DATE_f1": 0.7126321087065928,
446
+ "eval_EMAIL_f1": 0.6934642528382939,
447
+ "eval_EXPIRY_f1": 0.6621376811594203,
448
+ "eval_ORGANIZATION_f1": 0.7623318385650225,
449
+ "eval_PAN_f1": 0.7771563143950544,
450
+ "eval_PERSON_f1": 0.7567690557451651,
451
+ "eval_PHONENUMBER_f1": 0.8193832599118942,
452
+ "eval_SECONDARY_f1": 0.6278339727938611,
453
+ "eval_STATE_f1": 0.7735243269943195,
454
+ "eval_TIME_f1": 0.7856,
455
+ "eval_URL_f1": 0.5823902842947833,
456
+ "eval_loss": 0.07368036359548569,
457
+ "eval_overall_accuracy": 0.9751581288663377,
458
+ "eval_overall_f1": 0.7349767992514655,
459
+ "eval_overall_precision": 0.7273242630385488,
460
+ "eval_overall_recall": 0.7427920799722104,
461
+ "eval_runtime": 85.8812,
462
+ "eval_samples_per_second": 90.648,
463
+ "eval_steps_per_second": 11.341,
464
+ "step": 19465
465
+ }
466
+ ],
467
+ "logging_steps": 500,
468
+ "max_steps": 19465,
469
+ "num_input_tokens_seen": 0,
470
+ "num_train_epochs": 5,
471
+ "save_steps": 500,
472
+ "total_flos": 2.3463823353089932e+16,
473
+ "train_batch_size": 8,
474
+ "trial_name": null,
475
+ "trial_params": null
476
+ }