adrianeboyd commited on
Commit
07168d0
1 Parent(s): e3b3969

Update spaCy pipeline

Browse files
Files changed (14) hide show
  1. README.md +20 -20
  2. accuracy.json +200 -200
  3. en_core_web_sm-any-py3-none-any.whl +2 -2
  4. meta.json +204 -203
  5. ner/model +1 -1
  6. ner/moves +1 -1
  7. parser/model +1 -1
  8. parser/moves +1 -2
  9. senter/model +1 -1
  10. tagger/cfg +1 -0
  11. tagger/model +2 -2
  12. tok2vec/model +1 -1
  13. tokenizer +0 -0
  14. vocab/strings.json +2 -2
README.md CHANGED
@@ -14,41 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8508041869
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8344851763
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8425656714
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9726545475
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
- value: 0.9180803841
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
- value: 0.8996666011
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
- value: 0.9060200669
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_sm
54
 
@@ -57,8 +57,8 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_sm` |
60
- | **Version** | `3.3.0` |
61
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
@@ -70,11 +70,11 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
70
 
71
  <details>
72
 
73
- <summary>View label scheme (112 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
- | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
@@ -88,12 +88,12 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
- | `TAG_ACC` | 97.27 |
92
- | `SENTS_P` | 91.89 |
93
- | `SENTS_R` | 89.35 |
94
- | `SENTS_F` | 90.60 |
95
- | `DEP_UAS` | 91.81 |
96
- | `DEP_LAS` | 89.97 |
97
- | `ENTS_P` | 85.08 |
98
- | `ENTS_R` | 83.45 |
99
- | `ENTS_F` | 84.26 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8565043157
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8348858173
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8455569081
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9726250474
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
+ value: 0.9165718428
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
+ value: 0.8978441095
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
+ value: 0.9038596962
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_sm
54
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_sm` |
60
+ | **Version** | `3.4.0` |
61
+ | **spaCy** | `>=3.4.0,<3.5.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (113 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
+ | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
+ | `TAG_ACC` | 97.26 |
92
+ | `SENTS_P` | 91.92 |
93
+ | `SENTS_R` | 88.90 |
94
+ | `SENTS_F` | 90.39 |
95
+ | `DEP_UAS` | 91.66 |
96
+ | `DEP_LAS` | 89.78 |
97
+ | `ENTS_P` | 85.65 |
98
+ | `ENTS_R` | 83.49 |
99
+ | `ENTS_F` | 84.56 |
accuracy.json CHANGED
@@ -3,328 +3,328 @@
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
- "tag_acc": 0.9726545475,
7
- "sents_p": 0.9188657486,
8
- "sents_r": 0.8935285969,
9
- "sents_f": 0.9060200669,
10
- "dep_uas": 0.9180803841,
11
- "dep_las": 0.8996666011,
12
  "dep_las_per_type": {
13
  "prep": {
14
- "p": 0.8545638793,
15
- "r": 0.8639347372,
16
- "f": 0.8592237589
17
  },
18
  "det": {
19
- "p": 0.9767252604,
20
- "r": 0.9787164642,
21
- "f": 0.9777198485
22
  },
23
  "pobj": {
24
- "p": 0.9624306803,
25
- "r": 0.9677596701,
26
- "f": 0.9650878189
27
  },
28
  "nsubj": {
29
- "p": 0.9570163789,
30
- "r": 0.9471631982,
31
- "f": 0.9520642959
32
  },
33
  "aux": {
34
- "p": 0.9800017776,
35
- "r": 0.9815721535,
36
- "f": 0.980786337
37
  },
38
  "advmod": {
39
- "p": 0.8575166709,
40
- "r": 0.8547030119,
41
- "f": 0.8561075296
42
  },
43
  "relcl": {
44
- "p": 0.7630824373,
45
- "r": 0.7724963716,
46
- "f": 0.7677605481
47
  },
48
  "root": {
49
- "p": 0.9153887909,
50
- "r": 0.8899663566,
51
- "f": 0.9024985785
52
  },
53
  "xcomp": {
54
- "p": 0.8755679832,
55
- "r": 0.8991385499,
56
- "f": 0.8871967416
57
  },
58
  "amod": {
59
- "p": 0.9161676647,
60
- "r": 0.9119533528,
61
- "f": 0.9140556512
62
  },
63
  "compound": {
64
- "p": 0.9153947513,
65
- "r": 0.9285475607,
66
- "f": 0.9219242466
67
  },
68
  "poss": {
69
- "p": 0.9729241877,
70
- "r": 0.9764492754,
71
- "f": 0.9746835443
72
  },
73
  "ccomp": {
74
- "p": 0.7720643231,
75
- "r": 0.8409368635,
76
- "f": 0.8050302203
77
  },
78
  "attr": {
79
- "p": 0.9093147312,
80
- "r": 0.9318755257,
81
- "f": 0.9204569055
82
  },
83
  "case": {
84
- "p": 0.977799704,
85
- "r": 0.991991992,
86
- "f": 0.9848447205
87
  },
88
  "mark": {
89
- "p": 0.9035921817,
90
- "r": 0.9064652888,
91
- "f": 0.905026455
92
  },
93
  "intj": {
94
- "p": 0.6711250983,
95
- "r": 0.6249084249,
96
- "f": 0.6471927162
97
  },
98
  "advcl": {
99
- "p": 0.6756965944,
100
- "r": 0.6595316041,
101
- "f": 0.6675162482
102
  },
103
  "cc": {
104
- "p": 0.8375451264,
105
- "r": 0.8324363114,
106
- "f": 0.8349829044
107
  },
108
  "neg": {
109
- "p": 0.9480778832,
110
- "r": 0.9528349222,
111
- "f": 0.9504504505
112
  },
113
  "conj": {
114
- "p": 0.7654719087,
115
- "r": 0.77693857,
116
- "f": 0.7711626164
117
  },
118
  "nsubjpass": {
119
- "p": 0.9168804515,
120
- "r": 0.9164102564,
121
- "f": 0.9166452937
122
  },
123
  "auxpass": {
124
- "p": 0.9459821429,
125
- "r": 0.9653758542,
126
- "f": 0.9555806088
127
  },
128
  "dobj": {
129
- "p": 0.9223308565,
130
- "r": 0.9396764682,
131
- "f": 0.9309228705
132
  },
133
  "nummod": {
134
- "p": 0.9368956743,
135
- "r": 0.9297979798,
136
- "f": 0.9333333333
137
  },
138
  "npadvmod": {
139
- "p": 0.7781178271,
140
- "r": 0.7225577265,
141
- "f": 0.7493092651
142
  },
143
  "prt": {
144
- "p": 0.816091954,
145
- "r": 0.8906810036,
146
- "f": 0.851756641
147
  },
148
  "pcomp": {
149
- "p": 0.8699300699,
150
- "r": 0.8711484594,
151
- "f": 0.8705388383
152
  },
153
  "expl": {
154
- "p": 0.983014862,
155
  "r": 0.9914346895,
156
- "f": 0.987206823
157
  },
158
  "acl": {
159
- "p": 0.7332949309,
160
- "r": 0.6944899073,
161
- "f": 0.7133650883
162
  },
163
  "agent": {
164
- "p": 0.8885135135,
165
- "r": 0.9426523297,
166
- "f": 0.9147826087
167
  },
168
  "dative": {
169
- "p": 0.7847769029,
170
- "r": 0.6857798165,
171
- "f": 0.7319461444
172
  },
173
  "acomp": {
174
- "p": 0.9046746104,
175
- "r": 0.8952380952,
176
- "f": 0.8999316161
177
  },
178
  "dep": {
179
- "p": 0.4151624549,
180
- "r": 0.1866883117,
181
- "f": 0.2575587906
182
  },
183
  "csubj": {
184
- "p": 0.6476683938,
185
- "r": 0.7396449704,
186
- "f": 0.6906077348
187
  },
188
  "quantmod": {
189
- "p": 0.8682310469,
190
- "r": 0.7814784728,
191
- "f": 0.8225737495
192
  },
193
  "nmod": {
194
- "p": 0.741078208,
195
- "r": 0.5947592931,
196
- "f": 0.6599053414
197
  },
198
  "appos": {
199
- "p": 0.7215189873,
200
- "r": 0.6676789588,
201
- "f": 0.6935556557
202
  },
203
  "predet": {
204
- "p": 0.8395061728,
205
- "r": 0.8755364807,
206
- "f": 0.8571428571
207
  },
208
  "preconj": {
209
- "p": 0.5544554455,
210
- "r": 0.6511627907,
211
- "f": 0.5989304813
212
  },
213
  "oprd": {
214
- "p": 0.8205980066,
215
- "r": 0.7373134328,
216
- "f": 0.7767295597
217
  },
218
  "parataxis": {
219
- "p": 0.6121883657,
220
- "r": 0.4793926247,
221
- "f": 0.5377128954
222
  },
223
  "meta": {
224
- "p": 0.7407407407,
225
- "r": 0.3846153846,
226
- "f": 0.5063291139
227
  },
228
  "csubjpass": {
229
- "p": 0.7142857143,
230
- "r": 0.8333333333,
231
- "f": 0.7692307692
232
  }
233
  },
234
- "ents_p": 0.8508041869,
235
- "ents_r": 0.8344851763,
236
- "ents_f": 0.8425656714,
237
  "ents_per_type": {
238
  "DATE": {
239
- "p": 0.8732394366,
240
- "r": 0.866031746,
241
- "f": 0.8696206567
242
  },
243
  "GPE": {
244
- "p": 0.9154443486,
245
- "r": 0.8878661088,
246
- "f": 0.90144435
247
  },
248
  "ORDINAL": {
249
- "p": 0.7927927928,
250
- "r": 0.8198757764,
251
- "f": 0.8061068702
252
- },
253
- "FAC": {
254
- "p": 0.4049586777,
255
- "r": 0.3769230769,
256
- "f": 0.390438247
257
  },
258
  "ORG": {
259
- "p": 0.8038601982,
260
- "r": 0.8170731707,
261
- "f": 0.810412832
262
  },
263
- "CARDINAL": {
264
- "p": 0.8222477064,
265
- "r": 0.8525564804,
266
- "f": 0.8371278459
267
  },
268
- "LOC": {
269
- "p": 0.714801444,
270
- "r": 0.6305732484,
271
- "f": 0.6700507614
272
  },
273
  "PERSON": {
274
- "p": 0.8572793883,
275
- "r": 0.8782637076,
276
- "f": 0.8676446881
277
  },
278
  "NORP": {
279
- "p": 0.918652424,
280
- "r": 0.8944,
281
- "f": 0.9063640049
282
  },
283
  "TIME": {
284
- "p": 0.7436708861,
285
- "r": 0.6871345029,
286
- "f": 0.7142857143
 
 
 
 
 
287
  },
288
  "QUANTITY": {
289
- "p": 0.8308823529,
290
- "r": 0.6208791209,
291
- "f": 0.7106918239
292
  },
293
  "EVENT": {
294
- "p": 0.5533980583,
295
- "r": 0.3275862069,
296
- "f": 0.4115523466
297
  },
298
  "WORK_OF_ART": {
299
- "p": 0.4926470588,
300
- "r": 0.3453608247,
301
- "f": 0.4060606061
302
  },
303
  "LAW": {
304
- "p": 0.58,
305
  "r": 0.453125,
306
- "f": 0.5087719298
307
  },
308
  "MONEY": {
309
- "p": 0.9198564593,
310
- "r": 0.9079102715,
311
- "f": 0.9138443256
312
  },
313
  "PERCENT": {
314
- "p": 0.9153354633,
315
- "r": 0.8774885145,
316
- "f": 0.8960125098
317
- },
318
- "LANGUAGE": {
319
- "p": 0.7857142857,
320
- "r": 0.6875,
321
- "f": 0.7333333333
322
  },
323
  "PRODUCT": {
324
- "p": 0.5795454545,
325
- "r": 0.2417061611,
326
- "f": 0.3411371237
 
 
 
 
 
327
  }
328
  },
329
- "speed": 9738.3022066337
330
  }
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
+ "tag_acc": 0.9726250474,
7
+ "sents_p": 0.9191788296,
8
+ "sents_r": 0.8890428129,
9
+ "sents_f": 0.9038596962,
10
+ "dep_uas": 0.9165718428,
11
+ "dep_las": 0.8978441095,
12
  "dep_las_per_type": {
13
  "prep": {
14
+ "p": 0.8546376267,
15
+ "r": 0.8635553026,
16
+ "f": 0.8590733226
17
  },
18
  "det": {
19
+ "p": 0.9768701389,
20
+ "r": 0.9781048683,
21
+ "f": 0.9774871137
22
  },
23
  "pobj": {
24
+ "p": 0.9605592002,
25
+ "r": 0.9659532692,
26
+ "f": 0.9632486833
27
  },
28
  "nsubj": {
29
+ "p": 0.9545715675,
30
+ "r": 0.9463745893,
31
+ "f": 0.9504554055
32
  },
33
  "aux": {
34
+ "p": 0.9798401421,
35
+ "r": 0.9821953174,
36
+ "f": 0.9810163162
37
  },
38
  "advmod": {
39
+ "p": 0.8545653823,
40
+ "r": 0.8527679623,
41
+ "f": 0.8536657262
42
  },
43
  "relcl": {
44
+ "p": 0.7656695157,
45
+ "r": 0.7801161103,
46
+ "f": 0.7728253055
47
  },
48
  "root": {
49
+ "p": 0.91776518,
50
+ "r": 0.8864041164,
51
+ "f": 0.9018120805
52
  },
53
  "xcomp": {
54
+ "p": 0.882290562,
55
+ "r": 0.895908112,
56
+ "f": 0.889047195
57
  },
58
  "amod": {
59
+ "p": 0.9154102213,
60
+ "r": 0.908649174,
61
+ "f": 0.9120171674
62
  },
63
  "compound": {
64
+ "p": 0.9121588361,
65
+ "r": 0.9287703275,
66
+ "f": 0.9203896355
67
  },
68
  "poss": {
69
+ "p": 0.9735258724,
70
+ "r": 0.9770531401,
71
+ "f": 0.9752863171
72
  },
73
  "ccomp": {
74
+ "p": 0.7657557167,
75
+ "r": 0.8389002037,
76
+ "f": 0.8006609
77
  },
78
  "attr": {
79
+ "p": 0.9032126881,
80
+ "r": 0.9339781329,
81
+ "f": 0.9183378127
82
  },
83
  "case": {
84
+ "p": 0.9773063641,
85
+ "r": 0.9914914915,
86
+ "f": 0.9843478261
87
  },
88
  "mark": {
89
+ "p": 0.9002893975,
90
+ "r": 0.9067302597,
91
+ "f": 0.9034983498
92
  },
93
  "intj": {
94
+ "p": 0.6514555468,
95
+ "r": 0.6065934066,
96
+ "f": 0.6282245827
97
  },
98
  "advcl": {
99
+ "p": 0.6653050804,
100
+ "r": 0.6562578696,
101
+ "f": 0.6607505071
102
  },
103
  "cc": {
104
+ "p": 0.8285611165,
105
+ "r": 0.8237052984,
106
+ "f": 0.8261260721
107
  },
108
  "neg": {
109
+ "p": 0.9452191235,
110
+ "r": 0.9523331661,
111
+ "f": 0.9487628093
112
  },
113
  "conj": {
114
+ "p": 0.7577601192,
115
+ "r": 0.7682527694,
116
+ "f": 0.7629703713
117
  },
118
  "nsubjpass": {
119
+ "p": 0.9216589862,
120
+ "r": 0.9230769231,
121
+ "f": 0.9223674097
122
  },
123
  "auxpass": {
124
+ "p": 0.946875,
125
+ "r": 0.9662870159,
126
+ "f": 0.9564825254
127
  },
128
  "dobj": {
129
+ "p": 0.919205298,
130
+ "r": 0.940154594,
131
+ "f": 0.9295619288
132
  },
133
  "nummod": {
134
+ "p": 0.9373886485,
135
+ "r": 0.9300505051,
136
+ "f": 0.9337051591
137
  },
138
  "npadvmod": {
139
+ "p": 0.7748549323,
140
+ "r": 0.7115452931,
141
+ "f": 0.7418518519
142
  },
143
  "prt": {
144
+ "p": 0.8097199341,
145
+ "r": 0.8808243728,
146
+ "f": 0.843776824
147
  },
148
  "pcomp": {
149
+ "p": 0.8756183746,
150
+ "r": 0.8676470588,
151
+ "f": 0.8716144917
152
  },
153
  "expl": {
154
+ "p": 0.9809322034,
155
  "r": 0.9914346895,
156
+ "f": 0.9861554846
157
  },
158
  "acl": {
159
+ "p": 0.7327887981,
160
+ "r": 0.6852154937,
161
+ "f": 0.7082041162
162
  },
163
  "agent": {
164
+ "p": 0.8959044369,
165
+ "r": 0.9408602151,
166
+ "f": 0.9178321678
167
  },
168
  "dative": {
169
+ "p": 0.7846153846,
170
+ "r": 0.7018348624,
171
+ "f": 0.7409200969
172
  },
173
  "acomp": {
174
+ "p": 0.9127423823,
175
+ "r": 0.8965986395,
176
+ "f": 0.90459849
177
  },
178
  "dep": {
179
+ "p": 0.3786764706,
180
+ "r": 0.1672077922,
181
+ "f": 0.231981982
182
  },
183
  "csubj": {
184
+ "p": 0.7393939394,
185
+ "r": 0.7218934911,
186
+ "f": 0.7305389222
187
  },
188
  "quantmod": {
189
+ "p": 0.8694493783,
190
+ "r": 0.7952883834,
191
+ "f": 0.8307170132
192
  },
193
  "nmod": {
194
+ "p": 0.7147169811,
195
+ "r": 0.577087142,
196
+ "f": 0.6385704653
197
  },
198
  "appos": {
199
+ "p": 0.7100509495,
200
+ "r": 0.6650759219,
201
+ "f": 0.686827957
202
  },
203
  "predet": {
204
+ "p": 0.8636363636,
205
+ "r": 0.8969957082,
206
+ "f": 0.88
207
  },
208
  "preconj": {
209
+ "p": 0.5769230769,
210
+ "r": 0.6976744186,
211
+ "f": 0.6315789474
212
  },
213
  "oprd": {
214
+ "p": 0.8160535117,
215
+ "r": 0.728358209,
216
+ "f": 0.7697160883
217
  },
218
  "parataxis": {
219
+ "p": 0.5835694051,
220
+ "r": 0.4468546638,
221
+ "f": 0.5061425061
222
  },
223
  "meta": {
224
+ "p": 0.8666666667,
225
+ "r": 0.5,
226
+ "f": 0.6341463415
227
  },
228
  "csubjpass": {
229
+ "p": 0.5,
230
+ "r": 0.6666666667,
231
+ "f": 0.5714285714
232
  }
233
  },
234
+ "ents_p": 0.8565043157,
235
+ "ents_r": 0.8348858173,
236
+ "ents_f": 0.8455569081,
237
  "ents_per_type": {
238
  "DATE": {
239
+ "p": 0.8804768041,
240
+ "r": 0.8676190476,
241
+ "f": 0.8740006396
242
  },
243
  "GPE": {
244
+ "p": 0.9239884393,
245
+ "r": 0.8917712692,
246
+ "f": 0.9075940383
247
  },
248
  "ORDINAL": {
249
+ "p": 0.7910447761,
250
+ "r": 0.8229813665,
251
+ "f": 0.8066971081
 
 
 
 
 
252
  },
253
  "ORG": {
254
+ "p": 0.8107606679,
255
+ "r": 0.8109756098,
256
+ "f": 0.8108681246
257
  },
258
+ "FAC": {
259
+ "p": 0.3902439024,
260
+ "r": 0.3692307692,
261
+ "f": 0.3794466403
262
  },
263
+ "CARDINAL": {
264
+ "p": 0.8266978923,
265
+ "r": 0.8394768133,
266
+ "f": 0.8330383481
267
  },
268
  "PERSON": {
269
+ "p": 0.8648820905,
270
+ "r": 0.885770235,
271
+ "f": 0.8752015479
272
  },
273
  "NORP": {
274
+ "p": 0.9130787977,
275
+ "r": 0.8992,
276
+ "f": 0.9060862555
277
  },
278
  "TIME": {
279
+ "p": 0.7492163009,
280
+ "r": 0.6988304094,
281
+ "f": 0.7231467474
282
+ },
283
+ "LOC": {
284
+ "p": 0.7158273381,
285
+ "r": 0.6337579618,
286
+ "f": 0.6722972973
287
  },
288
  "QUANTITY": {
289
+ "p": 0.7971014493,
290
+ "r": 0.6043956044,
291
+ "f": 0.6875
292
  },
293
  "EVENT": {
294
+ "p": 0.6373626374,
295
+ "r": 0.3333333333,
296
+ "f": 0.4377358491
297
  },
298
  "WORK_OF_ART": {
299
+ "p": 0.5230769231,
300
+ "r": 0.3505154639,
301
+ "f": 0.4197530864
302
  },
303
  "LAW": {
304
+ "p": 0.6304347826,
305
  "r": 0.453125,
306
+ "f": 0.5272727273
307
  },
308
  "MONEY": {
309
+ "p": 0.9179548157,
310
+ "r": 0.9114521842,
311
+ "f": 0.9146919431
312
  },
313
  "PERCENT": {
314
+ "p": 0.9171974522,
315
+ "r": 0.8820826953,
316
+ "f": 0.8992974239
 
 
 
 
 
317
  },
318
  "PRODUCT": {
319
+ "p": 0.5,
320
+ "r": 0.2274881517,
321
+ "f": 0.3127035831
322
+ },
323
+ "LANGUAGE": {
324
+ "p": 0.8,
325
+ "r": 0.625,
326
+ "f": 0.701754386
327
  }
328
  },
329
+ "speed": 9012.0225085527
330
  }
en_core_web_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:84d7d8059bfbf53c09b39139782f76cd6ac7064851e7799dcc685c06ebf5fd4f
3
- size 12799907
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:35365d49d389ecb19ba33119dca9ab9ff80b36fbeb063fa5d44008197fead8fa
3
+ size 12803016
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"en",
3
  "name":"core_web_sm",
4
- "version":"3.3.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -68,6 +68,7 @@
68
  "WP$",
69
  "WRB",
70
  "XX",
 
71
  "``"
72
  ],
73
  "parser":[
@@ -169,330 +170,330 @@
169
  "token_p":0.9956819193,
170
  "token_r":0.9957659295,
171
  "token_f":0.9957239226,
172
- "tag_acc":0.9726545475,
173
- "sents_p":0.9188657486,
174
- "sents_r":0.8935285969,
175
- "sents_f":0.9060200669,
176
- "dep_uas":0.9180803841,
177
- "dep_las":0.8996666011,
178
  "dep_las_per_type":{
179
  "prep":{
180
- "p":0.8545638793,
181
- "r":0.8639347372,
182
- "f":0.8592237589
183
  },
184
  "det":{
185
- "p":0.9767252604,
186
- "r":0.9787164642,
187
- "f":0.9777198485
188
  },
189
  "pobj":{
190
- "p":0.9624306803,
191
- "r":0.9677596701,
192
- "f":0.9650878189
193
  },
194
  "nsubj":{
195
- "p":0.9570163789,
196
- "r":0.9471631982,
197
- "f":0.9520642959
198
  },
199
  "aux":{
200
- "p":0.9800017776,
201
- "r":0.9815721535,
202
- "f":0.980786337
203
  },
204
  "advmod":{
205
- "p":0.8575166709,
206
- "r":0.8547030119,
207
- "f":0.8561075296
208
  },
209
  "relcl":{
210
- "p":0.7630824373,
211
- "r":0.7724963716,
212
- "f":0.7677605481
213
  },
214
  "root":{
215
- "p":0.9153887909,
216
- "r":0.8899663566,
217
- "f":0.9024985785
218
  },
219
  "xcomp":{
220
- "p":0.8755679832,
221
- "r":0.8991385499,
222
- "f":0.8871967416
223
  },
224
  "amod":{
225
- "p":0.9161676647,
226
- "r":0.9119533528,
227
- "f":0.9140556512
228
  },
229
  "compound":{
230
- "p":0.9153947513,
231
- "r":0.9285475607,
232
- "f":0.9219242466
233
  },
234
  "poss":{
235
- "p":0.9729241877,
236
- "r":0.9764492754,
237
- "f":0.9746835443
238
  },
239
  "ccomp":{
240
- "p":0.7720643231,
241
- "r":0.8409368635,
242
- "f":0.8050302203
243
  },
244
  "attr":{
245
- "p":0.9093147312,
246
- "r":0.9318755257,
247
- "f":0.9204569055
248
  },
249
  "case":{
250
- "p":0.977799704,
251
- "r":0.991991992,
252
- "f":0.9848447205
253
  },
254
  "mark":{
255
- "p":0.9035921817,
256
- "r":0.9064652888,
257
- "f":0.905026455
258
  },
259
  "intj":{
260
- "p":0.6711250983,
261
- "r":0.6249084249,
262
- "f":0.6471927162
263
  },
264
  "advcl":{
265
- "p":0.6756965944,
266
- "r":0.6595316041,
267
- "f":0.6675162482
268
  },
269
  "cc":{
270
- "p":0.8375451264,
271
- "r":0.8324363114,
272
- "f":0.8349829044
273
  },
274
  "neg":{
275
- "p":0.9480778832,
276
- "r":0.9528349222,
277
- "f":0.9504504505
278
  },
279
  "conj":{
280
- "p":0.7654719087,
281
- "r":0.77693857,
282
- "f":0.7711626164
283
  },
284
  "nsubjpass":{
285
- "p":0.9168804515,
286
- "r":0.9164102564,
287
- "f":0.9166452937
288
  },
289
  "auxpass":{
290
- "p":0.9459821429,
291
- "r":0.9653758542,
292
- "f":0.9555806088
293
  },
294
  "dobj":{
295
- "p":0.9223308565,
296
- "r":0.9396764682,
297
- "f":0.9309228705
298
  },
299
  "nummod":{
300
- "p":0.9368956743,
301
- "r":0.9297979798,
302
- "f":0.9333333333
303
  },
304
  "npadvmod":{
305
- "p":0.7781178271,
306
- "r":0.7225577265,
307
- "f":0.7493092651
308
  },
309
  "prt":{
310
- "p":0.816091954,
311
- "r":0.8906810036,
312
- "f":0.851756641
313
  },
314
  "pcomp":{
315
- "p":0.8699300699,
316
- "r":0.8711484594,
317
- "f":0.8705388383
318
  },
319
  "expl":{
320
- "p":0.983014862,
321
  "r":0.9914346895,
322
- "f":0.987206823
323
  },
324
  "acl":{
325
- "p":0.7332949309,
326
- "r":0.6944899073,
327
- "f":0.7133650883
328
  },
329
  "agent":{
330
- "p":0.8885135135,
331
- "r":0.9426523297,
332
- "f":0.9147826087
333
  },
334
  "dative":{
335
- "p":0.7847769029,
336
- "r":0.6857798165,
337
- "f":0.7319461444
338
  },
339
  "acomp":{
340
- "p":0.9046746104,
341
- "r":0.8952380952,
342
- "f":0.8999316161
343
  },
344
  "dep":{
345
- "p":0.4151624549,
346
- "r":0.1866883117,
347
- "f":0.2575587906
348
  },
349
  "csubj":{
350
- "p":0.6476683938,
351
- "r":0.7396449704,
352
- "f":0.6906077348
353
  },
354
  "quantmod":{
355
- "p":0.8682310469,
356
- "r":0.7814784728,
357
- "f":0.8225737495
358
  },
359
  "nmod":{
360
- "p":0.741078208,
361
- "r":0.5947592931,
362
- "f":0.6599053414
363
  },
364
  "appos":{
365
- "p":0.7215189873,
366
- "r":0.6676789588,
367
- "f":0.6935556557
368
  },
369
  "predet":{
370
- "p":0.8395061728,
371
- "r":0.8755364807,
372
- "f":0.8571428571
373
  },
374
  "preconj":{
375
- "p":0.5544554455,
376
- "r":0.6511627907,
377
- "f":0.5989304813
378
  },
379
  "oprd":{
380
- "p":0.8205980066,
381
- "r":0.7373134328,
382
- "f":0.7767295597
383
  },
384
  "parataxis":{
385
- "p":0.6121883657,
386
- "r":0.4793926247,
387
- "f":0.5377128954
388
  },
389
  "meta":{
390
- "p":0.7407407407,
391
- "r":0.3846153846,
392
- "f":0.5063291139
393
  },
394
  "csubjpass":{
395
- "p":0.7142857143,
396
- "r":0.8333333333,
397
- "f":0.7692307692
398
  }
399
  },
400
- "ents_p":0.8508041869,
401
- "ents_r":0.8344851763,
402
- "ents_f":0.8425656714,
403
  "ents_per_type":{
404
  "DATE":{
405
- "p":0.8732394366,
406
- "r":0.866031746,
407
- "f":0.8696206567
408
  },
409
  "GPE":{
410
- "p":0.9154443486,
411
- "r":0.8878661088,
412
- "f":0.90144435
413
  },
414
  "ORDINAL":{
415
- "p":0.7927927928,
416
- "r":0.8198757764,
417
- "f":0.8061068702
418
- },
419
- "FAC":{
420
- "p":0.4049586777,
421
- "r":0.3769230769,
422
- "f":0.390438247
423
  },
424
  "ORG":{
425
- "p":0.8038601982,
426
- "r":0.8170731707,
427
- "f":0.810412832
428
  },
429
- "CARDINAL":{
430
- "p":0.8222477064,
431
- "r":0.8525564804,
432
- "f":0.8371278459
433
  },
434
- "LOC":{
435
- "p":0.714801444,
436
- "r":0.6305732484,
437
- "f":0.6700507614
438
  },
439
  "PERSON":{
440
- "p":0.8572793883,
441
- "r":0.8782637076,
442
- "f":0.8676446881
443
  },
444
  "NORP":{
445
- "p":0.918652424,
446
- "r":0.8944,
447
- "f":0.9063640049
448
  },
449
  "TIME":{
450
- "p":0.7436708861,
451
- "r":0.6871345029,
452
- "f":0.7142857143
 
 
 
 
 
453
  },
454
  "QUANTITY":{
455
- "p":0.8308823529,
456
- "r":0.6208791209,
457
- "f":0.7106918239
458
  },
459
  "EVENT":{
460
- "p":0.5533980583,
461
- "r":0.3275862069,
462
- "f":0.4115523466
463
  },
464
  "WORK_OF_ART":{
465
- "p":0.4926470588,
466
- "r":0.3453608247,
467
- "f":0.4060606061
468
  },
469
  "LAW":{
470
- "p":0.58,
471
  "r":0.453125,
472
- "f":0.5087719298
473
  },
474
  "MONEY":{
475
- "p":0.9198564593,
476
- "r":0.9079102715,
477
- "f":0.9138443256
478
  },
479
  "PERCENT":{
480
- "p":0.9153354633,
481
- "r":0.8774885145,
482
- "f":0.8960125098
483
- },
484
- "LANGUAGE":{
485
- "p":0.7857142857,
486
- "r":0.6875,
487
- "f":0.7333333333
488
  },
489
  "PRODUCT":{
490
- "p":0.5795454545,
491
- "r":0.2417061611,
492
- "f":0.3411371237
 
 
 
 
 
493
  }
494
  },
495
- "speed":9738.3022066337
496
  },
497
  "sources":[
498
  {
1
  {
2
  "lang":"en",
3
  "name":"core_web_sm",
4
+ "version":"3.4.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
68
  "WP$",
69
  "WRB",
70
  "XX",
71
+ "_SP",
72
  "``"
73
  ],
74
  "parser":[
170
  "token_p":0.9956819193,
171
  "token_r":0.9957659295,
172
  "token_f":0.9957239226,
173
+ "tag_acc":0.9726250474,
174
+ "sents_p":0.9191788296,
175
+ "sents_r":0.8890428129,
176
+ "sents_f":0.9038596962,
177
+ "dep_uas":0.9165718428,
178
+ "dep_las":0.8978441095,
179
  "dep_las_per_type":{
180
  "prep":{
181
+ "p":0.8546376267,
182
+ "r":0.8635553026,
183
+ "f":0.8590733226
184
  },
185
  "det":{
186
+ "p":0.9768701389,
187
+ "r":0.9781048683,
188
+ "f":0.9774871137
189
  },
190
  "pobj":{
191
+ "p":0.9605592002,
192
+ "r":0.9659532692,
193
+ "f":0.9632486833
194
  },
195
  "nsubj":{
196
+ "p":0.9545715675,
197
+ "r":0.9463745893,
198
+ "f":0.9504554055
199
  },
200
  "aux":{
201
+ "p":0.9798401421,
202
+ "r":0.9821953174,
203
+ "f":0.9810163162
204
  },
205
  "advmod":{
206
+ "p":0.8545653823,
207
+ "r":0.8527679623,
208
+ "f":0.8536657262
209
  },
210
  "relcl":{
211
+ "p":0.7656695157,
212
+ "r":0.7801161103,
213
+ "f":0.7728253055
214
  },
215
  "root":{
216
+ "p":0.91776518,
217
+ "r":0.8864041164,
218
+ "f":0.9018120805
219
  },
220
  "xcomp":{
221
+ "p":0.882290562,
222
+ "r":0.895908112,
223
+ "f":0.889047195
224
  },
225
  "amod":{
226
+ "p":0.9154102213,
227
+ "r":0.908649174,
228
+ "f":0.9120171674
229
  },
230
  "compound":{
231
+ "p":0.9121588361,
232
+ "r":0.9287703275,
233
+ "f":0.9203896355
234
  },
235
  "poss":{
236
+ "p":0.9735258724,
237
+ "r":0.9770531401,
238
+ "f":0.9752863171
239
  },
240
  "ccomp":{
241
+ "p":0.7657557167,
242
+ "r":0.8389002037,
243
+ "f":0.8006609
244
  },
245
  "attr":{
246
+ "p":0.9032126881,
247
+ "r":0.9339781329,
248
+ "f":0.9183378127
249
  },
250
  "case":{
251
+ "p":0.9773063641,
252
+ "r":0.9914914915,
253
+ "f":0.9843478261
254
  },
255
  "mark":{
256
+ "p":0.9002893975,
257
+ "r":0.9067302597,
258
+ "f":0.9034983498
259
  },
260
  "intj":{
261
+ "p":0.6514555468,
262
+ "r":0.6065934066,
263
+ "f":0.6282245827
264
  },
265
  "advcl":{
266
+ "p":0.6653050804,
267
+ "r":0.6562578696,
268
+ "f":0.6607505071
269
  },
270
  "cc":{
271
+ "p":0.8285611165,
272
+ "r":0.8237052984,
273
+ "f":0.8261260721
274
  },
275
  "neg":{
276
+ "p":0.9452191235,
277
+ "r":0.9523331661,
278
+ "f":0.9487628093
279
  },
280
  "conj":{
281
+ "p":0.7577601192,
282
+ "r":0.7682527694,
283
+ "f":0.7629703713
284
  },
285
  "nsubjpass":{
286
+ "p":0.9216589862,
287
+ "r":0.9230769231,
288
+ "f":0.9223674097
289
  },
290
  "auxpass":{
291
+ "p":0.946875,
292
+ "r":0.9662870159,
293
+ "f":0.9564825254
294
  },
295
  "dobj":{
296
+ "p":0.919205298,
297
+ "r":0.940154594,
298
+ "f":0.9295619288
299
  },
300
  "nummod":{
301
+ "p":0.9373886485,
302
+ "r":0.9300505051,
303
+ "f":0.9337051591
304
  },
305
  "npadvmod":{
306
+ "p":0.7748549323,
307
+ "r":0.7115452931,
308
+ "f":0.7418518519
309
  },
310
  "prt":{
311
+ "p":0.8097199341,
312
+ "r":0.8808243728,
313
+ "f":0.843776824
314
  },
315
  "pcomp":{
316
+ "p":0.8756183746,
317
+ "r":0.8676470588,
318
+ "f":0.8716144917
319
  },
320
  "expl":{
321
+ "p":0.9809322034,
322
  "r":0.9914346895,
323
+ "f":0.9861554846
324
  },
325
  "acl":{
326
+ "p":0.7327887981,
327
+ "r":0.6852154937,
328
+ "f":0.7082041162
329
  },
330
  "agent":{
331
+ "p":0.8959044369,
332
+ "r":0.9408602151,
333
+ "f":0.9178321678
334
  },
335
  "dative":{
336
+ "p":0.7846153846,
337
+ "r":0.7018348624,
338
+ "f":0.7409200969
339
  },
340
  "acomp":{
341
+ "p":0.9127423823,
342
+ "r":0.8965986395,
343
+ "f":0.90459849
344
  },
345
  "dep":{
346
+ "p":0.3786764706,
347
+ "r":0.1672077922,
348
+ "f":0.231981982
349
  },
350
  "csubj":{
351
+ "p":0.7393939394,
352
+ "r":0.7218934911,
353
+ "f":0.7305389222
354
  },
355
  "quantmod":{
356
+ "p":0.8694493783,
357
+ "r":0.7952883834,
358
+ "f":0.8307170132
359
  },
360
  "nmod":{
361
+ "p":0.7147169811,
362
+ "r":0.577087142,
363
+ "f":0.6385704653
364
  },
365
  "appos":{
366
+ "p":0.7100509495,
367
+ "r":0.6650759219,
368
+ "f":0.686827957
369
  },
370
  "predet":{
371
+ "p":0.8636363636,
372
+ "r":0.8969957082,
373
+ "f":0.88
374
  },
375
  "preconj":{
376
+ "p":0.5769230769,
377
+ "r":0.6976744186,
378
+ "f":0.6315789474
379
  },
380
  "oprd":{
381
+ "p":0.8160535117,
382
+ "r":0.728358209,
383
+ "f":0.7697160883
384
  },
385
  "parataxis":{
386
+ "p":0.5835694051,
387
+ "r":0.4468546638,
388
+ "f":0.5061425061
389
  },
390
  "meta":{
391
+ "p":0.8666666667,
392
+ "r":0.5,
393
+ "f":0.6341463415
394
  },
395
  "csubjpass":{
396
+ "p":0.5,
397
+ "r":0.6666666667,
398
+ "f":0.5714285714
399
  }
400
  },
401
+ "ents_p":0.8565043157,
402
+ "ents_r":0.8348858173,
403
+ "ents_f":0.8455569081,
404
  "ents_per_type":{
405
  "DATE":{
406
+ "p":0.8804768041,
407
+ "r":0.8676190476,
408
+ "f":0.8740006396
409
  },
410
  "GPE":{
411
+ "p":0.9239884393,
412
+ "r":0.8917712692,
413
+ "f":0.9075940383
414
  },
415
  "ORDINAL":{
416
+ "p":0.7910447761,
417
+ "r":0.8229813665,
418
+ "f":0.8066971081
 
 
 
 
 
419
  },
420
  "ORG":{
421
+ "p":0.8107606679,
422
+ "r":0.8109756098,
423
+ "f":0.8108681246
424
  },
425
+ "FAC":{
426
+ "p":0.3902439024,
427
+ "r":0.3692307692,
428
+ "f":0.3794466403
429
  },
430
+ "CARDINAL":{
431
+ "p":0.8266978923,
432
+ "r":0.8394768133,
433
+ "f":0.8330383481
434
  },
435
  "PERSON":{
436
+ "p":0.8648820905,
437
+ "r":0.885770235,
438
+ "f":0.8752015479
439
  },
440
  "NORP":{
441
+ "p":0.9130787977,
442
+ "r":0.8992,
443
+ "f":0.9060862555
444
  },
445
  "TIME":{
446
+ "p":0.7492163009,
447
+ "r":0.6988304094,
448
+ "f":0.7231467474
449
+ },
450
+ "LOC":{
451
+ "p":0.7158273381,
452
+ "r":0.6337579618,
453
+ "f":0.6722972973
454
  },
455
  "QUANTITY":{
456
+ "p":0.7971014493,
457
+ "r":0.6043956044,
458
+ "f":0.6875
459
  },
460
  "EVENT":{
461
+ "p":0.6373626374,
462
+ "r":0.3333333333,
463
+ "f":0.4377358491
464
  },
465
  "WORK_OF_ART":{
466
+ "p":0.5230769231,
467
+ "r":0.3505154639,
468
+ "f":0.4197530864
469
  },
470
  "LAW":{
471
+ "p":0.6304347826,
472
  "r":0.453125,
473
+ "f":0.5272727273
474
  },
475
  "MONEY":{
476
+ "p":0.9179548157,
477
+ "r":0.9114521842,
478
+ "f":0.9146919431
479
  },
480
  "PERCENT":{
481
+ "p":0.9171974522,
482
+ "r":0.8820826953,
483
+ "f":0.8992974239
 
 
 
 
 
484
  },
485
  "PRODUCT":{
486
+ "p":0.5,
487
+ "r":0.2274881517,
488
+ "f":0.3127035831
489
+ },
490
+ "LANGUAGE":{
491
+ "p":0.8,
492
+ "r":0.625,
493
+ "f":0.701754386
494
  }
495
  },
496
+ "speed":9012.0225085527
497
  },
498
  "sources":[
499
  {
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b150d51c7d759de938bcd86aa40744117371278ddae98c9d909939c38af0000f
3
  size 6284763
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9501662e93036b834cac45ad7014f9376ee2709606d82d79aa29e4243cdaca42
3
  size 6284763
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{},"1":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"2":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"3":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"4":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
1
+ ��moves�{"0":{},"1":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"2":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"3":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"4":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4dffb1da181e83f7742b1a1f8060b9cb039c2136f49df9d7a94e24a4529b428f
3
  size 319909
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e80971fd38f1f20f11dabe644a485c6ef0846064256c7b2e929148a8b3ce6b97
3
  size 319909
parser/moves CHANGED
@@ -1,2 +1 @@
1
- ��moves�
2
- {"0":{"":994267},"1":{"":990803},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2806,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"dep":1487,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
1
+ ��moves� {"0":{"":994332},"1":{"":999432},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2871,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"dep":10116,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
 
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f0dcd17fe48150d6202c6e499a296f713f29506ffb0fdc4fe76631b9c84df0a6
3
  size 197089
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:907fcab12ab2be9ac2f0facf732009bb278dc81fb25065fcd27c4909fd761c68
3
  size 197089
tagger/cfg CHANGED
@@ -48,6 +48,7 @@
48
  "WP$",
49
  "WRB",
50
  "XX",
 
51
  "``"
52
  ],
53
  "neg_prefix":"!",
48
  "WP$",
49
  "WRB",
50
  "XX",
51
+ "_SP",
52
  "``"
53
  ],
54
  "neg_prefix":"!",
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a8decebdc61f212d8373cbd9255b6593461c29db529daa88ca1b222fb576f8a3
3
- size 19441
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d62054e74f89be08b720157a45ddf3a5a5a9e8c51f191cdea364e390c0032d7e
3
+ size 19829
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1948c4cf0d3b8bfc58386f9f62a3cfdbffc3ff29e13d960a5392896e06d60a2f
3
  size 6139229
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6967e88ec7b0680d94a75500c46fe19a1b1e01ef5f608a58826077e45af5010d
3
  size 6139229
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9fbd4af89668f1aa5e362c98949666f92186f169e0941afc4f00172d4f25a2cd
3
- size 1089443
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1966d1f07a05b68576df07f43f40203b4e11124ad82cc839e8312ad8d7fdae7
3
+ size 1103983