adrianeboyd commited on
Commit
15db25f
1 Parent(s): f8cf61b

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -64,11 +64,11 @@ Princeton University and LICENSEE agrees to preserve same.```
64
 
65
 
66
 
67
- # GloVe Common Crawl
68
 
69
- * Author: Jeffrey Pennington, Richard Socher, and Christopher D. Manning
70
- * URL: https://nlp.stanford.edu/projects/glove/
71
- * License: Public Domain Dedication and License v1.0
72
 
73
  ```
74
  The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work").
64
 
65
 
66
 
67
+ # Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)
68
 
69
+ * Author: Explosion
70
+ * URL: https://github.com/explosion/spacy-vectors-builder
71
+ * License: CC0
72
 
73
  ```
74
  The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work").
README.md CHANGED
@@ -14,41 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8511198946
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8411458333
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8461034709
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9730543186
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
- value: 0.9190946961
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
- value: 0.9007569337
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
- value: 0.9055959278
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_md
54
 
@@ -57,12 +57,12 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_md` |
60
- | **Version** | `3.3.0` |
61
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
- | **Vectors** | 684830 keys, 20000 unique vectors (300 dimensions) |
65
- | **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University)<br />[GloVe Common Crawl](https://nlp.stanford.edu/projects/glove/) (Jeffrey Pennington, Richard Socher, and Christopher D. Manning) |
66
  | **License** | `MIT` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
@@ -70,11 +70,11 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
70
 
71
  <details>
72
 
73
- <summary>View label scheme (112 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
- | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
@@ -88,12 +88,12 @@ English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
- | `TAG_ACC` | 97.31 |
92
- | `SENTS_P` | 91.97 |
93
- | `SENTS_R` | 89.19 |
94
- | `SENTS_F` | 90.56 |
95
- | `DEP_UAS` | 91.91 |
96
- | `DEP_LAS` | 90.08 |
97
- | `ENTS_P` | 85.11 |
98
- | `ENTS_R` | 84.11 |
99
- | `ENTS_F` | 84.61 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8644910088
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8450520833
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8546610277
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9727809676
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
+ value: 0.9208996725
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
+ value: 0.9025794107
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
+ value: 0.9090848229
52
  ---
53
  ### Details: https://spacy.io/models/en#en_core_web_md
54
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `en_core_web_md` |
60
+ | **Version** | `3.4.0` |
61
+ | **spaCy** | `>=3.4.0,<3.5.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
+ | **Vectors** | 514157 keys, 20000 unique vectors (300 dimensions) |
65
+ | **Sources** | [OntoNotes 5](https://catalog.ldc.upenn.edu/LDC2013T19) (Ralph Weischedel, Martha Palmer, Mitchell Marcus, Eduard Hovy, Sameer Pradhan, Lance Ramshaw, Nianwen Xue, Ann Taylor, Jeff Kaufman, Michelle Franchini, Mohammed El-Bachouti, Robert Belvin, Ann Houston)<br />[ClearNLP Constituent-to-Dependency Conversion](https://github.com/clir/clearnlp-guidelines/blob/master/md/components/dependency_conversion.md) (Emory University)<br />[WordNet 3.0](https://wordnet.princeton.edu/) (Princeton University)<br />[Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)](https://github.com/explosion/spacy-vectors-builder) (Explosion) |
66
  | **License** | `MIT` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (113 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
+ | **`tagger`** | `$`, `''`, `,`, `-LRB-`, `-RRB-`, `.`, `:`, `ADD`, `AFX`, `CC`, `CD`, `DT`, `EX`, `FW`, `HYPH`, `IN`, `JJ`, `JJR`, `JJS`, `LS`, `MD`, `NFP`, `NN`, `NNP`, `NNPS`, `NNS`, `PDT`, `POS`, `PRP`, `PRP$`, `RB`, `RBR`, `RBS`, `RP`, `SYM`, `TO`, `UH`, `VB`, `VBD`, `VBG`, `VBN`, `VBP`, `VBZ`, `WDT`, `WP`, `WP$`, `WRB`, `XX`, `_SP`, ```` |
78
  | **`parser`** | `ROOT`, `acl`, `acomp`, `advcl`, `advmod`, `agent`, `amod`, `appos`, `attr`, `aux`, `auxpass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `csubj`, `csubjpass`, `dative`, `dep`, `det`, `dobj`, `expl`, `intj`, `mark`, `meta`, `neg`, `nmod`, `npadvmod`, `nsubj`, `nsubjpass`, `nummod`, `oprd`, `parataxis`, `pcomp`, `pobj`, `poss`, `preconj`, `predet`, `prep`, `prt`, `punct`, `quantmod`, `relcl`, `xcomp` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
88
  | `TOKEN_P` | 99.57 |
89
  | `TOKEN_R` | 99.58 |
90
  | `TOKEN_F` | 99.57 |
91
+ | `TAG_ACC` | 97.28 |
92
+ | `SENTS_P` | 92.28 |
93
+ | `SENTS_R` | 89.58 |
94
+ | `SENTS_F` | 90.91 |
95
+ | `DEP_UAS` | 92.09 |
96
+ | `DEP_LAS` | 90.26 |
97
+ | `ENTS_P` | 86.45 |
98
+ | `ENTS_R` | 84.51 |
99
+ | `ENTS_F` | 85.47 |
accuracy.json CHANGED
@@ -3,328 +3,328 @@
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
- "tag_acc": 0.9730543186,
7
- "sents_p": 0.9196707931,
8
- "sents_r": 0.891945379,
9
- "sents_f": 0.9055959278,
10
- "dep_uas": 0.9190946961,
11
- "dep_las": 0.9007569337,
12
  "dep_las_per_type": {
13
  "prep": {
14
- "p": 0.8560125989,
15
- "r": 0.8662113451,
16
- "f": 0.8610817743
17
  },
18
  "det": {
19
- "p": 0.9772801303,
20
- "r": 0.978634918,
21
- "f": 0.977957055
22
  },
23
  "pobj": {
24
- "p": 0.962675204,
25
- "r": 0.9682701747,
26
- "f": 0.9654645836
27
  },
28
  "nsubj": {
29
- "p": 0.9572204318,
30
- "r": 0.9499233297,
31
- "f": 0.9535579207
32
  },
33
  "aux": {
34
- "p": 0.9785175322,
35
- "r": 0.9813050832,
36
- "f": 0.9799093253
37
  },
38
  "advmod": {
39
- "p": 0.8565884933,
40
- "r": 0.854282349,
41
- "f": 0.8554338669
42
  },
43
  "relcl": {
44
- "p": 0.767698328,
45
- "r": 0.7830188679,
46
- "f": 0.7752829172
47
  },
48
  "root": {
49
- "p": 0.919640425,
50
- "r": 0.890823933,
51
- "f": 0.9050028482
52
  },
53
  "xcomp": {
54
- "p": 0.8803810868,
55
- "r": 0.8955491744,
56
- "f": 0.8879003559
57
  },
58
  "amod": {
59
- "p": 0.9190016943,
60
- "r": 0.9137026239,
61
- "f": 0.9163444982
62
  },
63
  "compound": {
64
- "p": 0.9179876706,
65
- "r": 0.9288260192,
66
- "f": 0.9233750415
67
  },
68
  "poss": {
69
- "p": 0.96996997,
70
- "r": 0.9752415459,
71
- "f": 0.9725986149
72
  },
73
  "ccomp": {
74
- "p": 0.7815332326,
75
- "r": 0.8429735234,
76
- "f": 0.8110915148
77
  },
78
  "attr": {
79
- "p": 0.9055374593,
80
- "r": 0.9352396972,
81
- "f": 0.920148945
82
  },
83
  "case": {
84
- "p": 0.9772502473,
85
- "r": 0.988988989,
86
- "f": 0.9830845771
87
  },
88
  "mark": {
89
- "p": 0.9047619048,
90
- "r": 0.9112347642,
91
- "f": 0.9079867987
92
  },
93
  "intj": {
94
- "p": 0.671630094,
95
- "r": 0.6278388278,
96
- "f": 0.6489965922
97
  },
98
  "advcl": {
99
- "p": 0.6692111959,
100
- "r": 0.6623016872,
101
- "f": 0.6657385141
102
  },
103
  "cc": {
104
- "p": 0.8336738373,
105
- "r": 0.8296854443,
106
- "f": 0.8316748591
107
  },
108
  "neg": {
109
- "p": 0.944027986,
110
- "r": 0.9478173608,
111
- "f": 0.9459188783
112
  },
113
  "conj": {
114
- "p": 0.7673786887,
115
- "r": 0.7823514602,
116
- "f": 0.7747927445
117
  },
118
  "nsubjpass": {
119
- "p": 0.9214175655,
120
- "r": 0.92,
121
- "f": 0.9207082371
122
  },
123
  "auxpass": {
124
- "p": 0.9504242966,
125
  "r": 0.969476082,
126
- "f": 0.9598556608
127
  },
128
  "dobj": {
129
- "p": 0.9276569005,
130
- "r": 0.9411108455,
131
- "f": 0.934335443
132
  },
133
  "nummod": {
134
- "p": 0.9344345616,
135
- "r": 0.9285353535,
136
- "f": 0.9314756175
137
  },
138
  "npadvmod": {
139
- "p": 0.7719101124,
140
- "r": 0.7321492007,
141
- "f": 0.7515041021
142
  },
143
  "prt": {
144
- "p": 0.8105436573,
145
- "r": 0.8817204301,
146
- "f": 0.8446351931
147
  },
148
  "pcomp": {
149
- "p": 0.8834399431,
150
- "r": 0.8704481793,
151
- "f": 0.8768959436
152
  },
153
  "expl": {
154
- "p": 0.978858351,
155
  "r": 0.9914346895,
156
- "f": 0.985106383
157
  },
158
  "acl": {
159
- "p": 0.7338709677,
160
- "r": 0.695035461,
161
- "f": 0.7139254693
162
  },
163
  "agent": {
164
- "p": 0.8931034483,
165
- "r": 0.9283154122,
166
- "f": 0.9103690685
167
  },
168
  "dative": {
169
- "p": 0.7809278351,
170
- "r": 0.6949541284,
171
- "f": 0.7354368932
172
  },
173
  "acomp": {
174
- "p": 0.9010440309,
175
- "r": 0.9002267574,
176
- "f": 0.9006352087
177
  },
178
  "dep": {
179
- "p": 0.4375,
180
- "r": 0.1818181818,
181
- "f": 0.2568807339
182
  },
183
  "csubj": {
184
- "p": 0.6994535519,
185
- "r": 0.7573964497,
186
- "f": 0.7272727273
187
  },
188
  "quantmod": {
189
- "p": 0.8572710952,
190
- "r": 0.775792039,
191
- "f": 0.8144989339
192
  },
193
  "nmod": {
194
- "p": 0.7576923077,
195
- "r": 0.6002437538,
196
- "f": 0.6698401904
197
  },
198
  "appos": {
199
- "p": 0.7131675875,
200
  "r": 0.6720173536,
201
- "f": 0.6919812374
202
  },
203
  "predet": {
204
- "p": 0.8259109312,
205
- "r": 0.8755364807,
206
- "f": 0.85
207
  },
208
  "preconj": {
209
- "p": 0.5376344086,
210
- "r": 0.5813953488,
211
- "f": 0.5586592179
212
  },
213
  "oprd": {
214
- "p": 0.8384879725,
215
- "r": 0.728358209,
216
- "f": 0.7795527157
217
  },
218
  "parataxis": {
219
- "p": 0.627027027,
220
- "r": 0.5032537961,
221
- "f": 0.5583634176
222
  },
223
  "meta": {
224
- "p": 0.9047619048,
225
- "r": 0.3653846154,
226
- "f": 0.5205479452
227
  },
228
  "csubjpass": {
229
- "p": 0.625,
230
  "r": 0.8333333333,
231
- "f": 0.7142857143
232
  }
233
  },
234
- "ents_p": 0.8511198946,
235
- "ents_r": 0.8411458333,
236
- "ents_f": 0.8461034709,
237
  "ents_per_type": {
238
  "DATE": {
239
- "p": 0.8734459675,
240
- "r": 0.8698412698,
241
- "f": 0.8716398918
242
  },
243
  "GPE": {
244
- "p": 0.9166902805,
245
- "r": 0.9023709902,
246
- "f": 0.9094742761
247
  },
248
  "ORDINAL": {
249
- "p": 0.7703081232,
250
- "r": 0.8540372671,
251
- "f": 0.8100147275
 
 
 
 
 
252
  },
253
  "ORG": {
254
- "p": 0.8110611273,
255
- "r": 0.8125662778,
256
- "f": 0.8118130049
 
 
 
 
 
257
  },
258
  "CARDINAL": {
259
- "p": 0.8257619321,
260
- "r": 0.853745541,
261
- "f": 0.839520608
262
  },
263
  "PERSON": {
264
- "p": 0.8546404425,
265
- "r": 0.9076370757,
266
- "f": 0.8803418803
267
  },
268
  "NORP": {
269
- "p": 0.9006410256,
270
- "r": 0.8992,
271
- "f": 0.8999199359
272
  },
273
  "LOC": {
274
- "p": 0.7007575758,
275
- "r": 0.5891719745,
276
- "f": 0.6401384083
277
- },
278
- "LAW": {
279
- "p": 0.5,
280
- "r": 0.4375,
281
- "f": 0.4666666667
282
- },
283
- "FAC": {
284
- "p": 0.4519230769,
285
- "r": 0.3615384615,
286
- "f": 0.4017094017
287
  },
288
  "TIME": {
289
- "p": 0.752293578,
290
- "r": 0.7192982456,
291
- "f": 0.735426009
292
- },
293
- "QUANTITY": {
294
- "p": 0.7867647059,
295
- "r": 0.5879120879,
296
- "f": 0.6729559748
297
  },
298
  "WORK_OF_ART": {
299
- "p": 0.5416666667,
300
- "r": 0.3350515464,
301
- "f": 0.4140127389
302
- },
303
- "MONEY": {
304
- "p": 0.9107142857,
305
- "r": 0.9031877214,
306
- "f": 0.9069353883
307
  },
308
  "EVENT": {
309
- "p": 0.5578947368,
310
- "r": 0.3045977011,
311
- "f": 0.3940520446
 
 
 
 
 
 
 
 
 
 
312
  },
313
  "PERCENT": {
314
- "p": 0.9216,
315
- "r": 0.8820826953,
316
- "f": 0.9014084507
317
  },
318
  "PRODUCT": {
319
- "p": 0.48,
320
- "r": 0.2274881517,
321
- "f": 0.308681672
322
  },
323
  "LANGUAGE": {
324
- "p": 0.8333333333,
325
- "r": 0.625,
326
- "f": 0.7142857143
327
  }
328
  },
329
- "speed": 8543.7326288502
330
  }
3
  "token_p": 0.9956819193,
4
  "token_r": 0.9957659295,
5
  "token_f": 0.9957239226,
6
+ "tag_acc": 0.9727809676,
7
+ "sents_p": 0.9227998641,
8
+ "sents_r": 0.8957714889,
9
+ "sents_f": 0.9090848229,
10
+ "dep_uas": 0.9208996725,
11
+ "dep_las": 0.9025794107,
12
  "dep_las_per_type": {
13
  "prep": {
14
+ "p": 0.8615940393,
15
+ "r": 0.8687535572,
16
+ "f": 0.8651589866
17
  },
18
  "det": {
19
+ "p": 0.9788101059,
20
+ "r": 0.9793688331,
21
+ "f": 0.9790893898
22
  },
23
  "pobj": {
24
+ "p": 0.9638667292,
25
+ "r": 0.9679167485,
26
+ "f": 0.9658874934
27
  },
28
  "nsubj": {
29
+ "p": 0.9573857364,
30
+ "r": 0.9498357065,
31
+ "f": 0.9535957774
32
  },
33
  "aux": {
34
+ "p": 0.9798669623,
35
+ "r": 0.9835306686,
36
+ "f": 0.9816953972
37
  },
38
  "advmod": {
39
+ "p": 0.8569501812,
40
+ "r": 0.8552919401,
41
+ "f": 0.8561202577
42
  },
43
  "relcl": {
44
+ "p": 0.776330076,
45
+ "r": 0.7783018868,
46
+ "f": 0.7773147309
47
  },
48
  "root": {
49
+ "p": 0.9218431426,
50
+ "r": 0.8947819777,
51
+ "f": 0.9081110032
52
  },
53
  "xcomp": {
54
+ "p": 0.8827247191,
55
+ "r": 0.9023689878,
56
+ "f": 0.8924387646
57
  },
58
  "amod": {
59
+ "p": 0.9168349072,
60
+ "r": 0.9120829284,
61
+ "f": 0.9144527444
62
  },
63
  "compound": {
64
+ "p": 0.9189753738,
65
+ "r": 0.9310536868,
66
+ "f": 0.9249751024
67
  },
68
  "poss": {
69
+ "p": 0.972745491,
70
+ "r": 0.9770531401,
71
+ "f": 0.9748945571
72
  },
73
  "ccomp": {
74
+ "p": 0.7827560241,
75
+ "r": 0.8468431772,
76
+ "f": 0.8135394248
77
  },
78
  "attr": {
79
+ "p": 0.9095744681,
80
+ "r": 0.9348191758,
81
+ "f": 0.9220240564
82
  },
83
  "case": {
84
+ "p": 0.9782501236,
85
+ "r": 0.9904904905,
86
+ "f": 0.9843322557
87
  },
88
  "mark": {
89
+ "p": 0.9012023001,
90
+ "r": 0.9136195019,
91
+ "f": 0.9073684211
92
  },
93
  "intj": {
94
+ "p": 0.6692975533,
95
+ "r": 0.6212454212,
96
+ "f": 0.6443768997
97
  },
98
  "advcl": {
99
+ "p": 0.6774029926,
100
+ "r": 0.6726265424,
101
+ "f": 0.6750063179
102
  },
103
  "cc": {
104
+ "p": 0.8407015858,
105
+ "r": 0.8369812223,
106
+ "f": 0.838837279
107
  },
108
  "neg": {
109
+ "p": 0.9451097804,
110
+ "r": 0.9503261415,
111
+ "f": 0.9477107831
112
  },
113
  "conj": {
114
+ "p": 0.7748971706,
115
+ "r": 0.7826032226,
116
+ "f": 0.778731133
117
  },
118
  "nsubjpass": {
119
+ "p": 0.9196108551,
120
+ "r": 0.921025641,
121
+ "f": 0.9203177043
122
  },
123
  "auxpass": {
124
+ "p": 0.9491525424,
125
  "r": 0.969476082,
126
+ "f": 0.9592066712
127
  },
128
  "dobj": {
129
+ "p": 0.9284929356,
130
+ "r": 0.9426249104,
131
+ "f": 0.9355055558
132
  },
133
  "nummod": {
134
+ "p": 0.9416857652,
135
+ "r": 0.9338383838,
136
+ "f": 0.9377456574
137
  },
138
  "npadvmod": {
139
+ "p": 0.7796163971,
140
+ "r": 0.7364120782,
141
+ "f": 0.7573986116
142
  },
143
  "prt": {
144
+ "p": 0.8156606852,
145
+ "r": 0.8960573477,
146
+ "f": 0.853970965
147
  },
148
  "pcomp": {
149
+ "p": 0.8794926004,
150
+ "r": 0.8739495798,
151
+ "f": 0.8767123288
152
  },
153
  "expl": {
154
+ "p": 0.9809322034,
155
  "r": 0.9914346895,
156
+ "f": 0.9861554846
157
  },
158
  "acl": {
159
+ "p": 0.7488505747,
160
+ "r": 0.7108565194,
161
+ "f": 0.729359082
162
  },
163
  "agent": {
164
+ "p": 0.889632107,
165
+ "r": 0.9534050179,
166
+ "f": 0.9204152249
167
  },
168
  "dative": {
169
+ "p": 0.7918918919,
170
+ "r": 0.6720183486,
171
+ "f": 0.7270471464
172
  },
173
  "acomp": {
174
+ "p": 0.9041970803,
175
+ "r": 0.8988662132,
176
+ "f": 0.9015237662
177
  },
178
  "dep": {
179
+ "p": 0.3385093168,
180
+ "r": 0.1769480519,
181
+ "f": 0.2324093817
182
  },
183
  "csubj": {
184
+ "p": 0.6983240223,
185
+ "r": 0.7396449704,
186
+ "f": 0.7183908046
187
  },
188
  "quantmod": {
189
+ "p": 0.8521434821,
190
+ "r": 0.791226645,
191
+ "f": 0.8205560236
192
  },
193
  "nmod": {
194
+ "p": 0.7608359133,
195
+ "r": 0.5990249848,
196
+ "f": 0.6703034436
197
  },
198
  "appos": {
199
+ "p": 0.7089244851,
200
  "r": 0.6720173536,
201
+ "f": 0.6899777283
202
  },
203
  "predet": {
204
+ "p": 0.8380566802,
205
+ "r": 0.8884120172,
206
+ "f": 0.8625
207
  },
208
  "preconj": {
209
+ "p": 0.5463917526,
210
+ "r": 0.6162790698,
211
+ "f": 0.5792349727
212
  },
213
  "oprd": {
214
+ "p": 0.8697183099,
215
+ "r": 0.7373134328,
216
+ "f": 0.7980613893
217
  },
218
  "parataxis": {
219
+ "p": 0.5855614973,
220
+ "r": 0.4750542299,
221
+ "f": 0.5245508982
222
  },
223
  "meta": {
224
+ "p": 0.7714285714,
225
+ "r": 0.5192307692,
226
+ "f": 0.6206896552
227
  },
228
  "csubjpass": {
229
+ "p": 0.4545454545,
230
  "r": 0.8333333333,
231
+ "f": 0.5882352941
232
  }
233
  },
234
+ "ents_p": 0.8644910088,
235
+ "ents_r": 0.8450520833,
236
+ "ents_f": 0.8546610277,
237
  "ents_per_type": {
238
  "DATE": {
239
+ "p": 0.8751191611,
240
+ "r": 0.8742857143,
241
+ "f": 0.8747022392
242
  },
243
  "GPE": {
244
+ "p": 0.9322571346,
245
+ "r": 0.9020920502,
246
+ "f": 0.9169265665
247
  },
248
  "ORDINAL": {
249
+ "p": 0.7808988764,
250
+ "r": 0.8633540373,
251
+ "f": 0.8200589971
252
+ },
253
+ "FAC": {
254
+ "p": 0.390625,
255
+ "r": 0.3846153846,
256
+ "f": 0.3875968992
257
  },
258
  "ORG": {
259
+ "p": 0.8222987288,
260
+ "r": 0.8231707317,
261
+ "f": 0.8227344992
262
+ },
263
+ "QUANTITY": {
264
+ "p": 0.8310810811,
265
+ "r": 0.6758241758,
266
+ "f": 0.7454545455
267
  },
268
  "CARDINAL": {
269
+ "p": 0.8402818555,
270
+ "r": 0.8507728894,
271
+ "f": 0.8454948301
272
  },
273
  "PERSON": {
274
+ "p": 0.8898495037,
275
+ "r": 0.9069843342,
276
+ "f": 0.898335219
277
  },
278
  "NORP": {
279
+ "p": 0.902676399,
280
+ "r": 0.8904,
281
+ "f": 0.896496174
282
  },
283
  "LOC": {
284
+ "p": 0.7042253521,
285
+ "r": 0.6369426752,
286
+ "f": 0.6688963211
 
 
 
 
 
 
 
 
 
 
287
  },
288
  "TIME": {
289
+ "p": 0.7459807074,
290
+ "r": 0.6783625731,
291
+ "f": 0.7105666156
 
 
 
 
 
292
  },
293
  "WORK_OF_ART": {
294
+ "p": 0.5303030303,
295
+ "r": 0.3608247423,
296
+ "f": 0.4294478528
 
 
 
 
 
297
  },
298
  "EVENT": {
299
+ "p": 0.6593406593,
300
+ "r": 0.3448275862,
301
+ "f": 0.4528301887
302
+ },
303
+ "LAW": {
304
+ "p": 0.6481481481,
305
+ "r": 0.546875,
306
+ "f": 0.593220339
307
+ },
308
+ "MONEY": {
309
+ "p": 0.9265944645,
310
+ "r": 0.9090909091,
311
+ "f": 0.9177592372
312
  },
313
  "PERCENT": {
314
+ "p": 0.9072,
315
+ "r": 0.8683001531,
316
+ "f": 0.8873239437
317
  },
318
  "PRODUCT": {
319
+ "p": 0.6506024096,
320
+ "r": 0.2559241706,
321
+ "f": 0.3673469388
322
  },
323
  "LANGUAGE": {
324
+ "p": 0.76,
325
+ "r": 0.59375,
326
+ "f": 0.6666666667
327
  }
328
  },
329
+ "speed": 9753.3917239012
330
  }
en_core_web_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:473336f57d587ffdacc16d62af025d5cf67b629a6146f6f476ebcd9f3471a913
3
- size 33453509
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:962cde66dac9c03196c84834d478397527cc614cd515cb1442141c78e89dc17f
3
+ size 42781333
meta.json CHANGED
@@ -1,18 +1,18 @@
1
  {
2
  "lang":"en",
3
  "name":"core_web_md",
4
- "version":"3.3.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
15
- "keys":684830,
16
  "name":"en_vectors"
17
  },
18
  "labels":{
@@ -68,6 +68,7 @@
68
  "WP$",
69
  "WRB",
70
  "XX",
 
71
  "``"
72
  ],
73
  "parser":[
@@ -169,330 +170,330 @@
169
  "token_p":0.9956819193,
170
  "token_r":0.9957659295,
171
  "token_f":0.9957239226,
172
- "tag_acc":0.9730543186,
173
- "sents_p":0.9196707931,
174
- "sents_r":0.891945379,
175
- "sents_f":0.9055959278,
176
- "dep_uas":0.9190946961,
177
- "dep_las":0.9007569337,
178
  "dep_las_per_type":{
179
  "prep":{
180
- "p":0.8560125989,
181
- "r":0.8662113451,
182
- "f":0.8610817743
183
  },
184
  "det":{
185
- "p":0.9772801303,
186
- "r":0.978634918,
187
- "f":0.977957055
188
  },
189
  "pobj":{
190
- "p":0.962675204,
191
- "r":0.9682701747,
192
- "f":0.9654645836
193
  },
194
  "nsubj":{
195
- "p":0.9572204318,
196
- "r":0.9499233297,
197
- "f":0.9535579207
198
  },
199
  "aux":{
200
- "p":0.9785175322,
201
- "r":0.9813050832,
202
- "f":0.9799093253
203
  },
204
  "advmod":{
205
- "p":0.8565884933,
206
- "r":0.854282349,
207
- "f":0.8554338669
208
  },
209
  "relcl":{
210
- "p":0.767698328,
211
- "r":0.7830188679,
212
- "f":0.7752829172
213
  },
214
  "root":{
215
- "p":0.919640425,
216
- "r":0.890823933,
217
- "f":0.9050028482
218
  },
219
  "xcomp":{
220
- "p":0.8803810868,
221
- "r":0.8955491744,
222
- "f":0.8879003559
223
  },
224
  "amod":{
225
- "p":0.9190016943,
226
- "r":0.9137026239,
227
- "f":0.9163444982
228
  },
229
  "compound":{
230
- "p":0.9179876706,
231
- "r":0.9288260192,
232
- "f":0.9233750415
233
  },
234
  "poss":{
235
- "p":0.96996997,
236
- "r":0.9752415459,
237
- "f":0.9725986149
238
  },
239
  "ccomp":{
240
- "p":0.7815332326,
241
- "r":0.8429735234,
242
- "f":0.8110915148
243
  },
244
  "attr":{
245
- "p":0.9055374593,
246
- "r":0.9352396972,
247
- "f":0.920148945
248
  },
249
  "case":{
250
- "p":0.9772502473,
251
- "r":0.988988989,
252
- "f":0.9830845771
253
  },
254
  "mark":{
255
- "p":0.9047619048,
256
- "r":0.9112347642,
257
- "f":0.9079867987
258
  },
259
  "intj":{
260
- "p":0.671630094,
261
- "r":0.6278388278,
262
- "f":0.6489965922
263
  },
264
  "advcl":{
265
- "p":0.6692111959,
266
- "r":0.6623016872,
267
- "f":0.6657385141
268
  },
269
  "cc":{
270
- "p":0.8336738373,
271
- "r":0.8296854443,
272
- "f":0.8316748591
273
  },
274
  "neg":{
275
- "p":0.944027986,
276
- "r":0.9478173608,
277
- "f":0.9459188783
278
  },
279
  "conj":{
280
- "p":0.7673786887,
281
- "r":0.7823514602,
282
- "f":0.7747927445
283
  },
284
  "nsubjpass":{
285
- "p":0.9214175655,
286
- "r":0.92,
287
- "f":0.9207082371
288
  },
289
  "auxpass":{
290
- "p":0.9504242966,
291
  "r":0.969476082,
292
- "f":0.9598556608
293
  },
294
  "dobj":{
295
- "p":0.9276569005,
296
- "r":0.9411108455,
297
- "f":0.934335443
298
  },
299
  "nummod":{
300
- "p":0.9344345616,
301
- "r":0.9285353535,
302
- "f":0.9314756175
303
  },
304
  "npadvmod":{
305
- "p":0.7719101124,
306
- "r":0.7321492007,
307
- "f":0.7515041021
308
  },
309
  "prt":{
310
- "p":0.8105436573,
311
- "r":0.8817204301,
312
- "f":0.8446351931
313
  },
314
  "pcomp":{
315
- "p":0.8834399431,
316
- "r":0.8704481793,
317
- "f":0.8768959436
318
  },
319
  "expl":{
320
- "p":0.978858351,
321
  "r":0.9914346895,
322
- "f":0.985106383
323
  },
324
  "acl":{
325
- "p":0.7338709677,
326
- "r":0.695035461,
327
- "f":0.7139254693
328
  },
329
  "agent":{
330
- "p":0.8931034483,
331
- "r":0.9283154122,
332
- "f":0.9103690685
333
  },
334
  "dative":{
335
- "p":0.7809278351,
336
- "r":0.6949541284,
337
- "f":0.7354368932
338
  },
339
  "acomp":{
340
- "p":0.9010440309,
341
- "r":0.9002267574,
342
- "f":0.9006352087
343
  },
344
  "dep":{
345
- "p":0.4375,
346
- "r":0.1818181818,
347
- "f":0.2568807339
348
  },
349
  "csubj":{
350
- "p":0.6994535519,
351
- "r":0.7573964497,
352
- "f":0.7272727273
353
  },
354
  "quantmod":{
355
- "p":0.8572710952,
356
- "r":0.775792039,
357
- "f":0.8144989339
358
  },
359
  "nmod":{
360
- "p":0.7576923077,
361
- "r":0.6002437538,
362
- "f":0.6698401904
363
  },
364
  "appos":{
365
- "p":0.7131675875,
366
  "r":0.6720173536,
367
- "f":0.6919812374
368
  },
369
  "predet":{
370
- "p":0.8259109312,
371
- "r":0.8755364807,
372
- "f":0.85
373
  },
374
  "preconj":{
375
- "p":0.5376344086,
376
- "r":0.5813953488,
377
- "f":0.5586592179
378
  },
379
  "oprd":{
380
- "p":0.8384879725,
381
- "r":0.728358209,
382
- "f":0.7795527157
383
  },
384
  "parataxis":{
385
- "p":0.627027027,
386
- "r":0.5032537961,
387
- "f":0.5583634176
388
  },
389
  "meta":{
390
- "p":0.9047619048,
391
- "r":0.3653846154,
392
- "f":0.5205479452
393
  },
394
  "csubjpass":{
395
- "p":0.625,
396
  "r":0.8333333333,
397
- "f":0.7142857143
398
  }
399
  },
400
- "ents_p":0.8511198946,
401
- "ents_r":0.8411458333,
402
- "ents_f":0.8461034709,
403
  "ents_per_type":{
404
  "DATE":{
405
- "p":0.8734459675,
406
- "r":0.8698412698,
407
- "f":0.8716398918
408
  },
409
  "GPE":{
410
- "p":0.9166902805,
411
- "r":0.9023709902,
412
- "f":0.9094742761
413
  },
414
  "ORDINAL":{
415
- "p":0.7703081232,
416
- "r":0.8540372671,
417
- "f":0.8100147275
 
 
 
 
 
418
  },
419
  "ORG":{
420
- "p":0.8110611273,
421
- "r":0.8125662778,
422
- "f":0.8118130049
 
 
 
 
 
423
  },
424
  "CARDINAL":{
425
- "p":0.8257619321,
426
- "r":0.853745541,
427
- "f":0.839520608
428
  },
429
  "PERSON":{
430
- "p":0.8546404425,
431
- "r":0.9076370757,
432
- "f":0.8803418803
433
  },
434
  "NORP":{
435
- "p":0.9006410256,
436
- "r":0.8992,
437
- "f":0.8999199359
438
  },
439
  "LOC":{
440
- "p":0.7007575758,
441
- "r":0.5891719745,
442
- "f":0.6401384083
443
- },
444
- "LAW":{
445
- "p":0.5,
446
- "r":0.4375,
447
- "f":0.4666666667
448
- },
449
- "FAC":{
450
- "p":0.4519230769,
451
- "r":0.3615384615,
452
- "f":0.4017094017
453
  },
454
  "TIME":{
455
- "p":0.752293578,
456
- "r":0.7192982456,
457
- "f":0.735426009
458
- },
459
- "QUANTITY":{
460
- "p":0.7867647059,
461
- "r":0.5879120879,
462
- "f":0.6729559748
463
  },
464
  "WORK_OF_ART":{
465
- "p":0.5416666667,
466
- "r":0.3350515464,
467
- "f":0.4140127389
468
- },
469
- "MONEY":{
470
- "p":0.9107142857,
471
- "r":0.9031877214,
472
- "f":0.9069353883
473
  },
474
  "EVENT":{
475
- "p":0.5578947368,
476
- "r":0.3045977011,
477
- "f":0.3940520446
 
 
 
 
 
 
 
 
 
 
478
  },
479
  "PERCENT":{
480
- "p":0.9216,
481
- "r":0.8820826953,
482
- "f":0.9014084507
483
  },
484
  "PRODUCT":{
485
- "p":0.48,
486
- "r":0.2274881517,
487
- "f":0.308681672
488
  },
489
  "LANGUAGE":{
490
- "p":0.8333333333,
491
- "r":0.625,
492
- "f":0.7142857143
493
  }
494
  },
495
- "speed":8543.7326288502
496
  },
497
  "sources":[
498
  {
@@ -514,10 +515,10 @@
514
  "license":"WordNet 3.0 License"
515
  },
516
  {
517
- "name":"GloVe Common Crawl",
518
- "url":"https://nlp.stanford.edu/projects/glove/",
519
- "license":"Public Domain Dedication and License v1.0",
520
- "author":"Jeffrey Pennington, Richard Socher, and Christopher D. Manning"
521
  }
522
  ],
523
  "requirements":[
1
  {
2
  "lang":"en",
3
  "name":"core_web_md",
4
+ "version":"3.4.0",
5
  "description":"English pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
15
+ "keys":514157,
16
  "name":"en_vectors"
17
  },
18
  "labels":{
68
  "WP$",
69
  "WRB",
70
  "XX",
71
+ "_SP",
72
  "``"
73
  ],
74
  "parser":[
170
  "token_p":0.9956819193,
171
  "token_r":0.9957659295,
172
  "token_f":0.9957239226,
173
+ "tag_acc":0.9727809676,
174
+ "sents_p":0.9227998641,
175
+ "sents_r":0.8957714889,
176
+ "sents_f":0.9090848229,
177
+ "dep_uas":0.9208996725,
178
+ "dep_las":0.9025794107,
179
  "dep_las_per_type":{
180
  "prep":{
181
+ "p":0.8615940393,
182
+ "r":0.8687535572,
183
+ "f":0.8651589866
184
  },
185
  "det":{
186
+ "p":0.9788101059,
187
+ "r":0.9793688331,
188
+ "f":0.9790893898
189
  },
190
  "pobj":{
191
+ "p":0.9638667292,
192
+ "r":0.9679167485,
193
+ "f":0.9658874934
194
  },
195
  "nsubj":{
196
+ "p":0.9573857364,
197
+ "r":0.9498357065,
198
+ "f":0.9535957774
199
  },
200
  "aux":{
201
+ "p":0.9798669623,
202
+ "r":0.9835306686,
203
+ "f":0.9816953972
204
  },
205
  "advmod":{
206
+ "p":0.8569501812,
207
+ "r":0.8552919401,
208
+ "f":0.8561202577
209
  },
210
  "relcl":{
211
+ "p":0.776330076,
212
+ "r":0.7783018868,
213
+ "f":0.7773147309
214
  },
215
  "root":{
216
+ "p":0.9218431426,
217
+ "r":0.8947819777,
218
+ "f":0.9081110032
219
  },
220
  "xcomp":{
221
+ "p":0.8827247191,
222
+ "r":0.9023689878,
223
+ "f":0.8924387646
224
  },
225
  "amod":{
226
+ "p":0.9168349072,
227
+ "r":0.9120829284,
228
+ "f":0.9144527444
229
  },
230
  "compound":{
231
+ "p":0.9189753738,
232
+ "r":0.9310536868,
233
+ "f":0.9249751024
234
  },
235
  "poss":{
236
+ "p":0.972745491,
237
+ "r":0.9770531401,
238
+ "f":0.9748945571
239
  },
240
  "ccomp":{
241
+ "p":0.7827560241,
242
+ "r":0.8468431772,
243
+ "f":0.8135394248
244
  },
245
  "attr":{
246
+ "p":0.9095744681,
247
+ "r":0.9348191758,
248
+ "f":0.9220240564
249
  },
250
  "case":{
251
+ "p":0.9782501236,
252
+ "r":0.9904904905,
253
+ "f":0.9843322557
254
  },
255
  "mark":{
256
+ "p":0.9012023001,
257
+ "r":0.9136195019,
258
+ "f":0.9073684211
259
  },
260
  "intj":{
261
+ "p":0.6692975533,
262
+ "r":0.6212454212,
263
+ "f":0.6443768997
264
  },
265
  "advcl":{
266
+ "p":0.6774029926,
267
+ "r":0.6726265424,
268
+ "f":0.6750063179
269
  },
270
  "cc":{
271
+ "p":0.8407015858,
272
+ "r":0.8369812223,
273
+ "f":0.838837279
274
  },
275
  "neg":{
276
+ "p":0.9451097804,
277
+ "r":0.9503261415,
278
+ "f":0.9477107831
279
  },
280
  "conj":{
281
+ "p":0.7748971706,
282
+ "r":0.7826032226,
283
+ "f":0.778731133
284
  },
285
  "nsubjpass":{
286
+ "p":0.9196108551,
287
+ "r":0.921025641,
288
+ "f":0.9203177043
289
  },
290
  "auxpass":{
291
+ "p":0.9491525424,
292
  "r":0.969476082,
293
+ "f":0.9592066712
294
  },
295
  "dobj":{
296
+ "p":0.9284929356,
297
+ "r":0.9426249104,
298
+ "f":0.9355055558
299
  },
300
  "nummod":{
301
+ "p":0.9416857652,
302
+ "r":0.9338383838,
303
+ "f":0.9377456574
304
  },
305
  "npadvmod":{
306
+ "p":0.7796163971,
307
+ "r":0.7364120782,
308
+ "f":0.7573986116
309
  },
310
  "prt":{
311
+ "p":0.8156606852,
312
+ "r":0.8960573477,
313
+ "f":0.853970965
314
  },
315
  "pcomp":{
316
+ "p":0.8794926004,
317
+ "r":0.8739495798,
318
+ "f":0.8767123288
319
  },
320
  "expl":{
321
+ "p":0.9809322034,
322
  "r":0.9914346895,
323
+ "f":0.9861554846
324
  },
325
  "acl":{
326
+ "p":0.7488505747,
327
+ "r":0.7108565194,
328
+ "f":0.729359082
329
  },
330
  "agent":{
331
+ "p":0.889632107,
332
+ "r":0.9534050179,
333
+ "f":0.9204152249
334
  },
335
  "dative":{
336
+ "p":0.7918918919,
337
+ "r":0.6720183486,
338
+ "f":0.7270471464
339
  },
340
  "acomp":{
341
+ "p":0.9041970803,
342
+ "r":0.8988662132,
343
+ "f":0.9015237662
344
  },
345
  "dep":{
346
+ "p":0.3385093168,
347
+ "r":0.1769480519,
348
+ "f":0.2324093817
349
  },
350
  "csubj":{
351
+ "p":0.6983240223,
352
+ "r":0.7396449704,
353
+ "f":0.7183908046
354
  },
355
  "quantmod":{
356
+ "p":0.8521434821,
357
+ "r":0.791226645,
358
+ "f":0.8205560236
359
  },
360
  "nmod":{
361
+ "p":0.7608359133,
362
+ "r":0.5990249848,
363
+ "f":0.6703034436
364
  },
365
  "appos":{
366
+ "p":0.7089244851,
367
  "r":0.6720173536,
368
+ "f":0.6899777283
369
  },
370
  "predet":{
371
+ "p":0.8380566802,
372
+ "r":0.8884120172,
373
+ "f":0.8625
374
  },
375
  "preconj":{
376
+ "p":0.5463917526,
377
+ "r":0.6162790698,
378
+ "f":0.5792349727
379
  },
380
  "oprd":{
381
+ "p":0.8697183099,
382
+ "r":0.7373134328,
383
+ "f":0.7980613893
384
  },
385
  "parataxis":{
386
+ "p":0.5855614973,
387
+ "r":0.4750542299,
388
+ "f":0.5245508982
389
  },
390
  "meta":{
391
+ "p":0.7714285714,
392
+ "r":0.5192307692,
393
+ "f":0.6206896552
394
  },
395
  "csubjpass":{
396
+ "p":0.4545454545,
397
  "r":0.8333333333,
398
+ "f":0.5882352941
399
  }
400
  },
401
+ "ents_p":0.8644910088,
402
+ "ents_r":0.8450520833,
403
+ "ents_f":0.8546610277,
404
  "ents_per_type":{
405
  "DATE":{
406
+ "p":0.8751191611,
407
+ "r":0.8742857143,
408
+ "f":0.8747022392
409
  },
410
  "GPE":{
411
+ "p":0.9322571346,
412
+ "r":0.9020920502,
413
+ "f":0.9169265665
414
  },
415
  "ORDINAL":{
416
+ "p":0.7808988764,
417
+ "r":0.8633540373,
418
+ "f":0.8200589971
419
+ },
420
+ "FAC":{
421
+ "p":0.390625,
422
+ "r":0.3846153846,
423
+ "f":0.3875968992
424
  },
425
  "ORG":{
426
+ "p":0.8222987288,
427
+ "r":0.8231707317,
428
+ "f":0.8227344992
429
+ },
430
+ "QUANTITY":{
431
+ "p":0.8310810811,
432
+ "r":0.6758241758,
433
+ "f":0.7454545455
434
  },
435
  "CARDINAL":{
436
+ "p":0.8402818555,
437
+ "r":0.8507728894,
438
+ "f":0.8454948301
439
  },
440
  "PERSON":{
441
+ "p":0.8898495037,
442
+ "r":0.9069843342,
443
+ "f":0.898335219
444
  },
445
  "NORP":{
446
+ "p":0.902676399,
447
+ "r":0.8904,
448
+ "f":0.896496174
449
  },
450
  "LOC":{
451
+ "p":0.7042253521,
452
+ "r":0.6369426752,
453
+ "f":0.6688963211
 
 
 
 
 
 
 
 
 
 
454
  },
455
  "TIME":{
456
+ "p":0.7459807074,
457
+ "r":0.6783625731,
458
+ "f":0.7105666156
 
 
 
 
 
459
  },
460
  "WORK_OF_ART":{
461
+ "p":0.5303030303,
462
+ "r":0.3608247423,
463
+ "f":0.4294478528
 
 
 
 
 
464
  },
465
  "EVENT":{
466
+ "p":0.6593406593,
467
+ "r":0.3448275862,
468
+ "f":0.4528301887
469
+ },
470
+ "LAW":{
471
+ "p":0.6481481481,
472
+ "r":0.546875,
473
+ "f":0.593220339
474
+ },
475
+ "MONEY":{
476
+ "p":0.9265944645,
477
+ "r":0.9090909091,
478
+ "f":0.9177592372
479
  },
480
  "PERCENT":{
481
+ "p":0.9072,
482
+ "r":0.8683001531,
483
+ "f":0.8873239437
484
  },
485
  "PRODUCT":{
486
+ "p":0.6506024096,
487
+ "r":0.2559241706,
488
+ "f":0.3673469388
489
  },
490
  "LANGUAGE":{
491
+ "p":0.76,
492
+ "r":0.59375,
493
+ "f":0.6666666667
494
  }
495
  },
496
+ "speed":9753.3917239012
497
  },
498
  "sources":[
499
  {
515
  "license":"WordNet 3.0 License"
516
  },
517
  {
518
+ "name":"Explosion Vectors (OSCAR 2109 + Wikipedia + OpenSubtitles + WMT News Crawl)",
519
+ "url":"https://github.com/explosion/spacy-vectors-builder",
520
+ "license":"CC0",
521
+ "author":"Explosion"
522
  }
523
  ],
524
  "requirements":[
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b41efca4b580a40c0b25cc1fb6a67bac8e130aaec912a693daf7ea929f45888e
3
  size 6511153
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4aa77dde97264deb19826a34229265638053037f2873b889e1e016dcbba614c1
3
  size 6511153
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{},"1":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"2":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"3":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355},"4":{"ORG":56356,"DATE":40381,"PERSON":36475,"GPE":26716,"MONEY":15121,"CARDINAL":14096,"NORP":9638,"PERCENT":9182,"WORK_OF_ART":4475,"LOC":4047,"TIME":3670,"QUANTITY":3114,"FAC":3042,"EVENT":3015,"ORDINAL":2142,"PRODUCT":1782,"LAW":1620,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
1
+ ��moves�{"0":{},"1":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"2":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"3":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355},"4":{"ORG":56516,"DATE":40493,"PERSON":36534,"GPE":26745,"MONEY":15158,"CARDINAL":14109,"NORP":9641,"PERCENT":9199,"WORK_OF_ART":4488,"LOC":4055,"TIME":3678,"QUANTITY":3123,"FAC":3046,"EVENT":3021,"ORDINAL":2142,"PRODUCT":1787,"LAW":1624,"LANGUAGE":355,"":1},"5":{"":1}}�cfg��neg_key�
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76af6521395477a0c496585f2645d72951011e17f7460e3c88f44331e6537ae1
3
  size 319909
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cea80069aa441a381c5cf33c1d593f4f3e75d2f9d016231c0ae3dc7863527ea
3
  size 319909
parser/moves CHANGED
@@ -1,2 +1 @@
1
- ��moves�
2
- {"0":{"":994267},"1":{"":990803},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2806,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"dep":1487,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
1
+ ��moves� {"0":{"":994332},"1":{"":999432},"2":{"det":172595,"nsubj":165748,"compound":116623,"amod":105184,"aux":86667,"punct":65478,"advmod":62763,"poss":36443,"mark":27941,"nummod":22598,"auxpass":15594,"prep":14001,"nsubjpass":13856,"neg":12357,"cc":10739,"nmod":9562,"advcl":9062,"npadvmod":8168,"quantmod":7101,"intj":6464,"ccomp":5896,"dobj":3427,"expl":3360,"dep":2871,"predet":1944,"parataxis":1837,"csubj":1428,"preconj":621,"pobj||prep":616,"attr":578,"meta":376,"advmod||conj":368,"dobj||xcomp":352,"acomp":284,"nsubj||ccomp":224,"dative":206,"advmod||xcomp":149,"dobj||ccomp":70,"csubjpass":64,"dobj||conj":62,"prep||conj":51,"acl":48,"prep||nsubj":41,"prep||dobj":36,"xcomp":34,"advmod||ccomp":32,"oprd":31},"3":{"punct":183790,"pobj":182191,"prep":174008,"dobj":89615,"conj":59687,"cc":51930,"ccomp":30385,"advmod":22861,"xcomp":21021,"relcl":20969,"advcl":19828,"attr":17741,"acomp":16922,"appos":15265,"case":13388,"acl":12085,"pcomp":10324,"dep":10116,"npadvmod":9796,"prt":8179,"agent":3903,"dative":3866,"nsubj":3470,"neg":2906,"amod":2839,"intj":2819,"nummod":2732,"oprd":2301,"parataxis":1261,"quantmod":319,"nmod":294,"acl||dobj":200,"prep||dobj":190,"prep||nsubj":162,"acl||nsubj":159,"appos||nsubj":145,"relcl||dobj":134,"relcl||nsubj":111,"aux":103,"expl":96,"meta":92,"appos||dobj":86,"preconj":71,"csubj":65,"prep||nsubjpass":55,"prep||advmod":54,"prep||acomp":53,"det":51,"nsubjpass":45,"relcl||pobj":42,"acl||nsubjpass":42,"mark":40,"auxpass":39,"prep||pobj":36,"relcl||nsubjpass":32,"appos||nsubjpass":31},"4":{"ROOT":111664}}�cfg��neg_key�
 
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cf22e914a7dfd4affbd3101e76c3c492b4be5bcbd5f6990693b7dec2e600da7d
3
  size 219953
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7ebeec81aedef1f421d0814f78f6a847b7b7b3734754224f4e782b42288edd5
3
  size 219953
tagger/cfg CHANGED
@@ -48,6 +48,7 @@
48
  "WP$",
49
  "WRB",
50
  "XX",
 
51
  "``"
52
  ],
53
  "neg_prefix":"!",
48
  "WP$",
49
  "WRB",
50
  "XX",
51
+ "_SP",
52
  "``"
53
  ],
54
  "neg_prefix":"!",
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e1939d0de1aa5fe41a810a23f1c356267d927a5a7380602a2b8a42b19cfb08c3
3
- size 19441
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cb683ea4bc5a74108e97a55419f3f32dbbf871034995af1a7f10d49b992e0fe
3
+ size 19829
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:739e3624f453c1a0117a2c9c55ec0a76308e9e07c71a58ffd5f18bcde3400256
3
  size 6365604
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5df1f1a278e76fd260a1800c086dc4d48dc98483e3441bdcf225db5aa165282
3
  size 6365604
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
vocab/key2row CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ab2632eead29429129edb1fae6bb17a84fdc8247b9632fe081bf1075a0395fe
3
- size 8214448
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:389912f67e81a52fbabb7edf8e36a0c3700b0b20d6dc6ef71bd56eb91ba08a0a
3
+ size 6165224
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:649ca580aed1f07d3b761fa73308bc96f72b78e8bd4d51140a3a920b3429ba10
3
- size 9694998
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06aeff5e8687bc142b8d3b54846e034ad18b8d9e98650d4a27273e483ed57f45
3
+ size 10369007
vocab/vectors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:22d102290ba09bb681c5a2e6063e9b4cea4c41dec9517565181a907b6ba6a830
3
  size 24000128
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7e833278b382d25fed830bb17531356fe739fc6d3a223bdea62c21e5a6d220a
3
  size 24000128