EC2 Default User commited on
Commit
c0daea5
1 Parent(s): 7bfda68

Update spaCy pipeline

Browse files
README.md CHANGED
@@ -14,47 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7224990884
18
  - name: NER Recall
19
  type: recall
20
- value: 0.6531868132
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.6860968431
24
  - task:
25
- name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.8957464158
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.7817728729
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.7311469952
41
- - name: SENTER F Score
42
  type: f_score
43
- value: 0.7556129032
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
- type: accuracy
50
- value: 0.6965379684
51
  - task:
52
- name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.6965379684
58
  ---
59
  ### Details: https://spacy.io/models/zh#zh_core_web_sm
60
 
@@ -63,8 +57,8 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `zh_core_web_sm` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
@@ -76,13 +70,12 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
76
 
77
  <details>
78
 
79
- <summary>View label scheme (101 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`tagger`** | `AD`, `AS`, `BA`, `CC`, `CD`, `CS`, `DEC`, `DEG`, `DER`, `DEV`, `DT`, `ETC`, `FW`, `IJ`, `INF`, `JJ`, `LB`, `LC`, `M`, `MSP`, `NN`, `NR`, `NT`, `OD`, `ON`, `P`, `PN`, `PU`, `SB`, `SP`, `URL`, `VA`, `VC`, `VE`, `VV`, `X` |
84
  | **`parser`** | `ROOT`, `acl`, `advcl:loc`, `advmod`, `advmod:dvp`, `advmod:loc`, `advmod:rcomp`, `amod`, `amod:ordmod`, `appos`, `aux:asp`, `aux:ba`, `aux:modal`, `aux:prtmod`, `auxpass`, `case`, `cc`, `ccomp`, `compound:nn`, `compound:vc`, `conj`, `cop`, `dep`, `det`, `discourse`, `dobj`, `etc`, `mark`, `mark:clf`, `name`, `neg`, `nmod`, `nmod:assmod`, `nmod:poss`, `nmod:prep`, `nmod:range`, `nmod:tmod`, `nmod:topic`, `nsubj`, `nsubj:xsubj`, `nsubjpass`, `nummod`, `parataxis:prnmod`, `punct`, `xcomp` |
85
- | **`senter`** | `I`, `S` |
86
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
87
 
88
  </details>
@@ -95,12 +88,12 @@ Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter,
95
  | `TOKEN_P` | 94.58 |
96
  | `TOKEN_R` | 91.36 |
97
  | `TOKEN_F` | 92.94 |
98
- | `TAG_ACC` | 89.57 |
99
- | `SENTS_P` | 78.18 |
100
- | `SENTS_R` | 73.11 |
101
- | `SENTS_F` | 75.56 |
102
- | `DEP_UAS` | 69.65 |
103
- | `DEP_LAS` | 64.26 |
104
- | `ENTS_P` | 72.25 |
105
- | `ENTS_R` | 65.32 |
106
- | `ENTS_F` | 68.61 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7248870987
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.6526373626
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.6868675186
24
  - task:
25
+ name: TAG
26
  type: token-classification
27
  metrics:
28
+ - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.8926983284
31
  - task:
32
+ name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
+ - name: Unlabeled Attachment Score (UAS)
 
 
 
 
 
 
36
  type: f_score
37
+ value: 0.6937939046
38
  - task:
39
+ name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
+ - name: Labeled Attachment Score (LAS)
43
+ type: f_score
44
+ value: 0.6399631978
45
  - task:
46
+ name: SENTS
47
  type: token-classification
48
  metrics:
49
+ - name: Sentences F-Score
50
+ type: f_score
51
+ value: 0.7540813682
52
  ---
53
  ### Details: https://spacy.io/models/zh#zh_core_web_sm
54
 
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `zh_core_web_sm` |
60
+ | **Version** | `3.3.0` |
61
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `ner` |
63
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `ner` |
64
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (99 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
  | **`tagger`** | `AD`, `AS`, `BA`, `CC`, `CD`, `CS`, `DEC`, `DEG`, `DER`, `DEV`, `DT`, `ETC`, `FW`, `IJ`, `INF`, `JJ`, `LB`, `LC`, `M`, `MSP`, `NN`, `NR`, `NT`, `OD`, `ON`, `P`, `PN`, `PU`, `SB`, `SP`, `URL`, `VA`, `VC`, `VE`, `VV`, `X` |
78
  | **`parser`** | `ROOT`, `acl`, `advcl:loc`, `advmod`, `advmod:dvp`, `advmod:loc`, `advmod:rcomp`, `amod`, `amod:ordmod`, `appos`, `aux:asp`, `aux:ba`, `aux:modal`, `aux:prtmod`, `auxpass`, `case`, `cc`, `ccomp`, `compound:nn`, `compound:vc`, `conj`, `cop`, `dep`, `det`, `discourse`, `dobj`, `etc`, `mark`, `mark:clf`, `name`, `neg`, `nmod`, `nmod:assmod`, `nmod:poss`, `nmod:prep`, `nmod:range`, `nmod:tmod`, `nmod:topic`, `nsubj`, `nsubj:xsubj`, `nsubjpass`, `nummod`, `parataxis:prnmod`, `punct`, `xcomp` |
 
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
81
  </details>
 
88
  | `TOKEN_P` | 94.58 |
89
  | `TOKEN_R` | 91.36 |
90
  | `TOKEN_F` | 92.94 |
91
+ | `TAG_ACC` | 89.27 |
92
+ | `SENTS_P` | 78.37 |
93
+ | `SENTS_R` | 72.67 |
94
+ | `SENTS_F` | 75.41 |
95
+ | `DEP_UAS` | 69.38 |
96
+ | `DEP_LAS` | 64.00 |
97
+ | `ENTS_P` | 72.49 |
98
+ | `ENTS_R` | 65.26 |
99
+ | `ENTS_F` | 68.69 |
accuracy.json CHANGED
@@ -3,227 +3,227 @@
3
  "token_p": 0.9458325855,
4
  "token_r": 0.9136060443,
5
  "token_f": 0.9294400505,
6
- "tag_acc": 0.8957464158,
7
- "sents_p": 0.7817728729,
8
- "sents_r": 0.7311469952,
9
- "sents_f": 0.7556129032,
10
- "dep_uas": 0.6965379684,
11
- "dep_las": 0.6426392548,
12
  "dep_las_per_type": {
13
  "dep": {
14
- "p": 0.4702473498,
15
- "r": 0.3361624735,
16
- "f": 0.3920575065
17
  },
18
  "case": {
19
- "p": 0.8028549383,
20
- "r": 0.7569107662,
21
- "f": 0.7792061907
22
  },
23
  "nmod:tmod": {
24
- "p": 0.7231788079,
25
  "r": 0.7428571429,
26
- "f": 0.732885906
27
  },
28
  "nummod": {
29
- "p": 0.8233471074,
30
- "r": 0.5309793471,
31
- "f": 0.6456055083
32
  },
33
  "mark:clf": {
34
- "p": 0.9301898347,
35
- "r": 0.5665796345,
36
- "f": 0.7042188224
37
  },
38
  "auxpass": {
39
- "p": 0.8756756757,
40
- "r": 0.8756756757,
41
- "f": 0.8756756757
42
  },
43
  "nsubj": {
44
- "p": 0.771189813,
45
- "r": 0.7141628793,
46
- "f": 0.7415816327
47
  },
48
  "acl": {
49
- "p": 0.6791758646,
50
- "r": 0.5119245702,
51
- "f": 0.5838077166
52
  },
53
  "advmod": {
54
- "p": 0.8065869786,
55
- "r": 0.7189979596,
56
- "f": 0.7602780774
57
  },
58
  "mark": {
59
- "p": 0.7065868263,
60
- "r": 0.6722173532,
61
- "f": 0.6889737256
62
  },
63
  "xcomp": {
64
- "p": 0.7559198543,
65
- "r": 0.6758957655,
66
- "f": 0.7136715391
67
  },
68
  "nmod:assmod": {
69
- "p": 0.7642786398,
70
- "r": 0.7205104264,
71
- "f": 0.7417494393
72
  },
73
  "det": {
74
- "p": 0.8394160584,
75
- "r": 0.6063268893,
76
- "f": 0.7040816327
77
  },
78
  "amod": {
79
- "p": 0.7544338336,
80
- "r": 0.6516103692,
81
- "f": 0.6992623815
82
  },
83
  "nmod:prep": {
84
- "p": 0.7013125222,
85
- "r": 0.5980036298,
86
- "f": 0.6455510204
87
  },
88
  "root": {
89
- "p": 0.7283996995,
90
- "r": 0.6455801565,
91
- "f": 0.6844938664
92
  },
93
  "aux:prtmod": {
94
- "p": 0.890625,
95
- "r": 0.8142857143,
96
- "f": 0.8507462687
97
  },
98
  "compound:nn": {
99
- "p": 0.7243023667,
100
- "r": 0.6939086294,
101
- "f": 0.7087798133
102
  },
103
  "dobj": {
104
- "p": 0.780507386,
105
- "r": 0.7200414753,
106
- "f": 0.7490561677
107
  },
108
  "ccomp": {
109
- "p": 0.6268199234,
110
- "r": 0.6360808709,
111
- "f": 0.6314164415
112
  },
113
  "advmod:rcomp": {
114
- "p": 0.8096774194,
115
- "r": 0.6952908587,
116
- "f": 0.7481371088
117
  },
118
  "nmod:topic": {
119
- "p": 0.3686868687,
120
- "r": 0.237012987,
121
- "f": 0.2885375494
122
  },
123
  "cop": {
124
- "p": 0.7385620915,
125
- "r": 0.5817245817,
126
- "f": 0.6508279338
127
  },
128
  "discourse": {
129
- "p": 0.5540037244,
130
- "r": 0.4909240924,
131
- "f": 0.52055993
132
  },
133
  "neg": {
134
- "p": 0.823880597,
135
- "r": 0.6563614744,
136
- "f": 0.730641959
137
  },
138
  "aux:modal": {
139
- "p": 0.8563772776,
140
- "r": 0.8262668046,
141
- "f": 0.8410526316
142
  },
143
  "nmod": {
144
- "p": 0.7135761589,
145
- "r": 0.5848032564,
146
- "f": 0.6428038777
147
  },
148
  "aux:ba": {
149
- "p": 0.8087431694,
150
- "r": 0.7872340426,
151
- "f": 0.7978436658
152
  },
153
  "advmod:loc": {
154
- "p": 0.58203125,
155
- "r": 0.4421364985,
156
- "f": 0.502529511
157
  },
158
  "aux:asp": {
159
- "p": 0.9053941909,
160
- "r": 0.870015949,
161
- "f": 0.8873525824
162
  },
163
  "conj": {
164
- "p": 0.4784786642,
165
- "r": 0.4875236295,
166
- "f": 0.4829588015
167
  },
168
  "nsubjpass": {
169
- "p": 0.8292682927,
170
- "r": 0.68,
171
- "f": 0.7472527473
172
  },
173
  "compound:vc": {
174
- "p": 0.3876404494,
175
- "r": 0.3575129534,
176
- "f": 0.371967655
177
  },
178
  "advcl:loc": {
179
- "p": 0.5304347826,
180
- "r": 0.4357142857,
181
- "f": 0.4784313725
182
  },
183
  "cc": {
184
- "p": 0.6937618147,
185
- "r": 0.6512866016,
186
- "f": 0.6718535469
187
  },
188
  "advmod:dvp": {
189
- "p": 0.8114754098,
190
- "r": 0.6149068323,
191
- "f": 0.6996466431
192
  },
193
  "appos": {
194
- "p": 0.8778054863,
195
- "r": 0.8091954023,
196
- "f": 0.8421052632
197
- },
198
- "nmod:range": {
199
- "p": 0.6897810219,
200
- "r": 0.6342281879,
201
- "f": 0.6608391608
202
  },
203
  "nmod:poss": {
204
- "p": 0.6989247312,
205
- "r": 0.4814814815,
206
- "f": 0.5701754386
207
  },
208
  "name": {
209
- "p": 0.6391752577,
210
- "r": 0.4592592593,
211
- "f": 0.5344827586
212
  },
213
  "nsubj:xsubj": {
214
  "p": 0.0,
215
  "r": 0.0,
216
  "f": 0.0
217
  },
 
 
 
 
 
218
  "parataxis:prnmod": {
219
- "p": 0.4516129032,
220
- "r": 0.1052631579,
221
- "f": 0.1707317073
222
  },
223
  "amod:ordmod": {
224
- "p": 0.6274509804,
225
- "r": 0.5,
226
- "f": 0.5565217391
227
  },
228
  "erased": {
229
  "p": 0.0,
@@ -231,99 +231,99 @@
231
  "f": 0.0
232
  },
233
  "etc": {
234
- "p": 0.8837209302,
235
  "r": 0.9047619048,
236
- "f": 0.8941176471
237
  }
238
  },
239
- "ents_p": 0.7224990884,
240
- "ents_r": 0.6531868132,
241
- "ents_f": 0.6860968431,
242
  "ents_per_type": {
243
  "DATE": {
244
- "p": 0.75,
245
- "r": 0.7849355798,
246
- "f": 0.7670702179
247
  },
248
  "GPE": {
249
- "p": 0.7579383341,
250
- "r": 0.8049853372,
251
- "f": 0.7807537331
252
  },
253
  "ORDINAL": {
254
- "p": 0.8603351955,
255
- "r": 0.8105263158,
256
- "f": 0.8346883469
257
  },
258
  "FAC": {
259
- "p": 0.4482758621,
260
- "r": 0.2795698925,
261
- "f": 0.3443708609
262
  },
263
  "ORG": {
264
- "p": 0.6875,
265
- "r": 0.602739726,
266
- "f": 0.6423357664
267
  },
268
  "QUANTITY": {
269
- "p": 0.7777777778,
270
- "r": 0.6222222222,
271
- "f": 0.6913580247
272
  },
273
  "PERSON": {
274
- "p": 0.8103932584,
275
- "r": 0.743556701,
276
- "f": 0.7755376344
277
  },
278
  "CARDINAL": {
279
- "p": 0.5814220183,
280
- "r": 0.5110887097,
281
- "f": 0.5439914163
282
  },
283
- "NORP": {
284
- "p": 0.6774193548,
285
- "r": 0.4411764706,
286
- "f": 0.534351145
287
  },
288
  "LOC": {
289
- "p": 0.5319148936,
290
  "r": 0.3360215054,
291
- "f": 0.4118616145
 
 
 
 
 
292
  },
293
  "TIME": {
294
- "p": 0.7438423645,
295
  "r": 0.7330097087,
296
- "f": 0.7383863081
297
- },
298
- "WORK_OF_ART": {
299
- "p": 0.4520547945,
300
- "r": 0.22,
301
- "f": 0.2959641256
302
  },
303
  "MONEY": {
304
- "p": 0.9292035398,
305
- "r": 0.7777777778,
306
- "f": 0.8467741935
307
  },
308
  "PERCENT": {
309
- "p": 0.8395061728,
310
  "r": 0.8192771084,
311
- "f": 0.8292682927
312
  },
313
  "EVENT": {
314
- "p": 0.6170212766,
315
  "r": 0.4264705882,
316
- "f": 0.5043478261
317
  },
318
  "PRODUCT": {
319
- "p": 0.0,
320
- "r": 0.0,
321
- "f": 0.0
322
  },
323
  "LAW": {
324
- "p": 0.3043478261,
325
- "r": 0.1166666667,
326
- "f": 0.1686746988
327
  },
328
  "LANGUAGE": {
329
  "p": 0.5,
@@ -331,5 +331,5 @@
331
  "f": 0.5263157895
332
  }
333
  },
334
- "speed": 6703.9223469178
335
  }
 
3
  "token_p": 0.9458325855,
4
  "token_r": 0.9136060443,
5
  "token_f": 0.9294400505,
6
+ "tag_acc": 0.8926983284,
7
+ "sents_p": 0.7836624776,
8
+ "sents_r": 0.7266522391,
9
+ "sents_f": 0.7540813682,
10
+ "dep_uas": 0.6937939046,
11
+ "dep_las": 0.6399631978,
12
  "dep_las_per_type": {
13
  "dep": {
14
+ "p": 0.4724552573,
15
+ "r": 0.3414165909,
16
+ "f": 0.3963868849
17
  },
18
  "case": {
19
+ "p": 0.8002814739,
20
+ "r": 0.7583656644,
21
+ "f": 0.7787599602
22
  },
23
  "nmod:tmod": {
24
+ "p": 0.734858681,
25
  "r": 0.7428571429,
26
+ "f": 0.7388362652
27
  },
28
  "nummod": {
29
+ "p": 0.8180873181,
30
+ "r": 0.5243171219,
31
+ "f": 0.6390580593
32
  },
33
  "mark:clf": {
34
+ "p": 0.9367710252,
35
+ "r": 0.5691906005,
36
+ "f": 0.7081206497
37
  },
38
  "auxpass": {
39
+ "p": 0.8655913978,
40
+ "r": 0.8702702703,
41
+ "f": 0.8679245283
42
  },
43
  "nsubj": {
44
+ "p": 0.7633487146,
45
+ "r": 0.7112148385,
46
+ "f": 0.7363601679
47
  },
48
  "acl": {
49
+ "p": 0.645030426,
50
+ "r": 0.5291181364,
51
+ "f": 0.5813528336
52
  },
53
  "advmod": {
54
+ "p": 0.8115056637,
55
+ "r": 0.722738608,
56
+ "f": 0.7645542299
57
  },
58
  "mark": {
59
+ "p": 0.7109853843,
60
+ "r": 0.6608238387,
61
+ "f": 0.6849875085
62
  },
63
  "xcomp": {
64
+ "p": 0.7726432532,
65
+ "r": 0.680781759,
66
+ "f": 0.7238095238
67
  },
68
  "nmod:assmod": {
69
+ "p": 0.7476453394,
70
+ "r": 0.7164643635,
71
+ "f": 0.7317228226
72
  },
73
  "det": {
74
+ "p": 0.8333333333,
75
+ "r": 0.6033977739,
76
+ "f": 0.6999660211
77
  },
78
  "amod": {
79
+ "p": 0.7617312073,
80
+ "r": 0.6567164179,
81
+ "f": 0.7053364269
82
  },
83
  "nmod:prep": {
84
+ "p": 0.6905606813,
85
+ "r": 0.5886267393,
86
+ "f": 0.6355323318
87
  },
88
  "root": {
89
+ "p": 0.7277735562,
90
+ "r": 0.6377559514,
91
+ "f": 0.6797977109
92
  },
93
  "aux:prtmod": {
94
+ "p": 0.9058823529,
95
+ "r": 0.825,
96
+ "f": 0.8635514019
97
  },
98
  "compound:nn": {
99
+ "p": 0.7184210526,
100
+ "r": 0.692893401,
101
+ "f": 0.7054263566
102
  },
103
  "dobj": {
104
+ "p": 0.7797667926,
105
+ "r": 0.7033032143,
106
+ "f": 0.7395638629
107
  },
108
  "ccomp": {
109
+ "p": 0.6302021403,
110
+ "r": 0.6181959565,
111
+ "f": 0.624141315
112
  },
113
  "advmod:rcomp": {
114
+ "p": 0.7897897898,
115
+ "r": 0.728531856,
116
+ "f": 0.757925072
117
  },
118
  "nmod:topic": {
119
+ "p": 0.336492891,
120
+ "r": 0.2305194805,
121
+ "f": 0.2736030829
122
  },
123
  "cop": {
124
+ "p": 0.7432321575,
125
+ "r": 0.583011583,
126
+ "f": 0.6534439235
127
  },
128
  "discourse": {
129
+ "p": 0.5467889908,
130
+ "r": 0.4917491749,
131
+ "f": 0.5178105995
132
  },
133
  "neg": {
134
+ "p": 0.8290854573,
135
+ "r": 0.6575505351,
136
+ "f": 0.7334217507
137
  },
138
  "aux:modal": {
139
+ "p": 0.8595927117,
140
+ "r": 0.829369183,
141
+ "f": 0.8442105263
142
  },
143
  "nmod": {
144
+ "p": 0.6822580645,
145
+ "r": 0.5739484396,
146
+ "f": 0.6234340457
147
  },
148
  "aux:ba": {
149
+ "p": 0.777173913,
150
+ "r": 0.7606382979,
151
+ "f": 0.7688172043
152
  },
153
  "advmod:loc": {
154
+ "p": 0.5581395349,
155
+ "r": 0.4272997033,
156
+ "f": 0.4840336134
157
  },
158
  "aux:asp": {
159
+ "p": 0.9087866109,
160
+ "r": 0.8660287081,
161
+ "f": 0.8868926092
162
  },
163
  "conj": {
164
+ "p": 0.4782029317,
165
+ "r": 0.4748582231,
166
+ "f": 0.4765247083
167
  },
168
  "nsubjpass": {
169
+ "p": 0.7857142857,
170
+ "r": 0.66,
171
+ "f": 0.7173913043
172
  },
173
  "compound:vc": {
174
+ "p": 0.3863636364,
175
+ "r": 0.3523316062,
176
+ "f": 0.3685636856
177
  },
178
  "advcl:loc": {
179
+ "p": 0.4453125,
180
+ "r": 0.4071428571,
181
+ "f": 0.4253731343
182
  },
183
  "cc": {
184
+ "p": 0.6863849765,
185
+ "r": 0.6486246673,
186
+ "f": 0.6669708029
187
  },
188
  "advmod:dvp": {
189
+ "p": 0.8487394958,
190
+ "r": 0.6273291925,
191
+ "f": 0.7214285714
192
  },
193
  "appos": {
194
+ "p": 0.8692307692,
195
+ "r": 0.7793103448,
196
+ "f": 0.8218181818
 
 
 
 
 
197
  },
198
  "nmod:poss": {
199
+ "p": 0.6956521739,
200
+ "r": 0.4740740741,
201
+ "f": 0.563876652
202
  },
203
  "name": {
204
+ "p": 0.6213592233,
205
+ "r": 0.4740740741,
206
+ "f": 0.5378151261
207
  },
208
  "nsubj:xsubj": {
209
  "p": 0.0,
210
  "r": 0.0,
211
  "f": 0.0
212
  },
213
+ "nmod:range": {
214
+ "p": 0.7725490196,
215
+ "r": 0.6610738255,
216
+ "f": 0.712477396
217
+ },
218
  "parataxis:prnmod": {
219
+ "p": 0.53125,
220
+ "r": 0.1278195489,
221
+ "f": 0.2060606061
222
  },
223
  "amod:ordmod": {
224
+ "p": 0.606557377,
225
+ "r": 0.578125,
226
+ "f": 0.592
227
  },
228
  "erased": {
229
  "p": 0.0,
 
231
  "f": 0.0
232
  },
233
  "etc": {
234
+ "p": 0.8941176471,
235
  "r": 0.9047619048,
236
+ "f": 0.899408284
237
  }
238
  },
239
+ "ents_p": 0.7248870987,
240
+ "ents_r": 0.6526373626,
241
+ "ents_f": 0.6868675186,
242
  "ents_per_type": {
243
  "DATE": {
244
+ "p": 0.7625830959,
245
+ "r": 0.7958374628,
246
+ "f": 0.7788554801
247
  },
248
  "GPE": {
249
+ "p": 0.7793560606,
250
+ "r": 0.8044965787,
251
+ "f": 0.7917267917
252
  },
253
  "ORDINAL": {
254
+ "p": 0.8693181818,
255
+ "r": 0.8052631579,
256
+ "f": 0.8360655738
257
  },
258
  "FAC": {
259
+ "p": 0.456,
260
+ "r": 0.3064516129,
261
+ "f": 0.3665594855
262
  },
263
  "ORG": {
264
+ "p": 0.6830092984,
265
+ "r": 0.6149162861,
266
+ "f": 0.6471766119
267
  },
268
  "QUANTITY": {
269
+ "p": 0.7735849057,
270
+ "r": 0.6074074074,
271
+ "f": 0.6804979253
272
  },
273
  "PERSON": {
274
+ "p": 0.8024513338,
275
+ "r": 0.7171391753,
276
+ "f": 0.7574004764
277
  },
278
  "CARDINAL": {
279
+ "p": 0.5778032037,
280
+ "r": 0.5090725806,
281
+ "f": 0.5412647374
282
  },
283
+ "WORK_OF_ART": {
284
+ "p": 0.45,
285
+ "r": 0.24,
286
+ "f": 0.3130434783
287
  },
288
  "LOC": {
289
+ "p": 0.5020080321,
290
  "r": 0.3360215054,
291
+ "f": 0.4025764895
292
+ },
293
+ "NORP": {
294
+ "p": 0.66875,
295
+ "r": 0.4495798319,
296
+ "f": 0.5376884422
297
  },
298
  "TIME": {
299
+ "p": 0.7512437811,
300
  "r": 0.7330097087,
301
+ "f": 0.742014742
 
 
 
 
 
302
  },
303
  "MONEY": {
304
+ "p": 0.9224137931,
305
+ "r": 0.7925925926,
306
+ "f": 0.8525896414
307
  },
308
  "PERCENT": {
309
+ "p": 0.85,
310
  "r": 0.8192771084,
311
+ "f": 0.8343558282
312
  },
313
  "EVENT": {
314
+ "p": 0.5742574257,
315
  "r": 0.4264705882,
316
+ "f": 0.4894514768
317
  },
318
  "PRODUCT": {
319
+ "p": 0.25,
320
+ "r": 0.0408163265,
321
+ "f": 0.0701754386
322
  },
323
  "LAW": {
324
+ "p": 0.5,
325
+ "r": 0.1,
326
+ "f": 0.1666666667
327
  },
328
  "LANGUAGE": {
329
  "p": 0.5,
 
331
  "f": 0.5263157895
332
  }
333
  },
334
+ "speed": 6465.8027619703
335
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -51,7 +51,7 @@ nO = null
51
  @architectures = "spacy.MultiHashEmbed.v2"
52
  width = 96
53
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
- rows = [5000,2500,2500,2500]
55
  include_static_vectors = false
56
 
57
  [components.ner.model.tok2vec.encode]
@@ -89,8 +89,9 @@ overwrite = false
89
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
90
 
91
  [components.senter.model]
92
- @architectures = "spacy.Tagger.v1"
93
  nO = null
 
94
 
95
  [components.senter.model.tok2vec]
96
  @architectures = "spacy.Tok2Vec.v2"
@@ -111,12 +112,14 @@ maxout_pieces = 2
111
 
112
  [components.tagger]
113
  factory = "tagger"
 
114
  overwrite = false
115
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
116
 
117
  [components.tagger.model]
118
- @architectures = "spacy.Tagger.v1"
119
  nO = null
 
120
 
121
  [components.tagger.model.tok2vec]
122
  @architectures = "spacy.Tok2VecListener.v1"
@@ -133,7 +136,7 @@ factory = "tok2vec"
133
  @architectures = "spacy.MultiHashEmbed.v2"
134
  width = ${components.tok2vec.model.encode:width}
135
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
136
- rows = [5000,2500,2500,2500]
137
  include_static_vectors = false
138
 
139
  [components.tok2vec.model.encode]
@@ -170,7 +173,7 @@ dropout = 0.1
170
  accumulate_gradient = 1
171
  patience = 5000
172
  max_epochs = 0
173
- max_steps = 0
174
  eval_frequency = 1000
175
  frozen_components = []
176
  before_to_disk = null
 
51
  @architectures = "spacy.MultiHashEmbed.v2"
52
  width = 96
53
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
+ rows = [5000,1000,2500,2500]
55
  include_static_vectors = false
56
 
57
  [components.ner.model.tok2vec.encode]
 
89
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
90
 
91
  [components.senter.model]
92
+ @architectures = "spacy.Tagger.v2"
93
  nO = null
94
+ normalize = false
95
 
96
  [components.senter.model.tok2vec]
97
  @architectures = "spacy.Tok2Vec.v2"
 
112
 
113
  [components.tagger]
114
  factory = "tagger"
115
+ neg_prefix = "!"
116
  overwrite = false
117
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
118
 
119
  [components.tagger.model]
120
+ @architectures = "spacy.Tagger.v2"
121
  nO = null
122
+ normalize = false
123
 
124
  [components.tagger.model.tok2vec]
125
  @architectures = "spacy.Tok2VecListener.v1"
 
136
  @architectures = "spacy.MultiHashEmbed.v2"
137
  width = ${components.tok2vec.model.encode:width}
138
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
139
+ rows = [5000,1000,2500,2500]
140
  include_static_vectors = false
141
 
142
  [components.tok2vec.model.encode]
 
173
  accumulate_gradient = 1
174
  patience = 5000
175
  max_epochs = 0
176
+ max_steps = 100000
177
  eval_frequency = 1000
178
  frozen_components = []
179
  before_to_disk = null
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"zh",
3
  "name":"core_web_sm",
4
- "version":"3.2.0",
5
  "description":"Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -104,10 +104,6 @@
104
  "punct",
105
  "xcomp"
106
  ],
107
- "senter":[
108
- "I",
109
- "S"
110
- ],
111
  "attribute_ruler":[
112
 
113
  ],
@@ -155,227 +151,227 @@
155
  "token_p":0.9458325855,
156
  "token_r":0.9136060443,
157
  "token_f":0.9294400505,
158
- "tag_acc":0.8957464158,
159
- "sents_p":0.7817728729,
160
- "sents_r":0.7311469952,
161
- "sents_f":0.7556129032,
162
- "dep_uas":0.6965379684,
163
- "dep_las":0.6426392548,
164
  "dep_las_per_type":{
165
  "dep":{
166
- "p":0.4702473498,
167
- "r":0.3361624735,
168
- "f":0.3920575065
169
  },
170
  "case":{
171
- "p":0.8028549383,
172
- "r":0.7569107662,
173
- "f":0.7792061907
174
  },
175
  "nmod:tmod":{
176
- "p":0.7231788079,
177
  "r":0.7428571429,
178
- "f":0.732885906
179
  },
180
  "nummod":{
181
- "p":0.8233471074,
182
- "r":0.5309793471,
183
- "f":0.6456055083
184
  },
185
  "mark:clf":{
186
- "p":0.9301898347,
187
- "r":0.5665796345,
188
- "f":0.7042188224
189
  },
190
  "auxpass":{
191
- "p":0.8756756757,
192
- "r":0.8756756757,
193
- "f":0.8756756757
194
  },
195
  "nsubj":{
196
- "p":0.771189813,
197
- "r":0.7141628793,
198
- "f":0.7415816327
199
  },
200
  "acl":{
201
- "p":0.6791758646,
202
- "r":0.5119245702,
203
- "f":0.5838077166
204
  },
205
  "advmod":{
206
- "p":0.8065869786,
207
- "r":0.7189979596,
208
- "f":0.7602780774
209
  },
210
  "mark":{
211
- "p":0.7065868263,
212
- "r":0.6722173532,
213
- "f":0.6889737256
214
  },
215
  "xcomp":{
216
- "p":0.7559198543,
217
- "r":0.6758957655,
218
- "f":0.7136715391
219
  },
220
  "nmod:assmod":{
221
- "p":0.7642786398,
222
- "r":0.7205104264,
223
- "f":0.7417494393
224
  },
225
  "det":{
226
- "p":0.8394160584,
227
- "r":0.6063268893,
228
- "f":0.7040816327
229
  },
230
  "amod":{
231
- "p":0.7544338336,
232
- "r":0.6516103692,
233
- "f":0.6992623815
234
  },
235
  "nmod:prep":{
236
- "p":0.7013125222,
237
- "r":0.5980036298,
238
- "f":0.6455510204
239
  },
240
  "root":{
241
- "p":0.7283996995,
242
- "r":0.6455801565,
243
- "f":0.6844938664
244
  },
245
  "aux:prtmod":{
246
- "p":0.890625,
247
- "r":0.8142857143,
248
- "f":0.8507462687
249
  },
250
  "compound:nn":{
251
- "p":0.7243023667,
252
- "r":0.6939086294,
253
- "f":0.7087798133
254
  },
255
  "dobj":{
256
- "p":0.780507386,
257
- "r":0.7200414753,
258
- "f":0.7490561677
259
  },
260
  "ccomp":{
261
- "p":0.6268199234,
262
- "r":0.6360808709,
263
- "f":0.6314164415
264
  },
265
  "advmod:rcomp":{
266
- "p":0.8096774194,
267
- "r":0.6952908587,
268
- "f":0.7481371088
269
  },
270
  "nmod:topic":{
271
- "p":0.3686868687,
272
- "r":0.237012987,
273
- "f":0.2885375494
274
  },
275
  "cop":{
276
- "p":0.7385620915,
277
- "r":0.5817245817,
278
- "f":0.6508279338
279
  },
280
  "discourse":{
281
- "p":0.5540037244,
282
- "r":0.4909240924,
283
- "f":0.52055993
284
  },
285
  "neg":{
286
- "p":0.823880597,
287
- "r":0.6563614744,
288
- "f":0.730641959
289
  },
290
  "aux:modal":{
291
- "p":0.8563772776,
292
- "r":0.8262668046,
293
- "f":0.8410526316
294
  },
295
  "nmod":{
296
- "p":0.7135761589,
297
- "r":0.5848032564,
298
- "f":0.6428038777
299
  },
300
  "aux:ba":{
301
- "p":0.8087431694,
302
- "r":0.7872340426,
303
- "f":0.7978436658
304
  },
305
  "advmod:loc":{
306
- "p":0.58203125,
307
- "r":0.4421364985,
308
- "f":0.502529511
309
  },
310
  "aux:asp":{
311
- "p":0.9053941909,
312
- "r":0.870015949,
313
- "f":0.8873525824
314
  },
315
  "conj":{
316
- "p":0.4784786642,
317
- "r":0.4875236295,
318
- "f":0.4829588015
319
  },
320
  "nsubjpass":{
321
- "p":0.8292682927,
322
- "r":0.68,
323
- "f":0.7472527473
324
  },
325
  "compound:vc":{
326
- "p":0.3876404494,
327
- "r":0.3575129534,
328
- "f":0.371967655
329
  },
330
  "advcl:loc":{
331
- "p":0.5304347826,
332
- "r":0.4357142857,
333
- "f":0.4784313725
334
  },
335
  "cc":{
336
- "p":0.6937618147,
337
- "r":0.6512866016,
338
- "f":0.6718535469
339
  },
340
  "advmod:dvp":{
341
- "p":0.8114754098,
342
- "r":0.6149068323,
343
- "f":0.6996466431
344
  },
345
  "appos":{
346
- "p":0.8778054863,
347
- "r":0.8091954023,
348
- "f":0.8421052632
349
- },
350
- "nmod:range":{
351
- "p":0.6897810219,
352
- "r":0.6342281879,
353
- "f":0.6608391608
354
  },
355
  "nmod:poss":{
356
- "p":0.6989247312,
357
- "r":0.4814814815,
358
- "f":0.5701754386
359
  },
360
  "name":{
361
- "p":0.6391752577,
362
- "r":0.4592592593,
363
- "f":0.5344827586
364
  },
365
  "nsubj:xsubj":{
366
  "p":0.0,
367
  "r":0.0,
368
  "f":0.0
369
  },
 
 
 
 
 
370
  "parataxis:prnmod":{
371
- "p":0.4516129032,
372
- "r":0.1052631579,
373
- "f":0.1707317073
374
  },
375
  "amod:ordmod":{
376
- "p":0.6274509804,
377
- "r":0.5,
378
- "f":0.5565217391
379
  },
380
  "erased":{
381
  "p":0.0,
@@ -383,99 +379,99 @@
383
  "f":0.0
384
  },
385
  "etc":{
386
- "p":0.8837209302,
387
  "r":0.9047619048,
388
- "f":0.8941176471
389
  }
390
  },
391
- "ents_p":0.7224990884,
392
- "ents_r":0.6531868132,
393
- "ents_f":0.6860968431,
394
  "ents_per_type":{
395
  "DATE":{
396
- "p":0.75,
397
- "r":0.7849355798,
398
- "f":0.7670702179
399
  },
400
  "GPE":{
401
- "p":0.7579383341,
402
- "r":0.8049853372,
403
- "f":0.7807537331
404
  },
405
  "ORDINAL":{
406
- "p":0.8603351955,
407
- "r":0.8105263158,
408
- "f":0.8346883469
409
  },
410
  "FAC":{
411
- "p":0.4482758621,
412
- "r":0.2795698925,
413
- "f":0.3443708609
414
  },
415
  "ORG":{
416
- "p":0.6875,
417
- "r":0.602739726,
418
- "f":0.6423357664
419
  },
420
  "QUANTITY":{
421
- "p":0.7777777778,
422
- "r":0.6222222222,
423
- "f":0.6913580247
424
  },
425
  "PERSON":{
426
- "p":0.8103932584,
427
- "r":0.743556701,
428
- "f":0.7755376344
429
  },
430
  "CARDINAL":{
431
- "p":0.5814220183,
432
- "r":0.5110887097,
433
- "f":0.5439914163
434
  },
435
- "NORP":{
436
- "p":0.6774193548,
437
- "r":0.4411764706,
438
- "f":0.534351145
439
  },
440
  "LOC":{
441
- "p":0.5319148936,
442
  "r":0.3360215054,
443
- "f":0.4118616145
 
 
 
 
 
444
  },
445
  "TIME":{
446
- "p":0.7438423645,
447
  "r":0.7330097087,
448
- "f":0.7383863081
449
- },
450
- "WORK_OF_ART":{
451
- "p":0.4520547945,
452
- "r":0.22,
453
- "f":0.2959641256
454
  },
455
  "MONEY":{
456
- "p":0.9292035398,
457
- "r":0.7777777778,
458
- "f":0.8467741935
459
  },
460
  "PERCENT":{
461
- "p":0.8395061728,
462
  "r":0.8192771084,
463
- "f":0.8292682927
464
  },
465
  "EVENT":{
466
- "p":0.6170212766,
467
  "r":0.4264705882,
468
- "f":0.5043478261
469
  },
470
  "PRODUCT":{
471
- "p":0.0,
472
- "r":0.0,
473
- "f":0.0
474
  },
475
  "LAW":{
476
- "p":0.3043478261,
477
- "r":0.1166666667,
478
- "f":0.1686746988
479
  },
480
  "LANGUAGE":{
481
  "p":0.5,
@@ -483,7 +479,7 @@
483
  "f":0.5263157895
484
  }
485
  },
486
- "speed":6703.9223469178
487
  },
488
  "sources":[
489
  {
 
1
  {
2
  "lang":"zh",
3
  "name":"core_web_sm",
4
+ "version":"3.3.0",
5
  "description":"Chinese pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"MIT",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
104
  "punct",
105
  "xcomp"
106
  ],
 
 
 
 
107
  "attribute_ruler":[
108
 
109
  ],
 
151
  "token_p":0.9458325855,
152
  "token_r":0.9136060443,
153
  "token_f":0.9294400505,
154
+ "tag_acc":0.8926983284,
155
+ "sents_p":0.7836624776,
156
+ "sents_r":0.7266522391,
157
+ "sents_f":0.7540813682,
158
+ "dep_uas":0.6937939046,
159
+ "dep_las":0.6399631978,
160
  "dep_las_per_type":{
161
  "dep":{
162
+ "p":0.4724552573,
163
+ "r":0.3414165909,
164
+ "f":0.3963868849
165
  },
166
  "case":{
167
+ "p":0.8002814739,
168
+ "r":0.7583656644,
169
+ "f":0.7787599602
170
  },
171
  "nmod:tmod":{
172
+ "p":0.734858681,
173
  "r":0.7428571429,
174
+ "f":0.7388362652
175
  },
176
  "nummod":{
177
+ "p":0.8180873181,
178
+ "r":0.5243171219,
179
+ "f":0.6390580593
180
  },
181
  "mark:clf":{
182
+ "p":0.9367710252,
183
+ "r":0.5691906005,
184
+ "f":0.7081206497
185
  },
186
  "auxpass":{
187
+ "p":0.8655913978,
188
+ "r":0.8702702703,
189
+ "f":0.8679245283
190
  },
191
  "nsubj":{
192
+ "p":0.7633487146,
193
+ "r":0.7112148385,
194
+ "f":0.7363601679
195
  },
196
  "acl":{
197
+ "p":0.645030426,
198
+ "r":0.5291181364,
199
+ "f":0.5813528336
200
  },
201
  "advmod":{
202
+ "p":0.8115056637,
203
+ "r":0.722738608,
204
+ "f":0.7645542299
205
  },
206
  "mark":{
207
+ "p":0.7109853843,
208
+ "r":0.6608238387,
209
+ "f":0.6849875085
210
  },
211
  "xcomp":{
212
+ "p":0.7726432532,
213
+ "r":0.680781759,
214
+ "f":0.7238095238
215
  },
216
  "nmod:assmod":{
217
+ "p":0.7476453394,
218
+ "r":0.7164643635,
219
+ "f":0.7317228226
220
  },
221
  "det":{
222
+ "p":0.8333333333,
223
+ "r":0.6033977739,
224
+ "f":0.6999660211
225
  },
226
  "amod":{
227
+ "p":0.7617312073,
228
+ "r":0.6567164179,
229
+ "f":0.7053364269
230
  },
231
  "nmod:prep":{
232
+ "p":0.6905606813,
233
+ "r":0.5886267393,
234
+ "f":0.6355323318
235
  },
236
  "root":{
237
+ "p":0.7277735562,
238
+ "r":0.6377559514,
239
+ "f":0.6797977109
240
  },
241
  "aux:prtmod":{
242
+ "p":0.9058823529,
243
+ "r":0.825,
244
+ "f":0.8635514019
245
  },
246
  "compound:nn":{
247
+ "p":0.7184210526,
248
+ "r":0.692893401,
249
+ "f":0.7054263566
250
  },
251
  "dobj":{
252
+ "p":0.7797667926,
253
+ "r":0.7033032143,
254
+ "f":0.7395638629
255
  },
256
  "ccomp":{
257
+ "p":0.6302021403,
258
+ "r":0.6181959565,
259
+ "f":0.624141315
260
  },
261
  "advmod:rcomp":{
262
+ "p":0.7897897898,
263
+ "r":0.728531856,
264
+ "f":0.757925072
265
  },
266
  "nmod:topic":{
267
+ "p":0.336492891,
268
+ "r":0.2305194805,
269
+ "f":0.2736030829
270
  },
271
  "cop":{
272
+ "p":0.7432321575,
273
+ "r":0.583011583,
274
+ "f":0.6534439235
275
  },
276
  "discourse":{
277
+ "p":0.5467889908,
278
+ "r":0.4917491749,
279
+ "f":0.5178105995
280
  },
281
  "neg":{
282
+ "p":0.8290854573,
283
+ "r":0.6575505351,
284
+ "f":0.7334217507
285
  },
286
  "aux:modal":{
287
+ "p":0.8595927117,
288
+ "r":0.829369183,
289
+ "f":0.8442105263
290
  },
291
  "nmod":{
292
+ "p":0.6822580645,
293
+ "r":0.5739484396,
294
+ "f":0.6234340457
295
  },
296
  "aux:ba":{
297
+ "p":0.777173913,
298
+ "r":0.7606382979,
299
+ "f":0.7688172043
300
  },
301
  "advmod:loc":{
302
+ "p":0.5581395349,
303
+ "r":0.4272997033,
304
+ "f":0.4840336134
305
  },
306
  "aux:asp":{
307
+ "p":0.9087866109,
308
+ "r":0.8660287081,
309
+ "f":0.8868926092
310
  },
311
  "conj":{
312
+ "p":0.4782029317,
313
+ "r":0.4748582231,
314
+ "f":0.4765247083
315
  },
316
  "nsubjpass":{
317
+ "p":0.7857142857,
318
+ "r":0.66,
319
+ "f":0.7173913043
320
  },
321
  "compound:vc":{
322
+ "p":0.3863636364,
323
+ "r":0.3523316062,
324
+ "f":0.3685636856
325
  },
326
  "advcl:loc":{
327
+ "p":0.4453125,
328
+ "r":0.4071428571,
329
+ "f":0.4253731343
330
  },
331
  "cc":{
332
+ "p":0.6863849765,
333
+ "r":0.6486246673,
334
+ "f":0.6669708029
335
  },
336
  "advmod:dvp":{
337
+ "p":0.8487394958,
338
+ "r":0.6273291925,
339
+ "f":0.7214285714
340
  },
341
  "appos":{
342
+ "p":0.8692307692,
343
+ "r":0.7793103448,
344
+ "f":0.8218181818
 
 
 
 
 
345
  },
346
  "nmod:poss":{
347
+ "p":0.6956521739,
348
+ "r":0.4740740741,
349
+ "f":0.563876652
350
  },
351
  "name":{
352
+ "p":0.6213592233,
353
+ "r":0.4740740741,
354
+ "f":0.5378151261
355
  },
356
  "nsubj:xsubj":{
357
  "p":0.0,
358
  "r":0.0,
359
  "f":0.0
360
  },
361
+ "nmod:range":{
362
+ "p":0.7725490196,
363
+ "r":0.6610738255,
364
+ "f":0.712477396
365
+ },
366
  "parataxis:prnmod":{
367
+ "p":0.53125,
368
+ "r":0.1278195489,
369
+ "f":0.2060606061
370
  },
371
  "amod:ordmod":{
372
+ "p":0.606557377,
373
+ "r":0.578125,
374
+ "f":0.592
375
  },
376
  "erased":{
377
  "p":0.0,
 
379
  "f":0.0
380
  },
381
  "etc":{
382
+ "p":0.8941176471,
383
  "r":0.9047619048,
384
+ "f":0.899408284
385
  }
386
  },
387
+ "ents_p":0.7248870987,
388
+ "ents_r":0.6526373626,
389
+ "ents_f":0.6868675186,
390
  "ents_per_type":{
391
  "DATE":{
392
+ "p":0.7625830959,
393
+ "r":0.7958374628,
394
+ "f":0.7788554801
395
  },
396
  "GPE":{
397
+ "p":0.7793560606,
398
+ "r":0.8044965787,
399
+ "f":0.7917267917
400
  },
401
  "ORDINAL":{
402
+ "p":0.8693181818,
403
+ "r":0.8052631579,
404
+ "f":0.8360655738
405
  },
406
  "FAC":{
407
+ "p":0.456,
408
+ "r":0.3064516129,
409
+ "f":0.3665594855
410
  },
411
  "ORG":{
412
+ "p":0.6830092984,
413
+ "r":0.6149162861,
414
+ "f":0.6471766119
415
  },
416
  "QUANTITY":{
417
+ "p":0.7735849057,
418
+ "r":0.6074074074,
419
+ "f":0.6804979253
420
  },
421
  "PERSON":{
422
+ "p":0.8024513338,
423
+ "r":0.7171391753,
424
+ "f":0.7574004764
425
  },
426
  "CARDINAL":{
427
+ "p":0.5778032037,
428
+ "r":0.5090725806,
429
+ "f":0.5412647374
430
  },
431
+ "WORK_OF_ART":{
432
+ "p":0.45,
433
+ "r":0.24,
434
+ "f":0.3130434783
435
  },
436
  "LOC":{
437
+ "p":0.5020080321,
438
  "r":0.3360215054,
439
+ "f":0.4025764895
440
+ },
441
+ "NORP":{
442
+ "p":0.66875,
443
+ "r":0.4495798319,
444
+ "f":0.5376884422
445
  },
446
  "TIME":{
447
+ "p":0.7512437811,
448
  "r":0.7330097087,
449
+ "f":0.742014742
 
 
 
 
 
450
  },
451
  "MONEY":{
452
+ "p":0.9224137931,
453
+ "r":0.7925925926,
454
+ "f":0.8525896414
455
  },
456
  "PERCENT":{
457
+ "p":0.85,
458
  "r":0.8192771084,
459
+ "f":0.8343558282
460
  },
461
  "EVENT":{
462
+ "p":0.5742574257,
463
  "r":0.4264705882,
464
+ "f":0.4894514768
465
  },
466
  "PRODUCT":{
467
+ "p":0.25,
468
+ "r":0.0408163265,
469
+ "f":0.0701754386
470
  },
471
  "LAW":{
472
+ "p":0.5,
473
+ "r":0.1,
474
+ "f":0.1666666667
475
  },
476
  "LANGUAGE":{
477
  "p":0.5,
 
479
  "f":0.5263157895
480
  }
481
  },
482
+ "speed":6465.8027619703
483
  },
484
  "sources":[
485
  {
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:652f95d2d1ea30c3e684b3b31ac97de4d3e011ac4fc4e84bdecc5c97921d4e83
3
- size 6730601
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1937cf1b18662e3faa59fc7b9e4baf1ba1d5dc5602ad52842e47c0cd001fbdc6
3
+ size 6154601
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f3b5bbce1c7f668252591b4bf557e5c5d7c121affb5dfd60aa98e4d3611abacf
3
  size 308728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9eefad84f9bfd4048209b5a401df359d753859afae1ecd76bf0afc22932dcd8b
3
  size 308728
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{"":406716},"1":{"":267231},"2":{"advmod":56960,"nsubj":53520,"compound:nn":43919,"dep":40111,"punct":36035,"case":23986,"nmod:assmod":21599,"nmod:prep":20098,"amod":16922,"acl":11979,"conj":10687,"cop":7238,"det":7210,"nummod":6994,"cc":6235,"aux:modal":5566,"nmod:tmod":5335,"nmod":4915,"neg":4363,"xcomp":3881,"appos":2955,"nmod:topic":2410,"discourse":2163,"advmod:loc":1591,"aux:prtmod":1539,"aux:ba":1311,"auxpass":1220,"advmod:dvp":1142,"advcl:loc":1046,"name":1032,"compound:vc":830,"nmod:poss":560,"amod:ordmod":511,"dobj":406,"nsubjpass":263,"nsubj:xsubj||ccomp":62,"parataxis:prnmod":34,"nsubj:xsubj":32},"3":{"punct":74006,"dobj":45383,"conj":30040,"case":30024,"dep":18660,"ccomp":17216,"mark":16600,"mark:clf":11551,"aux:asp":7896,"discourse":3998,"advmod:rcomp":2387,"nmod:range":1885,"cc":1675,"nmod:prep":1595,"advmod":1116,"etc":941,"compound:vc":790,"parataxis:prnmod":693,"advmod:loc":522,"neg":69,"advcl:loc":39,"acl":39},"4":{"ROOT":34525}}�cfg��neg_key�
 
1
+ ��moves��{"0":{"":436297},"1":{"":282750},"2":{"advmod":61142,"nsubj":55539,"compound:nn":45994,"dep":43937,"punct":36396,"case":24751,"nmod:assmod":22308,"nmod:prep":21037,"amod":18609,"acl":12438,"conj":10993,"det":10371,"nummod":9922,"cop":9515,"cc":6289,"aux:modal":6003,"neg":5955,"nmod:tmod":5338,"nmod":5049,"xcomp":4333,"appos":2988,"nmod:topic":2532,"discourse":2283,"advmod:loc":1902,"aux:prtmod":1724,"aux:ba":1323,"auxpass":1240,"advmod:dvp":1193,"name":1117,"advcl:loc":1072,"compound:vc":834,"nmod:poss":657,"amod:ordmod":601,"dobj":441,"nsubjpass":276,"nsubj:xsubj||ccomp":64,"parataxis:prnmod":36,"nsubj:xsubj":32},"3":{"punct":74587,"dobj":46958,"conj":31352,"case":31222,"dep":20953,"mark:clf":18377,"ccomp":17748,"mark":16793,"aux:asp":8130,"discourse":4187,"advmod:rcomp":2519,"nmod:range":2021,"cc":1715,"nmod:prep":1690,"advmod":1162,"etc":943,"compound:vc":828,"parataxis:prnmod":724,"advmod:loc":571,"neg":70,"acl":43,"advcl:loc":42},"4":{"ROOT":36097}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ed8b52593e8404168f51b59c7a39e7899657f57599ca4a6f7fed66b08cb887f
3
- size 190395
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33967ce4b474fcf8d22611caeb57c33cc7a2f4a10d73ed91a2252f5ae2064ce8
3
+ size 190447
tagger/cfg CHANGED
@@ -37,5 +37,6 @@
37
  "VV",
38
  "X"
39
  ],
 
40
  "overwrite":false
41
  }
 
37
  "VV",
38
  "X"
39
  ],
40
+ "neg_prefix":"!",
41
  "overwrite":false
42
  }
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:359a1ebf77180c2f81edee2f49fcc949ad1f91a943656147b51ee93caf4e4021
3
- size 14345
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:72bbfc75315c8ce0abcc0726656b860957e3355275fcb6b9c073c5ba8e242a49
3
+ size 14397
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7747c3c866decac9d95922554e9edf1084d8dd3d22a9c6e207a453183ebdc551
3
- size 6585091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:faab328dede50366999622b9a79a843f2dab5567752b256bd673152795be3ccd
3
+ size 6009091
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1ac3189551d59dc58dc9f4e3c525da1ec4ad3890362a1efd3ad904d2c698d077
3
- size 1217934
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:99efd18f8a9c6fd07ece91efd83ba997cfb7b755cad66dace7d5856aea0c0d78
3
+ size 1218820
zh_core_web_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0629b5fe5fc8979fa895be4956363e7721e040908b0dd4c9ed469ad7309dd5cf
3
- size 49466491
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ea854aad175da45e65024136316fbffb9583e49133d1ab042107217d5f809998
3
+ size 48395517