osanseviero HF staff commited on
Commit
ce93cd3
1 Parent(s): 322de72

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -14,3 +14,7 @@
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
17
+ *.whl filter=lfs diff=lfs merge=lfs -text
18
+ *.npz filter=lfs diff=lfs merge=lfs -text
19
+ *strings.json filter=lfs diff=lfs merge=lfs -text
20
+ vectors filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,428 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution-ShareAlike 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution-ShareAlike 4.0 International Public
58
+ License
59
+
60
+ By exercising the Licensed Rights (defined below), You accept and agree
61
+ to be bound by the terms and conditions of this Creative Commons
62
+ Attribution-ShareAlike 4.0 International Public License ("Public
63
+ License"). To the extent this Public License may be interpreted as a
64
+ contract, You are granted the Licensed Rights in consideration of Your
65
+ acceptance of these terms and conditions, and the Licensor grants You
66
+ such rights in consideration of benefits the Licensor receives from
67
+ making the Licensed Material available under these terms and
68
+ conditions.
69
+
70
+
71
+ Section 1 -- Definitions.
72
+
73
+ a. Adapted Material means material subject to Copyright and Similar
74
+ Rights that is derived from or based upon the Licensed Material
75
+ and in which the Licensed Material is translated, altered,
76
+ arranged, transformed, or otherwise modified in a manner requiring
77
+ permission under the Copyright and Similar Rights held by the
78
+ Licensor. For purposes of this Public License, where the Licensed
79
+ Material is a musical work, performance, or sound recording,
80
+ Adapted Material is always produced where the Licensed Material is
81
+ synched in timed relation with a moving image.
82
+
83
+ b. Adapter's License means the license You apply to Your Copyright
84
+ and Similar Rights in Your contributions to Adapted Material in
85
+ accordance with the terms and conditions of this Public License.
86
+
87
+ c. BY-SA Compatible License means a license listed at
88
+ creativecommons.org/compatiblelicenses, approved by Creative
89
+ Commons as essentially the equivalent of this Public License.
90
+
91
+ d. Copyright and Similar Rights means copyright and/or similar rights
92
+ closely related to copyright including, without limitation,
93
+ performance, broadcast, sound recording, and Sui Generis Database
94
+ Rights, without regard to how the rights are labeled or
95
+ categorized. For purposes of this Public License, the rights
96
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
97
+ Rights.
98
+
99
+ e. Effective Technological Measures means those measures that, in the
100
+ absence of proper authority, may not be circumvented under laws
101
+ fulfilling obligations under Article 11 of the WIPO Copyright
102
+ Treaty adopted on December 20, 1996, and/or similar international
103
+ agreements.
104
+
105
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
106
+ any other exception or limitation to Copyright and Similar Rights
107
+ that applies to Your use of the Licensed Material.
108
+
109
+ g. License Elements means the license attributes listed in the name
110
+ of a Creative Commons Public License. The License Elements of this
111
+ Public License are Attribution and ShareAlike.
112
+
113
+ h. Licensed Material means the artistic or literary work, database,
114
+ or other material to which the Licensor applied this Public
115
+ License.
116
+
117
+ i. Licensed Rights means the rights granted to You subject to the
118
+ terms and conditions of this Public License, which are limited to
119
+ all Copyright and Similar Rights that apply to Your use of the
120
+ Licensed Material and that the Licensor has authority to license.
121
+
122
+ j. Licensor means the individual(s) or entity(ies) granting rights
123
+ under this Public License.
124
+
125
+ k. Share means to provide material to the public by any means or
126
+ process that requires permission under the Licensed Rights, such
127
+ as reproduction, public display, public performance, distribution,
128
+ dissemination, communication, or importation, and to make material
129
+ available to the public including in ways that members of the
130
+ public may access the material from a place and at a time
131
+ individually chosen by them.
132
+
133
+ l. Sui Generis Database Rights means rights other than copyright
134
+ resulting from Directive 96/9/EC of the European Parliament and of
135
+ the Council of 11 March 1996 on the legal protection of databases,
136
+ as amended and/or succeeded, as well as other essentially
137
+ equivalent rights anywhere in the world.
138
+
139
+ m. You means the individual or entity exercising the Licensed Rights
140
+ under this Public License. Your has a corresponding meaning.
141
+
142
+
143
+ Section 2 -- Scope.
144
+
145
+ a. License grant.
146
+
147
+ 1. Subject to the terms and conditions of this Public License,
148
+ the Licensor hereby grants You a worldwide, royalty-free,
149
+ non-sublicensable, non-exclusive, irrevocable license to
150
+ exercise the Licensed Rights in the Licensed Material to:
151
+
152
+ a. reproduce and Share the Licensed Material, in whole or
153
+ in part; and
154
+
155
+ b. produce, reproduce, and Share Adapted Material.
156
+
157
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
158
+ Exceptions and Limitations apply to Your use, this Public
159
+ License does not apply, and You do not need to comply with
160
+ its terms and conditions.
161
+
162
+ 3. Term. The term of this Public License is specified in Section
163
+ 6(a).
164
+
165
+ 4. Media and formats; technical modifications allowed. The
166
+ Licensor authorizes You to exercise the Licensed Rights in
167
+ all media and formats whether now known or hereafter created,
168
+ and to make technical modifications necessary to do so. The
169
+ Licensor waives and/or agrees not to assert any right or
170
+ authority to forbid You from making technical modifications
171
+ necessary to exercise the Licensed Rights, including
172
+ technical modifications necessary to circumvent Effective
173
+ Technological Measures. For purposes of this Public License,
174
+ simply making modifications authorized by this Section 2(a)
175
+ (4) never produces Adapted Material.
176
+
177
+ 5. Downstream recipients.
178
+
179
+ a. Offer from the Licensor -- Licensed Material. Every
180
+ recipient of the Licensed Material automatically
181
+ receives an offer from the Licensor to exercise the
182
+ Licensed Rights under the terms and conditions of this
183
+ Public License.
184
+
185
+ b. Additional offer from the Licensor -- Adapted Material.
186
+ Every recipient of Adapted Material from You
187
+ automatically receives an offer from the Licensor to
188
+ exercise the Licensed Rights in the Adapted Material
189
+ under the conditions of the Adapter's License You apply.
190
+
191
+ c. No downstream restrictions. You may not offer or impose
192
+ any additional or different terms or conditions on, or
193
+ apply any Effective Technological Measures to, the
194
+ Licensed Material if doing so restricts exercise of the
195
+ Licensed Rights by any recipient of the Licensed
196
+ Material.
197
+
198
+ 6. No endorsement. Nothing in this Public License constitutes or
199
+ may be construed as permission to assert or imply that You
200
+ are, or that Your use of the Licensed Material is, connected
201
+ with, or sponsored, endorsed, or granted official status by,
202
+ the Licensor or others designated to receive attribution as
203
+ provided in Section 3(a)(1)(A)(i).
204
+
205
+ b. Other rights.
206
+
207
+ 1. Moral rights, such as the right of integrity, are not
208
+ licensed under this Public License, nor are publicity,
209
+ privacy, and/or other similar personality rights; however, to
210
+ the extent possible, the Licensor waives and/or agrees not to
211
+ assert any such rights held by the Licensor to the limited
212
+ extent necessary to allow You to exercise the Licensed
213
+ Rights, but not otherwise.
214
+
215
+ 2. Patent and trademark rights are not licensed under this
216
+ Public License.
217
+
218
+ 3. To the extent possible, the Licensor waives any right to
219
+ collect royalties from You for the exercise of the Licensed
220
+ Rights, whether directly or through a collecting society
221
+ under any voluntary or waivable statutory or compulsory
222
+ licensing scheme. In all other cases the Licensor expressly
223
+ reserves any right to collect such royalties.
224
+
225
+
226
+ Section 3 -- License Conditions.
227
+
228
+ Your exercise of the Licensed Rights is expressly made subject to the
229
+ following conditions.
230
+
231
+ a. Attribution.
232
+
233
+ 1. If You Share the Licensed Material (including in modified
234
+ form), You must:
235
+
236
+ a. retain the following if it is supplied by the Licensor
237
+ with the Licensed Material:
238
+
239
+ i. identification of the creator(s) of the Licensed
240
+ Material and any others designated to receive
241
+ attribution, in any reasonable manner requested by
242
+ the Licensor (including by pseudonym if
243
+ designated);
244
+
245
+ ii. a copyright notice;
246
+
247
+ iii. a notice that refers to this Public License;
248
+
249
+ iv. a notice that refers to the disclaimer of
250
+ warranties;
251
+
252
+ v. a URI or hyperlink to the Licensed Material to the
253
+ extent reasonably practicable;
254
+
255
+ b. indicate if You modified the Licensed Material and
256
+ retain an indication of any previous modifications; and
257
+
258
+ c. indicate the Licensed Material is licensed under this
259
+ Public License, and include the text of, or the URI or
260
+ hyperlink to, this Public License.
261
+
262
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
263
+ reasonable manner based on the medium, means, and context in
264
+ which You Share the Licensed Material. For example, it may be
265
+ reasonable to satisfy the conditions by providing a URI or
266
+ hyperlink to a resource that includes the required
267
+ information.
268
+
269
+ 3. If requested by the Licensor, You must remove any of the
270
+ information required by Section 3(a)(1)(A) to the extent
271
+ reasonably practicable.
272
+
273
+ b. ShareAlike.
274
+
275
+ In addition to the conditions in Section 3(a), if You Share
276
+ Adapted Material You produce, the following conditions also apply.
277
+
278
+ 1. The Adapter's License You apply must be a Creative Commons
279
+ license with the same License Elements, this version or
280
+ later, or a BY-SA Compatible License.
281
+
282
+ 2. You must include the text of, or the URI or hyperlink to, the
283
+ Adapter's License You apply. You may satisfy this condition
284
+ in any reasonable manner based on the medium, means, and
285
+ context in which You Share Adapted Material.
286
+
287
+ 3. You may not offer or impose any additional or different terms
288
+ or conditions on, or apply any Effective Technological
289
+ Measures to, Adapted Material that restrict exercise of the
290
+ rights granted under the Adapter's License You apply.
291
+
292
+
293
+ Section 4 -- Sui Generis Database Rights.
294
+
295
+ Where the Licensed Rights include Sui Generis Database Rights that
296
+ apply to Your use of the Licensed Material:
297
+
298
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
299
+ to extract, reuse, reproduce, and Share all or a substantial
300
+ portion of the contents of the database;
301
+
302
+ b. if You include all or a substantial portion of the database
303
+ contents in a database in which You have Sui Generis Database
304
+ Rights, then the database in which You have Sui Generis Database
305
+ Rights (but not its individual contents) is Adapted Material,
306
+
307
+ including for purposes of Section 3(b); and
308
+ c. You must comply with the conditions in Section 3(a) if You Share
309
+ all or a substantial portion of the contents of the database.
310
+
311
+ For the avoidance of doubt, this Section 4 supplements and does not
312
+ replace Your obligations under this Public License where the Licensed
313
+ Rights include other Copyright and Similar Rights.
314
+
315
+
316
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
317
+
318
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
319
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
320
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
321
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
322
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
323
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
324
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
325
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
326
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
327
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
328
+
329
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
330
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
331
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
332
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
333
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
334
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
335
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
336
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
337
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
338
+
339
+ c. The disclaimer of warranties and limitation of liability provided
340
+ above shall be interpreted in a manner that, to the extent
341
+ possible, most closely approximates an absolute disclaimer and
342
+ waiver of all liability.
343
+
344
+
345
+ Section 6 -- Term and Termination.
346
+
347
+ a. This Public License applies for the term of the Copyright and
348
+ Similar Rights licensed here. However, if You fail to comply with
349
+ this Public License, then Your rights under this Public License
350
+ terminate automatically.
351
+
352
+ b. Where Your right to use the Licensed Material has terminated under
353
+ Section 6(a), it reinstates:
354
+
355
+ 1. automatically as of the date the violation is cured, provided
356
+ it is cured within 30 days of Your discovery of the
357
+ violation; or
358
+
359
+ 2. upon express reinstatement by the Licensor.
360
+
361
+ For the avoidance of doubt, this Section 6(b) does not affect any
362
+ right the Licensor may have to seek remedies for Your violations
363
+ of this Public License.
364
+
365
+ c. For the avoidance of doubt, the Licensor may also offer the
366
+ Licensed Material under separate terms or conditions or stop
367
+ distributing the Licensed Material at any time; however, doing so
368
+ will not terminate this Public License.
369
+
370
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
371
+ License.
372
+
373
+
374
+ Section 7 -- Other Terms and Conditions.
375
+
376
+ a. The Licensor shall not be bound by any additional or different
377
+ terms or conditions communicated by You unless expressly agreed.
378
+
379
+ b. Any arrangements, understandings, or agreements regarding the
380
+ Licensed Material not stated herein are separate from and
381
+ independent of the terms and conditions of this Public License.
382
+
383
+
384
+ Section 8 -- Interpretation.
385
+
386
+ a. For the avoidance of doubt, this Public License does not, and
387
+ shall not be interpreted to, reduce, limit, restrict, or impose
388
+ conditions on any use of the Licensed Material that could lawfully
389
+ be made without permission under this Public License.
390
+
391
+ b. To the extent possible, if any provision of this Public License is
392
+ deemed unenforceable, it shall be automatically reformed to the
393
+ minimum extent necessary to make it enforceable. If the provision
394
+ cannot be reformed, it shall be severed from this Public License
395
+ without affecting the enforceability of the remaining terms and
396
+ conditions.
397
+
398
+ c. No term or condition of this Public License will be waived and no
399
+ failure to comply consented to unless expressly agreed to by the
400
+ Licensor.
401
+
402
+ d. Nothing in this Public License constitutes or may be interpreted
403
+ as a limitation upon, or waiver of, any privileges and immunities
404
+ that apply to the Licensor or You, including from the legal
405
+ processes of any jurisdiction or authority.
406
+
407
+
408
+ =======================================================================
409
+
410
+ Creative Commons is not a party to its public
411
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
412
+ its public licenses to material it publishes and in those instances
413
+ will be considered the “Licensor.” The text of the Creative Commons
414
+ public licenses is dedicated to the public domain under the CC0 Public
415
+ Domain Dedication. Except for the limited purpose of indicating that
416
+ material is shared under a Creative Commons public license or as
417
+ otherwise permitted by the Creative Commons policies published at
418
+ creativecommons.org/policies, Creative Commons does not authorize the
419
+ use of the trademark "Creative Commons" or any other trademark or logo
420
+ of Creative Commons without its prior written consent including,
421
+ without limitation, in connection with any unauthorized modifications
422
+ to any of its public licenses or any other arrangements,
423
+ understandings, or agreements concerning use of licensed material. For
424
+ the avoidance of doubt, this paragraph does not form part of the
425
+ public licenses.
426
+
427
+ Creative Commons may be contacted at creativecommons.org.
428
+
LICENSES_SOURCES ADDED
@@ -0,0 +1,880 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # UD Japanese GSD v2.6
2
+
3
+ * Author: Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel
4
+ * URL: https://github.com/UniversalDependencies/UD_Japanese-GSD
5
+ * License: CC BY-SA 4.0
6
+
7
+ ```
8
+ Attribution-ShareAlike 4.0 International
9
+
10
+ =======================================================================
11
+
12
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
13
+ does not provide legal services or legal advice. Distribution of
14
+ Creative Commons public licenses does not create a lawyer-client or
15
+ other relationship. Creative Commons makes its licenses and related
16
+ information available on an "as-is" basis. Creative Commons gives no
17
+ warranties regarding its licenses, any material licensed under their
18
+ terms and conditions, or any related information. Creative Commons
19
+ disclaims all liability for damages resulting from their use to the
20
+ fullest extent possible.
21
+
22
+ Using Creative Commons Public Licenses
23
+
24
+ Creative Commons public licenses provide a standard set of terms and
25
+ conditions that creators and other rights holders may use to share
26
+ original works of authorship and other material subject to copyright
27
+ and certain other rights specified in the public license below. The
28
+ following considerations are for informational purposes only, are not
29
+ exhaustive, and do not form part of our licenses.
30
+
31
+ Considerations for licensors: Our public licenses are
32
+ intended for use by those authorized to give the public
33
+ permission to use material in ways otherwise restricted by
34
+ copyright and certain other rights. Our licenses are
35
+ irrevocable. Licensors should read and understand the terms
36
+ and conditions of the license they choose before applying it.
37
+ Licensors should also secure all rights necessary before
38
+ applying our licenses so that the public can reuse the
39
+ material as expected. Licensors should clearly mark any
40
+ material not subject to the license. This includes other CC-
41
+ licensed material, or material used under an exception or
42
+ limitation to copyright. More considerations for licensors:
43
+ wiki.creativecommons.org/Considerations_for_licensors
44
+
45
+ Considerations for the public: By using one of our public
46
+ licenses, a licensor grants the public permission to use the
47
+ licensed material under specified terms and conditions. If
48
+ the licensor's permission is not necessary for any reason--for
49
+ example, because of any applicable exception or limitation to
50
+ copyright--then that use is not regulated by the license. Our
51
+ licenses grant only permissions under copyright and certain
52
+ other rights that a licensor has authority to grant. Use of
53
+ the licensed material may still be restricted for other
54
+ reasons, including because others have copyright or other
55
+ rights in the material. A licensor may make special requests,
56
+ such as asking that all changes be marked or described.
57
+ Although not required by our licenses, you are encouraged to
58
+ respect those requests where reasonable. More considerations
59
+ for the public:
60
+ wiki.creativecommons.org/Considerations_for_licensees
61
+
62
+ =======================================================================
63
+
64
+ Creative Commons Attribution-ShareAlike 4.0 International Public
65
+ License
66
+
67
+ By exercising the Licensed Rights (defined below), You accept and agree
68
+ to be bound by the terms and conditions of this Creative Commons
69
+ Attribution-ShareAlike 4.0 International Public License ("Public
70
+ License"). To the extent this Public License may be interpreted as a
71
+ contract, You are granted the Licensed Rights in consideration of Your
72
+ acceptance of these terms and conditions, and the Licensor grants You
73
+ such rights in consideration of benefits the Licensor receives from
74
+ making the Licensed Material available under these terms and
75
+ conditions.
76
+
77
+
78
+ Section 1 -- Definitions.
79
+
80
+ a. Adapted Material means material subject to Copyright and Similar
81
+ Rights that is derived from or based upon the Licensed Material
82
+ and in which the Licensed Material is translated, altered,
83
+ arranged, transformed, or otherwise modified in a manner requiring
84
+ permission under the Copyright and Similar Rights held by the
85
+ Licensor. For purposes of this Public License, where the Licensed
86
+ Material is a musical work, performance, or sound recording,
87
+ Adapted Material is always produced where the Licensed Material is
88
+ synched in timed relation with a moving image.
89
+
90
+ b. Adapter's License means the license You apply to Your Copyright
91
+ and Similar Rights in Your contributions to Adapted Material in
92
+ accordance with the terms and conditions of this Public License.
93
+
94
+ c. BY-SA Compatible License means a license listed at
95
+ creativecommons.org/compatiblelicenses, approved by Creative
96
+ Commons as essentially the equivalent of this Public License.
97
+
98
+ d. Copyright and Similar Rights means copyright and/or similar rights
99
+ closely related to copyright including, without limitation,
100
+ performance, broadcast, sound recording, and Sui Generis Database
101
+ Rights, without regard to how the rights are labeled or
102
+ categorized. For purposes of this Public License, the rights
103
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
104
+ Rights.
105
+
106
+ e. Effective Technological Measures means those measures that, in the
107
+ absence of proper authority, may not be circumvented under laws
108
+ fulfilling obligations under Article 11 of the WIPO Copyright
109
+ Treaty adopted on December 20, 1996, and/or similar international
110
+ agreements.
111
+
112
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
113
+ any other exception or limitation to Copyright and Similar Rights
114
+ that applies to Your use of the Licensed Material.
115
+
116
+ g. License Elements means the license attributes listed in the name
117
+ of a Creative Commons Public License. The License Elements of this
118
+ Public License are Attribution and ShareAlike.
119
+
120
+ h. Licensed Material means the artistic or literary work, database,
121
+ or other material to which the Licensor applied this Public
122
+ License.
123
+
124
+ i. Licensed Rights means the rights granted to You subject to the
125
+ terms and conditions of this Public License, which are limited to
126
+ all Copyright and Similar Rights that apply to Your use of the
127
+ Licensed Material and that the Licensor has authority to license.
128
+
129
+ j. Licensor means the individual(s) or entity(ies) granting rights
130
+ under this Public License.
131
+
132
+ k. Share means to provide material to the public by any means or
133
+ process that requires permission under the Licensed Rights, such
134
+ as reproduction, public display, public performance, distribution,
135
+ dissemination, communication, or importation, and to make material
136
+ available to the public including in ways that members of the
137
+ public may access the material from a place and at a time
138
+ individually chosen by them.
139
+
140
+ l. Sui Generis Database Rights means rights other than copyright
141
+ resulting from Directive 96/9/EC of the European Parliament and of
142
+ the Council of 11 March 1996 on the legal protection of databases,
143
+ as amended and/or succeeded, as well as other essentially
144
+ equivalent rights anywhere in the world.
145
+
146
+ m. You means the individual or entity exercising the Licensed Rights
147
+ under this Public License. Your has a corresponding meaning.
148
+
149
+
150
+ Section 2 -- Scope.
151
+
152
+ a. License grant.
153
+
154
+ 1. Subject to the terms and conditions of this Public License,
155
+ the Licensor hereby grants You a worldwide, royalty-free,
156
+ non-sublicensable, non-exclusive, irrevocable license to
157
+ exercise the Licensed Rights in the Licensed Material to:
158
+
159
+ a. reproduce and Share the Licensed Material, in whole or
160
+ in part; and
161
+
162
+ b. produce, reproduce, and Share Adapted Material.
163
+
164
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
165
+ Exceptions and Limitations apply to Your use, this Public
166
+ License does not apply, and You do not need to comply with
167
+ its terms and conditions.
168
+
169
+ 3. Term. The term of this Public License is specified in Section
170
+ 6(a).
171
+
172
+ 4. Media and formats; technical modifications allowed. The
173
+ Licensor authorizes You to exercise the Licensed Rights in
174
+ all media and formats whether now known or hereafter created,
175
+ and to make technical modifications necessary to do so. The
176
+ Licensor waives and/or agrees not to assert any right or
177
+ authority to forbid You from making technical modifications
178
+ necessary to exercise the Licensed Rights, including
179
+ technical modifications necessary to circumvent Effective
180
+ Technological Measures. For purposes of this Public License,
181
+ simply making modifications authorized by this Section 2(a)
182
+ (4) never produces Adapted Material.
183
+
184
+ 5. Downstream recipients.
185
+
186
+ a. Offer from the Licensor -- Licensed Material. Every
187
+ recipient of the Licensed Material automatically
188
+ receives an offer from the Licensor to exercise the
189
+ Licensed Rights under the terms and conditions of this
190
+ Public License.
191
+
192
+ b. Additional offer from the Licensor -- Adapted Material.
193
+ Every recipient of Adapted Material from You
194
+ automatically receives an offer from the Licensor to
195
+ exercise the Licensed Rights in the Adapted Material
196
+ under the conditions of the Adapter's License You apply.
197
+
198
+ c. No downstream restrictions. You may not offer or impose
199
+ any additional or different terms or conditions on, or
200
+ apply any Effective Technological Measures to, the
201
+ Licensed Material if doing so restricts exercise of the
202
+ Licensed Rights by any recipient of the Licensed
203
+ Material.
204
+
205
+ 6. No endorsement. Nothing in this Public License constitutes or
206
+ may be construed as permission to assert or imply that You
207
+ are, or that Your use of the Licensed Material is, connected
208
+ with, or sponsored, endorsed, or granted official status by,
209
+ the Licensor or others designated to receive attribution as
210
+ provided in Section 3(a)(1)(A)(i).
211
+
212
+ b. Other rights.
213
+
214
+ 1. Moral rights, such as the right of integrity, are not
215
+ licensed under this Public License, nor are publicity,
216
+ privacy, and/or other similar personality rights; however, to
217
+ the extent possible, the Licensor waives and/or agrees not to
218
+ assert any such rights held by the Licensor to the limited
219
+ extent necessary to allow You to exercise the Licensed
220
+ Rights, but not otherwise.
221
+
222
+ 2. Patent and trademark rights are not licensed under this
223
+ Public License.
224
+
225
+ 3. To the extent possible, the Licensor waives any right to
226
+ collect royalties from You for the exercise of the Licensed
227
+ Rights, whether directly or through a collecting society
228
+ under any voluntary or waivable statutory or compulsory
229
+ licensing scheme. In all other cases the Licensor expressly
230
+ reserves any right to collect such royalties.
231
+
232
+
233
+ Section 3 -- License Conditions.
234
+
235
+ Your exercise of the Licensed Rights is expressly made subject to the
236
+ following conditions.
237
+
238
+ a. Attribution.
239
+
240
+ 1. If You Share the Licensed Material (including in modified
241
+ form), You must:
242
+
243
+ a. retain the following if it is supplied by the Licensor
244
+ with the Licensed Material:
245
+
246
+ i. identification of the creator(s) of the Licensed
247
+ Material and any others designated to receive
248
+ attribution, in any reasonable manner requested by
249
+ the Licensor (including by pseudonym if
250
+ designated);
251
+
252
+ ii. a copyright notice;
253
+
254
+ iii. a notice that refers to this Public License;
255
+
256
+ iv. a notice that refers to the disclaimer of
257
+ warranties;
258
+
259
+ v. a URI or hyperlink to the Licensed Material to the
260
+ extent reasonably practicable;
261
+
262
+ b. indicate if You modified the Licensed Material and
263
+ retain an indication of any previous modifications; and
264
+
265
+ c. indicate the Licensed Material is licensed under this
266
+ Public License, and include the text of, or the URI or
267
+ hyperlink to, this Public License.
268
+
269
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
270
+ reasonable manner based on the medium, means, and context in
271
+ which You Share the Licensed Material. For example, it may be
272
+ reasonable to satisfy the conditions by providing a URI or
273
+ hyperlink to a resource that includes the required
274
+ information.
275
+
276
+ 3. If requested by the Licensor, You must remove any of the
277
+ information required by Section 3(a)(1)(A) to the extent
278
+ reasonably practicable.
279
+
280
+ b. ShareAlike.
281
+
282
+ In addition to the conditions in Section 3(a), if You Share
283
+ Adapted Material You produce, the following conditions also apply.
284
+
285
+ 1. The Adapter's License You apply must be a Creative Commons
286
+ license with the same License Elements, this version or
287
+ later, or a BY-SA Compatible License.
288
+
289
+ 2. You must include the text of, or the URI or hyperlink to, the
290
+ Adapter's License You apply. You may satisfy this condition
291
+ in any reasonable manner based on the medium, means, and
292
+ context in which You Share Adapted Material.
293
+
294
+ 3. You may not offer or impose any additional or different terms
295
+ or conditions on, or apply any Effective Technological
296
+ Measures to, Adapted Material that restrict exercise of the
297
+ rights granted under the Adapter's License You apply.
298
+
299
+
300
+ Section 4 -- Sui Generis Database Rights.
301
+
302
+ Where the Licensed Rights include Sui Generis Database Rights that
303
+ apply to Your use of the Licensed Material:
304
+
305
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
306
+ to extract, reuse, reproduce, and Share all or a substantial
307
+ portion of the contents of the database;
308
+
309
+ b. if You include all or a substantial portion of the database
310
+ contents in a database in which You have Sui Generis Database
311
+ Rights, then the database in which You have Sui Generis Database
312
+ Rights (but not its individual contents) is Adapted Material,
313
+
314
+ including for purposes of Section 3(b); and
315
+ c. You must comply with the conditions in Section 3(a) if You Share
316
+ all or a substantial portion of the contents of the database.
317
+
318
+ For the avoidance of doubt, this Section 4 supplements and does not
319
+ replace Your obligations under this Public License where the Licensed
320
+ Rights include other Copyright and Similar Rights.
321
+
322
+
323
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
324
+
325
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
326
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
327
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
328
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
329
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
330
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
331
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
332
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
333
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
334
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
335
+
336
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
337
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
338
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
339
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
340
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
341
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
342
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
343
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
344
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
345
+
346
+ c. The disclaimer of warranties and limitation of liability provided
347
+ above shall be interpreted in a manner that, to the extent
348
+ possible, most closely approximates an absolute disclaimer and
349
+ waiver of all liability.
350
+
351
+
352
+ Section 6 -- Term and Termination.
353
+
354
+ a. This Public License applies for the term of the Copyright and
355
+ Similar Rights licensed here. However, if You fail to comply with
356
+ this Public License, then Your rights under this Public License
357
+ terminate automatically.
358
+
359
+ b. Where Your right to use the Licensed Material has terminated under
360
+ Section 6(a), it reinstates:
361
+
362
+ 1. automatically as of the date the violation is cured, provided
363
+ it is cured within 30 days of Your discovery of the
364
+ violation; or
365
+
366
+ 2. upon express reinstatement by the Licensor.
367
+
368
+ For the avoidance of doubt, this Section 6(b) does not affect any
369
+ right the Licensor may have to seek remedies for Your violations
370
+ of this Public License.
371
+
372
+ c. For the avoidance of doubt, the Licensor may also offer the
373
+ Licensed Material under separate terms or conditions or stop
374
+ distributing the Licensed Material at any time; however, doing so
375
+ will not terminate this Public License.
376
+
377
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
378
+ License.
379
+
380
+
381
+ Section 7 -- Other Terms and Conditions.
382
+
383
+ a. The Licensor shall not be bound by any additional or different
384
+ terms or conditions communicated by You unless expressly agreed.
385
+
386
+ b. Any arrangements, understandings, or agreements regarding the
387
+ Licensed Material not stated herein are separate from and
388
+ independent of the terms and conditions of this Public License.
389
+
390
+
391
+ Section 8 -- Interpretation.
392
+
393
+ a. For the avoidance of doubt, this Public License does not, and
394
+ shall not be interpreted to, reduce, limit, restrict, or impose
395
+ conditions on any use of the Licensed Material that could lawfully
396
+ be made without permission under this Public License.
397
+
398
+ b. To the extent possible, if any provision of this Public License is
399
+ deemed unenforceable, it shall be automatically reformed to the
400
+ minimum extent necessary to make it enforceable. If the provision
401
+ cannot be reformed, it shall be severed from this Public License
402
+ without affecting the enforceability of the remaining terms and
403
+ conditions.
404
+
405
+ c. No term or condition of this Public License will be waived and no
406
+ failure to comply consented to unless expressly agreed to by the
407
+ Licensor.
408
+
409
+ d. Nothing in this Public License constitutes or may be interpreted
410
+ as a limitation upon, or waiver of, any privileges and immunities
411
+ that apply to the Licensor or You, including from the legal
412
+ processes of any jurisdiction or authority.
413
+
414
+
415
+ =======================================================================
416
+
417
+ Creative Commons is not a party to its public
418
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
419
+ its public licenses to material it publishes and in those instances
420
+ will be considered the “Licensor.” The text of the Creative Commons
421
+ public licenses is dedicated to the public domain under the CC0 Public
422
+ Domain Dedication. Except for the limited purpose of indicating that
423
+ material is shared under a Creative Commons public license or as
424
+ otherwise permitted by the Creative Commons policies published at
425
+ creativecommons.org/policies, Creative Commons does not authorize the
426
+ use of the trademark "Creative Commons" or any other trademark or logo
427
+ of Creative Commons without its prior written consent including,
428
+ without limitation, in connection with any unauthorized modifications
429
+ to any of its public licenses or any other arrangements,
430
+ understandings, or agreements concerning use of licensed material. For
431
+ the avoidance of doubt, this paragraph does not form part of the
432
+ public licenses.
433
+
434
+ Creative Commons may be contacted at creativecommons.org.
435
+
436
+ ```
437
+
438
+
439
+
440
+
441
+ # UD Japanese GSD v2.6 NER
442
+
443
+ * Author: Megagon Labs Tokyo
444
+ * URL: https://github.com/megagonlabs/UD_Japanese-GSD
445
+ * License: CC BY-SA 4.0
446
+
447
+ ```
448
+ Attribution-ShareAlike 4.0 International
449
+
450
+ =======================================================================
451
+
452
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
453
+ does not provide legal services or legal advice. Distribution of
454
+ Creative Commons public licenses does not create a lawyer-client or
455
+ other relationship. Creative Commons makes its licenses and related
456
+ information available on an "as-is" basis. Creative Commons gives no
457
+ warranties regarding its licenses, any material licensed under their
458
+ terms and conditions, or any related information. Creative Commons
459
+ disclaims all liability for damages resulting from their use to the
460
+ fullest extent possible.
461
+
462
+ Using Creative Commons Public Licenses
463
+
464
+ Creative Commons public licenses provide a standard set of terms and
465
+ conditions that creators and other rights holders may use to share
466
+ original works of authorship and other material subject to copyright
467
+ and certain other rights specified in the public license below. The
468
+ following considerations are for informational purposes only, are not
469
+ exhaustive, and do not form part of our licenses.
470
+
471
+ Considerations for licensors: Our public licenses are
472
+ intended for use by those authorized to give the public
473
+ permission to use material in ways otherwise restricted by
474
+ copyright and certain other rights. Our licenses are
475
+ irrevocable. Licensors should read and understand the terms
476
+ and conditions of the license they choose before applying it.
477
+ Licensors should also secure all rights necessary before
478
+ applying our licenses so that the public can reuse the
479
+ material as expected. Licensors should clearly mark any
480
+ material not subject to the license. This includes other CC-
481
+ licensed material, or material used under an exception or
482
+ limitation to copyright. More considerations for licensors:
483
+ wiki.creativecommons.org/Considerations_for_licensors
484
+
485
+ Considerations for the public: By using one of our public
486
+ licenses, a licensor grants the public permission to use the
487
+ licensed material under specified terms and conditions. If
488
+ the licensor's permission is not necessary for any reason--for
489
+ example, because of any applicable exception or limitation to
490
+ copyright--then that use is not regulated by the license. Our
491
+ licenses grant only permissions under copyright and certain
492
+ other rights that a licensor has authority to grant. Use of
493
+ the licensed material may still be restricted for other
494
+ reasons, including because others have copyright or other
495
+ rights in the material. A licensor may make special requests,
496
+ such as asking that all changes be marked or described.
497
+ Although not required by our licenses, you are encouraged to
498
+ respect those requests where reasonable. More considerations
499
+ for the public:
500
+ wiki.creativecommons.org/Considerations_for_licensees
501
+
502
+ =======================================================================
503
+
504
+ Creative Commons Attribution-ShareAlike 4.0 International Public
505
+ License
506
+
507
+ By exercising the Licensed Rights (defined below), You accept and agree
508
+ to be bound by the terms and conditions of this Creative Commons
509
+ Attribution-ShareAlike 4.0 International Public License ("Public
510
+ License"). To the extent this Public License may be interpreted as a
511
+ contract, You are granted the Licensed Rights in consideration of Your
512
+ acceptance of these terms and conditions, and the Licensor grants You
513
+ such rights in consideration of benefits the Licensor receives from
514
+ making the Licensed Material available under these terms and
515
+ conditions.
516
+
517
+
518
+ Section 1 -- Definitions.
519
+
520
+ a. Adapted Material means material subject to Copyright and Similar
521
+ Rights that is derived from or based upon the Licensed Material
522
+ and in which the Licensed Material is translated, altered,
523
+ arranged, transformed, or otherwise modified in a manner requiring
524
+ permission under the Copyright and Similar Rights held by the
525
+ Licensor. For purposes of this Public License, where the Licensed
526
+ Material is a musical work, performance, or sound recording,
527
+ Adapted Material is always produced where the Licensed Material is
528
+ synched in timed relation with a moving image.
529
+
530
+ b. Adapter's License means the license You apply to Your Copyright
531
+ and Similar Rights in Your contributions to Adapted Material in
532
+ accordance with the terms and conditions of this Public License.
533
+
534
+ c. BY-SA Compatible License means a license listed at
535
+ creativecommons.org/compatiblelicenses, approved by Creative
536
+ Commons as essentially the equivalent of this Public License.
537
+
538
+ d. Copyright and Similar Rights means copyright and/or similar rights
539
+ closely related to copyright including, without limitation,
540
+ performance, broadcast, sound recording, and Sui Generis Database
541
+ Rights, without regard to how the rights are labeled or
542
+ categorized. For purposes of this Public License, the rights
543
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
544
+ Rights.
545
+
546
+ e. Effective Technological Measures means those measures that, in the
547
+ absence of proper authority, may not be circumvented under laws
548
+ fulfilling obligations under Article 11 of the WIPO Copyright
549
+ Treaty adopted on December 20, 1996, and/or similar international
550
+ agreements.
551
+
552
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
553
+ any other exception or limitation to Copyright and Similar Rights
554
+ that applies to Your use of the Licensed Material.
555
+
556
+ g. License Elements means the license attributes listed in the name
557
+ of a Creative Commons Public License. The License Elements of this
558
+ Public License are Attribution and ShareAlike.
559
+
560
+ h. Licensed Material means the artistic or literary work, database,
561
+ or other material to which the Licensor applied this Public
562
+ License.
563
+
564
+ i. Licensed Rights means the rights granted to You subject to the
565
+ terms and conditions of this Public License, which are limited to
566
+ all Copyright and Similar Rights that apply to Your use of the
567
+ Licensed Material and that the Licensor has authority to license.
568
+
569
+ j. Licensor means the individual(s) or entity(ies) granting rights
570
+ under this Public License.
571
+
572
+ k. Share means to provide material to the public by any means or
573
+ process that requires permission under the Licensed Rights, such
574
+ as reproduction, public display, public performance, distribution,
575
+ dissemination, communication, or importation, and to make material
576
+ available to the public including in ways that members of the
577
+ public may access the material from a place and at a time
578
+ individually chosen by them.
579
+
580
+ l. Sui Generis Database Rights means rights other than copyright
581
+ resulting from Directive 96/9/EC of the European Parliament and of
582
+ the Council of 11 March 1996 on the legal protection of databases,
583
+ as amended and/or succeeded, as well as other essentially
584
+ equivalent rights anywhere in the world.
585
+
586
+ m. You means the individual or entity exercising the Licensed Rights
587
+ under this Public License. Your has a corresponding meaning.
588
+
589
+
590
+ Section 2 -- Scope.
591
+
592
+ a. License grant.
593
+
594
+ 1. Subject to the terms and conditions of this Public License,
595
+ the Licensor hereby grants You a worldwide, royalty-free,
596
+ non-sublicensable, non-exclusive, irrevocable license to
597
+ exercise the Licensed Rights in the Licensed Material to:
598
+
599
+ a. reproduce and Share the Licensed Material, in whole or
600
+ in part; and
601
+
602
+ b. produce, reproduce, and Share Adapted Material.
603
+
604
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
605
+ Exceptions and Limitations apply to Your use, this Public
606
+ License does not apply, and You do not need to comply with
607
+ its terms and conditions.
608
+
609
+ 3. Term. The term of this Public License is specified in Section
610
+ 6(a).
611
+
612
+ 4. Media and formats; technical modifications allowed. The
613
+ Licensor authorizes You to exercise the Licensed Rights in
614
+ all media and formats whether now known or hereafter created,
615
+ and to make technical modifications necessary to do so. The
616
+ Licensor waives and/or agrees not to assert any right or
617
+ authority to forbid You from making technical modifications
618
+ necessary to exercise the Licensed Rights, including
619
+ technical modifications necessary to circumvent Effective
620
+ Technological Measures. For purposes of this Public License,
621
+ simply making modifications authorized by this Section 2(a)
622
+ (4) never produces Adapted Material.
623
+
624
+ 5. Downstream recipients.
625
+
626
+ a. Offer from the Licensor -- Licensed Material. Every
627
+ recipient of the Licensed Material automatically
628
+ receives an offer from the Licensor to exercise the
629
+ Licensed Rights under the terms and conditions of this
630
+ Public License.
631
+
632
+ b. Additional offer from the Licensor -- Adapted Material.
633
+ Every recipient of Adapted Material from You
634
+ automatically receives an offer from the Licensor to
635
+ exercise the Licensed Rights in the Adapted Material
636
+ under the conditions of the Adapter's License You apply.
637
+
638
+ c. No downstream restrictions. You may not offer or impose
639
+ any additional or different terms or conditions on, or
640
+ apply any Effective Technological Measures to, the
641
+ Licensed Material if doing so restricts exercise of the
642
+ Licensed Rights by any recipient of the Licensed
643
+ Material.
644
+
645
+ 6. No endorsement. Nothing in this Public License constitutes or
646
+ may be construed as permission to assert or imply that You
647
+ are, or that Your use of the Licensed Material is, connected
648
+ with, or sponsored, endorsed, or granted official status by,
649
+ the Licensor or others designated to receive attribution as
650
+ provided in Section 3(a)(1)(A)(i).
651
+
652
+ b. Other rights.
653
+
654
+ 1. Moral rights, such as the right of integrity, are not
655
+ licensed under this Public License, nor are publicity,
656
+ privacy, and/or other similar personality rights; however, to
657
+ the extent possible, the Licensor waives and/or agrees not to
658
+ assert any such rights held by the Licensor to the limited
659
+ extent necessary to allow You to exercise the Licensed
660
+ Rights, but not otherwise.
661
+
662
+ 2. Patent and trademark rights are not licensed under this
663
+ Public License.
664
+
665
+ 3. To the extent possible, the Licensor waives any right to
666
+ collect royalties from You for the exercise of the Licensed
667
+ Rights, whether directly or through a collecting society
668
+ under any voluntary or waivable statutory or compulsory
669
+ licensing scheme. In all other cases the Licensor expressly
670
+ reserves any right to collect such royalties.
671
+
672
+
673
+ Section 3 -- License Conditions.
674
+
675
+ Your exercise of the Licensed Rights is expressly made subject to the
676
+ following conditions.
677
+
678
+ a. Attribution.
679
+
680
+ 1. If You Share the Licensed Material (including in modified
681
+ form), You must:
682
+
683
+ a. retain the following if it is supplied by the Licensor
684
+ with the Licensed Material:
685
+
686
+ i. identification of the creator(s) of the Licensed
687
+ Material and any others designated to receive
688
+ attribution, in any reasonable manner requested by
689
+ the Licensor (including by pseudonym if
690
+ designated);
691
+
692
+ ii. a copyright notice;
693
+
694
+ iii. a notice that refers to this Public License;
695
+
696
+ iv. a notice that refers to the disclaimer of
697
+ warranties;
698
+
699
+ v. a URI or hyperlink to the Licensed Material to the
700
+ extent reasonably practicable;
701
+
702
+ b. indicate if You modified the Licensed Material and
703
+ retain an indication of any previous modifications; and
704
+
705
+ c. indicate the Licensed Material is licensed under this
706
+ Public License, and include the text of, or the URI or
707
+ hyperlink to, this Public License.
708
+
709
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
710
+ reasonable manner based on the medium, means, and context in
711
+ which You Share the Licensed Material. For example, it may be
712
+ reasonable to satisfy the conditions by providing a URI or
713
+ hyperlink to a resource that includes the required
714
+ information.
715
+
716
+ 3. If requested by the Licensor, You must remove any of the
717
+ information required by Section 3(a)(1)(A) to the extent
718
+ reasonably practicable.
719
+
720
+ b. ShareAlike.
721
+
722
+ In addition to the conditions in Section 3(a), if You Share
723
+ Adapted Material You produce, the following conditions also apply.
724
+
725
+ 1. The Adapter's License You apply must be a Creative Commons
726
+ license with the same License Elements, this version or
727
+ later, or a BY-SA Compatible License.
728
+
729
+ 2. You must include the text of, or the URI or hyperlink to, the
730
+ Adapter's License You apply. You may satisfy this condition
731
+ in any reasonable manner based on the medium, means, and
732
+ context in which You Share Adapted Material.
733
+
734
+ 3. You may not offer or impose any additional or different terms
735
+ or conditions on, or apply any Effective Technological
736
+ Measures to, Adapted Material that restrict exercise of the
737
+ rights granted under the Adapter's License You apply.
738
+
739
+
740
+ Section 4 -- Sui Generis Database Rights.
741
+
742
+ Where the Licensed Rights include Sui Generis Database Rights that
743
+ apply to Your use of the Licensed Material:
744
+
745
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
746
+ to extract, reuse, reproduce, and Share all or a substantial
747
+ portion of the contents of the database;
748
+
749
+ b. if You include all or a substantial portion of the database
750
+ contents in a database in which You have Sui Generis Database
751
+ Rights, then the database in which You have Sui Generis Database
752
+ Rights (but not its individual contents) is Adapted Material,
753
+
754
+ including for purposes of Section 3(b); and
755
+ c. You must comply with the conditions in Section 3(a) if You Share
756
+ all or a substantial portion of the contents of the database.
757
+
758
+ For the avoidance of doubt, this Section 4 supplements and does not
759
+ replace Your obligations under this Public License where the Licensed
760
+ Rights include other Copyright and Similar Rights.
761
+
762
+
763
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
764
+
765
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
766
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
767
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
768
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
769
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
770
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
771
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
772
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
773
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
774
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
775
+
776
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
777
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
778
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
779
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
780
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
781
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
782
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
783
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
784
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
785
+
786
+ c. The disclaimer of warranties and limitation of liability provided
787
+ above shall be interpreted in a manner that, to the extent
788
+ possible, most closely approximates an absolute disclaimer and
789
+ waiver of all liability.
790
+
791
+
792
+ Section 6 -- Term and Termination.
793
+
794
+ a. This Public License applies for the term of the Copyright and
795
+ Similar Rights licensed here. However, if You fail to comply with
796
+ this Public License, then Your rights under this Public License
797
+ terminate automatically.
798
+
799
+ b. Where Your right to use the Licensed Material has terminated under
800
+ Section 6(a), it reinstates:
801
+
802
+ 1. automatically as of the date the violation is cured, provided
803
+ it is cured within 30 days of Your discovery of the
804
+ violation; or
805
+
806
+ 2. upon express reinstatement by the Licensor.
807
+
808
+ For the avoidance of doubt, this Section 6(b) does not affect any
809
+ right the Licensor may have to seek remedies for Your violations
810
+ of this Public License.
811
+
812
+ c. For the avoidance of doubt, the Licensor may also offer the
813
+ Licensed Material under separate terms or conditions or stop
814
+ distributing the Licensed Material at any time; however, doing so
815
+ will not terminate this Public License.
816
+
817
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
818
+ License.
819
+
820
+
821
+ Section 7 -- Other Terms and Conditions.
822
+
823
+ a. The Licensor shall not be bound by any additional or different
824
+ terms or conditions communicated by You unless expressly agreed.
825
+
826
+ b. Any arrangements, understandings, or agreements regarding the
827
+ Licensed Material not stated herein are separate from and
828
+ independent of the terms and conditions of this Public License.
829
+
830
+
831
+ Section 8 -- Interpretation.
832
+
833
+ a. For the avoidance of doubt, this Public License does not, and
834
+ shall not be interpreted to, reduce, limit, restrict, or impose
835
+ conditions on any use of the Licensed Material that could lawfully
836
+ be made without permission under this Public License.
837
+
838
+ b. To the extent possible, if any provision of this Public License is
839
+ deemed unenforceable, it shall be automatically reformed to the
840
+ minimum extent necessary to make it enforceable. If the provision
841
+ cannot be reformed, it shall be severed from this Public License
842
+ without affecting the enforceability of the remaining terms and
843
+ conditions.
844
+
845
+ c. No term or condition of this Public License will be waived and no
846
+ failure to comply consented to unless expressly agreed to by the
847
+ Licensor.
848
+
849
+ d. Nothing in this Public License constitutes or may be interpreted
850
+ as a limitation upon, or waiver of, any privileges and immunities
851
+ that apply to the Licensor or You, including from the legal
852
+ processes of any jurisdiction or authority.
853
+
854
+
855
+ =======================================================================
856
+
857
+ Creative Commons is not a party to its public
858
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
859
+ its public licenses to material it publishes and in those instances
860
+ will be considered the “Licensor.” The text of the Creative Commons
861
+ public licenses is dedicated to the public domain under the CC0 Public
862
+ Domain Dedication. Except for the limited purpose of indicating that
863
+ material is shared under a Creative Commons public license or as
864
+ otherwise permitted by the Creative Commons policies published at
865
+ creativecommons.org/policies, Creative Commons does not authorize the
866
+ use of the trademark "Creative Commons" or any other trademark or logo
867
+ of Creative Commons without its prior written consent including,
868
+ without limitation, in connection with any unauthorized modifications
869
+ to any of its public licenses or any other arrangements,
870
+ understandings, or agreements concerning use of licensed material. For
871
+ the avoidance of doubt, this paragraph does not form part of the
872
+ public licenses.
873
+
874
+ Creative Commons may be contacted at creativecommons.org.
875
+
876
+ ```
877
+
878
+
879
+
880
+
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - spacy
4
+ - token-classification
5
+ language:
6
+ - ja
7
+ license: CC-BY-SA-4.0
8
+ model-index:
9
+ - name: ja_core_news_sm
10
+ results:
11
+ - tasks:
12
+ name: NER
13
+ type: token-classification
14
+ metrics:
15
+ - name: Precision
16
+ type: precision
17
+ value: 0.7163232964
18
+ - name: Recall
19
+ type: recall
20
+ value: 0.5772669221
21
+ - name: F Score
22
+ type: f_score
23
+ value: 0.639321075
24
+ - tasks:
25
+ name: POS
26
+ type: token-classification
27
+ metrics:
28
+ - name: Accuracy
29
+ type: accuracy
30
+ value: 0.9721899386
31
+ - tasks:
32
+ name: SENTER
33
+ type: token-classification
34
+ metrics:
35
+ - name: Precision
36
+ type: precision
37
+ value: 0.9860557769
38
+ - name: Recall
39
+ type: recall
40
+ value: 0.9880239521
41
+ - name: F Score
42
+ type: f_score
43
+ value: 0.9870388833
44
+ - tasks:
45
+ name: UNLABELED_DEPENDENCIES
46
+ type: token-classification
47
+ metrics:
48
+ - name: Accuracy
49
+ type: accuracy
50
+ value: 0.916212877
51
+ - tasks:
52
+ name: LABELED_DEPENDENCIES
53
+ type: token-classification
54
+ metrics:
55
+ - name: Accuracy
56
+ type: accuracy
57
+ value: 0.916212877
58
+ ---
59
+ ### Details: https://spacy.io/models/ja#ja_core_news_sm
60
+
61
+ Japanese pipeline optimized for CPU. Components: tok2vec, parser, senter, ner, attribute_ruler.
62
+
63
+ | Feature | Description |
64
+ | --- | --- |
65
+ | **Name** | `ja_core_news_sm` |
66
+ | **Version** | `3.1.0` |
67
+ | **spaCy** | `>=3.1.0,<3.2.0` |
68
+ | **Default Pipeline** | `tok2vec`, `parser`, `attribute_ruler`, `ner` |
69
+ | **Components** | `tok2vec`, `parser`, `senter`, `attribute_ruler`, `ner` |
70
+ | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
+ | **Sources** | [UD Japanese GSD v2.6](https://github.com/UniversalDependencies/UD_Japanese-GSD) (Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel)<br />[UD Japanese GSD v2.6 NER](https://github.com/megagonlabs/UD_Japanese-GSD) (Megagon Labs Tokyo) |
72
+ | **License** | `CC BY-SA 4.0` |
73
+ | **Author** | [Explosion](https://explosion.ai) |
74
+
75
+ ### Label Scheme
76
+
77
+ <details>
78
+
79
+ <summary>View label scheme (47 labels for 3 components)</summary>
80
+
81
+ | Component | Labels |
82
+ | --- | --- |
83
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advmod`, `amod`, `aux`, `case`, `cc`, `ccomp`, `compound`, `cop`, `csubj`, `dep`, `det`, `dislocated`, `fixed`, `mark`, `nmod`, `nsubj`, `nummod`, `obj`, `obl`, `punct` |
84
+ | **`senter`** | `I`, `S` |
85
+ | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `MOVEMENT`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PET_NAME`, `PHONE`, `PRODUCT`, `QUANTITY`, `TIME`, `TITLE_AFFIX`, `WORK_OF_ART` |
86
+
87
+ </details>
88
+
89
+ ### Accuracy
90
+
91
+ | Type | Score |
92
+ | --- | --- |
93
+ | `TOKEN_ACC` | 99.69 |
94
+ | `TAG_ACC` | 97.22 |
95
+ | `POS_ACC` | 96.40 |
96
+ | `MORPH_ACC` | 0.00 |
97
+ | `DEP_UAS` | 91.62 |
98
+ | `DEP_LAS` | 89.41 |
99
+ | `ENTS_P` | 71.63 |
100
+ | `ENTS_R` | 57.73 |
101
+ | `ENTS_F` | 63.93 |
102
+ | `SENTS_P` | 98.61 |
103
+ | `SENTS_R` | 98.80 |
104
+ | `SENTS_F` | 98.70 |
accuracy.json ADDED
@@ -0,0 +1,236 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "token_acc": 0.9968965945,
3
+ "tag_acc": 0.9721899386,
4
+ "pos_acc": 0.9639755682,
5
+ "morph_acc": 0.0,
6
+ "dep_uas": 0.916212877,
7
+ "dep_las": 0.894130554,
8
+ "ents_p": 0.7163232964,
9
+ "ents_r": 0.5772669221,
10
+ "ents_f": 0.639321075,
11
+ "sents_p": 0.9860557769,
12
+ "sents_r": 0.9880239521,
13
+ "sents_f": 0.9870388833,
14
+ "speed": 14745.6510174231,
15
+ "morph_per_feat": {
16
+ "Polarity": {
17
+ "p": 0.0,
18
+ "r": 0.0,
19
+ "f": 0.0
20
+ }
21
+ },
22
+ "dep_las_per_type": {
23
+ "cc": {
24
+ "p": 0.7659574468,
25
+ "r": 0.7826086957,
26
+ "f": 0.7741935484
27
+ },
28
+ "nummod": {
29
+ "p": 0.9730769231,
30
+ "r": 0.8694158076,
31
+ "f": 0.9183303085
32
+ },
33
+ "compound": {
34
+ "p": 0.9299242424,
35
+ "r": 0.9143389199,
36
+ "f": 0.9220657277
37
+ },
38
+ "obl": {
39
+ "p": 0.7782857143,
40
+ "r": 0.8185096154,
41
+ "f": 0.7978910369
42
+ },
43
+ "case": {
44
+ "p": 0.9808429119,
45
+ "r": 0.9774723177,
46
+ "f": 0.9791547141
47
+ },
48
+ "dislocated": {
49
+ "p": 0.4285714286,
50
+ "r": 0.3157894737,
51
+ "f": 0.3636363636
52
+ },
53
+ "nmod": {
54
+ "p": 0.8792650919,
55
+ "r": 0.8072289157,
56
+ "f": 0.8417085427
57
+ },
58
+ "nsubj": {
59
+ "p": 0.8020833333,
60
+ "r": 0.8020833333,
61
+ "f": 0.8020833333
62
+ },
63
+ "root": {
64
+ "p": 0.967611336,
65
+ "r": 0.9540918164,
66
+ "f": 0.9608040201
67
+ },
68
+ "aux": {
69
+ "p": 0.9545454545,
70
+ "r": 0.9586956522,
71
+ "f": 0.9566160521
72
+ },
73
+ "advcl": {
74
+ "p": 0.6402877698,
75
+ "r": 0.6223776224,
76
+ "f": 0.6312056738
77
+ },
78
+ "mark": {
79
+ "p": 0.953815261,
80
+ "r": 0.9350393701,
81
+ "f": 0.944333996
82
+ },
83
+ "acl": {
84
+ "p": 0.8161434978,
85
+ "r": 0.8017621145,
86
+ "f": 0.8088888889
87
+ },
88
+ "obj": {
89
+ "p": 0.9254658385,
90
+ "r": 0.9085365854,
91
+ "f": 0.9169230769
92
+ },
93
+ "fixed": {
94
+ "p": 0.9572192513,
95
+ "r": 0.9835164835,
96
+ "f": 0.9701897019
97
+ },
98
+ "advmod": {
99
+ "p": 0.724137931,
100
+ "r": 0.5,
101
+ "f": 0.5915492958
102
+ },
103
+ "amod": {
104
+ "p": 0.7941176471,
105
+ "r": 0.675,
106
+ "f": 0.7297297297
107
+ },
108
+ "cop": {
109
+ "p": 0.9482758621,
110
+ "r": 0.9065934066,
111
+ "f": 0.9269662921
112
+ },
113
+ "ccomp": {
114
+ "p": 0.9473684211,
115
+ "r": 0.8181818182,
116
+ "f": 0.8780487805
117
+ },
118
+ "det": {
119
+ "p": 0.9607843137,
120
+ "r": 0.9607843137,
121
+ "f": 0.9607843137
122
+ },
123
+ "csubj": {
124
+ "p": 0.8333333333,
125
+ "r": 0.7692307692,
126
+ "f": 0.8
127
+ },
128
+ "dep": {
129
+ "p": 0.0,
130
+ "r": 0.0,
131
+ "f": 0.0
132
+ }
133
+ },
134
+ "ents_per_type": {
135
+ "DATE": {
136
+ "p": 0.9272727273,
137
+ "r": 0.9444444444,
138
+ "f": 0.9357798165
139
+ },
140
+ "PERSON": {
141
+ "p": 0.6464646465,
142
+ "r": 0.4604316547,
143
+ "f": 0.5378151261
144
+ },
145
+ "GPE": {
146
+ "p": 0.6455696203,
147
+ "r": 0.5425531915,
148
+ "f": 0.5895953757
149
+ },
150
+ "PRODUCT": {
151
+ "p": 0.55,
152
+ "r": 0.2682926829,
153
+ "f": 0.3606557377
154
+ },
155
+ "TIME": {
156
+ "p": 0.6666666667,
157
+ "r": 1.0,
158
+ "f": 0.8
159
+ },
160
+ "QUANTITY": {
161
+ "p": 0.9090909091,
162
+ "r": 0.9090909091,
163
+ "f": 0.9090909091
164
+ },
165
+ "NORP": {
166
+ "p": 0.7826086957,
167
+ "r": 0.5625,
168
+ "f": 0.6545454545
169
+ },
170
+ "ORDINAL": {
171
+ "p": 0.5909090909,
172
+ "r": 0.6842105263,
173
+ "f": 0.6341463415
174
+ },
175
+ "TITLE_AFFIX": {
176
+ "p": 0.7647058824,
177
+ "r": 0.4333333333,
178
+ "f": 0.5531914894
179
+ },
180
+ "ORG": {
181
+ "p": 0.504950495,
182
+ "r": 0.3893129771,
183
+ "f": 0.4396551724
184
+ },
185
+ "WORK_OF_ART": {
186
+ "p": 0.7222222222,
187
+ "r": 0.7647058824,
188
+ "f": 0.7428571429
189
+ },
190
+ "PERCENT": {
191
+ "p": 1.0,
192
+ "r": 0.4285714286,
193
+ "f": 0.6
194
+ },
195
+ "EVENT": {
196
+ "p": 0.7619047619,
197
+ "r": 0.6153846154,
198
+ "f": 0.6808510638
199
+ },
200
+ "LOC": {
201
+ "p": 0.6363636364,
202
+ "r": 0.7,
203
+ "f": 0.6666666667
204
+ },
205
+ "FAC": {
206
+ "p": 0.6315789474,
207
+ "r": 0.3243243243,
208
+ "f": 0.4285714286
209
+ },
210
+ "MOVEMENT": {
211
+ "p": 0.0,
212
+ "r": 0.0,
213
+ "f": 0.0
214
+ },
215
+ "LAW": {
216
+ "p": 1.0,
217
+ "r": 0.3333333333,
218
+ "f": 0.5
219
+ },
220
+ "MONEY": {
221
+ "p": 1.0,
222
+ "r": 1.0,
223
+ "f": 1.0
224
+ },
225
+ "LANGUAGE": {
226
+ "p": 1.0,
227
+ "r": 1.0,
228
+ "f": 1.0
229
+ },
230
+ "CARDINAL": {
231
+ "p": 0.0,
232
+ "r": 0.0,
233
+ "f": 0.0
234
+ }
235
+ }
236
+ }
attribute_ruler/patterns ADDED
Binary file (64 Bytes). View file
 
config.cfg ADDED
@@ -0,0 +1,233 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/ja-core-news/train.spacy"
3
+ dev = "corpus/ja-core-news/dev.spacy"
4
+ vectors = null
5
+ raw = null
6
+ init_tok2vec = null
7
+ vocab_data = null
8
+
9
+ [system]
10
+ gpu_allocator = null
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "ja"
15
+ pipeline = ["tok2vec","parser","senter","attribute_ruler","ner"]
16
+ disabled = ["senter"]
17
+ before_creation = null
18
+ after_creation = null
19
+ after_pipeline_creation = null
20
+ batch_size = 256
21
+
22
+ [nlp.tokenizer]
23
+ @tokenizers = "spacy.ja.JapaneseTokenizer"
24
+ split_mode = null
25
+
26
+ [components]
27
+
28
+ [components.attribute_ruler]
29
+ factory = "attribute_ruler"
30
+ validate = false
31
+
32
+ [components.ner]
33
+ factory = "ner"
34
+ incorrect_spans_key = null
35
+ moves = null
36
+ update_with_oracle_cut_size = 100
37
+
38
+ [components.ner.model]
39
+ @architectures = "spacy.TransitionBasedParser.v2"
40
+ state_type = "ner"
41
+ extra_state_tokens = false
42
+ hidden_width = 64
43
+ maxout_pieces = 2
44
+ use_upper = true
45
+ nO = null
46
+
47
+ [components.ner.model.tok2vec]
48
+ @architectures = "spacy.Tok2Vec.v2"
49
+
50
+ [components.ner.model.tok2vec.embed]
51
+ @architectures = "spacy.MultiHashEmbed.v2"
52
+ width = 96
53
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
+ rows = [5000,2500,2500,2500]
55
+ include_static_vectors = false
56
+
57
+ [components.ner.model.tok2vec.encode]
58
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
59
+ width = 96
60
+ depth = 4
61
+ window_size = 1
62
+ maxout_pieces = 3
63
+
64
+ [components.parser]
65
+ factory = "parser"
66
+ learn_tokens = false
67
+ min_action_freq = 30
68
+ moves = null
69
+ update_with_oracle_cut_size = 100
70
+
71
+ [components.parser.model]
72
+ @architectures = "spacy.TransitionBasedParser.v2"
73
+ state_type = "parser"
74
+ extra_state_tokens = false
75
+ hidden_width = 64
76
+ maxout_pieces = 2
77
+ use_upper = true
78
+ nO = null
79
+
80
+ [components.parser.model.tok2vec]
81
+ @architectures = "spacy.Tok2VecListener.v1"
82
+ width = ${components.tok2vec.model.encode:width}
83
+ upstream = "tok2vec"
84
+
85
+ [components.senter]
86
+ factory = "senter"
87
+
88
+ [components.senter.model]
89
+ @architectures = "spacy.Tagger.v1"
90
+ nO = null
91
+
92
+ [components.senter.model.tok2vec]
93
+ @architectures = "spacy.Tok2Vec.v2"
94
+
95
+ [components.senter.model.tok2vec.embed]
96
+ @architectures = "spacy.MultiHashEmbed.v2"
97
+ width = 16
98
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
99
+ rows = [1000,500,500,500]
100
+ include_static_vectors = false
101
+
102
+ [components.senter.model.tok2vec.encode]
103
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
104
+ width = 16
105
+ depth = 2
106
+ window_size = 1
107
+ maxout_pieces = 2
108
+
109
+ [components.tok2vec]
110
+ factory = "tok2vec"
111
+
112
+ [components.tok2vec.model]
113
+ @architectures = "spacy.Tok2Vec.v2"
114
+
115
+ [components.tok2vec.model.embed]
116
+ @architectures = "spacy.MultiHashEmbed.v2"
117
+ width = ${components.tok2vec.model.encode:width}
118
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
119
+ rows = [5000,2500,2500,2500]
120
+ include_static_vectors = false
121
+
122
+ [components.tok2vec.model.encode]
123
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
124
+ width = 96
125
+ depth = 4
126
+ window_size = 1
127
+ maxout_pieces = 3
128
+
129
+ [corpora]
130
+
131
+ [corpora.dev]
132
+ @readers = "spacy.Corpus.v1"
133
+ limit = 0
134
+ max_length = 0
135
+ path = ${paths:dev}
136
+ gold_preproc = false
137
+ augmenter = null
138
+
139
+ [corpora.train]
140
+ @readers = "spacy.Corpus.v1"
141
+ path = ${paths:train}
142
+ max_length = 5000
143
+ gold_preproc = false
144
+ limit = 0
145
+ augmenter = null
146
+
147
+ [training]
148
+ train_corpus = "corpora.train"
149
+ dev_corpus = "corpora.dev"
150
+ seed = ${system:seed}
151
+ gpu_allocator = ${system:gpu_allocator}
152
+ dropout = 0.1
153
+ accumulate_gradient = 1
154
+ patience = 5000
155
+ max_epochs = 0
156
+ max_steps = 0
157
+ eval_frequency = 1000
158
+ frozen_components = []
159
+ before_to_disk = null
160
+ annotating_components = []
161
+
162
+ [training.batcher]
163
+ @batchers = "spacy.batch_by_words.v1"
164
+ discard_oversize = false
165
+ tolerance = 0.2
166
+ get_length = null
167
+
168
+ [training.batcher.size]
169
+ @schedules = "compounding.v1"
170
+ start = 100
171
+ stop = 1000
172
+ compound = 1.001
173
+ t = 0.0
174
+
175
+ [training.logger]
176
+ @loggers = "spacy.WandbLogger.v1"
177
+ project_name = "spacy-v3.0.0a2"
178
+ remove_config_values = []
179
+
180
+ [training.optimizer]
181
+ @optimizers = "Adam.v1"
182
+ beta1 = 0.9
183
+ beta2 = 0.999
184
+ L2_is_weight_decay = true
185
+ L2 = 0.01
186
+ grad_clip = 1.0
187
+ use_averages = true
188
+ eps = 0.00000001
189
+ learn_rate = 0.001
190
+
191
+ [training.score_weights]
192
+ dep_uas = 0.0
193
+ dep_las = 0.45
194
+ dep_las_per_type = null
195
+ sents_p = null
196
+ sents_r = null
197
+ sents_f = 0.06
198
+ ents_f = 0.5
199
+ ents_p = 0.0
200
+ ents_r = 0.0
201
+ ents_per_type = null
202
+
203
+ [pretraining]
204
+
205
+ [initialize]
206
+ vocab_data = ${paths.vocab_data}
207
+ vectors = ${paths.vectors}
208
+ init_tok2vec = ${paths.init_tok2vec}
209
+ before_init = null
210
+ after_init = null
211
+
212
+ [initialize.components]
213
+
214
+ [initialize.components.ner]
215
+
216
+ [initialize.components.ner.labels]
217
+ @readers = "spacy.read_labels.v1"
218
+ path = "corpus/labels/ner.json"
219
+ require = false
220
+
221
+ [initialize.components.parser]
222
+
223
+ [initialize.components.parser.labels]
224
+ @readers = "spacy.read_labels.v1"
225
+ path = "corpus/labels/parser.json"
226
+ require = false
227
+
228
+ [initialize.lookups]
229
+ @misc = "spacy.LookupsDataLoader.v1"
230
+ lang = ${nlp.lang}
231
+ tables = []
232
+
233
+ [initialize.tokenizer]
ja_core_news_sm-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b45400d19cd9682fb40946402532000ad81164abbaff6828abb012929ea918c8
3
+ size 12941362
meta.json ADDED
@@ -0,0 +1,349 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "lang":"ja",
3
+ "name":"core_news_sm",
4
+ "version":"3.1.0",
5
+ "description":"Japanese pipeline optimized for CPU. Components: tok2vec, parser, senter, ner, attribute_ruler.",
6
+ "author":"Explosion",
7
+ "email":"contact@explosion.ai",
8
+ "url":"https://explosion.ai",
9
+ "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.1.0,<3.2.0",
11
+ "spacy_git_version":"caba63b74",
12
+ "vectors":{
13
+ "width":0,
14
+ "vectors":0,
15
+ "keys":0,
16
+ "name":null
17
+ },
18
+ "labels":{
19
+ "tok2vec":[
20
+
21
+ ],
22
+ "parser":[
23
+ "ROOT",
24
+ "acl",
25
+ "advcl",
26
+ "advmod",
27
+ "amod",
28
+ "aux",
29
+ "case",
30
+ "cc",
31
+ "ccomp",
32
+ "compound",
33
+ "cop",
34
+ "csubj",
35
+ "dep",
36
+ "det",
37
+ "dislocated",
38
+ "fixed",
39
+ "mark",
40
+ "nmod",
41
+ "nsubj",
42
+ "nummod",
43
+ "obj",
44
+ "obl",
45
+ "punct"
46
+ ],
47
+ "senter":[
48
+ "I",
49
+ "S"
50
+ ],
51
+ "attribute_ruler":[
52
+
53
+ ],
54
+ "ner":[
55
+ "CARDINAL",
56
+ "DATE",
57
+ "EVENT",
58
+ "FAC",
59
+ "GPE",
60
+ "LANGUAGE",
61
+ "LAW",
62
+ "LOC",
63
+ "MONEY",
64
+ "MOVEMENT",
65
+ "NORP",
66
+ "ORDINAL",
67
+ "ORG",
68
+ "PERCENT",
69
+ "PERSON",
70
+ "PET_NAME",
71
+ "PHONE",
72
+ "PRODUCT",
73
+ "QUANTITY",
74
+ "TIME",
75
+ "TITLE_AFFIX",
76
+ "WORK_OF_ART"
77
+ ]
78
+ },
79
+ "pipeline":[
80
+ "tok2vec",
81
+ "parser",
82
+ "attribute_ruler",
83
+ "ner"
84
+ ],
85
+ "components":[
86
+ "tok2vec",
87
+ "parser",
88
+ "senter",
89
+ "attribute_ruler",
90
+ "ner"
91
+ ],
92
+ "disabled":[
93
+ "senter"
94
+ ],
95
+ "performance":{
96
+ "token_acc":0.9968965945,
97
+ "tag_acc":0.9721899386,
98
+ "pos_acc":0.9639755682,
99
+ "morph_acc":0.0,
100
+ "dep_uas":0.916212877,
101
+ "dep_las":0.894130554,
102
+ "ents_p":0.7163232964,
103
+ "ents_r":0.5772669221,
104
+ "ents_f":0.639321075,
105
+ "sents_p":0.9860557769,
106
+ "sents_r":0.9880239521,
107
+ "sents_f":0.9870388833,
108
+ "speed":14745.6510174231,
109
+ "morph_per_feat":{
110
+ "Polarity":{
111
+ "p":0.0,
112
+ "r":0.0,
113
+ "f":0.0
114
+ }
115
+ },
116
+ "dep_las_per_type":{
117
+ "cc":{
118
+ "p":0.7659574468,
119
+ "r":0.7826086957,
120
+ "f":0.7741935484
121
+ },
122
+ "nummod":{
123
+ "p":0.9730769231,
124
+ "r":0.8694158076,
125
+ "f":0.9183303085
126
+ },
127
+ "compound":{
128
+ "p":0.9299242424,
129
+ "r":0.9143389199,
130
+ "f":0.9220657277
131
+ },
132
+ "obl":{
133
+ "p":0.7782857143,
134
+ "r":0.8185096154,
135
+ "f":0.7978910369
136
+ },
137
+ "case":{
138
+ "p":0.9808429119,
139
+ "r":0.9774723177,
140
+ "f":0.9791547141
141
+ },
142
+ "dislocated":{
143
+ "p":0.4285714286,
144
+ "r":0.3157894737,
145
+ "f":0.3636363636
146
+ },
147
+ "nmod":{
148
+ "p":0.8792650919,
149
+ "r":0.8072289157,
150
+ "f":0.8417085427
151
+ },
152
+ "nsubj":{
153
+ "p":0.8020833333,
154
+ "r":0.8020833333,
155
+ "f":0.8020833333
156
+ },
157
+ "root":{
158
+ "p":0.967611336,
159
+ "r":0.9540918164,
160
+ "f":0.9608040201
161
+ },
162
+ "aux":{
163
+ "p":0.9545454545,
164
+ "r":0.9586956522,
165
+ "f":0.9566160521
166
+ },
167
+ "advcl":{
168
+ "p":0.6402877698,
169
+ "r":0.6223776224,
170
+ "f":0.6312056738
171
+ },
172
+ "mark":{
173
+ "p":0.953815261,
174
+ "r":0.9350393701,
175
+ "f":0.944333996
176
+ },
177
+ "acl":{
178
+ "p":0.8161434978,
179
+ "r":0.8017621145,
180
+ "f":0.8088888889
181
+ },
182
+ "obj":{
183
+ "p":0.9254658385,
184
+ "r":0.9085365854,
185
+ "f":0.9169230769
186
+ },
187
+ "fixed":{
188
+ "p":0.9572192513,
189
+ "r":0.9835164835,
190
+ "f":0.9701897019
191
+ },
192
+ "advmod":{
193
+ "p":0.724137931,
194
+ "r":0.5,
195
+ "f":0.5915492958
196
+ },
197
+ "amod":{
198
+ "p":0.7941176471,
199
+ "r":0.675,
200
+ "f":0.7297297297
201
+ },
202
+ "cop":{
203
+ "p":0.9482758621,
204
+ "r":0.9065934066,
205
+ "f":0.9269662921
206
+ },
207
+ "ccomp":{
208
+ "p":0.9473684211,
209
+ "r":0.8181818182,
210
+ "f":0.8780487805
211
+ },
212
+ "det":{
213
+ "p":0.9607843137,
214
+ "r":0.9607843137,
215
+ "f":0.9607843137
216
+ },
217
+ "csubj":{
218
+ "p":0.8333333333,
219
+ "r":0.7692307692,
220
+ "f":0.8
221
+ },
222
+ "dep":{
223
+ "p":0.0,
224
+ "r":0.0,
225
+ "f":0.0
226
+ }
227
+ },
228
+ "ents_per_type":{
229
+ "DATE":{
230
+ "p":0.9272727273,
231
+ "r":0.9444444444,
232
+ "f":0.9357798165
233
+ },
234
+ "PERSON":{
235
+ "p":0.6464646465,
236
+ "r":0.4604316547,
237
+ "f":0.5378151261
238
+ },
239
+ "GPE":{
240
+ "p":0.6455696203,
241
+ "r":0.5425531915,
242
+ "f":0.5895953757
243
+ },
244
+ "PRODUCT":{
245
+ "p":0.55,
246
+ "r":0.2682926829,
247
+ "f":0.3606557377
248
+ },
249
+ "TIME":{
250
+ "p":0.6666666667,
251
+ "r":1.0,
252
+ "f":0.8
253
+ },
254
+ "QUANTITY":{
255
+ "p":0.9090909091,
256
+ "r":0.9090909091,
257
+ "f":0.9090909091
258
+ },
259
+ "NORP":{
260
+ "p":0.7826086957,
261
+ "r":0.5625,
262
+ "f":0.6545454545
263
+ },
264
+ "ORDINAL":{
265
+ "p":0.5909090909,
266
+ "r":0.6842105263,
267
+ "f":0.6341463415
268
+ },
269
+ "TITLE_AFFIX":{
270
+ "p":0.7647058824,
271
+ "r":0.4333333333,
272
+ "f":0.5531914894
273
+ },
274
+ "ORG":{
275
+ "p":0.504950495,
276
+ "r":0.3893129771,
277
+ "f":0.4396551724
278
+ },
279
+ "WORK_OF_ART":{
280
+ "p":0.7222222222,
281
+ "r":0.7647058824,
282
+ "f":0.7428571429
283
+ },
284
+ "PERCENT":{
285
+ "p":1.0,
286
+ "r":0.4285714286,
287
+ "f":0.6
288
+ },
289
+ "EVENT":{
290
+ "p":0.7619047619,
291
+ "r":0.6153846154,
292
+ "f":0.6808510638
293
+ },
294
+ "LOC":{
295
+ "p":0.6363636364,
296
+ "r":0.7,
297
+ "f":0.6666666667
298
+ },
299
+ "FAC":{
300
+ "p":0.6315789474,
301
+ "r":0.3243243243,
302
+ "f":0.4285714286
303
+ },
304
+ "MOVEMENT":{
305
+ "p":0.0,
306
+ "r":0.0,
307
+ "f":0.0
308
+ },
309
+ "LAW":{
310
+ "p":1.0,
311
+ "r":0.3333333333,
312
+ "f":0.5
313
+ },
314
+ "MONEY":{
315
+ "p":1.0,
316
+ "r":1.0,
317
+ "f":1.0
318
+ },
319
+ "LANGUAGE":{
320
+ "p":1.0,
321
+ "r":1.0,
322
+ "f":1.0
323
+ },
324
+ "CARDINAL":{
325
+ "p":0.0,
326
+ "r":0.0,
327
+ "f":0.0
328
+ }
329
+ }
330
+ },
331
+ "sources":[
332
+ {
333
+ "name":"UD Japanese GSD v2.6",
334
+ "url":"https://github.com/UniversalDependencies/UD_Japanese-GSD",
335
+ "license":"CC BY-SA 4.0",
336
+ "author":"Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel"
337
+ },
338
+ {
339
+ "name":"UD Japanese GSD v2.6 NER",
340
+ "url":"https://github.com/megagonlabs/UD_Japanese-GSD",
341
+ "license":"CC BY-SA 4.0",
342
+ "author":"Megagon Labs Tokyo"
343
+ }
344
+ ],
345
+ "requirements":[
346
+ "sudachipy>=0.4.9",
347
+ "sudachidict-core>=20200330"
348
+ ]
349
+ }
ner/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
ner/model ADDED
Binary file (6.73 MB). View file
 
ner/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves��{"0":{},"1":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"2":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"3":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"4":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4,"":1},"5":{"":1}}�cfg��neg_key�
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":30,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
Binary file (300 kB). View file
 
parser/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves�q{"0":{"":75008},"1":{"":80671},"2":{"compound":20642,"obl":11201,"nmod":11139,"nsubj":6348,"acl":6215,"advcl":6023,"obj":4334,"nummod":3800,"advmod":1393,"punct":1249,"det":813,"cc":695,"amod":366,"ccomp":327,"dislocated":266,"csubj":159,"dep":0},"3":{"case":35563,"aux":18454,"punct":14888,"mark":6577,"fixed":2698,"cop":2198,"compound":248,"dep":0},"4":{"ROOT":6787}}�cfg��neg_key�
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
senter/model ADDED
Binary file (190 kB). View file
 
tok2vec/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
tok2vec/model ADDED
Binary file (6.59 MB). View file
 
tokenizer/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "split_mode":null
3
+ }
vocab/key2row ADDED
@@ -0,0 +1 @@
 
 
1
+
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cf36b06704e078e9c477ce1b341ece677cf7cf1a1c059de6d37cc9e50bb987c5
3
+ size 610901
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
3
+ size 128