osanseviero HF staff commited on
Commit
822433e
1 Parent(s): f5c2579

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -14,3 +14,7 @@
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
17
+ *.whl filter=lfs diff=lfs merge=lfs -text
18
+ *.npz filter=lfs diff=lfs merge=lfs -text
19
+ *strings.json filter=lfs diff=lfs merge=lfs -text
20
+ vectors filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,428 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution-ShareAlike 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution-ShareAlike 4.0 International Public
58
+ License
59
+
60
+ By exercising the Licensed Rights (defined below), You accept and agree
61
+ to be bound by the terms and conditions of this Creative Commons
62
+ Attribution-ShareAlike 4.0 International Public License ("Public
63
+ License"). To the extent this Public License may be interpreted as a
64
+ contract, You are granted the Licensed Rights in consideration of Your
65
+ acceptance of these terms and conditions, and the Licensor grants You
66
+ such rights in consideration of benefits the Licensor receives from
67
+ making the Licensed Material available under these terms and
68
+ conditions.
69
+
70
+
71
+ Section 1 -- Definitions.
72
+
73
+ a. Adapted Material means material subject to Copyright and Similar
74
+ Rights that is derived from or based upon the Licensed Material
75
+ and in which the Licensed Material is translated, altered,
76
+ arranged, transformed, or otherwise modified in a manner requiring
77
+ permission under the Copyright and Similar Rights held by the
78
+ Licensor. For purposes of this Public License, where the Licensed
79
+ Material is a musical work, performance, or sound recording,
80
+ Adapted Material is always produced where the Licensed Material is
81
+ synched in timed relation with a moving image.
82
+
83
+ b. Adapter's License means the license You apply to Your Copyright
84
+ and Similar Rights in Your contributions to Adapted Material in
85
+ accordance with the terms and conditions of this Public License.
86
+
87
+ c. BY-SA Compatible License means a license listed at
88
+ creativecommons.org/compatiblelicenses, approved by Creative
89
+ Commons as essentially the equivalent of this Public License.
90
+
91
+ d. Copyright and Similar Rights means copyright and/or similar rights
92
+ closely related to copyright including, without limitation,
93
+ performance, broadcast, sound recording, and Sui Generis Database
94
+ Rights, without regard to how the rights are labeled or
95
+ categorized. For purposes of this Public License, the rights
96
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
97
+ Rights.
98
+
99
+ e. Effective Technological Measures means those measures that, in the
100
+ absence of proper authority, may not be circumvented under laws
101
+ fulfilling obligations under Article 11 of the WIPO Copyright
102
+ Treaty adopted on December 20, 1996, and/or similar international
103
+ agreements.
104
+
105
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
106
+ any other exception or limitation to Copyright and Similar Rights
107
+ that applies to Your use of the Licensed Material.
108
+
109
+ g. License Elements means the license attributes listed in the name
110
+ of a Creative Commons Public License. The License Elements of this
111
+ Public License are Attribution and ShareAlike.
112
+
113
+ h. Licensed Material means the artistic or literary work, database,
114
+ or other material to which the Licensor applied this Public
115
+ License.
116
+
117
+ i. Licensed Rights means the rights granted to You subject to the
118
+ terms and conditions of this Public License, which are limited to
119
+ all Copyright and Similar Rights that apply to Your use of the
120
+ Licensed Material and that the Licensor has authority to license.
121
+
122
+ j. Licensor means the individual(s) or entity(ies) granting rights
123
+ under this Public License.
124
+
125
+ k. Share means to provide material to the public by any means or
126
+ process that requires permission under the Licensed Rights, such
127
+ as reproduction, public display, public performance, distribution,
128
+ dissemination, communication, or importation, and to make material
129
+ available to the public including in ways that members of the
130
+ public may access the material from a place and at a time
131
+ individually chosen by them.
132
+
133
+ l. Sui Generis Database Rights means rights other than copyright
134
+ resulting from Directive 96/9/EC of the European Parliament and of
135
+ the Council of 11 March 1996 on the legal protection of databases,
136
+ as amended and/or succeeded, as well as other essentially
137
+ equivalent rights anywhere in the world.
138
+
139
+ m. You means the individual or entity exercising the Licensed Rights
140
+ under this Public License. Your has a corresponding meaning.
141
+
142
+
143
+ Section 2 -- Scope.
144
+
145
+ a. License grant.
146
+
147
+ 1. Subject to the terms and conditions of this Public License,
148
+ the Licensor hereby grants You a worldwide, royalty-free,
149
+ non-sublicensable, non-exclusive, irrevocable license to
150
+ exercise the Licensed Rights in the Licensed Material to:
151
+
152
+ a. reproduce and Share the Licensed Material, in whole or
153
+ in part; and
154
+
155
+ b. produce, reproduce, and Share Adapted Material.
156
+
157
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
158
+ Exceptions and Limitations apply to Your use, this Public
159
+ License does not apply, and You do not need to comply with
160
+ its terms and conditions.
161
+
162
+ 3. Term. The term of this Public License is specified in Section
163
+ 6(a).
164
+
165
+ 4. Media and formats; technical modifications allowed. The
166
+ Licensor authorizes You to exercise the Licensed Rights in
167
+ all media and formats whether now known or hereafter created,
168
+ and to make technical modifications necessary to do so. The
169
+ Licensor waives and/or agrees not to assert any right or
170
+ authority to forbid You from making technical modifications
171
+ necessary to exercise the Licensed Rights, including
172
+ technical modifications necessary to circumvent Effective
173
+ Technological Measures. For purposes of this Public License,
174
+ simply making modifications authorized by this Section 2(a)
175
+ (4) never produces Adapted Material.
176
+
177
+ 5. Downstream recipients.
178
+
179
+ a. Offer from the Licensor -- Licensed Material. Every
180
+ recipient of the Licensed Material automatically
181
+ receives an offer from the Licensor to exercise the
182
+ Licensed Rights under the terms and conditions of this
183
+ Public License.
184
+
185
+ b. Additional offer from the Licensor -- Adapted Material.
186
+ Every recipient of Adapted Material from You
187
+ automatically receives an offer from the Licensor to
188
+ exercise the Licensed Rights in the Adapted Material
189
+ under the conditions of the Adapter's License You apply.
190
+
191
+ c. No downstream restrictions. You may not offer or impose
192
+ any additional or different terms or conditions on, or
193
+ apply any Effective Technological Measures to, the
194
+ Licensed Material if doing so restricts exercise of the
195
+ Licensed Rights by any recipient of the Licensed
196
+ Material.
197
+
198
+ 6. No endorsement. Nothing in this Public License constitutes or
199
+ may be construed as permission to assert or imply that You
200
+ are, or that Your use of the Licensed Material is, connected
201
+ with, or sponsored, endorsed, or granted official status by,
202
+ the Licensor or others designated to receive attribution as
203
+ provided in Section 3(a)(1)(A)(i).
204
+
205
+ b. Other rights.
206
+
207
+ 1. Moral rights, such as the right of integrity, are not
208
+ licensed under this Public License, nor are publicity,
209
+ privacy, and/or other similar personality rights; however, to
210
+ the extent possible, the Licensor waives and/or agrees not to
211
+ assert any such rights held by the Licensor to the limited
212
+ extent necessary to allow You to exercise the Licensed
213
+ Rights, but not otherwise.
214
+
215
+ 2. Patent and trademark rights are not licensed under this
216
+ Public License.
217
+
218
+ 3. To the extent possible, the Licensor waives any right to
219
+ collect royalties from You for the exercise of the Licensed
220
+ Rights, whether directly or through a collecting society
221
+ under any voluntary or waivable statutory or compulsory
222
+ licensing scheme. In all other cases the Licensor expressly
223
+ reserves any right to collect such royalties.
224
+
225
+
226
+ Section 3 -- License Conditions.
227
+
228
+ Your exercise of the Licensed Rights is expressly made subject to the
229
+ following conditions.
230
+
231
+ a. Attribution.
232
+
233
+ 1. If You Share the Licensed Material (including in modified
234
+ form), You must:
235
+
236
+ a. retain the following if it is supplied by the Licensor
237
+ with the Licensed Material:
238
+
239
+ i. identification of the creator(s) of the Licensed
240
+ Material and any others designated to receive
241
+ attribution, in any reasonable manner requested by
242
+ the Licensor (including by pseudonym if
243
+ designated);
244
+
245
+ ii. a copyright notice;
246
+
247
+ iii. a notice that refers to this Public License;
248
+
249
+ iv. a notice that refers to the disclaimer of
250
+ warranties;
251
+
252
+ v. a URI or hyperlink to the Licensed Material to the
253
+ extent reasonably practicable;
254
+
255
+ b. indicate if You modified the Licensed Material and
256
+ retain an indication of any previous modifications; and
257
+
258
+ c. indicate the Licensed Material is licensed under this
259
+ Public License, and include the text of, or the URI or
260
+ hyperlink to, this Public License.
261
+
262
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
263
+ reasonable manner based on the medium, means, and context in
264
+ which You Share the Licensed Material. For example, it may be
265
+ reasonable to satisfy the conditions by providing a URI or
266
+ hyperlink to a resource that includes the required
267
+ information.
268
+
269
+ 3. If requested by the Licensor, You must remove any of the
270
+ information required by Section 3(a)(1)(A) to the extent
271
+ reasonably practicable.
272
+
273
+ b. ShareAlike.
274
+
275
+ In addition to the conditions in Section 3(a), if You Share
276
+ Adapted Material You produce, the following conditions also apply.
277
+
278
+ 1. The Adapter's License You apply must be a Creative Commons
279
+ license with the same License Elements, this version or
280
+ later, or a BY-SA Compatible License.
281
+
282
+ 2. You must include the text of, or the URI or hyperlink to, the
283
+ Adapter's License You apply. You may satisfy this condition
284
+ in any reasonable manner based on the medium, means, and
285
+ context in which You Share Adapted Material.
286
+
287
+ 3. You may not offer or impose any additional or different terms
288
+ or conditions on, or apply any Effective Technological
289
+ Measures to, Adapted Material that restrict exercise of the
290
+ rights granted under the Adapter's License You apply.
291
+
292
+
293
+ Section 4 -- Sui Generis Database Rights.
294
+
295
+ Where the Licensed Rights include Sui Generis Database Rights that
296
+ apply to Your use of the Licensed Material:
297
+
298
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
299
+ to extract, reuse, reproduce, and Share all or a substantial
300
+ portion of the contents of the database;
301
+
302
+ b. if You include all or a substantial portion of the database
303
+ contents in a database in which You have Sui Generis Database
304
+ Rights, then the database in which You have Sui Generis Database
305
+ Rights (but not its individual contents) is Adapted Material,
306
+
307
+ including for purposes of Section 3(b); and
308
+ c. You must comply with the conditions in Section 3(a) if You Share
309
+ all or a substantial portion of the contents of the database.
310
+
311
+ For the avoidance of doubt, this Section 4 supplements and does not
312
+ replace Your obligations under this Public License where the Licensed
313
+ Rights include other Copyright and Similar Rights.
314
+
315
+
316
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
317
+
318
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
319
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
320
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
321
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
322
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
323
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
324
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
325
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
326
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
327
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
328
+
329
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
330
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
331
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
332
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
333
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
334
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
335
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
336
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
337
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
338
+
339
+ c. The disclaimer of warranties and limitation of liability provided
340
+ above shall be interpreted in a manner that, to the extent
341
+ possible, most closely approximates an absolute disclaimer and
342
+ waiver of all liability.
343
+
344
+
345
+ Section 6 -- Term and Termination.
346
+
347
+ a. This Public License applies for the term of the Copyright and
348
+ Similar Rights licensed here. However, if You fail to comply with
349
+ this Public License, then Your rights under this Public License
350
+ terminate automatically.
351
+
352
+ b. Where Your right to use the Licensed Material has terminated under
353
+ Section 6(a), it reinstates:
354
+
355
+ 1. automatically as of the date the violation is cured, provided
356
+ it is cured within 30 days of Your discovery of the
357
+ violation; or
358
+
359
+ 2. upon express reinstatement by the Licensor.
360
+
361
+ For the avoidance of doubt, this Section 6(b) does not affect any
362
+ right the Licensor may have to seek remedies for Your violations
363
+ of this Public License.
364
+
365
+ c. For the avoidance of doubt, the Licensor may also offer the
366
+ Licensed Material under separate terms or conditions or stop
367
+ distributing the Licensed Material at any time; however, doing so
368
+ will not terminate this Public License.
369
+
370
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
371
+ License.
372
+
373
+
374
+ Section 7 -- Other Terms and Conditions.
375
+
376
+ a. The Licensor shall not be bound by any additional or different
377
+ terms or conditions communicated by You unless expressly agreed.
378
+
379
+ b. Any arrangements, understandings, or agreements regarding the
380
+ Licensed Material not stated herein are separate from and
381
+ independent of the terms and conditions of this Public License.
382
+
383
+
384
+ Section 8 -- Interpretation.
385
+
386
+ a. For the avoidance of doubt, this Public License does not, and
387
+ shall not be interpreted to, reduce, limit, restrict, or impose
388
+ conditions on any use of the Licensed Material that could lawfully
389
+ be made without permission under this Public License.
390
+
391
+ b. To the extent possible, if any provision of this Public License is
392
+ deemed unenforceable, it shall be automatically reformed to the
393
+ minimum extent necessary to make it enforceable. If the provision
394
+ cannot be reformed, it shall be severed from this Public License
395
+ without affecting the enforceability of the remaining terms and
396
+ conditions.
397
+
398
+ c. No term or condition of this Public License will be waived and no
399
+ failure to comply consented to unless expressly agreed to by the
400
+ Licensor.
401
+
402
+ d. Nothing in this Public License constitutes or may be interpreted
403
+ as a limitation upon, or waiver of, any privileges and immunities
404
+ that apply to the Licensor or You, including from the legal
405
+ processes of any jurisdiction or authority.
406
+
407
+
408
+ =======================================================================
409
+
410
+ Creative Commons is not a party to its public
411
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
412
+ its public licenses to material it publishes and in those instances
413
+ will be considered the “Licensor.” The text of the Creative Commons
414
+ public licenses is dedicated to the public domain under the CC0 Public
415
+ Domain Dedication. Except for the limited purpose of indicating that
416
+ material is shared under a Creative Commons public license or as
417
+ otherwise permitted by the Creative Commons policies published at
418
+ creativecommons.org/policies, Creative Commons does not authorize the
419
+ use of the trademark "Creative Commons" or any other trademark or logo
420
+ of Creative Commons without its prior written consent including,
421
+ without limitation, in connection with any unauthorized modifications
422
+ to any of its public licenses or any other arrangements,
423
+ understandings, or agreements concerning use of licensed material. For
424
+ the avoidance of doubt, this paragraph does not form part of the
425
+ public licenses.
426
+
427
+ Creative Commons may be contacted at creativecommons.org.
428
+
LICENSES_SOURCES ADDED
@@ -0,0 +1,1092 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # UD Japanese GSD v2.6
2
+
3
+ * Author: Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel
4
+ * URL: https://github.com/UniversalDependencies/UD_Japanese-GSD
5
+ * License: CC BY-SA 4.0
6
+
7
+ ```
8
+ Attribution-ShareAlike 4.0 International
9
+
10
+ =======================================================================
11
+
12
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
13
+ does not provide legal services or legal advice. Distribution of
14
+ Creative Commons public licenses does not create a lawyer-client or
15
+ other relationship. Creative Commons makes its licenses and related
16
+ information available on an "as-is" basis. Creative Commons gives no
17
+ warranties regarding its licenses, any material licensed under their
18
+ terms and conditions, or any related information. Creative Commons
19
+ disclaims all liability for damages resulting from their use to the
20
+ fullest extent possible.
21
+
22
+ Using Creative Commons Public Licenses
23
+
24
+ Creative Commons public licenses provide a standard set of terms and
25
+ conditions that creators and other rights holders may use to share
26
+ original works of authorship and other material subject to copyright
27
+ and certain other rights specified in the public license below. The
28
+ following considerations are for informational purposes only, are not
29
+ exhaustive, and do not form part of our licenses.
30
+
31
+ Considerations for licensors: Our public licenses are
32
+ intended for use by those authorized to give the public
33
+ permission to use material in ways otherwise restricted by
34
+ copyright and certain other rights. Our licenses are
35
+ irrevocable. Licensors should read and understand the terms
36
+ and conditions of the license they choose before applying it.
37
+ Licensors should also secure all rights necessary before
38
+ applying our licenses so that the public can reuse the
39
+ material as expected. Licensors should clearly mark any
40
+ material not subject to the license. This includes other CC-
41
+ licensed material, or material used under an exception or
42
+ limitation to copyright. More considerations for licensors:
43
+ wiki.creativecommons.org/Considerations_for_licensors
44
+
45
+ Considerations for the public: By using one of our public
46
+ licenses, a licensor grants the public permission to use the
47
+ licensed material under specified terms and conditions. If
48
+ the licensor's permission is not necessary for any reason--for
49
+ example, because of any applicable exception or limitation to
50
+ copyright--then that use is not regulated by the license. Our
51
+ licenses grant only permissions under copyright and certain
52
+ other rights that a licensor has authority to grant. Use of
53
+ the licensed material may still be restricted for other
54
+ reasons, including because others have copyright or other
55
+ rights in the material. A licensor may make special requests,
56
+ such as asking that all changes be marked or described.
57
+ Although not required by our licenses, you are encouraged to
58
+ respect those requests where reasonable. More considerations
59
+ for the public:
60
+ wiki.creativecommons.org/Considerations_for_licensees
61
+
62
+ =======================================================================
63
+
64
+ Creative Commons Attribution-ShareAlike 4.0 International Public
65
+ License
66
+
67
+ By exercising the Licensed Rights (defined below), You accept and agree
68
+ to be bound by the terms and conditions of this Creative Commons
69
+ Attribution-ShareAlike 4.0 International Public License ("Public
70
+ License"). To the extent this Public License may be interpreted as a
71
+ contract, You are granted the Licensed Rights in consideration of Your
72
+ acceptance of these terms and conditions, and the Licensor grants You
73
+ such rights in consideration of benefits the Licensor receives from
74
+ making the Licensed Material available under these terms and
75
+ conditions.
76
+
77
+
78
+ Section 1 -- Definitions.
79
+
80
+ a. Adapted Material means material subject to Copyright and Similar
81
+ Rights that is derived from or based upon the Licensed Material
82
+ and in which the Licensed Material is translated, altered,
83
+ arranged, transformed, or otherwise modified in a manner requiring
84
+ permission under the Copyright and Similar Rights held by the
85
+ Licensor. For purposes of this Public License, where the Licensed
86
+ Material is a musical work, performance, or sound recording,
87
+ Adapted Material is always produced where the Licensed Material is
88
+ synched in timed relation with a moving image.
89
+
90
+ b. Adapter's License means the license You apply to Your Copyright
91
+ and Similar Rights in Your contributions to Adapted Material in
92
+ accordance with the terms and conditions of this Public License.
93
+
94
+ c. BY-SA Compatible License means a license listed at
95
+ creativecommons.org/compatiblelicenses, approved by Creative
96
+ Commons as essentially the equivalent of this Public License.
97
+
98
+ d. Copyright and Similar Rights means copyright and/or similar rights
99
+ closely related to copyright including, without limitation,
100
+ performance, broadcast, sound recording, and Sui Generis Database
101
+ Rights, without regard to how the rights are labeled or
102
+ categorized. For purposes of this Public License, the rights
103
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
104
+ Rights.
105
+
106
+ e. Effective Technological Measures means those measures that, in the
107
+ absence of proper authority, may not be circumvented under laws
108
+ fulfilling obligations under Article 11 of the WIPO Copyright
109
+ Treaty adopted on December 20, 1996, and/or similar international
110
+ agreements.
111
+
112
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
113
+ any other exception or limitation to Copyright and Similar Rights
114
+ that applies to Your use of the Licensed Material.
115
+
116
+ g. License Elements means the license attributes listed in the name
117
+ of a Creative Commons Public License. The License Elements of this
118
+ Public License are Attribution and ShareAlike.
119
+
120
+ h. Licensed Material means the artistic or literary work, database,
121
+ or other material to which the Licensor applied this Public
122
+ License.
123
+
124
+ i. Licensed Rights means the rights granted to You subject to the
125
+ terms and conditions of this Public License, which are limited to
126
+ all Copyright and Similar Rights that apply to Your use of the
127
+ Licensed Material and that the Licensor has authority to license.
128
+
129
+ j. Licensor means the individual(s) or entity(ies) granting rights
130
+ under this Public License.
131
+
132
+ k. Share means to provide material to the public by any means or
133
+ process that requires permission under the Licensed Rights, such
134
+ as reproduction, public display, public performance, distribution,
135
+ dissemination, communication, or importation, and to make material
136
+ available to the public including in ways that members of the
137
+ public may access the material from a place and at a time
138
+ individually chosen by them.
139
+
140
+ l. Sui Generis Database Rights means rights other than copyright
141
+ resulting from Directive 96/9/EC of the European Parliament and of
142
+ the Council of 11 March 1996 on the legal protection of databases,
143
+ as amended and/or succeeded, as well as other essentially
144
+ equivalent rights anywhere in the world.
145
+
146
+ m. You means the individual or entity exercising the Licensed Rights
147
+ under this Public License. Your has a corresponding meaning.
148
+
149
+
150
+ Section 2 -- Scope.
151
+
152
+ a. License grant.
153
+
154
+ 1. Subject to the terms and conditions of this Public License,
155
+ the Licensor hereby grants You a worldwide, royalty-free,
156
+ non-sublicensable, non-exclusive, irrevocable license to
157
+ exercise the Licensed Rights in the Licensed Material to:
158
+
159
+ a. reproduce and Share the Licensed Material, in whole or
160
+ in part; and
161
+
162
+ b. produce, reproduce, and Share Adapted Material.
163
+
164
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
165
+ Exceptions and Limitations apply to Your use, this Public
166
+ License does not apply, and You do not need to comply with
167
+ its terms and conditions.
168
+
169
+ 3. Term. The term of this Public License is specified in Section
170
+ 6(a).
171
+
172
+ 4. Media and formats; technical modifications allowed. The
173
+ Licensor authorizes You to exercise the Licensed Rights in
174
+ all media and formats whether now known or hereafter created,
175
+ and to make technical modifications necessary to do so. The
176
+ Licensor waives and/or agrees not to assert any right or
177
+ authority to forbid You from making technical modifications
178
+ necessary to exercise the Licensed Rights, including
179
+ technical modifications necessary to circumvent Effective
180
+ Technological Measures. For purposes of this Public License,
181
+ simply making modifications authorized by this Section 2(a)
182
+ (4) never produces Adapted Material.
183
+
184
+ 5. Downstream recipients.
185
+
186
+ a. Offer from the Licensor -- Licensed Material. Every
187
+ recipient of the Licensed Material automatically
188
+ receives an offer from the Licensor to exercise the
189
+ Licensed Rights under the terms and conditions of this
190
+ Public License.
191
+
192
+ b. Additional offer from the Licensor -- Adapted Material.
193
+ Every recipient of Adapted Material from You
194
+ automatically receives an offer from the Licensor to
195
+ exercise the Licensed Rights in the Adapted Material
196
+ under the conditions of the Adapter's License You apply.
197
+
198
+ c. No downstream restrictions. You may not offer or impose
199
+ any additional or different terms or conditions on, or
200
+ apply any Effective Technological Measures to, the
201
+ Licensed Material if doing so restricts exercise of the
202
+ Licensed Rights by any recipient of the Licensed
203
+ Material.
204
+
205
+ 6. No endorsement. Nothing in this Public License constitutes or
206
+ may be construed as permission to assert or imply that You
207
+ are, or that Your use of the Licensed Material is, connected
208
+ with, or sponsored, endorsed, or granted official status by,
209
+ the Licensor or others designated to receive attribution as
210
+ provided in Section 3(a)(1)(A)(i).
211
+
212
+ b. Other rights.
213
+
214
+ 1. Moral rights, such as the right of integrity, are not
215
+ licensed under this Public License, nor are publicity,
216
+ privacy, and/or other similar personality rights; however, to
217
+ the extent possible, the Licensor waives and/or agrees not to
218
+ assert any such rights held by the Licensor to the limited
219
+ extent necessary to allow You to exercise the Licensed
220
+ Rights, but not otherwise.
221
+
222
+ 2. Patent and trademark rights are not licensed under this
223
+ Public License.
224
+
225
+ 3. To the extent possible, the Licensor waives any right to
226
+ collect royalties from You for the exercise of the Licensed
227
+ Rights, whether directly or through a collecting society
228
+ under any voluntary or waivable statutory or compulsory
229
+ licensing scheme. In all other cases the Licensor expressly
230
+ reserves any right to collect such royalties.
231
+
232
+
233
+ Section 3 -- License Conditions.
234
+
235
+ Your exercise of the Licensed Rights is expressly made subject to the
236
+ following conditions.
237
+
238
+ a. Attribution.
239
+
240
+ 1. If You Share the Licensed Material (including in modified
241
+ form), You must:
242
+
243
+ a. retain the following if it is supplied by the Licensor
244
+ with the Licensed Material:
245
+
246
+ i. identification of the creator(s) of the Licensed
247
+ Material and any others designated to receive
248
+ attribution, in any reasonable manner requested by
249
+ the Licensor (including by pseudonym if
250
+ designated);
251
+
252
+ ii. a copyright notice;
253
+
254
+ iii. a notice that refers to this Public License;
255
+
256
+ iv. a notice that refers to the disclaimer of
257
+ warranties;
258
+
259
+ v. a URI or hyperlink to the Licensed Material to the
260
+ extent reasonably practicable;
261
+
262
+ b. indicate if You modified the Licensed Material and
263
+ retain an indication of any previous modifications; and
264
+
265
+ c. indicate the Licensed Material is licensed under this
266
+ Public License, and include the text of, or the URI or
267
+ hyperlink to, this Public License.
268
+
269
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
270
+ reasonable manner based on the medium, means, and context in
271
+ which You Share the Licensed Material. For example, it may be
272
+ reasonable to satisfy the conditions by providing a URI or
273
+ hyperlink to a resource that includes the required
274
+ information.
275
+
276
+ 3. If requested by the Licensor, You must remove any of the
277
+ information required by Section 3(a)(1)(A) to the extent
278
+ reasonably practicable.
279
+
280
+ b. ShareAlike.
281
+
282
+ In addition to the conditions in Section 3(a), if You Share
283
+ Adapted Material You produce, the following conditions also apply.
284
+
285
+ 1. The Adapter's License You apply must be a Creative Commons
286
+ license with the same License Elements, this version or
287
+ later, or a BY-SA Compatible License.
288
+
289
+ 2. You must include the text of, or the URI or hyperlink to, the
290
+ Adapter's License You apply. You may satisfy this condition
291
+ in any reasonable manner based on the medium, means, and
292
+ context in which You Share Adapted Material.
293
+
294
+ 3. You may not offer or impose any additional or different terms
295
+ or conditions on, or apply any Effective Technological
296
+ Measures to, Adapted Material that restrict exercise of the
297
+ rights granted under the Adapter's License You apply.
298
+
299
+
300
+ Section 4 -- Sui Generis Database Rights.
301
+
302
+ Where the Licensed Rights include Sui Generis Database Rights that
303
+ apply to Your use of the Licensed Material:
304
+
305
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
306
+ to extract, reuse, reproduce, and Share all or a substantial
307
+ portion of the contents of the database;
308
+
309
+ b. if You include all or a substantial portion of the database
310
+ contents in a database in which You have Sui Generis Database
311
+ Rights, then the database in which You have Sui Generis Database
312
+ Rights (but not its individual contents) is Adapted Material,
313
+
314
+ including for purposes of Section 3(b); and
315
+ c. You must comply with the conditions in Section 3(a) if You Share
316
+ all or a substantial portion of the contents of the database.
317
+
318
+ For the avoidance of doubt, this Section 4 supplements and does not
319
+ replace Your obligations under this Public License where the Licensed
320
+ Rights include other Copyright and Similar Rights.
321
+
322
+
323
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
324
+
325
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
326
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
327
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
328
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
329
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
330
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
331
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
332
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
333
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
334
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
335
+
336
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
337
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
338
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
339
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
340
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
341
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
342
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
343
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
344
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
345
+
346
+ c. The disclaimer of warranties and limitation of liability provided
347
+ above shall be interpreted in a manner that, to the extent
348
+ possible, most closely approximates an absolute disclaimer and
349
+ waiver of all liability.
350
+
351
+
352
+ Section 6 -- Term and Termination.
353
+
354
+ a. This Public License applies for the term of the Copyright and
355
+ Similar Rights licensed here. However, if You fail to comply with
356
+ this Public License, then Your rights under this Public License
357
+ terminate automatically.
358
+
359
+ b. Where Your right to use the Licensed Material has terminated under
360
+ Section 6(a), it reinstates:
361
+
362
+ 1. automatically as of the date the violation is cured, provided
363
+ it is cured within 30 days of Your discovery of the
364
+ violation; or
365
+
366
+ 2. upon express reinstatement by the Licensor.
367
+
368
+ For the avoidance of doubt, this Section 6(b) does not affect any
369
+ right the Licensor may have to seek remedies for Your violations
370
+ of this Public License.
371
+
372
+ c. For the avoidance of doubt, the Licensor may also offer the
373
+ Licensed Material under separate terms or conditions or stop
374
+ distributing the Licensed Material at any time; however, doing so
375
+ will not terminate this Public License.
376
+
377
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
378
+ License.
379
+
380
+
381
+ Section 7 -- Other Terms and Conditions.
382
+
383
+ a. The Licensor shall not be bound by any additional or different
384
+ terms or conditions communicated by You unless expressly agreed.
385
+
386
+ b. Any arrangements, understandings, or agreements regarding the
387
+ Licensed Material not stated herein are separate from and
388
+ independent of the terms and conditions of this Public License.
389
+
390
+
391
+ Section 8 -- Interpretation.
392
+
393
+ a. For the avoidance of doubt, this Public License does not, and
394
+ shall not be interpreted to, reduce, limit, restrict, or impose
395
+ conditions on any use of the Licensed Material that could lawfully
396
+ be made without permission under this Public License.
397
+
398
+ b. To the extent possible, if any provision of this Public License is
399
+ deemed unenforceable, it shall be automatically reformed to the
400
+ minimum extent necessary to make it enforceable. If the provision
401
+ cannot be reformed, it shall be severed from this Public License
402
+ without affecting the enforceability of the remaining terms and
403
+ conditions.
404
+
405
+ c. No term or condition of this Public License will be waived and no
406
+ failure to comply consented to unless expressly agreed to by the
407
+ Licensor.
408
+
409
+ d. Nothing in this Public License constitutes or may be interpreted
410
+ as a limitation upon, or waiver of, any privileges and immunities
411
+ that apply to the Licensor or You, including from the legal
412
+ processes of any jurisdiction or authority.
413
+
414
+
415
+ =======================================================================
416
+
417
+ Creative Commons is not a party to its public
418
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
419
+ its public licenses to material it publishes and in those instances
420
+ will be considered the “Licensor.” The text of the Creative Commons
421
+ public licenses is dedicated to the public domain under the CC0 Public
422
+ Domain Dedication. Except for the limited purpose of indicating that
423
+ material is shared under a Creative Commons public license or as
424
+ otherwise permitted by the Creative Commons policies published at
425
+ creativecommons.org/policies, Creative Commons does not authorize the
426
+ use of the trademark "Creative Commons" or any other trademark or logo
427
+ of Creative Commons without its prior written consent including,
428
+ without limitation, in connection with any unauthorized modifications
429
+ to any of its public licenses or any other arrangements,
430
+ understandings, or agreements concerning use of licensed material. For
431
+ the avoidance of doubt, this paragraph does not form part of the
432
+ public licenses.
433
+
434
+ Creative Commons may be contacted at creativecommons.org.
435
+
436
+ ```
437
+
438
+
439
+
440
+
441
+ # UD Japanese GSD v2.6 NER
442
+
443
+ * Author: Megagon Labs Tokyo
444
+ * URL: https://github.com/megagonlabs/UD_Japanese-GSD
445
+ * License: CC BY-SA 4.0
446
+
447
+ ```
448
+ Attribution-ShareAlike 4.0 International
449
+
450
+ =======================================================================
451
+
452
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
453
+ does not provide legal services or legal advice. Distribution of
454
+ Creative Commons public licenses does not create a lawyer-client or
455
+ other relationship. Creative Commons makes its licenses and related
456
+ information available on an "as-is" basis. Creative Commons gives no
457
+ warranties regarding its licenses, any material licensed under their
458
+ terms and conditions, or any related information. Creative Commons
459
+ disclaims all liability for damages resulting from their use to the
460
+ fullest extent possible.
461
+
462
+ Using Creative Commons Public Licenses
463
+
464
+ Creative Commons public licenses provide a standard set of terms and
465
+ conditions that creators and other rights holders may use to share
466
+ original works of authorship and other material subject to copyright
467
+ and certain other rights specified in the public license below. The
468
+ following considerations are for informational purposes only, are not
469
+ exhaustive, and do not form part of our licenses.
470
+
471
+ Considerations for licensors: Our public licenses are
472
+ intended for use by those authorized to give the public
473
+ permission to use material in ways otherwise restricted by
474
+ copyright and certain other rights. Our licenses are
475
+ irrevocable. Licensors should read and understand the terms
476
+ and conditions of the license they choose before applying it.
477
+ Licensors should also secure all rights necessary before
478
+ applying our licenses so that the public can reuse the
479
+ material as expected. Licensors should clearly mark any
480
+ material not subject to the license. This includes other CC-
481
+ licensed material, or material used under an exception or
482
+ limitation to copyright. More considerations for licensors:
483
+ wiki.creativecommons.org/Considerations_for_licensors
484
+
485
+ Considerations for the public: By using one of our public
486
+ licenses, a licensor grants the public permission to use the
487
+ licensed material under specified terms and conditions. If
488
+ the licensor's permission is not necessary for any reason--for
489
+ example, because of any applicable exception or limitation to
490
+ copyright--then that use is not regulated by the license. Our
491
+ licenses grant only permissions under copyright and certain
492
+ other rights that a licensor has authority to grant. Use of
493
+ the licensed material may still be restricted for other
494
+ reasons, including because others have copyright or other
495
+ rights in the material. A licensor may make special requests,
496
+ such as asking that all changes be marked or described.
497
+ Although not required by our licenses, you are encouraged to
498
+ respect those requests where reasonable. More considerations
499
+ for the public:
500
+ wiki.creativecommons.org/Considerations_for_licensees
501
+
502
+ =======================================================================
503
+
504
+ Creative Commons Attribution-ShareAlike 4.0 International Public
505
+ License
506
+
507
+ By exercising the Licensed Rights (defined below), You accept and agree
508
+ to be bound by the terms and conditions of this Creative Commons
509
+ Attribution-ShareAlike 4.0 International Public License ("Public
510
+ License"). To the extent this Public License may be interpreted as a
511
+ contract, You are granted the Licensed Rights in consideration of Your
512
+ acceptance of these terms and conditions, and the Licensor grants You
513
+ such rights in consideration of benefits the Licensor receives from
514
+ making the Licensed Material available under these terms and
515
+ conditions.
516
+
517
+
518
+ Section 1 -- Definitions.
519
+
520
+ a. Adapted Material means material subject to Copyright and Similar
521
+ Rights that is derived from or based upon the Licensed Material
522
+ and in which the Licensed Material is translated, altered,
523
+ arranged, transformed, or otherwise modified in a manner requiring
524
+ permission under the Copyright and Similar Rights held by the
525
+ Licensor. For purposes of this Public License, where the Licensed
526
+ Material is a musical work, performance, or sound recording,
527
+ Adapted Material is always produced where the Licensed Material is
528
+ synched in timed relation with a moving image.
529
+
530
+ b. Adapter's License means the license You apply to Your Copyright
531
+ and Similar Rights in Your contributions to Adapted Material in
532
+ accordance with the terms and conditions of this Public License.
533
+
534
+ c. BY-SA Compatible License means a license listed at
535
+ creativecommons.org/compatiblelicenses, approved by Creative
536
+ Commons as essentially the equivalent of this Public License.
537
+
538
+ d. Copyright and Similar Rights means copyright and/or similar rights
539
+ closely related to copyright including, without limitation,
540
+ performance, broadcast, sound recording, and Sui Generis Database
541
+ Rights, without regard to how the rights are labeled or
542
+ categorized. For purposes of this Public License, the rights
543
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
544
+ Rights.
545
+
546
+ e. Effective Technological Measures means those measures that, in the
547
+ absence of proper authority, may not be circumvented under laws
548
+ fulfilling obligations under Article 11 of the WIPO Copyright
549
+ Treaty adopted on December 20, 1996, and/or similar international
550
+ agreements.
551
+
552
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
553
+ any other exception or limitation to Copyright and Similar Rights
554
+ that applies to Your use of the Licensed Material.
555
+
556
+ g. License Elements means the license attributes listed in the name
557
+ of a Creative Commons Public License. The License Elements of this
558
+ Public License are Attribution and ShareAlike.
559
+
560
+ h. Licensed Material means the artistic or literary work, database,
561
+ or other material to which the Licensor applied this Public
562
+ License.
563
+
564
+ i. Licensed Rights means the rights granted to You subject to the
565
+ terms and conditions of this Public License, which are limited to
566
+ all Copyright and Similar Rights that apply to Your use of the
567
+ Licensed Material and that the Licensor has authority to license.
568
+
569
+ j. Licensor means the individual(s) or entity(ies) granting rights
570
+ under this Public License.
571
+
572
+ k. Share means to provide material to the public by any means or
573
+ process that requires permission under the Licensed Rights, such
574
+ as reproduction, public display, public performance, distribution,
575
+ dissemination, communication, or importation, and to make material
576
+ available to the public including in ways that members of the
577
+ public may access the material from a place and at a time
578
+ individually chosen by them.
579
+
580
+ l. Sui Generis Database Rights means rights other than copyright
581
+ resulting from Directive 96/9/EC of the European Parliament and of
582
+ the Council of 11 March 1996 on the legal protection of databases,
583
+ as amended and/or succeeded, as well as other essentially
584
+ equivalent rights anywhere in the world.
585
+
586
+ m. You means the individual or entity exercising the Licensed Rights
587
+ under this Public License. Your has a corresponding meaning.
588
+
589
+
590
+ Section 2 -- Scope.
591
+
592
+ a. License grant.
593
+
594
+ 1. Subject to the terms and conditions of this Public License,
595
+ the Licensor hereby grants You a worldwide, royalty-free,
596
+ non-sublicensable, non-exclusive, irrevocable license to
597
+ exercise the Licensed Rights in the Licensed Material to:
598
+
599
+ a. reproduce and Share the Licensed Material, in whole or
600
+ in part; and
601
+
602
+ b. produce, reproduce, and Share Adapted Material.
603
+
604
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
605
+ Exceptions and Limitations apply to Your use, this Public
606
+ License does not apply, and You do not need to comply with
607
+ its terms and conditions.
608
+
609
+ 3. Term. The term of this Public License is specified in Section
610
+ 6(a).
611
+
612
+ 4. Media and formats; technical modifications allowed. The
613
+ Licensor authorizes You to exercise the Licensed Rights in
614
+ all media and formats whether now known or hereafter created,
615
+ and to make technical modifications necessary to do so. The
616
+ Licensor waives and/or agrees not to assert any right or
617
+ authority to forbid You from making technical modifications
618
+ necessary to exercise the Licensed Rights, including
619
+ technical modifications necessary to circumvent Effective
620
+ Technological Measures. For purposes of this Public License,
621
+ simply making modifications authorized by this Section 2(a)
622
+ (4) never produces Adapted Material.
623
+
624
+ 5. Downstream recipients.
625
+
626
+ a. Offer from the Licensor -- Licensed Material. Every
627
+ recipient of the Licensed Material automatically
628
+ receives an offer from the Licensor to exercise the
629
+ Licensed Rights under the terms and conditions of this
630
+ Public License.
631
+
632
+ b. Additional offer from the Licensor -- Adapted Material.
633
+ Every recipient of Adapted Material from You
634
+ automatically receives an offer from the Licensor to
635
+ exercise the Licensed Rights in the Adapted Material
636
+ under the conditions of the Adapter's License You apply.
637
+
638
+ c. No downstream restrictions. You may not offer or impose
639
+ any additional or different terms or conditions on, or
640
+ apply any Effective Technological Measures to, the
641
+ Licensed Material if doing so restricts exercise of the
642
+ Licensed Rights by any recipient of the Licensed
643
+ Material.
644
+
645
+ 6. No endorsement. Nothing in this Public License constitutes or
646
+ may be construed as permission to assert or imply that You
647
+ are, or that Your use of the Licensed Material is, connected
648
+ with, or sponsored, endorsed, or granted official status by,
649
+ the Licensor or others designated to receive attribution as
650
+ provided in Section 3(a)(1)(A)(i).
651
+
652
+ b. Other rights.
653
+
654
+ 1. Moral rights, such as the right of integrity, are not
655
+ licensed under this Public License, nor are publicity,
656
+ privacy, and/or other similar personality rights; however, to
657
+ the extent possible, the Licensor waives and/or agrees not to
658
+ assert any such rights held by the Licensor to the limited
659
+ extent necessary to allow You to exercise the Licensed
660
+ Rights, but not otherwise.
661
+
662
+ 2. Patent and trademark rights are not licensed under this
663
+ Public License.
664
+
665
+ 3. To the extent possible, the Licensor waives any right to
666
+ collect royalties from You for the exercise of the Licensed
667
+ Rights, whether directly or through a collecting society
668
+ under any voluntary or waivable statutory or compulsory
669
+ licensing scheme. In all other cases the Licensor expressly
670
+ reserves any right to collect such royalties.
671
+
672
+
673
+ Section 3 -- License Conditions.
674
+
675
+ Your exercise of the Licensed Rights is expressly made subject to the
676
+ following conditions.
677
+
678
+ a. Attribution.
679
+
680
+ 1. If You Share the Licensed Material (including in modified
681
+ form), You must:
682
+
683
+ a. retain the following if it is supplied by the Licensor
684
+ with the Licensed Material:
685
+
686
+ i. identification of the creator(s) of the Licensed
687
+ Material and any others designated to receive
688
+ attribution, in any reasonable manner requested by
689
+ the Licensor (including by pseudonym if
690
+ designated);
691
+
692
+ ii. a copyright notice;
693
+
694
+ iii. a notice that refers to this Public License;
695
+
696
+ iv. a notice that refers to the disclaimer of
697
+ warranties;
698
+
699
+ v. a URI or hyperlink to the Licensed Material to the
700
+ extent reasonably practicable;
701
+
702
+ b. indicate if You modified the Licensed Material and
703
+ retain an indication of any previous modifications; and
704
+
705
+ c. indicate the Licensed Material is licensed under this
706
+ Public License, and include the text of, or the URI or
707
+ hyperlink to, this Public License.
708
+
709
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
710
+ reasonable manner based on the medium, means, and context in
711
+ which You Share the Licensed Material. For example, it may be
712
+ reasonable to satisfy the conditions by providing a URI or
713
+ hyperlink to a resource that includes the required
714
+ information.
715
+
716
+ 3. If requested by the Licensor, You must remove any of the
717
+ information required by Section 3(a)(1)(A) to the extent
718
+ reasonably practicable.
719
+
720
+ b. ShareAlike.
721
+
722
+ In addition to the conditions in Section 3(a), if You Share
723
+ Adapted Material You produce, the following conditions also apply.
724
+
725
+ 1. The Adapter's License You apply must be a Creative Commons
726
+ license with the same License Elements, this version or
727
+ later, or a BY-SA Compatible License.
728
+
729
+ 2. You must include the text of, or the URI or hyperlink to, the
730
+ Adapter's License You apply. You may satisfy this condition
731
+ in any reasonable manner based on the medium, means, and
732
+ context in which You Share Adapted Material.
733
+
734
+ 3. You may not offer or impose any additional or different terms
735
+ or conditions on, or apply any Effective Technological
736
+ Measures to, Adapted Material that restrict exercise of the
737
+ rights granted under the Adapter's License You apply.
738
+
739
+
740
+ Section 4 -- Sui Generis Database Rights.
741
+
742
+ Where the Licensed Rights include Sui Generis Database Rights that
743
+ apply to Your use of the Licensed Material:
744
+
745
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
746
+ to extract, reuse, reproduce, and Share all or a substantial
747
+ portion of the contents of the database;
748
+
749
+ b. if You include all or a substantial portion of the database
750
+ contents in a database in which You have Sui Generis Database
751
+ Rights, then the database in which You have Sui Generis Database
752
+ Rights (but not its individual contents) is Adapted Material,
753
+
754
+ including for purposes of Section 3(b); and
755
+ c. You must comply with the conditions in Section 3(a) if You Share
756
+ all or a substantial portion of the contents of the database.
757
+
758
+ For the avoidance of doubt, this Section 4 supplements and does not
759
+ replace Your obligations under this Public License where the Licensed
760
+ Rights include other Copyright and Similar Rights.
761
+
762
+
763
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
764
+
765
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
766
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
767
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
768
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
769
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
770
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
771
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
772
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
773
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
774
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
775
+
776
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
777
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
778
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
779
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
780
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
781
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
782
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
783
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
784
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
785
+
786
+ c. The disclaimer of warranties and limitation of liability provided
787
+ above shall be interpreted in a manner that, to the extent
788
+ possible, most closely approximates an absolute disclaimer and
789
+ waiver of all liability.
790
+
791
+
792
+ Section 6 -- Term and Termination.
793
+
794
+ a. This Public License applies for the term of the Copyright and
795
+ Similar Rights licensed here. However, if You fail to comply with
796
+ this Public License, then Your rights under this Public License
797
+ terminate automatically.
798
+
799
+ b. Where Your right to use the Licensed Material has terminated under
800
+ Section 6(a), it reinstates:
801
+
802
+ 1. automatically as of the date the violation is cured, provided
803
+ it is cured within 30 days of Your discovery of the
804
+ violation; or
805
+
806
+ 2. upon express reinstatement by the Licensor.
807
+
808
+ For the avoidance of doubt, this Section 6(b) does not affect any
809
+ right the Licensor may have to seek remedies for Your violations
810
+ of this Public License.
811
+
812
+ c. For the avoidance of doubt, the Licensor may also offer the
813
+ Licensed Material under separate terms or conditions or stop
814
+ distributing the Licensed Material at any time; however, doing so
815
+ will not terminate this Public License.
816
+
817
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
818
+ License.
819
+
820
+
821
+ Section 7 -- Other Terms and Conditions.
822
+
823
+ a. The Licensor shall not be bound by any additional or different
824
+ terms or conditions communicated by You unless expressly agreed.
825
+
826
+ b. Any arrangements, understandings, or agreements regarding the
827
+ Licensed Material not stated herein are separate from and
828
+ independent of the terms and conditions of this Public License.
829
+
830
+
831
+ Section 8 -- Interpretation.
832
+
833
+ a. For the avoidance of doubt, this Public License does not, and
834
+ shall not be interpreted to, reduce, limit, restrict, or impose
835
+ conditions on any use of the Licensed Material that could lawfully
836
+ be made without permission under this Public License.
837
+
838
+ b. To the extent possible, if any provision of this Public License is
839
+ deemed unenforceable, it shall be automatically reformed to the
840
+ minimum extent necessary to make it enforceable. If the provision
841
+ cannot be reformed, it shall be severed from this Public License
842
+ without affecting the enforceability of the remaining terms and
843
+ conditions.
844
+
845
+ c. No term or condition of this Public License will be waived and no
846
+ failure to comply consented to unless expressly agreed to by the
847
+ Licensor.
848
+
849
+ d. Nothing in this Public License constitutes or may be interpreted
850
+ as a limitation upon, or waiver of, any privileges and immunities
851
+ that apply to the Licensor or You, including from the legal
852
+ processes of any jurisdiction or authority.
853
+
854
+
855
+ =======================================================================
856
+
857
+ Creative Commons is not a party to its public
858
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
859
+ its public licenses to material it publishes and in those instances
860
+ will be considered the “Licensor.” The text of the Creative Commons
861
+ public licenses is dedicated to the public domain under the CC0 Public
862
+ Domain Dedication. Except for the limited purpose of indicating that
863
+ material is shared under a Creative Commons public license or as
864
+ otherwise permitted by the Creative Commons policies published at
865
+ creativecommons.org/policies, Creative Commons does not authorize the
866
+ use of the trademark "Creative Commons" or any other trademark or logo
867
+ of Creative Commons without its prior written consent including,
868
+ without limitation, in connection with any unauthorized modifications
869
+ to any of its public licenses or any other arrangements,
870
+ understandings, or agreements concerning use of licensed material. For
871
+ the avoidance of doubt, this paragraph does not form part of the
872
+ public licenses.
873
+
874
+ Creative Commons may be contacted at creativecommons.org.
875
+
876
+ ```
877
+
878
+
879
+
880
+
881
+ # chiVe: Japanese Word Embedding with Sudachi & NWJC (chive-1.1-mc90-500k)
882
+
883
+ * Author: Works Applications
884
+ * URL: https://github.com/WorksApplications/chiVe
885
+ * License: Apache-2.0
886
+
887
+ ```
888
+ Apache License
889
+ Version 2.0, January 2004
890
+ http://www.apache.org/licenses/
891
+
892
+ TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
893
+
894
+ 1. Definitions.
895
+
896
+ "License" shall mean the terms and conditions for use, reproduction,
897
+ and distribution as defined by Sections 1 through 9 of this document.
898
+
899
+ "Licensor" shall mean the copyright owner or entity authorized by
900
+ the copyright owner that is granting the License.
901
+
902
+ "Legal Entity" shall mean the union of the acting entity and all
903
+ other entities that control, are controlled by, or are under common
904
+ control with that entity. For the purposes of this definition,
905
+ "control" means (i) the power, direct or indirect, to cause the
906
+ direction or management of such entity, whether by contract or
907
+ otherwise, or (ii) ownership of fifty percent (50%) or more of the
908
+ outstanding shares, or (iii) beneficial ownership of such entity.
909
+
910
+ "You" (or "Your") shall mean an individual or Legal Entity
911
+ exercising permissions granted by this License.
912
+
913
+ "Source" form shall mean the preferred form for making modifications,
914
+ including but not limited to software source code, documentation
915
+ source, and configuration files.
916
+
917
+ "Object" form shall mean any form resulting from mechanical
918
+ transformation or translation of a Source form, including but
919
+ not limited to compiled object code, generated documentation,
920
+ and conversions to other media types.
921
+
922
+ "Work" shall mean the work of authorship, whether in Source or
923
+ Object form, made available under the License, as indicated by a
924
+ copyright notice that is included in or attached to the work
925
+ (an example is provided in the Appendix below).
926
+
927
+ "Derivative Works" shall mean any work, whether in Source or Object
928
+ form, that is based on (or derived from) the Work and for which the
929
+ editorial revisions, annotations, elaborations, or other modifications
930
+ represent, as a whole, an original work of authorship. For the purposes
931
+ of this License, Derivative Works shall not include works that remain
932
+ separable from, or merely link (or bind by name) to the interfaces of,
933
+ the Work and Derivative Works thereof.
934
+
935
+ "Contribution" shall mean any work of authorship, including
936
+ the original version of the Work and any modifications or additions
937
+ to that Work or Derivative Works thereof, that is intentionally
938
+ submitted to Licensor for inclusion in the Work by the copyright owner
939
+ or by an individual or Legal Entity authorized to submit on behalf of
940
+ the copyright owner. For the purposes of this definition, "submitted"
941
+ means any form of electronic, verbal, or written communication sent
942
+ to the Licensor or its representatives, including but not limited to
943
+ communication on electronic mailing lists, source code control systems,
944
+ and issue tracking systems that are managed by, or on behalf of, the
945
+ Licensor for the purpose of discussing and improving the Work, but
946
+ excluding communication that is conspicuously marked or otherwise
947
+ designated in writing by the copyright owner as "Not a Contribution."
948
+
949
+ "Contributor" shall mean Licensor and any individual or Legal Entity
950
+ on behalf of whom a Contribution has been received by Licensor and
951
+ subsequently incorporated within the Work.
952
+
953
+ 2. Grant of Copyright License. Subject to the terms and conditions of
954
+ this License, each Contributor hereby grants to You a perpetual,
955
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
956
+ copyright license to reproduce, prepare Derivative Works of,
957
+ publicly display, publicly perform, sublicense, and distribute the
958
+ Work and such Derivative Works in Source or Object form.
959
+
960
+ 3. Grant of Patent License. Subject to the terms and conditions of
961
+ this License, each Contributor hereby grants to You a perpetual,
962
+ worldwide, non-exclusive, no-charge, royalty-free, irrevocable
963
+ (except as stated in this section) patent license to make, have made,
964
+ use, offer to sell, sell, import, and otherwise transfer the Work,
965
+ where such license applies only to those patent claims licensable
966
+ by such Contributor that are necessarily infringed by their
967
+ Contribution(s) alone or by combination of their Contribution(s)
968
+ with the Work to which such Contribution(s) was submitted. If You
969
+ institute patent litigation against any entity (including a
970
+ cross-claim or counterclaim in a lawsuit) alleging that the Work
971
+ or a Contribution incorporated within the Work constitutes direct
972
+ or contributory patent infringement, then any patent licenses
973
+ granted to You under this License for that Work shall terminate
974
+ as of the date such litigation is filed.
975
+
976
+ 4. Redistribution. You may reproduce and distribute copies of the
977
+ Work or Derivative Works thereof in any medium, with or without
978
+ modifications, and in Source or Object form, provided that You
979
+ meet the following conditions:
980
+
981
+ (a) You must give any other recipients of the Work or
982
+ Derivative Works a copy of this License; and
983
+
984
+ (b) You must cause any modified files to carry prominent notices
985
+ stating that You changed the files; and
986
+
987
+ (c) You must retain, in the Source form of any Derivative Works
988
+ that You distribute, all copyright, patent, trademark, and
989
+ attribution notices from the Source form of the Work,
990
+ excluding those notices that do not pertain to any part of
991
+ the Derivative Works; and
992
+
993
+ (d) If the Work includes a "NOTICE" text file as part of its
994
+ distribution, then any Derivative Works that You distribute must
995
+ include a readable copy of the attribution notices contained
996
+ within such NOTICE file, excluding those notices that do not
997
+ pertain to any part of the Derivative Works, in at least one
998
+ of the following places: within a NOTICE text file distributed
999
+ as part of the Derivative Works; within the Source form or
1000
+ documentation, if provided along with the Derivative Works; or,
1001
+ within a display generated by the Derivative Works, if and
1002
+ wherever such third-party notices normally appear. The contents
1003
+ of the NOTICE file are for informational purposes only and
1004
+ do not modify the License. You may add Your own attribution
1005
+ notices within Derivative Works that You distribute, alongside
1006
+ or as an addendum to the NOTICE text from the Work, provided
1007
+ that such additional attribution notices cannot be construed
1008
+ as modifying the License.
1009
+
1010
+ You may add Your own copyright statement to Your modifications and
1011
+ may provide additional or different license terms and conditions
1012
+ for use, reproduction, or distribution of Your modifications, or
1013
+ for any such Derivative Works as a whole, provided Your use,
1014
+ reproduction, and distribution of the Work otherwise complies with
1015
+ the conditions stated in this License.
1016
+
1017
+ 5. Submission of Contributions. Unless You explicitly state otherwise,
1018
+ any Contribution intentionally submitted for inclusion in the Work
1019
+ by You to the Licensor shall be under the terms and conditions of
1020
+ this License, without any additional terms or conditions.
1021
+ Notwithstanding the above, nothing herein shall supersede or modify
1022
+ the terms of any separate license agreement you may have executed
1023
+ with Licensor regarding such Contributions.
1024
+
1025
+ 6. Trademarks. This License does not grant permission to use the trade
1026
+ names, trademarks, service marks, or product names of the Licensor,
1027
+ except as required for reasonable and customary use in describing the
1028
+ origin of the Work and reproducing the content of the NOTICE file.
1029
+
1030
+ 7. Disclaimer of Warranty. Unless required by applicable law or
1031
+ agreed to in writing, Licensor provides the Work (and each
1032
+ Contributor provides its Contributions) on an "AS IS" BASIS,
1033
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
1034
+ implied, including, without limitation, any warranties or conditions
1035
+ of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
1036
+ PARTICULAR PURPOSE. You are solely responsible for determining the
1037
+ appropriateness of using or redistributing the Work and assume any
1038
+ risks associated with Your exercise of permissions under this License.
1039
+
1040
+ 8. Limitation of Liability. In no event and under no legal theory,
1041
+ whether in tort (including negligence), contract, or otherwise,
1042
+ unless required by applicable law (such as deliberate and grossly
1043
+ negligent acts) or agreed to in writing, shall any Contributor be
1044
+ liable to You for damages, including any direct, indirect, special,
1045
+ incidental, or consequential damages of any character arising as a
1046
+ result of this License or out of the use or inability to use the
1047
+ Work (including but not limited to damages for loss of goodwill,
1048
+ work stoppage, computer failure or malfunction, or any and all
1049
+ other commercial damages or losses), even if such Contributor
1050
+ has been advised of the possibility of such damages.
1051
+
1052
+ 9. Accepting Warranty or Additional Liability. While redistributing
1053
+ the Work or Derivative Works thereof, You may choose to offer,
1054
+ and charge a fee for, acceptance of support, warranty, indemnity,
1055
+ or other liability obligations and/or rights consistent with this
1056
+ License. However, in accepting such obligations, You may act only
1057
+ on Your own behalf and on Your sole responsibility, not on behalf
1058
+ of any other Contributor, and only if You agree to indemnify,
1059
+ defend, and hold each Contributor harmless for any liability
1060
+ incurred by, or claims asserted against, such Contributor by reason
1061
+ of your accepting any such warranty or additional liability.
1062
+
1063
+ END OF TERMS AND CONDITIONS
1064
+
1065
+ APPENDIX: How to apply the Apache License to your work.
1066
+
1067
+ To apply the Apache License to your work, attach the following
1068
+ boilerplate notice, with the fields enclosed by brackets "[]"
1069
+ replaced with your own identifying information. (Don't include
1070
+ the brackets!) The text should be enclosed in the appropriate
1071
+ comment syntax for the file format. We also recommend that a
1072
+ file or class name and description of purpose be included on the
1073
+ same "printed page" as the copyright notice for easier
1074
+ identification within third-party archives.
1075
+
1076
+ Copyright [yyyy] [name of copyright owner]
1077
+
1078
+ Licensed under the Apache License, Version 2.0 (the "License");
1079
+ you may not use this file except in compliance with the License.
1080
+ You may obtain a copy of the License at
1081
+
1082
+ http://www.apache.org/licenses/LICENSE-2.0
1083
+
1084
+ Unless required by applicable law or agreed to in writing, software
1085
+ distributed under the License is distributed on an "AS IS" BASIS,
1086
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
1087
+ See the License for the specific language governing permissions and
1088
+ limitations under the License.```
1089
+
1090
+
1091
+
1092
+
README.md ADDED
@@ -0,0 +1,104 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - spacy
4
+ - token-classification
5
+ language:
6
+ - ja
7
+ license: CC-BY-SA-4.0
8
+ model-index:
9
+ - name: ja_core_news_lg
10
+ results:
11
+ - tasks:
12
+ name: NER
13
+ type: token-classification
14
+ metrics:
15
+ - name: Precision
16
+ type: precision
17
+ value: 0.760989011
18
+ - name: Recall
19
+ type: recall
20
+ value: 0.7075351213
21
+ - name: F Score
22
+ type: f_score
23
+ value: 0.7332892124
24
+ - tasks:
25
+ name: POS
26
+ type: token-classification
27
+ metrics:
28
+ - name: Accuracy
29
+ type: accuracy
30
+ value: 0.9721899386
31
+ - tasks:
32
+ name: SENTER
33
+ type: token-classification
34
+ metrics:
35
+ - name: Precision
36
+ type: precision
37
+ value: 0.9860557769
38
+ - name: Recall
39
+ type: recall
40
+ value: 0.9880239521
41
+ - name: F Score
42
+ type: f_score
43
+ value: 0.9870388833
44
+ - tasks:
45
+ name: UNLABELED_DEPENDENCIES
46
+ type: token-classification
47
+ metrics:
48
+ - name: Accuracy
49
+ type: accuracy
50
+ value: 0.9181002928
51
+ - tasks:
52
+ name: LABELED_DEPENDENCIES
53
+ type: token-classification
54
+ metrics:
55
+ - name: Accuracy
56
+ type: accuracy
57
+ value: 0.9181002928
58
+ ---
59
+ ### Details: https://spacy.io/models/ja#ja_core_news_lg
60
+
61
+ Japanese pipeline optimized for CPU. Components: tok2vec, parser, senter, ner, attribute_ruler.
62
+
63
+ | Feature | Description |
64
+ | --- | --- |
65
+ | **Name** | `ja_core_news_lg` |
66
+ | **Version** | `3.1.0` |
67
+ | **spaCy** | `>=3.1.0,<3.2.0` |
68
+ | **Default Pipeline** | `tok2vec`, `parser`, `attribute_ruler`, `ner` |
69
+ | **Components** | `tok2vec`, `parser`, `senter`, `attribute_ruler`, `ner` |
70
+ | **Vectors** | 480443 keys, 480443 unique vectors (300 dimensions) |
71
+ | **Sources** | [UD Japanese GSD v2.6](https://github.com/UniversalDependencies/UD_Japanese-GSD) (Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel)<br />[UD Japanese GSD v2.6 NER](https://github.com/megagonlabs/UD_Japanese-GSD) (Megagon Labs Tokyo)<br />[chiVe: Japanese Word Embedding with Sudachi & NWJC (chive-1.1-mc90-500k)](https://github.com/WorksApplications/chiVe) (Works Applications) |
72
+ | **License** | `CC BY-SA 4.0` |
73
+ | **Author** | [Explosion](https://explosion.ai) |
74
+
75
+ ### Label Scheme
76
+
77
+ <details>
78
+
79
+ <summary>View label scheme (47 labels for 3 components)</summary>
80
+
81
+ | Component | Labels |
82
+ | --- | --- |
83
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advmod`, `amod`, `aux`, `case`, `cc`, `ccomp`, `compound`, `cop`, `csubj`, `dep`, `det`, `dislocated`, `fixed`, `mark`, `nmod`, `nsubj`, `nummod`, `obj`, `obl`, `punct` |
84
+ | **`senter`** | `I`, `S` |
85
+ | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `MOVEMENT`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PET_NAME`, `PHONE`, `PRODUCT`, `QUANTITY`, `TIME`, `TITLE_AFFIX`, `WORK_OF_ART` |
86
+
87
+ </details>
88
+
89
+ ### Accuracy
90
+
91
+ | Type | Score |
92
+ | --- | --- |
93
+ | `TOKEN_ACC` | 99.69 |
94
+ | `TAG_ACC` | 97.22 |
95
+ | `POS_ACC` | 96.40 |
96
+ | `MORPH_ACC` | 0.00 |
97
+ | `DEP_UAS` | 91.81 |
98
+ | `DEP_LAS` | 89.98 |
99
+ | `ENTS_P` | 76.10 |
100
+ | `ENTS_R` | 70.75 |
101
+ | `ENTS_F` | 73.33 |
102
+ | `SENTS_P` | 98.61 |
103
+ | `SENTS_R` | 98.80 |
104
+ | `SENTS_F` | 98.70 |
accuracy.json ADDED
@@ -0,0 +1,236 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "token_acc": 0.9968965945,
3
+ "tag_acc": 0.9721899386,
4
+ "pos_acc": 0.9639755682,
5
+ "morph_acc": 0.0,
6
+ "dep_uas": 0.9181002928,
7
+ "dep_las": 0.8998080263,
8
+ "ents_p": 0.760989011,
9
+ "ents_r": 0.7075351213,
10
+ "ents_f": 0.7332892124,
11
+ "sents_p": 0.9860557769,
12
+ "sents_r": 0.9880239521,
13
+ "sents_f": 0.9870388833,
14
+ "speed": 11674.7452722222,
15
+ "morph_per_feat": {
16
+ "Polarity": {
17
+ "p": 0.0,
18
+ "r": 0.0,
19
+ "f": 0.0
20
+ }
21
+ },
22
+ "dep_las_per_type": {
23
+ "cc": {
24
+ "p": 0.7872340426,
25
+ "r": 0.8043478261,
26
+ "f": 0.7956989247
27
+ },
28
+ "nummod": {
29
+ "p": 0.9770114943,
30
+ "r": 0.8762886598,
31
+ "f": 0.9239130435
32
+ },
33
+ "compound": {
34
+ "p": 0.9397972117,
35
+ "r": 0.9205462446,
36
+ "f": 0.9300721229
37
+ },
38
+ "obl": {
39
+ "p": 0.7827102804,
40
+ "r": 0.8052884615,
41
+ "f": 0.7938388626
42
+ },
43
+ "case": {
44
+ "p": 0.986533282,
45
+ "r": 0.9789996182,
46
+ "f": 0.9827520123
47
+ },
48
+ "dislocated": {
49
+ "p": 0.5,
50
+ "r": 0.2105263158,
51
+ "f": 0.2962962963
52
+ },
53
+ "nmod": {
54
+ "p": 0.8676092545,
55
+ "r": 0.813253012,
56
+ "f": 0.8395522388
57
+ },
58
+ "nsubj": {
59
+ "p": 0.7950819672,
60
+ "r": 0.8083333333,
61
+ "f": 0.8016528926
62
+ },
63
+ "root": {
64
+ "p": 0.9717171717,
65
+ "r": 0.9600798403,
66
+ "f": 0.9658634538
67
+ },
68
+ "aux": {
69
+ "p": 0.9625090123,
70
+ "r": 0.9673913043,
71
+ "f": 0.9649439827
72
+ },
73
+ "advcl": {
74
+ "p": 0.6802884615,
75
+ "r": 0.6596736597,
76
+ "f": 0.6698224852
77
+ },
78
+ "mark": {
79
+ "p": 0.956,
80
+ "r": 0.9409448819,
81
+ "f": 0.9484126984
82
+ },
83
+ "acl": {
84
+ "p": 0.7887931034,
85
+ "r": 0.8061674009,
86
+ "f": 0.7973856209
87
+ },
88
+ "obj": {
89
+ "p": 0.950617284,
90
+ "r": 0.9390243902,
91
+ "f": 0.9447852761
92
+ },
93
+ "fixed": {
94
+ "p": 0.9421052632,
95
+ "r": 0.9835164835,
96
+ "f": 0.9623655914
97
+ },
98
+ "advmod": {
99
+ "p": 0.7045454545,
100
+ "r": 0.4920634921,
101
+ "f": 0.5794392523
102
+ },
103
+ "amod": {
104
+ "p": 0.8888888889,
105
+ "r": 0.6,
106
+ "f": 0.7164179104
107
+ },
108
+ "cop": {
109
+ "p": 0.9664804469,
110
+ "r": 0.9505494505,
111
+ "f": 0.9584487535
112
+ },
113
+ "ccomp": {
114
+ "p": 0.9,
115
+ "r": 0.8181818182,
116
+ "f": 0.8571428571
117
+ },
118
+ "det": {
119
+ "p": 0.9803921569,
120
+ "r": 0.9803921569,
121
+ "f": 0.9803921569
122
+ },
123
+ "dep": {
124
+ "p": 0.0,
125
+ "r": 0.0,
126
+ "f": 0.0
127
+ },
128
+ "csubj": {
129
+ "p": 0.8333333333,
130
+ "r": 0.7692307692,
131
+ "f": 0.8
132
+ }
133
+ },
134
+ "ents_per_type": {
135
+ "DATE": {
136
+ "p": 0.9626168224,
137
+ "r": 0.9537037037,
138
+ "f": 0.9581395349
139
+ },
140
+ "ORG": {
141
+ "p": 0.6637931034,
142
+ "r": 0.5877862595,
143
+ "f": 0.6234817814
144
+ },
145
+ "PERSON": {
146
+ "p": 0.780141844,
147
+ "r": 0.7913669065,
148
+ "f": 0.7857142857
149
+ },
150
+ "GPE": {
151
+ "p": 0.6956521739,
152
+ "r": 0.6808510638,
153
+ "f": 0.688172043
154
+ },
155
+ "EVENT": {
156
+ "p": 0.6666666667,
157
+ "r": 0.6153846154,
158
+ "f": 0.64
159
+ },
160
+ "PRODUCT": {
161
+ "p": 0.5666666667,
162
+ "r": 0.4146341463,
163
+ "f": 0.4788732394
164
+ },
165
+ "TIME": {
166
+ "p": 0.6666666667,
167
+ "r": 1.0,
168
+ "f": 0.8
169
+ },
170
+ "QUANTITY": {
171
+ "p": 0.8970588235,
172
+ "r": 0.9242424242,
173
+ "f": 0.9104477612
174
+ },
175
+ "NORP": {
176
+ "p": 0.7037037037,
177
+ "r": 0.59375,
178
+ "f": 0.6440677966
179
+ },
180
+ "TITLE_AFFIX": {
181
+ "p": 0.8571428571,
182
+ "r": 0.6,
183
+ "f": 0.7058823529
184
+ },
185
+ "ORDINAL": {
186
+ "p": 0.65,
187
+ "r": 0.6842105263,
188
+ "f": 0.6666666667
189
+ },
190
+ "WORK_OF_ART": {
191
+ "p": 0.6875,
192
+ "r": 0.6470588235,
193
+ "f": 0.6666666667
194
+ },
195
+ "FAC": {
196
+ "p": 0.5769230769,
197
+ "r": 0.4054054054,
198
+ "f": 0.4761904762
199
+ },
200
+ "PERCENT": {
201
+ "p": 1.0,
202
+ "r": 0.4285714286,
203
+ "f": 0.6
204
+ },
205
+ "LOC": {
206
+ "p": 0.6,
207
+ "r": 0.9,
208
+ "f": 0.72
209
+ },
210
+ "MOVEMENT": {
211
+ "p": 0.3333333333,
212
+ "r": 0.2,
213
+ "f": 0.25
214
+ },
215
+ "LAW": {
216
+ "p": 0.0,
217
+ "r": 0.0,
218
+ "f": 0.0
219
+ },
220
+ "MONEY": {
221
+ "p": 1.0,
222
+ "r": 1.0,
223
+ "f": 1.0
224
+ },
225
+ "LANGUAGE": {
226
+ "p": 1.0,
227
+ "r": 1.0,
228
+ "f": 1.0
229
+ },
230
+ "CARDINAL": {
231
+ "p": 0.0,
232
+ "r": 0.0,
233
+ "f": 0.0
234
+ }
235
+ }
236
+ }
attribute_ruler/patterns ADDED
Binary file (64 Bytes). View file
 
config.cfg ADDED
@@ -0,0 +1,233 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/ja-core-news/train.spacy"
3
+ dev = "corpus/ja-core-news/dev.spacy"
4
+ vectors = "corpus/ja_vectors"
5
+ raw = null
6
+ init_tok2vec = null
7
+ vocab_data = null
8
+
9
+ [system]
10
+ gpu_allocator = null
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "ja"
15
+ pipeline = ["tok2vec","parser","senter","attribute_ruler","ner"]
16
+ disabled = ["senter"]
17
+ before_creation = null
18
+ after_creation = null
19
+ after_pipeline_creation = null
20
+ batch_size = 256
21
+
22
+ [nlp.tokenizer]
23
+ @tokenizers = "spacy.ja.JapaneseTokenizer"
24
+ split_mode = null
25
+
26
+ [components]
27
+
28
+ [components.attribute_ruler]
29
+ factory = "attribute_ruler"
30
+ validate = false
31
+
32
+ [components.ner]
33
+ factory = "ner"
34
+ incorrect_spans_key = null
35
+ moves = null
36
+ update_with_oracle_cut_size = 100
37
+
38
+ [components.ner.model]
39
+ @architectures = "spacy.TransitionBasedParser.v2"
40
+ state_type = "ner"
41
+ extra_state_tokens = false
42
+ hidden_width = 64
43
+ maxout_pieces = 2
44
+ use_upper = true
45
+ nO = null
46
+
47
+ [components.ner.model.tok2vec]
48
+ @architectures = "spacy.Tok2Vec.v2"
49
+
50
+ [components.ner.model.tok2vec.embed]
51
+ @architectures = "spacy.MultiHashEmbed.v2"
52
+ width = 96
53
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
54
+ rows = [5000,2500,2500,2500]
55
+ include_static_vectors = true
56
+
57
+ [components.ner.model.tok2vec.encode]
58
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
59
+ width = 96
60
+ depth = 4
61
+ window_size = 1
62
+ maxout_pieces = 3
63
+
64
+ [components.parser]
65
+ factory = "parser"
66
+ learn_tokens = false
67
+ min_action_freq = 30
68
+ moves = null
69
+ update_with_oracle_cut_size = 100
70
+
71
+ [components.parser.model]
72
+ @architectures = "spacy.TransitionBasedParser.v2"
73
+ state_type = "parser"
74
+ extra_state_tokens = false
75
+ hidden_width = 64
76
+ maxout_pieces = 2
77
+ use_upper = true
78
+ nO = null
79
+
80
+ [components.parser.model.tok2vec]
81
+ @architectures = "spacy.Tok2VecListener.v1"
82
+ width = ${components.tok2vec.model.encode:width}
83
+ upstream = "tok2vec"
84
+
85
+ [components.senter]
86
+ factory = "senter"
87
+
88
+ [components.senter.model]
89
+ @architectures = "spacy.Tagger.v1"
90
+ nO = null
91
+
92
+ [components.senter.model.tok2vec]
93
+ @architectures = "spacy.Tok2Vec.v2"
94
+
95
+ [components.senter.model.tok2vec.embed]
96
+ @architectures = "spacy.MultiHashEmbed.v2"
97
+ width = 16
98
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
99
+ rows = [1000,500,500,500]
100
+ include_static_vectors = true
101
+
102
+ [components.senter.model.tok2vec.encode]
103
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
104
+ width = 16
105
+ depth = 2
106
+ window_size = 1
107
+ maxout_pieces = 2
108
+
109
+ [components.tok2vec]
110
+ factory = "tok2vec"
111
+
112
+ [components.tok2vec.model]
113
+ @architectures = "spacy.Tok2Vec.v2"
114
+
115
+ [components.tok2vec.model.embed]
116
+ @architectures = "spacy.MultiHashEmbed.v2"
117
+ width = ${components.tok2vec.model.encode:width}
118
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
119
+ rows = [5000,2500,2500,2500]
120
+ include_static_vectors = true
121
+
122
+ [components.tok2vec.model.encode]
123
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
124
+ width = 96
125
+ depth = 4
126
+ window_size = 1
127
+ maxout_pieces = 3
128
+
129
+ [corpora]
130
+
131
+ [corpora.dev]
132
+ @readers = "spacy.Corpus.v1"
133
+ limit = 0
134
+ max_length = 0
135
+ path = ${paths:dev}
136
+ gold_preproc = false
137
+ augmenter = null
138
+
139
+ [corpora.train]
140
+ @readers = "spacy.Corpus.v1"
141
+ path = ${paths:train}
142
+ max_length = 5000
143
+ gold_preproc = false
144
+ limit = 0
145
+ augmenter = null
146
+
147
+ [training]
148
+ train_corpus = "corpora.train"
149
+ dev_corpus = "corpora.dev"
150
+ seed = ${system:seed}
151
+ gpu_allocator = ${system:gpu_allocator}
152
+ dropout = 0.1
153
+ accumulate_gradient = 1
154
+ patience = 5000
155
+ max_epochs = 0
156
+ max_steps = 0
157
+ eval_frequency = 1000
158
+ frozen_components = []
159
+ before_to_disk = null
160
+ annotating_components = []
161
+
162
+ [training.batcher]
163
+ @batchers = "spacy.batch_by_words.v1"
164
+ discard_oversize = false
165
+ tolerance = 0.2
166
+ get_length = null
167
+
168
+ [training.batcher.size]
169
+ @schedules = "compounding.v1"
170
+ start = 100
171
+ stop = 1000
172
+ compound = 1.001
173
+ t = 0.0
174
+
175
+ [training.logger]
176
+ @loggers = "spacy.WandbLogger.v1"
177
+ project_name = "spacy-v3.0.0a2"
178
+ remove_config_values = []
179
+
180
+ [training.optimizer]
181
+ @optimizers = "Adam.v1"
182
+ beta1 = 0.9
183
+ beta2 = 0.999
184
+ L2_is_weight_decay = true
185
+ L2 = 0.01
186
+ grad_clip = 1.0
187
+ use_averages = true
188
+ eps = 0.00000001
189
+ learn_rate = 0.001
190
+
191
+ [training.score_weights]
192
+ dep_uas = 0.0
193
+ dep_las = 0.45
194
+ dep_las_per_type = null
195
+ sents_p = null
196
+ sents_r = null
197
+ sents_f = 0.06
198
+ ents_f = 0.5
199
+ ents_p = 0.0
200
+ ents_r = 0.0
201
+ ents_per_type = null
202
+
203
+ [pretraining]
204
+
205
+ [initialize]
206
+ vocab_data = ${paths.vocab_data}
207
+ vectors = ${paths.vectors}
208
+ init_tok2vec = ${paths.init_tok2vec}
209
+ before_init = null
210
+ after_init = null
211
+
212
+ [initialize.components]
213
+
214
+ [initialize.components.ner]
215
+
216
+ [initialize.components.ner.labels]
217
+ @readers = "spacy.read_labels.v1"
218
+ path = "corpus/labels/ner.json"
219
+ require = false
220
+
221
+ [initialize.components.parser]
222
+
223
+ [initialize.components.parser.labels]
224
+ @readers = "spacy.read_labels.v1"
225
+ path = "corpus/labels/parser.json"
226
+ require = false
227
+
228
+ [initialize.lookups]
229
+ @misc = "spacy.LookupsDataLoader.v1"
230
+ lang = ${nlp.lang}
231
+ tables = []
232
+
233
+ [initialize.tokenizer]
ja_core_news_lg-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:21c62d122014d7cb087f595285c74023bb80e7f27d9c8f558c89e5593bf1bfb5
3
+ size 555963364
meta.json ADDED
@@ -0,0 +1,355 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "lang":"ja",
3
+ "name":"core_news_lg",
4
+ "version":"3.1.0",
5
+ "description":"Japanese pipeline optimized for CPU. Components: tok2vec, parser, senter, ner, attribute_ruler.",
6
+ "author":"Explosion",
7
+ "email":"contact@explosion.ai",
8
+ "url":"https://explosion.ai",
9
+ "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.1.0,<3.2.0",
11
+ "spacy_git_version":"caba63b74",
12
+ "vectors":{
13
+ "width":300,
14
+ "vectors":480443,
15
+ "keys":480443,
16
+ "name":"ja_vectors"
17
+ },
18
+ "labels":{
19
+ "tok2vec":[
20
+
21
+ ],
22
+ "parser":[
23
+ "ROOT",
24
+ "acl",
25
+ "advcl",
26
+ "advmod",
27
+ "amod",
28
+ "aux",
29
+ "case",
30
+ "cc",
31
+ "ccomp",
32
+ "compound",
33
+ "cop",
34
+ "csubj",
35
+ "dep",
36
+ "det",
37
+ "dislocated",
38
+ "fixed",
39
+ "mark",
40
+ "nmod",
41
+ "nsubj",
42
+ "nummod",
43
+ "obj",
44
+ "obl",
45
+ "punct"
46
+ ],
47
+ "senter":[
48
+ "I",
49
+ "S"
50
+ ],
51
+ "attribute_ruler":[
52
+
53
+ ],
54
+ "ner":[
55
+ "CARDINAL",
56
+ "DATE",
57
+ "EVENT",
58
+ "FAC",
59
+ "GPE",
60
+ "LANGUAGE",
61
+ "LAW",
62
+ "LOC",
63
+ "MONEY",
64
+ "MOVEMENT",
65
+ "NORP",
66
+ "ORDINAL",
67
+ "ORG",
68
+ "PERCENT",
69
+ "PERSON",
70
+ "PET_NAME",
71
+ "PHONE",
72
+ "PRODUCT",
73
+ "QUANTITY",
74
+ "TIME",
75
+ "TITLE_AFFIX",
76
+ "WORK_OF_ART"
77
+ ]
78
+ },
79
+ "pipeline":[
80
+ "tok2vec",
81
+ "parser",
82
+ "attribute_ruler",
83
+ "ner"
84
+ ],
85
+ "components":[
86
+ "tok2vec",
87
+ "parser",
88
+ "senter",
89
+ "attribute_ruler",
90
+ "ner"
91
+ ],
92
+ "disabled":[
93
+ "senter"
94
+ ],
95
+ "performance":{
96
+ "token_acc":0.9968965945,
97
+ "tag_acc":0.9721899386,
98
+ "pos_acc":0.9639755682,
99
+ "morph_acc":0.0,
100
+ "dep_uas":0.9181002928,
101
+ "dep_las":0.8998080263,
102
+ "ents_p":0.760989011,
103
+ "ents_r":0.7075351213,
104
+ "ents_f":0.7332892124,
105
+ "sents_p":0.9860557769,
106
+ "sents_r":0.9880239521,
107
+ "sents_f":0.9870388833,
108
+ "speed":11674.7452722222,
109
+ "morph_per_feat":{
110
+ "Polarity":{
111
+ "p":0.0,
112
+ "r":0.0,
113
+ "f":0.0
114
+ }
115
+ },
116
+ "dep_las_per_type":{
117
+ "cc":{
118
+ "p":0.7872340426,
119
+ "r":0.8043478261,
120
+ "f":0.7956989247
121
+ },
122
+ "nummod":{
123
+ "p":0.9770114943,
124
+ "r":0.8762886598,
125
+ "f":0.9239130435
126
+ },
127
+ "compound":{
128
+ "p":0.9397972117,
129
+ "r":0.9205462446,
130
+ "f":0.9300721229
131
+ },
132
+ "obl":{
133
+ "p":0.7827102804,
134
+ "r":0.8052884615,
135
+ "f":0.7938388626
136
+ },
137
+ "case":{
138
+ "p":0.986533282,
139
+ "r":0.9789996182,
140
+ "f":0.9827520123
141
+ },
142
+ "dislocated":{
143
+ "p":0.5,
144
+ "r":0.2105263158,
145
+ "f":0.2962962963
146
+ },
147
+ "nmod":{
148
+ "p":0.8676092545,
149
+ "r":0.813253012,
150
+ "f":0.8395522388
151
+ },
152
+ "nsubj":{
153
+ "p":0.7950819672,
154
+ "r":0.8083333333,
155
+ "f":0.8016528926
156
+ },
157
+ "root":{
158
+ "p":0.9717171717,
159
+ "r":0.9600798403,
160
+ "f":0.9658634538
161
+ },
162
+ "aux":{
163
+ "p":0.9625090123,
164
+ "r":0.9673913043,
165
+ "f":0.9649439827
166
+ },
167
+ "advcl":{
168
+ "p":0.6802884615,
169
+ "r":0.6596736597,
170
+ "f":0.6698224852
171
+ },
172
+ "mark":{
173
+ "p":0.956,
174
+ "r":0.9409448819,
175
+ "f":0.9484126984
176
+ },
177
+ "acl":{
178
+ "p":0.7887931034,
179
+ "r":0.8061674009,
180
+ "f":0.7973856209
181
+ },
182
+ "obj":{
183
+ "p":0.950617284,
184
+ "r":0.9390243902,
185
+ "f":0.9447852761
186
+ },
187
+ "fixed":{
188
+ "p":0.9421052632,
189
+ "r":0.9835164835,
190
+ "f":0.9623655914
191
+ },
192
+ "advmod":{
193
+ "p":0.7045454545,
194
+ "r":0.4920634921,
195
+ "f":0.5794392523
196
+ },
197
+ "amod":{
198
+ "p":0.8888888889,
199
+ "r":0.6,
200
+ "f":0.7164179104
201
+ },
202
+ "cop":{
203
+ "p":0.9664804469,
204
+ "r":0.9505494505,
205
+ "f":0.9584487535
206
+ },
207
+ "ccomp":{
208
+ "p":0.9,
209
+ "r":0.8181818182,
210
+ "f":0.8571428571
211
+ },
212
+ "det":{
213
+ "p":0.9803921569,
214
+ "r":0.9803921569,
215
+ "f":0.9803921569
216
+ },
217
+ "dep":{
218
+ "p":0.0,
219
+ "r":0.0,
220
+ "f":0.0
221
+ },
222
+ "csubj":{
223
+ "p":0.8333333333,
224
+ "r":0.7692307692,
225
+ "f":0.8
226
+ }
227
+ },
228
+ "ents_per_type":{
229
+ "DATE":{
230
+ "p":0.9626168224,
231
+ "r":0.9537037037,
232
+ "f":0.9581395349
233
+ },
234
+ "ORG":{
235
+ "p":0.6637931034,
236
+ "r":0.5877862595,
237
+ "f":0.6234817814
238
+ },
239
+ "PERSON":{
240
+ "p":0.780141844,
241
+ "r":0.7913669065,
242
+ "f":0.7857142857
243
+ },
244
+ "GPE":{
245
+ "p":0.6956521739,
246
+ "r":0.6808510638,
247
+ "f":0.688172043
248
+ },
249
+ "EVENT":{
250
+ "p":0.6666666667,
251
+ "r":0.6153846154,
252
+ "f":0.64
253
+ },
254
+ "PRODUCT":{
255
+ "p":0.5666666667,
256
+ "r":0.4146341463,
257
+ "f":0.4788732394
258
+ },
259
+ "TIME":{
260
+ "p":0.6666666667,
261
+ "r":1.0,
262
+ "f":0.8
263
+ },
264
+ "QUANTITY":{
265
+ "p":0.8970588235,
266
+ "r":0.9242424242,
267
+ "f":0.9104477612
268
+ },
269
+ "NORP":{
270
+ "p":0.7037037037,
271
+ "r":0.59375,
272
+ "f":0.6440677966
273
+ },
274
+ "TITLE_AFFIX":{
275
+ "p":0.8571428571,
276
+ "r":0.6,
277
+ "f":0.7058823529
278
+ },
279
+ "ORDINAL":{
280
+ "p":0.65,
281
+ "r":0.6842105263,
282
+ "f":0.6666666667
283
+ },
284
+ "WORK_OF_ART":{
285
+ "p":0.6875,
286
+ "r":0.6470588235,
287
+ "f":0.6666666667
288
+ },
289
+ "FAC":{
290
+ "p":0.5769230769,
291
+ "r":0.4054054054,
292
+ "f":0.4761904762
293
+ },
294
+ "PERCENT":{
295
+ "p":1.0,
296
+ "r":0.4285714286,
297
+ "f":0.6
298
+ },
299
+ "LOC":{
300
+ "p":0.6,
301
+ "r":0.9,
302
+ "f":0.72
303
+ },
304
+ "MOVEMENT":{
305
+ "p":0.3333333333,
306
+ "r":0.2,
307
+ "f":0.25
308
+ },
309
+ "LAW":{
310
+ "p":0.0,
311
+ "r":0.0,
312
+ "f":0.0
313
+ },
314
+ "MONEY":{
315
+ "p":1.0,
316
+ "r":1.0,
317
+ "f":1.0
318
+ },
319
+ "LANGUAGE":{
320
+ "p":1.0,
321
+ "r":1.0,
322
+ "f":1.0
323
+ },
324
+ "CARDINAL":{
325
+ "p":0.0,
326
+ "r":0.0,
327
+ "f":0.0
328
+ }
329
+ }
330
+ },
331
+ "sources":[
332
+ {
333
+ "name":"UD Japanese GSD v2.6",
334
+ "url":"https://github.com/UniversalDependencies/UD_Japanese-GSD",
335
+ "license":"CC BY-SA 4.0",
336
+ "author":"Omura, Mai; Miyao, Yusuke; Kanayama, Hiroshi; Matsuda, Hiroshi; Wakasa, Aya; Yamashita, Kayo; Asahara, Masayuki; Tanaka, Takaaki; Murawaki, Yugo; Matsumoto, Yuji; Mori, Shinsuke; Uematsu, Sumire; McDonald, Ryan; Nivre, Joakim; Zeman, Daniel"
337
+ },
338
+ {
339
+ "name":"UD Japanese GSD v2.6 NER",
340
+ "url":"https://github.com/megagonlabs/UD_Japanese-GSD",
341
+ "license":"CC BY-SA 4.0",
342
+ "author":"Megagon Labs Tokyo"
343
+ },
344
+ {
345
+ "name":"chiVe: Japanese Word Embedding with Sudachi & NWJC (chive-1.1-mc90-500k)",
346
+ "url":"https://github.com/WorksApplications/chiVe",
347
+ "license":"Apache-2.0",
348
+ "author":"Works Applications"
349
+ }
350
+ ],
351
+ "requirements":[
352
+ "sudachipy>=0.4.9",
353
+ "sudachidict-core>=20200330"
354
+ ]
355
+ }
ner/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
ner/model ADDED
Binary file (6.96 MB). View file
 
ner/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves��{"0":{},"1":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"2":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"3":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4},"4":{"DATE":4112,"ORG":3465,"PERSON":2992,"QUANTITY":2502,"GPE":1927,"PRODUCT":1317,"FAC":1230,"ORDINAL":1095,"WORK_OF_ART":1022,"EVENT":865,"NORP":732,"LOC":557,"MONEY":400,"TITLE_AFFIX":343,"TIME":294,"PERCENT":272,"MOVEMENT":148,"LAW":94,"LANGUAGE":78,"CARDINAL":27,"PET_NAME":19,"PHONE":4,"":1},"5":{"":1}}�cfg��neg_key�
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":30,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
Binary file (300 kB). View file
 
parser/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves�q{"0":{"":75008},"1":{"":80671},"2":{"compound":20642,"obl":11201,"nmod":11139,"nsubj":6348,"acl":6215,"advcl":6023,"obj":4334,"nummod":3800,"advmod":1393,"punct":1249,"det":813,"cc":695,"amod":366,"ccomp":327,"dislocated":266,"csubj":159,"dep":0},"3":{"case":35563,"aux":18454,"punct":14888,"mark":6577,"fixed":2698,"cop":2198,"compound":248,"dep":0},"4":{"ROOT":6787}}�cfg��neg_key�
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
senter/model ADDED
Binary file (213 kB). View file
 
tok2vec/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
tok2vec/model ADDED
Binary file (6.81 MB). View file
 
tokenizer/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "split_mode":null
3
+ }
vocab/key2row ADDED
Binary file (6.59 MB). View file
 
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:11826e1ac851be0655fd038aba542199014396e08ad6c396ee6b0e70f9f1e0e8
3
+ size 13755465
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ab049bd7ffc6fd440b070f4b29eecfa625363955b1b47b90b61ccc30fe55866
3
+ size 576531728