osanseviero HF staff commited on
Commit
5564313
1 Parent(s): 475b38a

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -14,3 +14,7 @@
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
 
 
 
 
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
17
+ *.whl filter=lfs diff=lfs merge=lfs -text
18
+ *.npz filter=lfs diff=lfs merge=lfs -text
19
+ *strings.json filter=lfs diff=lfs merge=lfs -text
20
+ vectors filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,428 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution-ShareAlike 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution-ShareAlike 4.0 International Public
58
+ License
59
+
60
+ By exercising the Licensed Rights (defined below), You accept and agree
61
+ to be bound by the terms and conditions of this Creative Commons
62
+ Attribution-ShareAlike 4.0 International Public License ("Public
63
+ License"). To the extent this Public License may be interpreted as a
64
+ contract, You are granted the Licensed Rights in consideration of Your
65
+ acceptance of these terms and conditions, and the Licensor grants You
66
+ such rights in consideration of benefits the Licensor receives from
67
+ making the Licensed Material available under these terms and
68
+ conditions.
69
+
70
+
71
+ Section 1 -- Definitions.
72
+
73
+ a. Adapted Material means material subject to Copyright and Similar
74
+ Rights that is derived from or based upon the Licensed Material
75
+ and in which the Licensed Material is translated, altered,
76
+ arranged, transformed, or otherwise modified in a manner requiring
77
+ permission under the Copyright and Similar Rights held by the
78
+ Licensor. For purposes of this Public License, where the Licensed
79
+ Material is a musical work, performance, or sound recording,
80
+ Adapted Material is always produced where the Licensed Material is
81
+ synched in timed relation with a moving image.
82
+
83
+ b. Adapter's License means the license You apply to Your Copyright
84
+ and Similar Rights in Your contributions to Adapted Material in
85
+ accordance with the terms and conditions of this Public License.
86
+
87
+ c. BY-SA Compatible License means a license listed at
88
+ creativecommons.org/compatiblelicenses, approved by Creative
89
+ Commons as essentially the equivalent of this Public License.
90
+
91
+ d. Copyright and Similar Rights means copyright and/or similar rights
92
+ closely related to copyright including, without limitation,
93
+ performance, broadcast, sound recording, and Sui Generis Database
94
+ Rights, without regard to how the rights are labeled or
95
+ categorized. For purposes of this Public License, the rights
96
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
97
+ Rights.
98
+
99
+ e. Effective Technological Measures means those measures that, in the
100
+ absence of proper authority, may not be circumvented under laws
101
+ fulfilling obligations under Article 11 of the WIPO Copyright
102
+ Treaty adopted on December 20, 1996, and/or similar international
103
+ agreements.
104
+
105
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
106
+ any other exception or limitation to Copyright and Similar Rights
107
+ that applies to Your use of the Licensed Material.
108
+
109
+ g. License Elements means the license attributes listed in the name
110
+ of a Creative Commons Public License. The License Elements of this
111
+ Public License are Attribution and ShareAlike.
112
+
113
+ h. Licensed Material means the artistic or literary work, database,
114
+ or other material to which the Licensor applied this Public
115
+ License.
116
+
117
+ i. Licensed Rights means the rights granted to You subject to the
118
+ terms and conditions of this Public License, which are limited to
119
+ all Copyright and Similar Rights that apply to Your use of the
120
+ Licensed Material and that the Licensor has authority to license.
121
+
122
+ j. Licensor means the individual(s) or entity(ies) granting rights
123
+ under this Public License.
124
+
125
+ k. Share means to provide material to the public by any means or
126
+ process that requires permission under the Licensed Rights, such
127
+ as reproduction, public display, public performance, distribution,
128
+ dissemination, communication, or importation, and to make material
129
+ available to the public including in ways that members of the
130
+ public may access the material from a place and at a time
131
+ individually chosen by them.
132
+
133
+ l. Sui Generis Database Rights means rights other than copyright
134
+ resulting from Directive 96/9/EC of the European Parliament and of
135
+ the Council of 11 March 1996 on the legal protection of databases,
136
+ as amended and/or succeeded, as well as other essentially
137
+ equivalent rights anywhere in the world.
138
+
139
+ m. You means the individual or entity exercising the Licensed Rights
140
+ under this Public License. Your has a corresponding meaning.
141
+
142
+
143
+ Section 2 -- Scope.
144
+
145
+ a. License grant.
146
+
147
+ 1. Subject to the terms and conditions of this Public License,
148
+ the Licensor hereby grants You a worldwide, royalty-free,
149
+ non-sublicensable, non-exclusive, irrevocable license to
150
+ exercise the Licensed Rights in the Licensed Material to:
151
+
152
+ a. reproduce and Share the Licensed Material, in whole or
153
+ in part; and
154
+
155
+ b. produce, reproduce, and Share Adapted Material.
156
+
157
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
158
+ Exceptions and Limitations apply to Your use, this Public
159
+ License does not apply, and You do not need to comply with
160
+ its terms and conditions.
161
+
162
+ 3. Term. The term of this Public License is specified in Section
163
+ 6(a).
164
+
165
+ 4. Media and formats; technical modifications allowed. The
166
+ Licensor authorizes You to exercise the Licensed Rights in
167
+ all media and formats whether now known or hereafter created,
168
+ and to make technical modifications necessary to do so. The
169
+ Licensor waives and/or agrees not to assert any right or
170
+ authority to forbid You from making technical modifications
171
+ necessary to exercise the Licensed Rights, including
172
+ technical modifications necessary to circumvent Effective
173
+ Technological Measures. For purposes of this Public License,
174
+ simply making modifications authorized by this Section 2(a)
175
+ (4) never produces Adapted Material.
176
+
177
+ 5. Downstream recipients.
178
+
179
+ a. Offer from the Licensor -- Licensed Material. Every
180
+ recipient of the Licensed Material automatically
181
+ receives an offer from the Licensor to exercise the
182
+ Licensed Rights under the terms and conditions of this
183
+ Public License.
184
+
185
+ b. Additional offer from the Licensor -- Adapted Material.
186
+ Every recipient of Adapted Material from You
187
+ automatically receives an offer from the Licensor to
188
+ exercise the Licensed Rights in the Adapted Material
189
+ under the conditions of the Adapter's License You apply.
190
+
191
+ c. No downstream restrictions. You may not offer or impose
192
+ any additional or different terms or conditions on, or
193
+ apply any Effective Technological Measures to, the
194
+ Licensed Material if doing so restricts exercise of the
195
+ Licensed Rights by any recipient of the Licensed
196
+ Material.
197
+
198
+ 6. No endorsement. Nothing in this Public License constitutes or
199
+ may be construed as permission to assert or imply that You
200
+ are, or that Your use of the Licensed Material is, connected
201
+ with, or sponsored, endorsed, or granted official status by,
202
+ the Licensor or others designated to receive attribution as
203
+ provided in Section 3(a)(1)(A)(i).
204
+
205
+ b. Other rights.
206
+
207
+ 1. Moral rights, such as the right of integrity, are not
208
+ licensed under this Public License, nor are publicity,
209
+ privacy, and/or other similar personality rights; however, to
210
+ the extent possible, the Licensor waives and/or agrees not to
211
+ assert any such rights held by the Licensor to the limited
212
+ extent necessary to allow You to exercise the Licensed
213
+ Rights, but not otherwise.
214
+
215
+ 2. Patent and trademark rights are not licensed under this
216
+ Public License.
217
+
218
+ 3. To the extent possible, the Licensor waives any right to
219
+ collect royalties from You for the exercise of the Licensed
220
+ Rights, whether directly or through a collecting society
221
+ under any voluntary or waivable statutory or compulsory
222
+ licensing scheme. In all other cases the Licensor expressly
223
+ reserves any right to collect such royalties.
224
+
225
+
226
+ Section 3 -- License Conditions.
227
+
228
+ Your exercise of the Licensed Rights is expressly made subject to the
229
+ following conditions.
230
+
231
+ a. Attribution.
232
+
233
+ 1. If You Share the Licensed Material (including in modified
234
+ form), You must:
235
+
236
+ a. retain the following if it is supplied by the Licensor
237
+ with the Licensed Material:
238
+
239
+ i. identification of the creator(s) of the Licensed
240
+ Material and any others designated to receive
241
+ attribution, in any reasonable manner requested by
242
+ the Licensor (including by pseudonym if
243
+ designated);
244
+
245
+ ii. a copyright notice;
246
+
247
+ iii. a notice that refers to this Public License;
248
+
249
+ iv. a notice that refers to the disclaimer of
250
+ warranties;
251
+
252
+ v. a URI or hyperlink to the Licensed Material to the
253
+ extent reasonably practicable;
254
+
255
+ b. indicate if You modified the Licensed Material and
256
+ retain an indication of any previous modifications; and
257
+
258
+ c. indicate the Licensed Material is licensed under this
259
+ Public License, and include the text of, or the URI or
260
+ hyperlink to, this Public License.
261
+
262
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
263
+ reasonable manner based on the medium, means, and context in
264
+ which You Share the Licensed Material. For example, it may be
265
+ reasonable to satisfy the conditions by providing a URI or
266
+ hyperlink to a resource that includes the required
267
+ information.
268
+
269
+ 3. If requested by the Licensor, You must remove any of the
270
+ information required by Section 3(a)(1)(A) to the extent
271
+ reasonably practicable.
272
+
273
+ b. ShareAlike.
274
+
275
+ In addition to the conditions in Section 3(a), if You Share
276
+ Adapted Material You produce, the following conditions also apply.
277
+
278
+ 1. The Adapter's License You apply must be a Creative Commons
279
+ license with the same License Elements, this version or
280
+ later, or a BY-SA Compatible License.
281
+
282
+ 2. You must include the text of, or the URI or hyperlink to, the
283
+ Adapter's License You apply. You may satisfy this condition
284
+ in any reasonable manner based on the medium, means, and
285
+ context in which You Share Adapted Material.
286
+
287
+ 3. You may not offer or impose any additional or different terms
288
+ or conditions on, or apply any Effective Technological
289
+ Measures to, Adapted Material that restrict exercise of the
290
+ rights granted under the Adapter's License You apply.
291
+
292
+
293
+ Section 4 -- Sui Generis Database Rights.
294
+
295
+ Where the Licensed Rights include Sui Generis Database Rights that
296
+ apply to Your use of the Licensed Material:
297
+
298
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
299
+ to extract, reuse, reproduce, and Share all or a substantial
300
+ portion of the contents of the database;
301
+
302
+ b. if You include all or a substantial portion of the database
303
+ contents in a database in which You have Sui Generis Database
304
+ Rights, then the database in which You have Sui Generis Database
305
+ Rights (but not its individual contents) is Adapted Material,
306
+
307
+ including for purposes of Section 3(b); and
308
+ c. You must comply with the conditions in Section 3(a) if You Share
309
+ all or a substantial portion of the contents of the database.
310
+
311
+ For the avoidance of doubt, this Section 4 supplements and does not
312
+ replace Your obligations under this Public License where the Licensed
313
+ Rights include other Copyright and Similar Rights.
314
+
315
+
316
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
317
+
318
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
319
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
320
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
321
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
322
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
323
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
324
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
325
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
326
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
327
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
328
+
329
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
330
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
331
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
332
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
333
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
334
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
335
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
336
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
337
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
338
+
339
+ c. The disclaimer of warranties and limitation of liability provided
340
+ above shall be interpreted in a manner that, to the extent
341
+ possible, most closely approximates an absolute disclaimer and
342
+ waiver of all liability.
343
+
344
+
345
+ Section 6 -- Term and Termination.
346
+
347
+ a. This Public License applies for the term of the Copyright and
348
+ Similar Rights licensed here. However, if You fail to comply with
349
+ this Public License, then Your rights under this Public License
350
+ terminate automatically.
351
+
352
+ b. Where Your right to use the Licensed Material has terminated under
353
+ Section 6(a), it reinstates:
354
+
355
+ 1. automatically as of the date the violation is cured, provided
356
+ it is cured within 30 days of Your discovery of the
357
+ violation; or
358
+
359
+ 2. upon express reinstatement by the Licensor.
360
+
361
+ For the avoidance of doubt, this Section 6(b) does not affect any
362
+ right the Licensor may have to seek remedies for Your violations
363
+ of this Public License.
364
+
365
+ c. For the avoidance of doubt, the Licensor may also offer the
366
+ Licensed Material under separate terms or conditions or stop
367
+ distributing the Licensed Material at any time; however, doing so
368
+ will not terminate this Public License.
369
+
370
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
371
+ License.
372
+
373
+
374
+ Section 7 -- Other Terms and Conditions.
375
+
376
+ a. The Licensor shall not be bound by any additional or different
377
+ terms or conditions communicated by You unless expressly agreed.
378
+
379
+ b. Any arrangements, understandings, or agreements regarding the
380
+ Licensed Material not stated herein are separate from and
381
+ independent of the terms and conditions of this Public License.
382
+
383
+
384
+ Section 8 -- Interpretation.
385
+
386
+ a. For the avoidance of doubt, this Public License does not, and
387
+ shall not be interpreted to, reduce, limit, restrict, or impose
388
+ conditions on any use of the Licensed Material that could lawfully
389
+ be made without permission under this Public License.
390
+
391
+ b. To the extent possible, if any provision of this Public License is
392
+ deemed unenforceable, it shall be automatically reformed to the
393
+ minimum extent necessary to make it enforceable. If the provision
394
+ cannot be reformed, it shall be severed from this Public License
395
+ without affecting the enforceability of the remaining terms and
396
+ conditions.
397
+
398
+ c. No term or condition of this Public License will be waived and no
399
+ failure to comply consented to unless expressly agreed to by the
400
+ Licensor.
401
+
402
+ d. Nothing in this Public License constitutes or may be interpreted
403
+ as a limitation upon, or waiver of, any privileges and immunities
404
+ that apply to the Licensor or You, including from the legal
405
+ processes of any jurisdiction or authority.
406
+
407
+
408
+ =======================================================================
409
+
410
+ Creative Commons is not a party to its public
411
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
412
+ its public licenses to material it publishes and in those instances
413
+ will be considered the “Licensor.” The text of the Creative Commons
414
+ public licenses is dedicated to the public domain under the CC0 Public
415
+ Domain Dedication. Except for the limited purpose of indicating that
416
+ material is shared under a Creative Commons public license or as
417
+ otherwise permitted by the Creative Commons policies published at
418
+ creativecommons.org/policies, Creative Commons does not authorize the
419
+ use of the trademark "Creative Commons" or any other trademark or logo
420
+ of Creative Commons without its prior written consent including,
421
+ without limitation, in connection with any unauthorized modifications
422
+ to any of its public licenses or any other arrangements,
423
+ understandings, or agreements concerning use of licensed material. For
424
+ the avoidance of doubt, this paragraph does not form part of the
425
+ public licenses.
426
+
427
+ Creative Commons may be contacted at creativecommons.org.
428
+
LICENSES_SOURCES ADDED
@@ -0,0 +1,1351 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Macedonian Corpus
2
+
3
+ * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
4
+ * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
5
+ * License: CC BY-SA 4.0
6
+
7
+ ```
8
+ Attribution-ShareAlike 4.0 International
9
+
10
+ =======================================================================
11
+
12
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
13
+ does not provide legal services or legal advice. Distribution of
14
+ Creative Commons public licenses does not create a lawyer-client or
15
+ other relationship. Creative Commons makes its licenses and related
16
+ information available on an "as-is" basis. Creative Commons gives no
17
+ warranties regarding its licenses, any material licensed under their
18
+ terms and conditions, or any related information. Creative Commons
19
+ disclaims all liability for damages resulting from their use to the
20
+ fullest extent possible.
21
+
22
+ Using Creative Commons Public Licenses
23
+
24
+ Creative Commons public licenses provide a standard set of terms and
25
+ conditions that creators and other rights holders may use to share
26
+ original works of authorship and other material subject to copyright
27
+ and certain other rights specified in the public license below. The
28
+ following considerations are for informational purposes only, are not
29
+ exhaustive, and do not form part of our licenses.
30
+
31
+ Considerations for licensors: Our public licenses are
32
+ intended for use by those authorized to give the public
33
+ permission to use material in ways otherwise restricted by
34
+ copyright and certain other rights. Our licenses are
35
+ irrevocable. Licensors should read and understand the terms
36
+ and conditions of the license they choose before applying it.
37
+ Licensors should also secure all rights necessary before
38
+ applying our licenses so that the public can reuse the
39
+ material as expected. Licensors should clearly mark any
40
+ material not subject to the license. This includes other CC-
41
+ licensed material, or material used under an exception or
42
+ limitation to copyright. More considerations for licensors:
43
+ wiki.creativecommons.org/Considerations_for_licensors
44
+
45
+ Considerations for the public: By using one of our public
46
+ licenses, a licensor grants the public permission to use the
47
+ licensed material under specified terms and conditions. If
48
+ the licensor's permission is not necessary for any reason--for
49
+ example, because of any applicable exception or limitation to
50
+ copyright--then that use is not regulated by the license. Our
51
+ licenses grant only permissions under copyright and certain
52
+ other rights that a licensor has authority to grant. Use of
53
+ the licensed material may still be restricted for other
54
+ reasons, including because others have copyright or other
55
+ rights in the material. A licensor may make special requests,
56
+ such as asking that all changes be marked or described.
57
+ Although not required by our licenses, you are encouraged to
58
+ respect those requests where reasonable. More considerations
59
+ for the public:
60
+ wiki.creativecommons.org/Considerations_for_licensees
61
+
62
+ =======================================================================
63
+
64
+ Creative Commons Attribution-ShareAlike 4.0 International Public
65
+ License
66
+
67
+ By exercising the Licensed Rights (defined below), You accept and agree
68
+ to be bound by the terms and conditions of this Creative Commons
69
+ Attribution-ShareAlike 4.0 International Public License ("Public
70
+ License"). To the extent this Public License may be interpreted as a
71
+ contract, You are granted the Licensed Rights in consideration of Your
72
+ acceptance of these terms and conditions, and the Licensor grants You
73
+ such rights in consideration of benefits the Licensor receives from
74
+ making the Licensed Material available under these terms and
75
+ conditions.
76
+
77
+
78
+ Section 1 -- Definitions.
79
+
80
+ a. Adapted Material means material subject to Copyright and Similar
81
+ Rights that is derived from or based upon the Licensed Material
82
+ and in which the Licensed Material is translated, altered,
83
+ arranged, transformed, or otherwise modified in a manner requiring
84
+ permission under the Copyright and Similar Rights held by the
85
+ Licensor. For purposes of this Public License, where the Licensed
86
+ Material is a musical work, performance, or sound recording,
87
+ Adapted Material is always produced where the Licensed Material is
88
+ synched in timed relation with a moving image.
89
+
90
+ b. Adapter's License means the license You apply to Your Copyright
91
+ and Similar Rights in Your contributions to Adapted Material in
92
+ accordance with the terms and conditions of this Public License.
93
+
94
+ c. BY-SA Compatible License means a license listed at
95
+ creativecommons.org/compatiblelicenses, approved by Creative
96
+ Commons as essentially the equivalent of this Public License.
97
+
98
+ d. Copyright and Similar Rights means copyright and/or similar rights
99
+ closely related to copyright including, without limitation,
100
+ performance, broadcast, sound recording, and Sui Generis Database
101
+ Rights, without regard to how the rights are labeled or
102
+ categorized. For purposes of this Public License, the rights
103
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
104
+ Rights.
105
+
106
+ e. Effective Technological Measures means those measures that, in the
107
+ absence of proper authority, may not be circumvented under laws
108
+ fulfilling obligations under Article 11 of the WIPO Copyright
109
+ Treaty adopted on December 20, 1996, and/or similar international
110
+ agreements.
111
+
112
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
113
+ any other exception or limitation to Copyright and Similar Rights
114
+ that applies to Your use of the Licensed Material.
115
+
116
+ g. License Elements means the license attributes listed in the name
117
+ of a Creative Commons Public License. The License Elements of this
118
+ Public License are Attribution and ShareAlike.
119
+
120
+ h. Licensed Material means the artistic or literary work, database,
121
+ or other material to which the Licensor applied this Public
122
+ License.
123
+
124
+ i. Licensed Rights means the rights granted to You subject to the
125
+ terms and conditions of this Public License, which are limited to
126
+ all Copyright and Similar Rights that apply to Your use of the
127
+ Licensed Material and that the Licensor has authority to license.
128
+
129
+ j. Licensor means the individual(s) or entity(ies) granting rights
130
+ under this Public License.
131
+
132
+ k. Share means to provide material to the public by any means or
133
+ process that requires permission under the Licensed Rights, such
134
+ as reproduction, public display, public performance, distribution,
135
+ dissemination, communication, or importation, and to make material
136
+ available to the public including in ways that members of the
137
+ public may access the material from a place and at a time
138
+ individually chosen by them.
139
+
140
+ l. Sui Generis Database Rights means rights other than copyright
141
+ resulting from Directive 96/9/EC of the European Parliament and of
142
+ the Council of 11 March 1996 on the legal protection of databases,
143
+ as amended and/or succeeded, as well as other essentially
144
+ equivalent rights anywhere in the world.
145
+
146
+ m. You means the individual or entity exercising the Licensed Rights
147
+ under this Public License. Your has a corresponding meaning.
148
+
149
+
150
+ Section 2 -- Scope.
151
+
152
+ a. License grant.
153
+
154
+ 1. Subject to the terms and conditions of this Public License,
155
+ the Licensor hereby grants You a worldwide, royalty-free,
156
+ non-sublicensable, non-exclusive, irrevocable license to
157
+ exercise the Licensed Rights in the Licensed Material to:
158
+
159
+ a. reproduce and Share the Licensed Material, in whole or
160
+ in part; and
161
+
162
+ b. produce, reproduce, and Share Adapted Material.
163
+
164
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
165
+ Exceptions and Limitations apply to Your use, this Public
166
+ License does not apply, and You do not need to comply with
167
+ its terms and conditions.
168
+
169
+ 3. Term. The term of this Public License is specified in Section
170
+ 6(a).
171
+
172
+ 4. Media and formats; technical modifications allowed. The
173
+ Licensor authorizes You to exercise the Licensed Rights in
174
+ all media and formats whether now known or hereafter created,
175
+ and to make technical modifications necessary to do so. The
176
+ Licensor waives and/or agrees not to assert any right or
177
+ authority to forbid You from making technical modifications
178
+ necessary to exercise the Licensed Rights, including
179
+ technical modifications necessary to circumvent Effective
180
+ Technological Measures. For purposes of this Public License,
181
+ simply making modifications authorized by this Section 2(a)
182
+ (4) never produces Adapted Material.
183
+
184
+ 5. Downstream recipients.
185
+
186
+ a. Offer from the Licensor -- Licensed Material. Every
187
+ recipient of the Licensed Material automatically
188
+ receives an offer from the Licensor to exercise the
189
+ Licensed Rights under the terms and conditions of this
190
+ Public License.
191
+
192
+ b. Additional offer from the Licensor -- Adapted Material.
193
+ Every recipient of Adapted Material from You
194
+ automatically receives an offer from the Licensor to
195
+ exercise the Licensed Rights in the Adapted Material
196
+ under the conditions of the Adapter's License You apply.
197
+
198
+ c. No downstream restrictions. You may not offer or impose
199
+ any additional or different terms or conditions on, or
200
+ apply any Effective Technological Measures to, the
201
+ Licensed Material if doing so restricts exercise of the
202
+ Licensed Rights by any recipient of the Licensed
203
+ Material.
204
+
205
+ 6. No endorsement. Nothing in this Public License constitutes or
206
+ may be construed as permission to assert or imply that You
207
+ are, or that Your use of the Licensed Material is, connected
208
+ with, or sponsored, endorsed, or granted official status by,
209
+ the Licensor or others designated to receive attribution as
210
+ provided in Section 3(a)(1)(A)(i).
211
+
212
+ b. Other rights.
213
+
214
+ 1. Moral rights, such as the right of integrity, are not
215
+ licensed under this Public License, nor are publicity,
216
+ privacy, and/or other similar personality rights; however, to
217
+ the extent possible, the Licensor waives and/or agrees not to
218
+ assert any such rights held by the Licensor to the limited
219
+ extent necessary to allow You to exercise the Licensed
220
+ Rights, but not otherwise.
221
+
222
+ 2. Patent and trademark rights are not licensed under this
223
+ Public License.
224
+
225
+ 3. To the extent possible, the Licensor waives any right to
226
+ collect royalties from You for the exercise of the Licensed
227
+ Rights, whether directly or through a collecting society
228
+ under any voluntary or waivable statutory or compulsory
229
+ licensing scheme. In all other cases the Licensor expressly
230
+ reserves any right to collect such royalties.
231
+
232
+
233
+ Section 3 -- License Conditions.
234
+
235
+ Your exercise of the Licensed Rights is expressly made subject to the
236
+ following conditions.
237
+
238
+ a. Attribution.
239
+
240
+ 1. If You Share the Licensed Material (including in modified
241
+ form), You must:
242
+
243
+ a. retain the following if it is supplied by the Licensor
244
+ with the Licensed Material:
245
+
246
+ i. identification of the creator(s) of the Licensed
247
+ Material and any others designated to receive
248
+ attribution, in any reasonable manner requested by
249
+ the Licensor (including by pseudonym if
250
+ designated);
251
+
252
+ ii. a copyright notice;
253
+
254
+ iii. a notice that refers to this Public License;
255
+
256
+ iv. a notice that refers to the disclaimer of
257
+ warranties;
258
+
259
+ v. a URI or hyperlink to the Licensed Material to the
260
+ extent reasonably practicable;
261
+
262
+ b. indicate if You modified the Licensed Material and
263
+ retain an indication of any previous modifications; and
264
+
265
+ c. indicate the Licensed Material is licensed under this
266
+ Public License, and include the text of, or the URI or
267
+ hyperlink to, this Public License.
268
+
269
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
270
+ reasonable manner based on the medium, means, and context in
271
+ which You Share the Licensed Material. For example, it may be
272
+ reasonable to satisfy the conditions by providing a URI or
273
+ hyperlink to a resource that includes the required
274
+ information.
275
+
276
+ 3. If requested by the Licensor, You must remove any of the
277
+ information required by Section 3(a)(1)(A) to the extent
278
+ reasonably practicable.
279
+
280
+ b. ShareAlike.
281
+
282
+ In addition to the conditions in Section 3(a), if You Share
283
+ Adapted Material You produce, the following conditions also apply.
284
+
285
+ 1. The Adapter's License You apply must be a Creative Commons
286
+ license with the same License Elements, this version or
287
+ later, or a BY-SA Compatible License.
288
+
289
+ 2. You must include the text of, or the URI or hyperlink to, the
290
+ Adapter's License You apply. You may satisfy this condition
291
+ in any reasonable manner based on the medium, means, and
292
+ context in which You Share Adapted Material.
293
+
294
+ 3. You may not offer or impose any additional or different terms
295
+ or conditions on, or apply any Effective Technological
296
+ Measures to, Adapted Material that restrict exercise of the
297
+ rights granted under the Adapter's License You apply.
298
+
299
+
300
+ Section 4 -- Sui Generis Database Rights.
301
+
302
+ Where the Licensed Rights include Sui Generis Database Rights that
303
+ apply to Your use of the Licensed Material:
304
+
305
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
306
+ to extract, reuse, reproduce, and Share all or a substantial
307
+ portion of the contents of the database;
308
+
309
+ b. if You include all or a substantial portion of the database
310
+ contents in a database in which You have Sui Generis Database
311
+ Rights, then the database in which You have Sui Generis Database
312
+ Rights (but not its individual contents) is Adapted Material,
313
+
314
+ including for purposes of Section 3(b); and
315
+ c. You must comply with the conditions in Section 3(a) if You Share
316
+ all or a substantial portion of the contents of the database.
317
+
318
+ For the avoidance of doubt, this Section 4 supplements and does not
319
+ replace Your obligations under this Public License where the Licensed
320
+ Rights include other Copyright and Similar Rights.
321
+
322
+
323
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
324
+
325
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
326
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
327
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
328
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
329
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
330
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
331
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
332
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
333
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
334
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
335
+
336
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
337
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
338
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
339
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
340
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
341
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
342
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
343
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
344
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
345
+
346
+ c. The disclaimer of warranties and limitation of liability provided
347
+ above shall be interpreted in a manner that, to the extent
348
+ possible, most closely approximates an absolute disclaimer and
349
+ waiver of all liability.
350
+
351
+
352
+ Section 6 -- Term and Termination.
353
+
354
+ a. This Public License applies for the term of the Copyright and
355
+ Similar Rights licensed here. However, if You fail to comply with
356
+ this Public License, then Your rights under this Public License
357
+ terminate automatically.
358
+
359
+ b. Where Your right to use the Licensed Material has terminated under
360
+ Section 6(a), it reinstates:
361
+
362
+ 1. automatically as of the date the violation is cured, provided
363
+ it is cured within 30 days of Your discovery of the
364
+ violation; or
365
+
366
+ 2. upon express reinstatement by the Licensor.
367
+
368
+ For the avoidance of doubt, this Section 6(b) does not affect any
369
+ right the Licensor may have to seek remedies for Your violations
370
+ of this Public License.
371
+
372
+ c. For the avoidance of doubt, the Licensor may also offer the
373
+ Licensed Material under separate terms or conditions or stop
374
+ distributing the Licensed Material at any time; however, doing so
375
+ will not terminate this Public License.
376
+
377
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
378
+ License.
379
+
380
+
381
+ Section 7 -- Other Terms and Conditions.
382
+
383
+ a. The Licensor shall not be bound by any additional or different
384
+ terms or conditions communicated by You unless expressly agreed.
385
+
386
+ b. Any arrangements, understandings, or agreements regarding the
387
+ Licensed Material not stated herein are separate from and
388
+ independent of the terms and conditions of this Public License.
389
+
390
+
391
+ Section 8 -- Interpretation.
392
+
393
+ a. For the avoidance of doubt, this Public License does not, and
394
+ shall not be interpreted to, reduce, limit, restrict, or impose
395
+ conditions on any use of the Licensed Material that could lawfully
396
+ be made without permission under this Public License.
397
+
398
+ b. To the extent possible, if any provision of this Public License is
399
+ deemed unenforceable, it shall be automatically reformed to the
400
+ minimum extent necessary to make it enforceable. If the provision
401
+ cannot be reformed, it shall be severed from this Public License
402
+ without affecting the enforceability of the remaining terms and
403
+ conditions.
404
+
405
+ c. No term or condition of this Public License will be waived and no
406
+ failure to comply consented to unless expressly agreed to by the
407
+ Licensor.
408
+
409
+ d. Nothing in this Public License constitutes or may be interpreted
410
+ as a limitation upon, or waiver of, any privileges and immunities
411
+ that apply to the Licensor or You, including from the legal
412
+ processes of any jurisdiction or authority.
413
+
414
+
415
+ =======================================================================
416
+
417
+ Creative Commons is not a party to its public
418
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
419
+ its public licenses to material it publishes and in those instances
420
+ will be considered the “Licensor.” The text of the Creative Commons
421
+ public licenses is dedicated to the public domain under the CC0 Public
422
+ Domain Dedication. Except for the limited purpose of indicating that
423
+ material is shared under a Creative Commons public license or as
424
+ otherwise permitted by the Creative Commons policies published at
425
+ creativecommons.org/policies, Creative Commons does not authorize the
426
+ use of the trademark "Creative Commons" or any other trademark or logo
427
+ of Creative Commons without its prior written consent including,
428
+ without limitation, in connection with any unauthorized modifications
429
+ to any of its public licenses or any other arrangements,
430
+ understandings, or agreements concerning use of licensed material. For
431
+ the avoidance of doubt, this paragraph does not form part of the
432
+ public licenses.
433
+
434
+ Creative Commons may be contacted at creativecommons.org.
435
+
436
+ ```
437
+
438
+
439
+
440
+
441
+ # Macedonian Corpus
442
+
443
+ * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
444
+ * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
445
+ * License: CC BY-SA 4.0
446
+
447
+ ```
448
+ Attribution-ShareAlike 4.0 International
449
+
450
+ =======================================================================
451
+
452
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
453
+ does not provide legal services or legal advice. Distribution of
454
+ Creative Commons public licenses does not create a lawyer-client or
455
+ other relationship. Creative Commons makes its licenses and related
456
+ information available on an "as-is" basis. Creative Commons gives no
457
+ warranties regarding its licenses, any material licensed under their
458
+ terms and conditions, or any related information. Creative Commons
459
+ disclaims all liability for damages resulting from their use to the
460
+ fullest extent possible.
461
+
462
+ Using Creative Commons Public Licenses
463
+
464
+ Creative Commons public licenses provide a standard set of terms and
465
+ conditions that creators and other rights holders may use to share
466
+ original works of authorship and other material subject to copyright
467
+ and certain other rights specified in the public license below. The
468
+ following considerations are for informational purposes only, are not
469
+ exhaustive, and do not form part of our licenses.
470
+
471
+ Considerations for licensors: Our public licenses are
472
+ intended for use by those authorized to give the public
473
+ permission to use material in ways otherwise restricted by
474
+ copyright and certain other rights. Our licenses are
475
+ irrevocable. Licensors should read and understand the terms
476
+ and conditions of the license they choose before applying it.
477
+ Licensors should also secure all rights necessary before
478
+ applying our licenses so that the public can reuse the
479
+ material as expected. Licensors should clearly mark any
480
+ material not subject to the license. This includes other CC-
481
+ licensed material, or material used under an exception or
482
+ limitation to copyright. More considerations for licensors:
483
+ wiki.creativecommons.org/Considerations_for_licensors
484
+
485
+ Considerations for the public: By using one of our public
486
+ licenses, a licensor grants the public permission to use the
487
+ licensed material under specified terms and conditions. If
488
+ the licensor's permission is not necessary for any reason--for
489
+ example, because of any applicable exception or limitation to
490
+ copyright--then that use is not regulated by the license. Our
491
+ licenses grant only permissions under copyright and certain
492
+ other rights that a licensor has authority to grant. Use of
493
+ the licensed material may still be restricted for other
494
+ reasons, including because others have copyright or other
495
+ rights in the material. A licensor may make special requests,
496
+ such as asking that all changes be marked or described.
497
+ Although not required by our licenses, you are encouraged to
498
+ respect those requests where reasonable. More considerations
499
+ for the public:
500
+ wiki.creativecommons.org/Considerations_for_licensees
501
+
502
+ =======================================================================
503
+
504
+ Creative Commons Attribution-ShareAlike 4.0 International Public
505
+ License
506
+
507
+ By exercising the Licensed Rights (defined below), You accept and agree
508
+ to be bound by the terms and conditions of this Creative Commons
509
+ Attribution-ShareAlike 4.0 International Public License ("Public
510
+ License"). To the extent this Public License may be interpreted as a
511
+ contract, You are granted the Licensed Rights in consideration of Your
512
+ acceptance of these terms and conditions, and the Licensor grants You
513
+ such rights in consideration of benefits the Licensor receives from
514
+ making the Licensed Material available under these terms and
515
+ conditions.
516
+
517
+
518
+ Section 1 -- Definitions.
519
+
520
+ a. Adapted Material means material subject to Copyright and Similar
521
+ Rights that is derived from or based upon the Licensed Material
522
+ and in which the Licensed Material is translated, altered,
523
+ arranged, transformed, or otherwise modified in a manner requiring
524
+ permission under the Copyright and Similar Rights held by the
525
+ Licensor. For purposes of this Public License, where the Licensed
526
+ Material is a musical work, performance, or sound recording,
527
+ Adapted Material is always produced where the Licensed Material is
528
+ synched in timed relation with a moving image.
529
+
530
+ b. Adapter's License means the license You apply to Your Copyright
531
+ and Similar Rights in Your contributions to Adapted Material in
532
+ accordance with the terms and conditions of this Public License.
533
+
534
+ c. BY-SA Compatible License means a license listed at
535
+ creativecommons.org/compatiblelicenses, approved by Creative
536
+ Commons as essentially the equivalent of this Public License.
537
+
538
+ d. Copyright and Similar Rights means copyright and/or similar rights
539
+ closely related to copyright including, without limitation,
540
+ performance, broadcast, sound recording, and Sui Generis Database
541
+ Rights, without regard to how the rights are labeled or
542
+ categorized. For purposes of this Public License, the rights
543
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
544
+ Rights.
545
+
546
+ e. Effective Technological Measures means those measures that, in the
547
+ absence of proper authority, may not be circumvented under laws
548
+ fulfilling obligations under Article 11 of the WIPO Copyright
549
+ Treaty adopted on December 20, 1996, and/or similar international
550
+ agreements.
551
+
552
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
553
+ any other exception or limitation to Copyright and Similar Rights
554
+ that applies to Your use of the Licensed Material.
555
+
556
+ g. License Elements means the license attributes listed in the name
557
+ of a Creative Commons Public License. The License Elements of this
558
+ Public License are Attribution and ShareAlike.
559
+
560
+ h. Licensed Material means the artistic or literary work, database,
561
+ or other material to which the Licensor applied this Public
562
+ License.
563
+
564
+ i. Licensed Rights means the rights granted to You subject to the
565
+ terms and conditions of this Public License, which are limited to
566
+ all Copyright and Similar Rights that apply to Your use of the
567
+ Licensed Material and that the Licensor has authority to license.
568
+
569
+ j. Licensor means the individual(s) or entity(ies) granting rights
570
+ under this Public License.
571
+
572
+ k. Share means to provide material to the public by any means or
573
+ process that requires permission under the Licensed Rights, such
574
+ as reproduction, public display, public performance, distribution,
575
+ dissemination, communication, or importation, and to make material
576
+ available to the public including in ways that members of the
577
+ public may access the material from a place and at a time
578
+ individually chosen by them.
579
+
580
+ l. Sui Generis Database Rights means rights other than copyright
581
+ resulting from Directive 96/9/EC of the European Parliament and of
582
+ the Council of 11 March 1996 on the legal protection of databases,
583
+ as amended and/or succeeded, as well as other essentially
584
+ equivalent rights anywhere in the world.
585
+
586
+ m. You means the individual or entity exercising the Licensed Rights
587
+ under this Public License. Your has a corresponding meaning.
588
+
589
+
590
+ Section 2 -- Scope.
591
+
592
+ a. License grant.
593
+
594
+ 1. Subject to the terms and conditions of this Public License,
595
+ the Licensor hereby grants You a worldwide, royalty-free,
596
+ non-sublicensable, non-exclusive, irrevocable license to
597
+ exercise the Licensed Rights in the Licensed Material to:
598
+
599
+ a. reproduce and Share the Licensed Material, in whole or
600
+ in part; and
601
+
602
+ b. produce, reproduce, and Share Adapted Material.
603
+
604
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
605
+ Exceptions and Limitations apply to Your use, this Public
606
+ License does not apply, and You do not need to comply with
607
+ its terms and conditions.
608
+
609
+ 3. Term. The term of this Public License is specified in Section
610
+ 6(a).
611
+
612
+ 4. Media and formats; technical modifications allowed. The
613
+ Licensor authorizes You to exercise the Licensed Rights in
614
+ all media and formats whether now known or hereafter created,
615
+ and to make technical modifications necessary to do so. The
616
+ Licensor waives and/or agrees not to assert any right or
617
+ authority to forbid You from making technical modifications
618
+ necessary to exercise the Licensed Rights, including
619
+ technical modifications necessary to circumvent Effective
620
+ Technological Measures. For purposes of this Public License,
621
+ simply making modifications authorized by this Section 2(a)
622
+ (4) never produces Adapted Material.
623
+
624
+ 5. Downstream recipients.
625
+
626
+ a. Offer from the Licensor -- Licensed Material. Every
627
+ recipient of the Licensed Material automatically
628
+ receives an offer from the Licensor to exercise the
629
+ Licensed Rights under the terms and conditions of this
630
+ Public License.
631
+
632
+ b. Additional offer from the Licensor -- Adapted Material.
633
+ Every recipient of Adapted Material from You
634
+ automatically receives an offer from the Licensor to
635
+ exercise the Licensed Rights in the Adapted Material
636
+ under the conditions of the Adapter's License You apply.
637
+
638
+ c. No downstream restrictions. You may not offer or impose
639
+ any additional or different terms or conditions on, or
640
+ apply any Effective Technological Measures to, the
641
+ Licensed Material if doing so restricts exercise of the
642
+ Licensed Rights by any recipient of the Licensed
643
+ Material.
644
+
645
+ 6. No endorsement. Nothing in this Public License constitutes or
646
+ may be construed as permission to assert or imply that You
647
+ are, or that Your use of the Licensed Material is, connected
648
+ with, or sponsored, endorsed, or granted official status by,
649
+ the Licensor or others designated to receive attribution as
650
+ provided in Section 3(a)(1)(A)(i).
651
+
652
+ b. Other rights.
653
+
654
+ 1. Moral rights, such as the right of integrity, are not
655
+ licensed under this Public License, nor are publicity,
656
+ privacy, and/or other similar personality rights; however, to
657
+ the extent possible, the Licensor waives and/or agrees not to
658
+ assert any such rights held by the Licensor to the limited
659
+ extent necessary to allow You to exercise the Licensed
660
+ Rights, but not otherwise.
661
+
662
+ 2. Patent and trademark rights are not licensed under this
663
+ Public License.
664
+
665
+ 3. To the extent possible, the Licensor waives any right to
666
+ collect royalties from You for the exercise of the Licensed
667
+ Rights, whether directly or through a collecting society
668
+ under any voluntary or waivable statutory or compulsory
669
+ licensing scheme. In all other cases the Licensor expressly
670
+ reserves any right to collect such royalties.
671
+
672
+
673
+ Section 3 -- License Conditions.
674
+
675
+ Your exercise of the Licensed Rights is expressly made subject to the
676
+ following conditions.
677
+
678
+ a. Attribution.
679
+
680
+ 1. If You Share the Licensed Material (including in modified
681
+ form), You must:
682
+
683
+ a. retain the following if it is supplied by the Licensor
684
+ with the Licensed Material:
685
+
686
+ i. identification of the creator(s) of the Licensed
687
+ Material and any others designated to receive
688
+ attribution, in any reasonable manner requested by
689
+ the Licensor (including by pseudonym if
690
+ designated);
691
+
692
+ ii. a copyright notice;
693
+
694
+ iii. a notice that refers to this Public License;
695
+
696
+ iv. a notice that refers to the disclaimer of
697
+ warranties;
698
+
699
+ v. a URI or hyperlink to the Licensed Material to the
700
+ extent reasonably practicable;
701
+
702
+ b. indicate if You modified the Licensed Material and
703
+ retain an indication of any previous modifications; and
704
+
705
+ c. indicate the Licensed Material is licensed under this
706
+ Public License, and include the text of, or the URI or
707
+ hyperlink to, this Public License.
708
+
709
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
710
+ reasonable manner based on the medium, means, and context in
711
+ which You Share the Licensed Material. For example, it may be
712
+ reasonable to satisfy the conditions by providing a URI or
713
+ hyperlink to a resource that includes the required
714
+ information.
715
+
716
+ 3. If requested by the Licensor, You must remove any of the
717
+ information required by Section 3(a)(1)(A) to the extent
718
+ reasonably practicable.
719
+
720
+ b. ShareAlike.
721
+
722
+ In addition to the conditions in Section 3(a), if You Share
723
+ Adapted Material You produce, the following conditions also apply.
724
+
725
+ 1. The Adapter's License You apply must be a Creative Commons
726
+ license with the same License Elements, this version or
727
+ later, or a BY-SA Compatible License.
728
+
729
+ 2. You must include the text of, or the URI or hyperlink to, the
730
+ Adapter's License You apply. You may satisfy this condition
731
+ in any reasonable manner based on the medium, means, and
732
+ context in which You Share Adapted Material.
733
+
734
+ 3. You may not offer or impose any additional or different terms
735
+ or conditions on, or apply any Effective Technological
736
+ Measures to, Adapted Material that restrict exercise of the
737
+ rights granted under the Adapter's License You apply.
738
+
739
+
740
+ Section 4 -- Sui Generis Database Rights.
741
+
742
+ Where the Licensed Rights include Sui Generis Database Rights that
743
+ apply to Your use of the Licensed Material:
744
+
745
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
746
+ to extract, reuse, reproduce, and Share all or a substantial
747
+ portion of the contents of the database;
748
+
749
+ b. if You include all or a substantial portion of the database
750
+ contents in a database in which You have Sui Generis Database
751
+ Rights, then the database in which You have Sui Generis Database
752
+ Rights (but not its individual contents) is Adapted Material,
753
+
754
+ including for purposes of Section 3(b); and
755
+ c. You must comply with the conditions in Section 3(a) if You Share
756
+ all or a substantial portion of the contents of the database.
757
+
758
+ For the avoidance of doubt, this Section 4 supplements and does not
759
+ replace Your obligations under this Public License where the Licensed
760
+ Rights include other Copyright and Similar Rights.
761
+
762
+
763
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
764
+
765
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
766
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
767
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
768
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
769
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
770
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
771
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
772
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
773
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
774
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
775
+
776
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
777
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
778
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
779
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
780
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
781
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
782
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
783
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
784
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
785
+
786
+ c. The disclaimer of warranties and limitation of liability provided
787
+ above shall be interpreted in a manner that, to the extent
788
+ possible, most closely approximates an absolute disclaimer and
789
+ waiver of all liability.
790
+
791
+
792
+ Section 6 -- Term and Termination.
793
+
794
+ a. This Public License applies for the term of the Copyright and
795
+ Similar Rights licensed here. However, if You fail to comply with
796
+ this Public License, then Your rights under this Public License
797
+ terminate automatically.
798
+
799
+ b. Where Your right to use the Licensed Material has terminated under
800
+ Section 6(a), it reinstates:
801
+
802
+ 1. automatically as of the date the violation is cured, provided
803
+ it is cured within 30 days of Your discovery of the
804
+ violation; or
805
+
806
+ 2. upon express reinstatement by the Licensor.
807
+
808
+ For the avoidance of doubt, this Section 6(b) does not affect any
809
+ right the Licensor may have to seek remedies for Your violations
810
+ of this Public License.
811
+
812
+ c. For the avoidance of doubt, the Licensor may also offer the
813
+ Licensed Material under separate terms or conditions or stop
814
+ distributing the Licensed Material at any time; however, doing so
815
+ will not terminate this Public License.
816
+
817
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
818
+ License.
819
+
820
+
821
+ Section 7 -- Other Terms and Conditions.
822
+
823
+ a. The Licensor shall not be bound by any additional or different
824
+ terms or conditions communicated by You unless expressly agreed.
825
+
826
+ b. Any arrangements, understandings, or agreements regarding the
827
+ Licensed Material not stated herein are separate from and
828
+ independent of the terms and conditions of this Public License.
829
+
830
+
831
+ Section 8 -- Interpretation.
832
+
833
+ a. For the avoidance of doubt, this Public License does not, and
834
+ shall not be interpreted to, reduce, limit, restrict, or impose
835
+ conditions on any use of the Licensed Material that could lawfully
836
+ be made without permission under this Public License.
837
+
838
+ b. To the extent possible, if any provision of this Public License is
839
+ deemed unenforceable, it shall be automatically reformed to the
840
+ minimum extent necessary to make it enforceable. If the provision
841
+ cannot be reformed, it shall be severed from this Public License
842
+ without affecting the enforceability of the remaining terms and
843
+ conditions.
844
+
845
+ c. No term or condition of this Public License will be waived and no
846
+ failure to comply consented to unless expressly agreed to by the
847
+ Licensor.
848
+
849
+ d. Nothing in this Public License constitutes or may be interpreted
850
+ as a limitation upon, or waiver of, any privileges and immunities
851
+ that apply to the Licensor or You, including from the legal
852
+ processes of any jurisdiction or authority.
853
+
854
+
855
+ =======================================================================
856
+
857
+ Creative Commons is not a party to its public
858
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
859
+ its public licenses to material it publishes and in those instances
860
+ will be considered the “Licensor.” The text of the Creative Commons
861
+ public licenses is dedicated to the public domain under the CC0 Public
862
+ Domain Dedication. Except for the limited purpose of indicating that
863
+ material is shared under a Creative Commons public license or as
864
+ otherwise permitted by the Creative Commons policies published at
865
+ creativecommons.org/policies, Creative Commons does not authorize the
866
+ use of the trademark "Creative Commons" or any other trademark or logo
867
+ of Creative Commons without its prior written consent including,
868
+ without limitation, in connection with any unauthorized modifications
869
+ to any of its public licenses or any other arrangements,
870
+ understandings, or agreements concerning use of licensed material. For
871
+ the avoidance of doubt, this paragraph does not form part of the
872
+ public licenses.
873
+
874
+ Creative Commons may be contacted at creativecommons.org.
875
+
876
+ ```
877
+
878
+
879
+
880
+
881
+ # Macedonian Corpus
882
+
883
+ * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
884
+ * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
885
+ * License: CC BY-SA 4.0
886
+
887
+ ```
888
+ Attribution-ShareAlike 4.0 International
889
+
890
+ =======================================================================
891
+
892
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
893
+ does not provide legal services or legal advice. Distribution of
894
+ Creative Commons public licenses does not create a lawyer-client or
895
+ other relationship. Creative Commons makes its licenses and related
896
+ information available on an "as-is" basis. Creative Commons gives no
897
+ warranties regarding its licenses, any material licensed under their
898
+ terms and conditions, or any related information. Creative Commons
899
+ disclaims all liability for damages resulting from their use to the
900
+ fullest extent possible.
901
+
902
+ Using Creative Commons Public Licenses
903
+
904
+ Creative Commons public licenses provide a standard set of terms and
905
+ conditions that creators and other rights holders may use to share
906
+ original works of authorship and other material subject to copyright
907
+ and certain other rights specified in the public license below. The
908
+ following considerations are for informational purposes only, are not
909
+ exhaustive, and do not form part of our licenses.
910
+
911
+ Considerations for licensors: Our public licenses are
912
+ intended for use by those authorized to give the public
913
+ permission to use material in ways otherwise restricted by
914
+ copyright and certain other rights. Our licenses are
915
+ irrevocable. Licensors should read and understand the terms
916
+ and conditions of the license they choose before applying it.
917
+ Licensors should also secure all rights necessary before
918
+ applying our licenses so that the public can reuse the
919
+ material as expected. Licensors should clearly mark any
920
+ material not subject to the license. This includes other CC-
921
+ licensed material, or material used under an exception or
922
+ limitation to copyright. More considerations for licensors:
923
+ wiki.creativecommons.org/Considerations_for_licensors
924
+
925
+ Considerations for the public: By using one of our public
926
+ licenses, a licensor grants the public permission to use the
927
+ licensed material under specified terms and conditions. If
928
+ the licensor's permission is not necessary for any reason--for
929
+ example, because of any applicable exception or limitation to
930
+ copyright--then that use is not regulated by the license. Our
931
+ licenses grant only permissions under copyright and certain
932
+ other rights that a licensor has authority to grant. Use of
933
+ the licensed material may still be restricted for other
934
+ reasons, including because others have copyright or other
935
+ rights in the material. A licensor may make special requests,
936
+ such as asking that all changes be marked or described.
937
+ Although not required by our licenses, you are encouraged to
938
+ respect those requests where reasonable. More considerations
939
+ for the public:
940
+ wiki.creativecommons.org/Considerations_for_licensees
941
+
942
+ =======================================================================
943
+
944
+ Creative Commons Attribution-ShareAlike 4.0 International Public
945
+ License
946
+
947
+ By exercising the Licensed Rights (defined below), You accept and agree
948
+ to be bound by the terms and conditions of this Creative Commons
949
+ Attribution-ShareAlike 4.0 International Public License ("Public
950
+ License"). To the extent this Public License may be interpreted as a
951
+ contract, You are granted the Licensed Rights in consideration of Your
952
+ acceptance of these terms and conditions, and the Licensor grants You
953
+ such rights in consideration of benefits the Licensor receives from
954
+ making the Licensed Material available under these terms and
955
+ conditions.
956
+
957
+
958
+ Section 1 -- Definitions.
959
+
960
+ a. Adapted Material means material subject to Copyright and Similar
961
+ Rights that is derived from or based upon the Licensed Material
962
+ and in which the Licensed Material is translated, altered,
963
+ arranged, transformed, or otherwise modified in a manner requiring
964
+ permission under the Copyright and Similar Rights held by the
965
+ Licensor. For purposes of this Public License, where the Licensed
966
+ Material is a musical work, performance, or sound recording,
967
+ Adapted Material is always produced where the Licensed Material is
968
+ synched in timed relation with a moving image.
969
+
970
+ b. Adapter's License means the license You apply to Your Copyright
971
+ and Similar Rights in Your contributions to Adapted Material in
972
+ accordance with the terms and conditions of this Public License.
973
+
974
+ c. BY-SA Compatible License means a license listed at
975
+ creativecommons.org/compatiblelicenses, approved by Creative
976
+ Commons as essentially the equivalent of this Public License.
977
+
978
+ d. Copyright and Similar Rights means copyright and/or similar rights
979
+ closely related to copyright including, without limitation,
980
+ performance, broadcast, sound recording, and Sui Generis Database
981
+ Rights, without regard to how the rights are labeled or
982
+ categorized. For purposes of this Public License, the rights
983
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
984
+ Rights.
985
+
986
+ e. Effective Technological Measures means those measures that, in the
987
+ absence of proper authority, may not be circumvented under laws
988
+ fulfilling obligations under Article 11 of the WIPO Copyright
989
+ Treaty adopted on December 20, 1996, and/or similar international
990
+ agreements.
991
+
992
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
993
+ any other exception or limitation to Copyright and Similar Rights
994
+ that applies to Your use of the Licensed Material.
995
+
996
+ g. License Elements means the license attributes listed in the name
997
+ of a Creative Commons Public License. The License Elements of this
998
+ Public License are Attribution and ShareAlike.
999
+
1000
+ h. Licensed Material means the artistic or literary work, database,
1001
+ or other material to which the Licensor applied this Public
1002
+ License.
1003
+
1004
+ i. Licensed Rights means the rights granted to You subject to the
1005
+ terms and conditions of this Public License, which are limited to
1006
+ all Copyright and Similar Rights that apply to Your use of the
1007
+ Licensed Material and that the Licensor has authority to license.
1008
+
1009
+ j. Licensor means the individual(s) or entity(ies) granting rights
1010
+ under this Public License.
1011
+
1012
+ k. Share means to provide material to the public by any means or
1013
+ process that requires permission under the Licensed Rights, such
1014
+ as reproduction, public display, public performance, distribution,
1015
+ dissemination, communication, or importation, and to make material
1016
+ available to the public including in ways that members of the
1017
+ public may access the material from a place and at a time
1018
+ individually chosen by them.
1019
+
1020
+ l. Sui Generis Database Rights means rights other than copyright
1021
+ resulting from Directive 96/9/EC of the European Parliament and of
1022
+ the Council of 11 March 1996 on the legal protection of databases,
1023
+ as amended and/or succeeded, as well as other essentially
1024
+ equivalent rights anywhere in the world.
1025
+
1026
+ m. You means the individual or entity exercising the Licensed Rights
1027
+ under this Public License. Your has a corresponding meaning.
1028
+
1029
+
1030
+ Section 2 -- Scope.
1031
+
1032
+ a. License grant.
1033
+
1034
+ 1. Subject to the terms and conditions of this Public License,
1035
+ the Licensor hereby grants You a worldwide, royalty-free,
1036
+ non-sublicensable, non-exclusive, irrevocable license to
1037
+ exercise the Licensed Rights in the Licensed Material to:
1038
+
1039
+ a. reproduce and Share the Licensed Material, in whole or
1040
+ in part; and
1041
+
1042
+ b. produce, reproduce, and Share Adapted Material.
1043
+
1044
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
1045
+ Exceptions and Limitations apply to Your use, this Public
1046
+ License does not apply, and You do not need to comply with
1047
+ its terms and conditions.
1048
+
1049
+ 3. Term. The term of this Public License is specified in Section
1050
+ 6(a).
1051
+
1052
+ 4. Media and formats; technical modifications allowed. The
1053
+ Licensor authorizes You to exercise the Licensed Rights in
1054
+ all media and formats whether now known or hereafter created,
1055
+ and to make technical modifications necessary to do so. The
1056
+ Licensor waives and/or agrees not to assert any right or
1057
+ authority to forbid You from making technical modifications
1058
+ necessary to exercise the Licensed Rights, including
1059
+ technical modifications necessary to circumvent Effective
1060
+ Technological Measures. For purposes of this Public License,
1061
+ simply making modifications authorized by this Section 2(a)
1062
+ (4) never produces Adapted Material.
1063
+
1064
+ 5. Downstream recipients.
1065
+
1066
+ a. Offer from the Licensor -- Licensed Material. Every
1067
+ recipient of the Licensed Material automatically
1068
+ receives an offer from the Licensor to exercise the
1069
+ Licensed Rights under the terms and conditions of this
1070
+ Public License.
1071
+
1072
+ b. Additional offer from the Licensor -- Adapted Material.
1073
+ Every recipient of Adapted Material from You
1074
+ automatically receives an offer from the Licensor to
1075
+ exercise the Licensed Rights in the Adapted Material
1076
+ under the conditions of the Adapter's License You apply.
1077
+
1078
+ c. No downstream restrictions. You may not offer or impose
1079
+ any additional or different terms or conditions on, or
1080
+ apply any Effective Technological Measures to, the
1081
+ Licensed Material if doing so restricts exercise of the
1082
+ Licensed Rights by any recipient of the Licensed
1083
+ Material.
1084
+
1085
+ 6. No endorsement. Nothing in this Public License constitutes or
1086
+ may be construed as permission to assert or imply that You
1087
+ are, or that Your use of the Licensed Material is, connected
1088
+ with, or sponsored, endorsed, or granted official status by,
1089
+ the Licensor or others designated to receive attribution as
1090
+ provided in Section 3(a)(1)(A)(i).
1091
+
1092
+ b. Other rights.
1093
+
1094
+ 1. Moral rights, such as the right of integrity, are not
1095
+ licensed under this Public License, nor are publicity,
1096
+ privacy, and/or other similar personality rights; however, to
1097
+ the extent possible, the Licensor waives and/or agrees not to
1098
+ assert any such rights held by the Licensor to the limited
1099
+ extent necessary to allow You to exercise the Licensed
1100
+ Rights, but not otherwise.
1101
+
1102
+ 2. Patent and trademark rights are not licensed under this
1103
+ Public License.
1104
+
1105
+ 3. To the extent possible, the Licensor waives any right to
1106
+ collect royalties from You for the exercise of the Licensed
1107
+ Rights, whether directly or through a collecting society
1108
+ under any voluntary or waivable statutory or compulsory
1109
+ licensing scheme. In all other cases the Licensor expressly
1110
+ reserves any right to collect such royalties.
1111
+
1112
+
1113
+ Section 3 -- License Conditions.
1114
+
1115
+ Your exercise of the Licensed Rights is expressly made subject to the
1116
+ following conditions.
1117
+
1118
+ a. Attribution.
1119
+
1120
+ 1. If You Share the Licensed Material (including in modified
1121
+ form), You must:
1122
+
1123
+ a. retain the following if it is supplied by the Licensor
1124
+ with the Licensed Material:
1125
+
1126
+ i. identification of the creator(s) of the Licensed
1127
+ Material and any others designated to receive
1128
+ attribution, in any reasonable manner requested by
1129
+ the Licensor (including by pseudonym if
1130
+ designated);
1131
+
1132
+ ii. a copyright notice;
1133
+
1134
+ iii. a notice that refers to this Public License;
1135
+
1136
+ iv. a notice that refers to the disclaimer of
1137
+ warranties;
1138
+
1139
+ v. a URI or hyperlink to the Licensed Material to the
1140
+ extent reasonably practicable;
1141
+
1142
+ b. indicate if You modified the Licensed Material and
1143
+ retain an indication of any previous modifications; and
1144
+
1145
+ c. indicate the Licensed Material is licensed under this
1146
+ Public License, and include the text of, or the URI or
1147
+ hyperlink to, this Public License.
1148
+
1149
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
1150
+ reasonable manner based on the medium, means, and context in
1151
+ which You Share the Licensed Material. For example, it may be
1152
+ reasonable to satisfy the conditions by providing a URI or
1153
+ hyperlink to a resource that includes the required
1154
+ information.
1155
+
1156
+ 3. If requested by the Licensor, You must remove any of the
1157
+ information required by Section 3(a)(1)(A) to the extent
1158
+ reasonably practicable.
1159
+
1160
+ b. ShareAlike.
1161
+
1162
+ In addition to the conditions in Section 3(a), if You Share
1163
+ Adapted Material You produce, the following conditions also apply.
1164
+
1165
+ 1. The Adapter's License You apply must be a Creative Commons
1166
+ license with the same License Elements, this version or
1167
+ later, or a BY-SA Compatible License.
1168
+
1169
+ 2. You must include the text of, or the URI or hyperlink to, the
1170
+ Adapter's License You apply. You may satisfy this condition
1171
+ in any reasonable manner based on the medium, means, and
1172
+ context in which You Share Adapted Material.
1173
+
1174
+ 3. You may not offer or impose any additional or different terms
1175
+ or conditions on, or apply any Effective Technological
1176
+ Measures to, Adapted Material that restrict exercise of the
1177
+ rights granted under the Adapter's License You apply.
1178
+
1179
+
1180
+ Section 4 -- Sui Generis Database Rights.
1181
+
1182
+ Where the Licensed Rights include Sui Generis Database Rights that
1183
+ apply to Your use of the Licensed Material:
1184
+
1185
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
1186
+ to extract, reuse, reproduce, and Share all or a substantial
1187
+ portion of the contents of the database;
1188
+
1189
+ b. if You include all or a substantial portion of the database
1190
+ contents in a database in which You have Sui Generis Database
1191
+ Rights, then the database in which You have Sui Generis Database
1192
+ Rights (but not its individual contents) is Adapted Material,
1193
+
1194
+ including for purposes of Section 3(b); and
1195
+ c. You must comply with the conditions in Section 3(a) if You Share
1196
+ all or a substantial portion of the contents of the database.
1197
+
1198
+ For the avoidance of doubt, this Section 4 supplements and does not
1199
+ replace Your obligations under this Public License where the Licensed
1200
+ Rights include other Copyright and Similar Rights.
1201
+
1202
+
1203
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
1204
+
1205
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
1206
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
1207
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
1208
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
1209
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
1210
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
1211
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
1212
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
1213
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
1214
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
1215
+
1216
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
1217
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
1218
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
1219
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
1220
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
1221
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
1222
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
1223
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
1224
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
1225
+
1226
+ c. The disclaimer of warranties and limitation of liability provided
1227
+ above shall be interpreted in a manner that, to the extent
1228
+ possible, most closely approximates an absolute disclaimer and
1229
+ waiver of all liability.
1230
+
1231
+
1232
+ Section 6 -- Term and Termination.
1233
+
1234
+ a. This Public License applies for the term of the Copyright and
1235
+ Similar Rights licensed here. However, if You fail to comply with
1236
+ this Public License, then Your rights under this Public License
1237
+ terminate automatically.
1238
+
1239
+ b. Where Your right to use the Licensed Material has terminated under
1240
+ Section 6(a), it reinstates:
1241
+
1242
+ 1. automatically as of the date the violation is cured, provided
1243
+ it is cured within 30 days of Your discovery of the
1244
+ violation; or
1245
+
1246
+ 2. upon express reinstatement by the Licensor.
1247
+
1248
+ For the avoidance of doubt, this Section 6(b) does not affect any
1249
+ right the Licensor may have to seek remedies for Your violations
1250
+ of this Public License.
1251
+
1252
+ c. For the avoidance of doubt, the Licensor may also offer the
1253
+ Licensed Material under separate terms or conditions or stop
1254
+ distributing the Licensed Material at any time; however, doing so
1255
+ will not terminate this Public License.
1256
+
1257
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
1258
+ License.
1259
+
1260
+
1261
+ Section 7 -- Other Terms and Conditions.
1262
+
1263
+ a. The Licensor shall not be bound by any additional or different
1264
+ terms or conditions communicated by You unless expressly agreed.
1265
+
1266
+ b. Any arrangements, understandings, or agreements regarding the
1267
+ Licensed Material not stated herein are separate from and
1268
+ independent of the terms and conditions of this Public License.
1269
+
1270
+
1271
+ Section 8 -- Interpretation.
1272
+
1273
+ a. For the avoidance of doubt, this Public License does not, and
1274
+ shall not be interpreted to, reduce, limit, restrict, or impose
1275
+ conditions on any use of the Licensed Material that could lawfully
1276
+ be made without permission under this Public License.
1277
+
1278
+ b. To the extent possible, if any provision of this Public License is
1279
+ deemed unenforceable, it shall be automatically reformed to the
1280
+ minimum extent necessary to make it enforceable. If the provision
1281
+ cannot be reformed, it shall be severed from this Public License
1282
+ without affecting the enforceability of the remaining terms and
1283
+ conditions.
1284
+
1285
+ c. No term or condition of this Public License will be waived and no
1286
+ failure to comply consented to unless expressly agreed to by the
1287
+ Licensor.
1288
+
1289
+ d. Nothing in this Public License constitutes or may be interpreted
1290
+ as a limitation upon, or waiver of, any privileges and immunities
1291
+ that apply to the Licensor or You, including from the legal
1292
+ processes of any jurisdiction or authority.
1293
+
1294
+
1295
+ =======================================================================
1296
+
1297
+ Creative Commons is not a party to its public
1298
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
1299
+ its public licenses to material it publishes and in those instances
1300
+ will be considered the “Licensor.” The text of the Creative Commons
1301
+ public licenses is dedicated to the public domain under the CC0 Public
1302
+ Domain Dedication. Except for the limited purpose of indicating that
1303
+ material is shared under a Creative Commons public license or as
1304
+ otherwise permitted by the Creative Commons policies published at
1305
+ creativecommons.org/policies, Creative Commons does not authorize the
1306
+ use of the trademark "Creative Commons" or any other trademark or logo
1307
+ of Creative Commons without its prior written consent including,
1308
+ without limitation, in connection with any unauthorized modifications
1309
+ to any of its public licenses or any other arrangements,
1310
+ understandings, or agreements concerning use of licensed material. For
1311
+ the avoidance of doubt, this paragraph does not form part of the
1312
+ public licenses.
1313
+
1314
+ Creative Commons may be contacted at creativecommons.org.
1315
+
1316
+ ```
1317
+
1318
+
1319
+
1320
+
1321
+ # spaCy lookups data
1322
+
1323
+ * Author: Explosion
1324
+ * URL: https://github.com/explosion/spacy-lookups-data
1325
+ * License: MIT
1326
+
1327
+ ```
1328
+ Copyright 2019-2021 ExplosionAI GmbH
1329
+
1330
+ Permission is hereby granted, free of charge, to any person obtaining a copy of
1331
+ this software and associated documentation files (the "Software"), to deal in
1332
+ the Software without restriction, including without limitation the rights to
1333
+ use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
1334
+ of the Software, and to permit persons to whom the Software is furnished to do
1335
+ so, subject to the following conditions:
1336
+
1337
+ The above copyright notice and this permission notice shall be included in all
1338
+ copies or substantial portions of the Software.
1339
+
1340
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
1341
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
1342
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
1343
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
1344
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
1345
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
1346
+ SOFTWARE.
1347
+ ```
1348
+
1349
+
1350
+
1351
+
README.md ADDED
@@ -0,0 +1,96 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - spacy
4
+ - token-classification
5
+ language:
6
+ - mk
7
+ license: CC-BY-SA-4.0
8
+ model-index:
9
+ - name: mk_core_news_sm
10
+ results:
11
+ - tasks:
12
+ name: NER
13
+ type: token-classification
14
+ metrics:
15
+ - name: Precision
16
+ type: precision
17
+ value: 0.7100802855
18
+ - name: Recall
19
+ type: recall
20
+ value: 0.6774468085
21
+ - name: F Score
22
+ type: f_score
23
+ value: 0.6933797909
24
+ - tasks:
25
+ name: SENTER
26
+ type: token-classification
27
+ metrics:
28
+ - name: Precision
29
+ type: precision
30
+ value: 0.768115942
31
+ - name: Recall
32
+ type: recall
33
+ value: 0.6883116883
34
+ - name: F Score
35
+ type: f_score
36
+ value: 0.7260273973
37
+ - tasks:
38
+ name: UNLABELED_DEPENDENCIES
39
+ type: token-classification
40
+ metrics:
41
+ - name: Accuracy
42
+ type: accuracy
43
+ value: 0.6457311089
44
+ - tasks:
45
+ name: LABELED_DEPENDENCIES
46
+ type: token-classification
47
+ metrics:
48
+ - name: Accuracy
49
+ type: accuracy
50
+ value: 0.6457311089
51
+ ---
52
+ ### Details: https://spacy.io/models/mk#mk_core_news_sm
53
+
54
+ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.
55
+
56
+ | Feature | Description |
57
+ | --- | --- |
58
+ | **Name** | `mk_core_news_sm` |
59
+ | **Version** | `3.1.0` |
60
+ | **spaCy** | `>=3.1.0,<3.2.0` |
61
+ | **Default Pipeline** | `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
62
+ | **Components** | `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
63
+ | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
64
+ | **Sources** | [Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion) |
65
+ | **License** | `CC BY-SA 4.0` |
66
+ | **Author** | [Explosion](https://explosion.ai) |
67
+
68
+ ### Label Scheme
69
+
70
+ <details>
71
+
72
+ <summary>View label scheme (55 labels for 4 components)</summary>
73
+
74
+ | Component | Labels |
75
+ | --- | --- |
76
+ | **`morphologizer`** | `POS=PROPN`, `POS=AUX`, `POS=ADJ`, `POS=NOUN`, `POS=ADP`, `POS=PUNCT`, `POS=CONJ`, `POS=NUM`, `POS=VERB`, `POS=PRON`, `POS=ADV`, `POS=SCONJ`, `POS=PART`, `POS=SYM`, `POS=X`, `_`, `POS=INTJ` |
77
+ | **`parser`** | `ROOT`, `advmod`, `att`, `aux`, `cc`, `dep`, `det`, `dobj`, `iobj`, `neg`, `nsubj`, `pobj`, `poss`, `pozm`, `pozv`, `prep`, `punct`, `relcl` |
78
+ | **`senter`** | `I`, `S` |
79
+ | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
+
81
+ </details>
82
+
83
+ ### Accuracy
84
+
85
+ | Type | Score |
86
+ | --- | --- |
87
+ | `TOKEN_ACC` | 100.00 |
88
+ | `POS_ACC` | 92.10 |
89
+ | `SENTS_P` | 76.81 |
90
+ | `SENTS_R` | 68.83 |
91
+ | `SENTS_F` | 72.60 |
92
+ | `DEP_UAS` | 64.57 |
93
+ | `DEP_LAS` | 47.50 |
94
+ | `ENTS_P` | 71.01 |
95
+ | `ENTS_R` | 67.74 |
96
+ | `ENTS_F` | 69.34 |
accuracy.json ADDED
@@ -0,0 +1,257 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "token_acc": 1.0,
3
+ "pos_acc": 0.9209603453,
4
+ "sents_p": 0.768115942,
5
+ "sents_r": 0.6883116883,
6
+ "sents_f": 0.7260273973,
7
+ "speed": 2753.0743975125,
8
+ "dep_uas": 0.6457311089,
9
+ "dep_las": 0.4749754661,
10
+ "dep_las_per_type": {
11
+ "nsubj": {
12
+ "p": 0.525,
13
+ "r": 0.5526315789,
14
+ "f": 0.5384615385
15
+ },
16
+ "root": {
17
+ "p": 0.6956521739,
18
+ "r": 0.6857142857,
19
+ "f": 0.690647482
20
+ },
21
+ "cc": {
22
+ "p": 0.7368421053,
23
+ "r": 0.5,
24
+ "f": 0.5957446809
25
+ },
26
+ "relcl": {
27
+ "p": 0.380952381,
28
+ "r": 0.3076923077,
29
+ "f": 0.3404255319
30
+ },
31
+ "pozm": {
32
+ "p": 0.6,
33
+ "r": 0.2727272727,
34
+ "f": 0.375
35
+ },
36
+ "poss": {
37
+ "p": 0.0,
38
+ "r": 0.0,
39
+ "f": 0.0
40
+ },
41
+ "aux": {
42
+ "p": 0.5454545455,
43
+ "r": 0.7272727273,
44
+ "f": 0.6233766234
45
+ },
46
+ "prep": {
47
+ "p": 0.703125,
48
+ "r": 0.75,
49
+ "f": 0.7258064516
50
+ },
51
+ "iobj": {
52
+ "p": 0.0,
53
+ "r": 0.0,
54
+ "f": 0.0
55
+ },
56
+ "pozv": {
57
+ "p": 0.125,
58
+ "r": 0.0666666667,
59
+ "f": 0.0869565217
60
+ },
61
+ "quantmod": {
62
+ "p": 0.0,
63
+ "r": 0.0,
64
+ "f": 0.0
65
+ },
66
+ "att": {
67
+ "p": 0.6222222222,
68
+ "r": 0.5384615385,
69
+ "f": 0.5773195876
70
+ },
71
+ "det": {
72
+ "p": 0.0,
73
+ "r": 0.0,
74
+ "f": 0.0
75
+ },
76
+ "num": {
77
+ "p": 0.0,
78
+ "r": 0.0,
79
+ "f": 0.0
80
+ },
81
+ "dep": {
82
+ "p": 0.0,
83
+ "r": 0.0,
84
+ "f": 0.0
85
+ },
86
+ "dobj": {
87
+ "p": 0.3521126761,
88
+ "r": 0.4166666667,
89
+ "f": 0.3816793893
90
+ },
91
+ "ppdo": {
92
+ "p": 0.6666666667,
93
+ "r": 0.4,
94
+ "f": 0.5
95
+ },
96
+ "neg": {
97
+ "p": 0.5555555556,
98
+ "r": 0.4545454545,
99
+ "f": 0.5
100
+ },
101
+ "pobj": {
102
+ "p": 0.3823529412,
103
+ "r": 0.40625,
104
+ "f": 0.3939393939
105
+ },
106
+ "mwe": {
107
+ "p": 0.0,
108
+ "r": 0.0,
109
+ "f": 0.0
110
+ },
111
+ "ppio": {
112
+ "p": 0.0,
113
+ "r": 0.0,
114
+ "f": 0.0
115
+ },
116
+ "appos": {
117
+ "p": 0.0,
118
+ "r": 0.0,
119
+ "f": 0.0
120
+ },
121
+ "advmod": {
122
+ "p": 0.3333333333,
123
+ "r": 0.5,
124
+ "f": 0.4
125
+ },
126
+ "advcl": {
127
+ "p": 0.0,
128
+ "r": 0.0,
129
+ "f": 0.0
130
+ },
131
+ "number": {
132
+ "p": 0.0,
133
+ "r": 0.0,
134
+ "f": 0.0
135
+ },
136
+ "amod": {
137
+ "p": 0.0,
138
+ "r": 0.0,
139
+ "f": 0.0
140
+ },
141
+ "_": {
142
+ "p": 0.0,
143
+ "r": 0.0,
144
+ "f": 0.0
145
+ },
146
+ "acl": {
147
+ "p": 0.0,
148
+ "r": 0.0,
149
+ "f": 0.0
150
+ },
151
+ "pozn": {
152
+ "p": 0.0,
153
+ "r": 0.0,
154
+ "f": 0.0
155
+ },
156
+ "pozk": {
157
+ "p": 0.0,
158
+ "r": 0.0,
159
+ "f": 0.0
160
+ }
161
+ },
162
+ "ents_p": 0.7100802855,
163
+ "ents_r": 0.6774468085,
164
+ "ents_f": 0.6933797909,
165
+ "ents_per_type": {
166
+ "NORP": {
167
+ "p": 0.4893617021,
168
+ "r": 0.3538461538,
169
+ "f": 0.4107142857
170
+ },
171
+ "GPE": {
172
+ "p": 0.8321995465,
173
+ "r": 0.8655660377,
174
+ "f": 0.8485549133
175
+ },
176
+ "LOC": {
177
+ "p": 0.649122807,
178
+ "r": 0.4252873563,
179
+ "f": 0.5138888889
180
+ },
181
+ "CARDINAL": {
182
+ "p": 0.6847826087,
183
+ "r": 0.670212766,
184
+ "f": 0.6774193548
185
+ },
186
+ "QUANTITY": {
187
+ "p": 0.6285714286,
188
+ "r": 0.5365853659,
189
+ "f": 0.5789473684
190
+ },
191
+ "FAC": {
192
+ "p": 0.2,
193
+ "r": 0.1,
194
+ "f": 0.1333333333
195
+ },
196
+ "ORG": {
197
+ "p": 0.4558823529,
198
+ "r": 0.6326530612,
199
+ "f": 0.5299145299
200
+ },
201
+ "PERSON": {
202
+ "p": 0.68,
203
+ "r": 0.68,
204
+ "f": 0.68
205
+ },
206
+ "ORDINAL": {
207
+ "p": 0.4545454545,
208
+ "r": 0.4545454545,
209
+ "f": 0.4545454545
210
+ },
211
+ "DATE": {
212
+ "p": 0.7183098592,
213
+ "r": 0.7183098592,
214
+ "f": 0.7183098592
215
+ },
216
+ "MONEY": {
217
+ "p": 0.5,
218
+ "r": 1.0,
219
+ "f": 0.6666666667
220
+ },
221
+ "PERCENT": {
222
+ "p": 1.0,
223
+ "r": 1.0,
224
+ "f": 1.0
225
+ },
226
+ "WORK_OF_ART": {
227
+ "p": 0.5238095238,
228
+ "r": 0.2682926829,
229
+ "f": 0.3548387097
230
+ },
231
+ "LANGUAGE": {
232
+ "p": 0.0,
233
+ "r": 0.0,
234
+ "f": 0.0
235
+ },
236
+ "TIME": {
237
+ "p": 1.0,
238
+ "r": 0.8333333333,
239
+ "f": 0.9090909091
240
+ },
241
+ "EVENT": {
242
+ "p": 0.4210526316,
243
+ "r": 0.4705882353,
244
+ "f": 0.4444444444
245
+ },
246
+ "LAW": {
247
+ "p": 0.0,
248
+ "r": 0.0,
249
+ "f": 0.0
250
+ },
251
+ "PRODUCT": {
252
+ "p": 0.0,
253
+ "r": 0.0,
254
+ "f": 0.0
255
+ }
256
+ }
257
+ }
attribute_ruler/patterns ADDED
Binary file (956 Bytes). View file
config.cfg ADDED
@@ -0,0 +1,266 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/mk-tag-news/train.spacy"
3
+ dev = "corpus/mk-tag-news/dev.spacy"
4
+ vectors = null
5
+ raw = null
6
+ init_tok2vec = null
7
+ vocab_data = null
8
+
9
+ [system]
10
+ gpu_allocator = null
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "mk"
15
+ pipeline = ["morphologizer","parser","senter","attribute_ruler","lemmatizer","ner"]
16
+ disabled = ["senter"]
17
+ before_creation = null
18
+ after_creation = null
19
+ after_pipeline_creation = null
20
+ batch_size = 256
21
+ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
22
+
23
+ [components]
24
+
25
+ [components.attribute_ruler]
26
+ factory = "attribute_ruler"
27
+ validate = false
28
+
29
+ [components.lemmatizer]
30
+ factory = "lemmatizer"
31
+ mode = "rule"
32
+ model = null
33
+ overwrite = false
34
+
35
+ [components.morphologizer]
36
+ factory = "morphologizer"
37
+
38
+ [components.morphologizer.model]
39
+ @architectures = "spacy.Tagger.v1"
40
+ nO = null
41
+
42
+ [components.morphologizer.model.tok2vec]
43
+ @architectures = "spacy.Tok2Vec.v2"
44
+
45
+ [components.morphologizer.model.tok2vec.embed]
46
+ @architectures = "spacy.MultiHashEmbed.v2"
47
+ width = ${components.morphologizer.model.tok2vec.encode:width}
48
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
49
+ rows = [5000,2500,2500,2500]
50
+ include_static_vectors = false
51
+
52
+ [components.morphologizer.model.tok2vec.encode]
53
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
54
+ width = 96
55
+ depth = 4
56
+ window_size = 1
57
+ maxout_pieces = 3
58
+
59
+ [components.ner]
60
+ factory = "ner"
61
+ incorrect_spans_key = null
62
+ moves = null
63
+ update_with_oracle_cut_size = 100
64
+
65
+ [components.ner.model]
66
+ @architectures = "spacy.TransitionBasedParser.v2"
67
+ state_type = "ner"
68
+ extra_state_tokens = false
69
+ hidden_width = 64
70
+ maxout_pieces = 2
71
+ use_upper = true
72
+ nO = null
73
+
74
+ [components.ner.model.tok2vec]
75
+ @architectures = "spacy.Tok2Vec.v2"
76
+
77
+ [components.ner.model.tok2vec.embed]
78
+ @architectures = "spacy.MultiHashEmbed.v2"
79
+ width = 96
80
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
81
+ rows = [5000,2500,2500,2500]
82
+ include_static_vectors = false
83
+
84
+ [components.ner.model.tok2vec.encode]
85
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
86
+ width = 96
87
+ depth = 4
88
+ window_size = 1
89
+ maxout_pieces = 3
90
+
91
+ [components.parser]
92
+ factory = "parser"
93
+ learn_tokens = false
94
+ min_action_freq = 30
95
+ moves = null
96
+ update_with_oracle_cut_size = 100
97
+
98
+ [components.parser.model]
99
+ @architectures = "spacy.TransitionBasedParser.v2"
100
+ state_type = "parser"
101
+ extra_state_tokens = false
102
+ hidden_width = 64
103
+ maxout_pieces = 2
104
+ use_upper = true
105
+ nO = null
106
+
107
+ [components.parser.model.tok2vec]
108
+ @architectures = "spacy.Tok2Vec.v2"
109
+
110
+ [components.parser.model.tok2vec.embed]
111
+ @architectures = "spacy.MultiHashEmbed.v2"
112
+ width = 96
113
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
114
+ rows = [5000,2500,2500,2500]
115
+ include_static_vectors = false
116
+
117
+ [components.parser.model.tok2vec.encode]
118
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
119
+ width = 96
120
+ depth = 4
121
+ window_size = 1
122
+ maxout_pieces = 3
123
+
124
+ [components.senter]
125
+ factory = "senter"
126
+
127
+ [components.senter.model]
128
+ @architectures = "spacy.Tagger.v1"
129
+ nO = null
130
+
131
+ [components.senter.model.tok2vec]
132
+ @architectures = "spacy.Tok2Vec.v2"
133
+
134
+ [components.senter.model.tok2vec.embed]
135
+ @architectures = "spacy.MultiHashEmbed.v2"
136
+ width = 16
137
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
138
+ rows = [1000,500,500,500]
139
+ include_static_vectors = false
140
+
141
+ [components.senter.model.tok2vec.encode]
142
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
143
+ width = 16
144
+ depth = 2
145
+ window_size = 1
146
+ maxout_pieces = 2
147
+
148
+ [corpora]
149
+
150
+ [corpora.dev]
151
+ @readers = "spacy.Corpus.v1"
152
+ limit = 0
153
+ max_length = 0
154
+ path = ${paths:dev}
155
+ gold_preproc = false
156
+ augmenter = null
157
+
158
+ [corpora.train]
159
+ @readers = "spacy.Corpus.v1"
160
+ path = ${paths:train}
161
+ max_length = 5000
162
+ gold_preproc = false
163
+ limit = 0
164
+
165
+ [corpora.train.augmenter]
166
+ @augmenters = "spacy.lower_case.v1"
167
+ level = 0.1
168
+
169
+ [training]
170
+ train_corpus = "corpora.train"
171
+ dev_corpus = "corpora.dev"
172
+ seed = ${system:seed}
173
+ gpu_allocator = ${system:gpu_allocator}
174
+ dropout = 0.1
175
+ accumulate_gradient = 1
176
+ patience = 5000
177
+ max_epochs = 0
178
+ max_steps = 0
179
+ eval_frequency = 1000
180
+ frozen_components = []
181
+ before_to_disk = null
182
+ annotating_components = []
183
+
184
+ [training.batcher]
185
+ @batchers = "spacy.batch_by_words.v1"
186
+ discard_oversize = false
187
+ tolerance = 0.2
188
+ get_length = null
189
+
190
+ [training.batcher.size]
191
+ @schedules = "compounding.v1"
192
+ start = 100
193
+ stop = 1000
194
+ compound = 1.001
195
+ t = 0.0
196
+
197
+ [training.logger]
198
+ @loggers = "spacy.WandbLogger.v1"
199
+ project_name = "spacy-v3.0.0a2"
200
+ remove_config_values = []
201
+
202
+ [training.optimizer]
203
+ @optimizers = "Adam.v1"
204
+ beta1 = 0.9
205
+ beta2 = 0.999
206
+ L2_is_weight_decay = true
207
+ L2 = 0.01
208
+ grad_clip = 1.0
209
+ use_averages = true
210
+ eps = 0.00000001
211
+ learn_rate = 0.001
212
+
213
+ [training.score_weights]
214
+ pos_acc = 0.2
215
+ morph_acc = null
216
+ morph_per_feat = null
217
+ dep_uas = 0.1
218
+ dep_las = 0.1
219
+ dep_las_per_type = null
220
+ sents_p = 0.0
221
+ sents_r = 0.0
222
+ sents_f = 0.2
223
+ lemma_acc = 0.2
224
+ ents_f = 0.2
225
+ ents_p = 0.0
226
+ ents_r = 0.0
227
+ ents_per_type = null
228
+
229
+ [pretraining]
230
+
231
+ [initialize]
232
+ vocab_data = ${paths.vocab_data}
233
+ vectors = ${paths.vectors}
234
+ init_tok2vec = ${paths.init_tok2vec}
235
+ before_init = null
236
+ after_init = null
237
+
238
+ [initialize.components]
239
+
240
+ [initialize.components.morphologizer]
241
+
242
+ [initialize.components.morphologizer.labels]
243
+ @readers = "spacy.read_labels.v1"
244
+ path = "corpus/labels/morphologizer.json"
245
+ require = false
246
+
247
+ [initialize.components.ner]
248
+
249
+ [initialize.components.ner.labels]
250
+ @readers = "spacy.read_labels.v1"
251
+ path = "corpus/labels/ner.json"
252
+ require = false
253
+
254
+ [initialize.components.parser]
255
+
256
+ [initialize.components.parser.labels]
257
+ @readers = "spacy.read_labels.v1"
258
+ path = "corpus/labels/parser.json"
259
+ require = false
260
+
261
+ [initialize.lookups]
262
+ @misc = "spacy.LookupsDataLoader.v1"
263
+ lang = ${nlp.lang}
264
+ tables = ["lexeme_norm"]
265
+
266
+ [initialize.tokenizer]
lemmatizer/lookups/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:99add14a891bec130812b7f891e5ee103e764bcb24e10867280abe41ab8d474a
3
+ size 902279
meta.json ADDED
@@ -0,0 +1,393 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "lang":"mk",
3
+ "name":"core_news_sm",
4
+ "version":"3.1.0",
5
+ "description":"Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
+ "author":"Explosion",
7
+ "email":"contact@explosion.ai",
8
+ "url":"https://explosion.ai",
9
+ "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.1.0,<3.2.0",
11
+ "spacy_git_version":"caba63b74",
12
+ "vectors":{
13
+ "width":0,
14
+ "vectors":0,
15
+ "keys":0,
16
+ "name":null
17
+ },
18
+ "labels":{
19
+ "morphologizer":[
20
+ "POS=PROPN",
21
+ "POS=AUX",
22
+ "POS=ADJ",
23
+ "POS=NOUN",
24
+ "POS=ADP",
25
+ "POS=PUNCT",
26
+ "POS=CONJ",
27
+ "POS=NUM",
28
+ "POS=VERB",
29
+ "POS=PRON",
30
+ "POS=ADV",
31
+ "POS=SCONJ",
32
+ "POS=PART",
33
+ "POS=SYM",
34
+ "POS=X",
35
+ "_",
36
+ "POS=INTJ"
37
+ ],
38
+ "parser":[
39
+ "ROOT",
40
+ "advmod",
41
+ "att",
42
+ "aux",
43
+ "cc",
44
+ "dep",
45
+ "det",
46
+ "dobj",
47
+ "iobj",
48
+ "neg",
49
+ "nsubj",
50
+ "pobj",
51
+ "poss",
52
+ "pozm",
53
+ "pozv",
54
+ "prep",
55
+ "punct",
56
+ "relcl"
57
+ ],
58
+ "senter":[
59
+ "I",
60
+ "S"
61
+ ],
62
+ "attribute_ruler":[
63
+
64
+ ],
65
+ "lemmatizer":[
66
+
67
+ ],
68
+ "ner":[
69
+ "CARDINAL",
70
+ "DATE",
71
+ "EVENT",
72
+ "FAC",
73
+ "GPE",
74
+ "LANGUAGE",
75
+ "LAW",
76
+ "LOC",
77
+ "MONEY",
78
+ "NORP",
79
+ "ORDINAL",
80
+ "ORG",
81
+ "PERCENT",
82
+ "PERSON",
83
+ "PRODUCT",
84
+ "QUANTITY",
85
+ "TIME",
86
+ "WORK_OF_ART"
87
+ ]
88
+ },
89
+ "pipeline":[
90
+ "morphologizer",
91
+ "parser",
92
+ "attribute_ruler",
93
+ "lemmatizer",
94
+ "ner"
95
+ ],
96
+ "components":[
97
+ "morphologizer",
98
+ "parser",
99
+ "senter",
100
+ "attribute_ruler",
101
+ "lemmatizer",
102
+ "ner"
103
+ ],
104
+ "disabled":[
105
+ "senter"
106
+ ],
107
+ "performance":{
108
+ "token_acc":1.0,
109
+ "pos_acc":0.9209603453,
110
+ "sents_p":0.768115942,
111
+ "sents_r":0.6883116883,
112
+ "sents_f":0.7260273973,
113
+ "speed":2753.0743975125,
114
+ "dep_uas":0.6457311089,
115
+ "dep_las":0.4749754661,
116
+ "dep_las_per_type":{
117
+ "nsubj":{
118
+ "p":0.525,
119
+ "r":0.5526315789,
120
+ "f":0.5384615385
121
+ },
122
+ "root":{
123
+ "p":0.6956521739,
124
+ "r":0.6857142857,
125
+ "f":0.690647482
126
+ },
127
+ "cc":{
128
+ "p":0.7368421053,
129
+ "r":0.5,
130
+ "f":0.5957446809
131
+ },
132
+ "relcl":{
133
+ "p":0.380952381,
134
+ "r":0.3076923077,
135
+ "f":0.3404255319
136
+ },
137
+ "pozm":{
138
+ "p":0.6,
139
+ "r":0.2727272727,
140
+ "f":0.375
141
+ },
142
+ "poss":{
143
+ "p":0.0,
144
+ "r":0.0,
145
+ "f":0.0
146
+ },
147
+ "aux":{
148
+ "p":0.5454545455,
149
+ "r":0.7272727273,
150
+ "f":0.6233766234
151
+ },
152
+ "prep":{
153
+ "p":0.703125,
154
+ "r":0.75,
155
+ "f":0.7258064516
156
+ },
157
+ "iobj":{
158
+ "p":0.0,
159
+ "r":0.0,
160
+ "f":0.0
161
+ },
162
+ "pozv":{
163
+ "p":0.125,
164
+ "r":0.0666666667,
165
+ "f":0.0869565217
166
+ },
167
+ "quantmod":{
168
+ "p":0.0,
169
+ "r":0.0,
170
+ "f":0.0
171
+ },
172
+ "att":{
173
+ "p":0.6222222222,
174
+ "r":0.5384615385,
175
+ "f":0.5773195876
176
+ },
177
+ "det":{
178
+ "p":0.0,
179
+ "r":0.0,
180
+ "f":0.0
181
+ },
182
+ "num":{
183
+ "p":0.0,
184
+ "r":0.0,
185
+ "f":0.0
186
+ },
187
+ "dep":{
188
+ "p":0.0,
189
+ "r":0.0,
190
+ "f":0.0
191
+ },
192
+ "dobj":{
193
+ "p":0.3521126761,
194
+ "r":0.4166666667,
195
+ "f":0.3816793893
196
+ },
197
+ "ppdo":{
198
+ "p":0.6666666667,
199
+ "r":0.4,
200
+ "f":0.5
201
+ },
202
+ "neg":{
203
+ "p":0.5555555556,
204
+ "r":0.4545454545,
205
+ "f":0.5
206
+ },
207
+ "pobj":{
208
+ "p":0.3823529412,
209
+ "r":0.40625,
210
+ "f":0.3939393939
211
+ },
212
+ "mwe":{
213
+ "p":0.0,
214
+ "r":0.0,
215
+ "f":0.0
216
+ },
217
+ "ppio":{
218
+ "p":0.0,
219
+ "r":0.0,
220
+ "f":0.0
221
+ },
222
+ "appos":{
223
+ "p":0.0,
224
+ "r":0.0,
225
+ "f":0.0
226
+ },
227
+ "advmod":{
228
+ "p":0.3333333333,
229
+ "r":0.5,
230
+ "f":0.4
231
+ },
232
+ "advcl":{
233
+ "p":0.0,
234
+ "r":0.0,
235
+ "f":0.0
236
+ },
237
+ "number":{
238
+ "p":0.0,
239
+ "r":0.0,
240
+ "f":0.0
241
+ },
242
+ "amod":{
243
+ "p":0.0,
244
+ "r":0.0,
245
+ "f":0.0
246
+ },
247
+ "_":{
248
+ "p":0.0,
249
+ "r":0.0,
250
+ "f":0.0
251
+ },
252
+ "acl":{
253
+ "p":0.0,
254
+ "r":0.0,
255
+ "f":0.0
256
+ },
257
+ "pozn":{
258
+ "p":0.0,
259
+ "r":0.0,
260
+ "f":0.0
261
+ },
262
+ "pozk":{
263
+ "p":0.0,
264
+ "r":0.0,
265
+ "f":0.0
266
+ }
267
+ },
268
+ "ents_p":0.7100802855,
269
+ "ents_r":0.6774468085,
270
+ "ents_f":0.6933797909,
271
+ "ents_per_type":{
272
+ "NORP":{
273
+ "p":0.4893617021,
274
+ "r":0.3538461538,
275
+ "f":0.4107142857
276
+ },
277
+ "GPE":{
278
+ "p":0.8321995465,
279
+ "r":0.8655660377,
280
+ "f":0.8485549133
281
+ },
282
+ "LOC":{
283
+ "p":0.649122807,
284
+ "r":0.4252873563,
285
+ "f":0.5138888889
286
+ },
287
+ "CARDINAL":{
288
+ "p":0.6847826087,
289
+ "r":0.670212766,
290
+ "f":0.6774193548
291
+ },
292
+ "QUANTITY":{
293
+ "p":0.6285714286,
294
+ "r":0.5365853659,
295
+ "f":0.5789473684
296
+ },
297
+ "FAC":{
298
+ "p":0.2,
299
+ "r":0.1,
300
+ "f":0.1333333333
301
+ },
302
+ "ORG":{
303
+ "p":0.4558823529,
304
+ "r":0.6326530612,
305
+ "f":0.5299145299
306
+ },
307
+ "PERSON":{
308
+ "p":0.68,
309
+ "r":0.68,
310
+ "f":0.68
311
+ },
312
+ "ORDINAL":{
313
+ "p":0.4545454545,
314
+ "r":0.4545454545,
315
+ "f":0.4545454545
316
+ },
317
+ "DATE":{
318
+ "p":0.7183098592,
319
+ "r":0.7183098592,
320
+ "f":0.7183098592
321
+ },
322
+ "MONEY":{
323
+ "p":0.5,
324
+ "r":1.0,
325
+ "f":0.6666666667
326
+ },
327
+ "PERCENT":{
328
+ "p":1.0,
329
+ "r":1.0,
330
+ "f":1.0
331
+ },
332
+ "WORK_OF_ART":{
333
+ "p":0.5238095238,
334
+ "r":0.2682926829,
335
+ "f":0.3548387097
336
+ },
337
+ "LANGUAGE":{
338
+ "p":0.0,
339
+ "r":0.0,
340
+ "f":0.0
341
+ },
342
+ "TIME":{
343
+ "p":1.0,
344
+ "r":0.8333333333,
345
+ "f":0.9090909091
346
+ },
347
+ "EVENT":{
348
+ "p":0.4210526316,
349
+ "r":0.4705882353,
350
+ "f":0.4444444444
351
+ },
352
+ "LAW":{
353
+ "p":0.0,
354
+ "r":0.0,
355
+ "f":0.0
356
+ },
357
+ "PRODUCT":{
358
+ "p":0.0,
359
+ "r":0.0,
360
+ "f":0.0
361
+ }
362
+ }
363
+ },
364
+ "sources":[
365
+ {
366
+ "name":"Macedonian Corpus",
367
+ "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
368
+ "license":"CC BY-SA 4.0",
369
+ "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
370
+ },
371
+ {
372
+ "name":"Macedonian Corpus",
373
+ "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
374
+ "license":"CC BY-SA 4.0",
375
+ "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
376
+ },
377
+ {
378
+ "name":"Macedonian Corpus",
379
+ "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
380
+ "license":"CC BY-SA 4.0",
381
+ "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
382
+ },
383
+ {
384
+ "name":"spaCy lookups data",
385
+ "author":"Explosion",
386
+ "url":"https://github.com/explosion/spacy-lookups-data",
387
+ "license":"MIT"
388
+ }
389
+ ],
390
+ "requirements":[
391
+
392
+ ]
393
+ }
mk_core_news_sm-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5069c5dfdcd487ab2d9c34c74d0ed57c3c64b504bac15a7243696c719971edfd
3
+ size 19221627
morphologizer/cfg ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels_morph":{
3
+ "POS=PROPN":"",
4
+ "POS=AUX":"",
5
+ "POS=ADJ":"",
6
+ "POS=NOUN":"",
7
+ "POS=ADP":"",
8
+ "POS=PUNCT":"",
9
+ "POS=CONJ":"",
10
+ "POS=NUM":"",
11
+ "POS=VERB":"",
12
+ "POS=PRON":"",
13
+ "POS=ADV":"",
14
+ "POS=SCONJ":"",
15
+ "POS=PART":"",
16
+ "POS=SYM":"",
17
+ "POS=X":"",
18
+ "_":"",
19
+ "POS=INTJ":""
20
+ },
21
+ "labels_pos":{
22
+ "POS=PROPN":96,
23
+ "POS=AUX":87,
24
+ "POS=ADJ":84,
25
+ "POS=NOUN":92,
26
+ "POS=ADP":85,
27
+ "POS=PUNCT":97,
28
+ "POS=CONJ":88,
29
+ "POS=NUM":93,
30
+ "POS=VERB":100,
31
+ "POS=PRON":95,
32
+ "POS=ADV":86,
33
+ "POS=SCONJ":98,
34
+ "POS=PART":94,
35
+ "POS=SYM":99,
36
+ "POS=X":101,
37
+ "_":0,
38
+ "POS=INTJ":91
39
+ }
40
+ }
morphologizer/model ADDED
Binary file (6.59 MB). View file
ner/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
ner/model ADDED
Binary file (6.73 MB). View file
ner/moves ADDED
@@ -0,0 +1 @@
 
1
+ ��moves��{"0":{},"1":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"2":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"3":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"4":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43,"":1},"5":{"":1}}�cfg��neg_key�
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":30,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
Binary file (6.88 MB). View file
parser/moves ADDED
@@ -0,0 +1 @@
 
1
+ ��moves�2{"0":{"":1190},"1":{"":2140},"2":{"nsubj":278,"aux":187,"att":180,"neg":67,"prep":56,"poss":42,"pozv":37,"advmod":36,"ppdo||dobj":35,"dep":0},"3":{"punct":550,"dobj":316,"prep":291,"relcl":148,"pobj":141,"aux":87,"cc":70,"iobj":63,"att":56,"nsubj":51,"pozv":44,"pozm":40,"det":31,"dep":0},"4":{"ROOT":500}}�cfg��neg_key�
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ {
2
+
3
+ }
senter/model ADDED
Binary file (190 kB). View file
tokenizer ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ ��prefix_search� ~^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2"…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�=�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])�token_match��url_match�
2
+ ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�b.��A�b.�c.��A�c.�d.��A�d.�e.��A�e.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l.��A�l.�m.��A�m.�n.��A�n.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�q.��A�q.�r.��A�r.�s.��A�s.�t.��A�t.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�арх.��A�арх.C�архитект�бел.��A�бел.C�белешка�бот.��A�бот.C�ботаника�бр.��A�бр.C�број�в.��A�в.C�век�в.д.��A�в.д.C�$вршител на должност�г��A�гC�грам�г-дин��A�г-динC�господин�г-ца��A�г-цаC�госпоѓица�г-ѓа��A�г-ѓаC�госпоѓа�г.��A�г.C�година�г.г.��A�г.г.C�!господин господин�геогр.��A�геогр.C�географија�гимн.��A�гимн.C�гимназија�год.��A�год.C�женски род�гр.��A�гр.C�град�д-р��A�д-рC�доктор�дг��A�дгC�дециграм�ден.��A�ден.C�денар�дкг��A�дкгC�декаграм�дкл��A�дклC�декалитар�дл��A�длC�децилитар�дм��A�дмC�дециметар�др.��A�др.C�другар�и др.��A�и др.C�и друго�и сл.��A�и сл.C�и слично�инж.��A�инж.C�инженер�истор.��A�истор.C�историја�кг��A�кгC�килограм�кл��A�клC�килолитар�км��A�кмC�километар�кн.��A�кн.C�книга�л��A�лC�литар�литер.��A�литер.C�литература�м��A�мC�метар�м-р��A�м-рC�магистер�м.р.��A�м.р.C�машки род�мат.��A�мат.C�математика�мг��A�мгC�милиграм�мед.��A�мед.C�медицина�мм��A�ммC�милиметар�мн.��A�мн.C�множина�н.е.��A�н.е.C�наша ера�на пр.��A�на пр.C�на пример�о.г.��A�о.г.C�оваа година�о.м.��A�о.м.C�овој месец�пр. н.��A�пр. н.C�природни науки�прид.��A�прид.C�придавка�прил.��A�прил.C�прилог�проф.��A�проф.C�професор�с.��A�с.C�страница�с.р.��A�с.р.C�среден род�св.��A�св.C�свети�сврз.��A�сврз.C�сврзник�см��A�смC�сантиметар�сп.��A�сп.C�списание�стр.��A�стр.C�страница�студ.��A�студ.C�студент�т��A�тC�тон�т.��A�т.C�точка�т.е.��A�т.е.C�то ест�т.н.��A�т.н.C�таканаречен�ул.��A�ул.C�улица�физ.��A�физ.C�физика�хем.��A�хем.C�хемија�хл��A�хлC�хектолитар�цм��A�цмC�центиметар�чл.��A�чл.C�член�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/key2row ADDED
@@ -0,0 +1 @@
 
1
+
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6774f603b329c9d7be9d34311c95377d824a77b2cf498cfe2e609bca3adadcb4
3
+ size 2274
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbe1637b8b0430ce0596780a1871fe79bcf075051d0d4783777aa55c6cc97d89
3
+ size 1409283
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
3
+ size 128