osanseviero commited on
Commit
cfbc9d4
1 Parent(s): 761c1da

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -14,3 +14,7 @@
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
17
+ *.whl filter=lfs diff=lfs merge=lfs -text
18
+ *.npz filter=lfs diff=lfs merge=lfs -text
19
+ *strings.json filter=lfs diff=lfs merge=lfs -text
20
+ vectors filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,428 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution-ShareAlike 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution-ShareAlike 4.0 International Public
58
+ License
59
+
60
+ By exercising the Licensed Rights (defined below), You accept and agree
61
+ to be bound by the terms and conditions of this Creative Commons
62
+ Attribution-ShareAlike 4.0 International Public License ("Public
63
+ License"). To the extent this Public License may be interpreted as a
64
+ contract, You are granted the Licensed Rights in consideration of Your
65
+ acceptance of these terms and conditions, and the Licensor grants You
66
+ such rights in consideration of benefits the Licensor receives from
67
+ making the Licensed Material available under these terms and
68
+ conditions.
69
+
70
+
71
+ Section 1 -- Definitions.
72
+
73
+ a. Adapted Material means material subject to Copyright and Similar
74
+ Rights that is derived from or based upon the Licensed Material
75
+ and in which the Licensed Material is translated, altered,
76
+ arranged, transformed, or otherwise modified in a manner requiring
77
+ permission under the Copyright and Similar Rights held by the
78
+ Licensor. For purposes of this Public License, where the Licensed
79
+ Material is a musical work, performance, or sound recording,
80
+ Adapted Material is always produced where the Licensed Material is
81
+ synched in timed relation with a moving image.
82
+
83
+ b. Adapter's License means the license You apply to Your Copyright
84
+ and Similar Rights in Your contributions to Adapted Material in
85
+ accordance with the terms and conditions of this Public License.
86
+
87
+ c. BY-SA Compatible License means a license listed at
88
+ creativecommons.org/compatiblelicenses, approved by Creative
89
+ Commons as essentially the equivalent of this Public License.
90
+
91
+ d. Copyright and Similar Rights means copyright and/or similar rights
92
+ closely related to copyright including, without limitation,
93
+ performance, broadcast, sound recording, and Sui Generis Database
94
+ Rights, without regard to how the rights are labeled or
95
+ categorized. For purposes of this Public License, the rights
96
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
97
+ Rights.
98
+
99
+ e. Effective Technological Measures means those measures that, in the
100
+ absence of proper authority, may not be circumvented under laws
101
+ fulfilling obligations under Article 11 of the WIPO Copyright
102
+ Treaty adopted on December 20, 1996, and/or similar international
103
+ agreements.
104
+
105
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
106
+ any other exception or limitation to Copyright and Similar Rights
107
+ that applies to Your use of the Licensed Material.
108
+
109
+ g. License Elements means the license attributes listed in the name
110
+ of a Creative Commons Public License. The License Elements of this
111
+ Public License are Attribution and ShareAlike.
112
+
113
+ h. Licensed Material means the artistic or literary work, database,
114
+ or other material to which the Licensor applied this Public
115
+ License.
116
+
117
+ i. Licensed Rights means the rights granted to You subject to the
118
+ terms and conditions of this Public License, which are limited to
119
+ all Copyright and Similar Rights that apply to Your use of the
120
+ Licensed Material and that the Licensor has authority to license.
121
+
122
+ j. Licensor means the individual(s) or entity(ies) granting rights
123
+ under this Public License.
124
+
125
+ k. Share means to provide material to the public by any means or
126
+ process that requires permission under the Licensed Rights, such
127
+ as reproduction, public display, public performance, distribution,
128
+ dissemination, communication, or importation, and to make material
129
+ available to the public including in ways that members of the
130
+ public may access the material from a place and at a time
131
+ individually chosen by them.
132
+
133
+ l. Sui Generis Database Rights means rights other than copyright
134
+ resulting from Directive 96/9/EC of the European Parliament and of
135
+ the Council of 11 March 1996 on the legal protection of databases,
136
+ as amended and/or succeeded, as well as other essentially
137
+ equivalent rights anywhere in the world.
138
+
139
+ m. You means the individual or entity exercising the Licensed Rights
140
+ under this Public License. Your has a corresponding meaning.
141
+
142
+
143
+ Section 2 -- Scope.
144
+
145
+ a. License grant.
146
+
147
+ 1. Subject to the terms and conditions of this Public License,
148
+ the Licensor hereby grants You a worldwide, royalty-free,
149
+ non-sublicensable, non-exclusive, irrevocable license to
150
+ exercise the Licensed Rights in the Licensed Material to:
151
+
152
+ a. reproduce and Share the Licensed Material, in whole or
153
+ in part; and
154
+
155
+ b. produce, reproduce, and Share Adapted Material.
156
+
157
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
158
+ Exceptions and Limitations apply to Your use, this Public
159
+ License does not apply, and You do not need to comply with
160
+ its terms and conditions.
161
+
162
+ 3. Term. The term of this Public License is specified in Section
163
+ 6(a).
164
+
165
+ 4. Media and formats; technical modifications allowed. The
166
+ Licensor authorizes You to exercise the Licensed Rights in
167
+ all media and formats whether now known or hereafter created,
168
+ and to make technical modifications necessary to do so. The
169
+ Licensor waives and/or agrees not to assert any right or
170
+ authority to forbid You from making technical modifications
171
+ necessary to exercise the Licensed Rights, including
172
+ technical modifications necessary to circumvent Effective
173
+ Technological Measures. For purposes of this Public License,
174
+ simply making modifications authorized by this Section 2(a)
175
+ (4) never produces Adapted Material.
176
+
177
+ 5. Downstream recipients.
178
+
179
+ a. Offer from the Licensor -- Licensed Material. Every
180
+ recipient of the Licensed Material automatically
181
+ receives an offer from the Licensor to exercise the
182
+ Licensed Rights under the terms and conditions of this
183
+ Public License.
184
+
185
+ b. Additional offer from the Licensor -- Adapted Material.
186
+ Every recipient of Adapted Material from You
187
+ automatically receives an offer from the Licensor to
188
+ exercise the Licensed Rights in the Adapted Material
189
+ under the conditions of the Adapter's License You apply.
190
+
191
+ c. No downstream restrictions. You may not offer or impose
192
+ any additional or different terms or conditions on, or
193
+ apply any Effective Technological Measures to, the
194
+ Licensed Material if doing so restricts exercise of the
195
+ Licensed Rights by any recipient of the Licensed
196
+ Material.
197
+
198
+ 6. No endorsement. Nothing in this Public License constitutes or
199
+ may be construed as permission to assert or imply that You
200
+ are, or that Your use of the Licensed Material is, connected
201
+ with, or sponsored, endorsed, or granted official status by,
202
+ the Licensor or others designated to receive attribution as
203
+ provided in Section 3(a)(1)(A)(i).
204
+
205
+ b. Other rights.
206
+
207
+ 1. Moral rights, such as the right of integrity, are not
208
+ licensed under this Public License, nor are publicity,
209
+ privacy, and/or other similar personality rights; however, to
210
+ the extent possible, the Licensor waives and/or agrees not to
211
+ assert any such rights held by the Licensor to the limited
212
+ extent necessary to allow You to exercise the Licensed
213
+ Rights, but not otherwise.
214
+
215
+ 2. Patent and trademark rights are not licensed under this
216
+ Public License.
217
+
218
+ 3. To the extent possible, the Licensor waives any right to
219
+ collect royalties from You for the exercise of the Licensed
220
+ Rights, whether directly or through a collecting society
221
+ under any voluntary or waivable statutory or compulsory
222
+ licensing scheme. In all other cases the Licensor expressly
223
+ reserves any right to collect such royalties.
224
+
225
+
226
+ Section 3 -- License Conditions.
227
+
228
+ Your exercise of the Licensed Rights is expressly made subject to the
229
+ following conditions.
230
+
231
+ a. Attribution.
232
+
233
+ 1. If You Share the Licensed Material (including in modified
234
+ form), You must:
235
+
236
+ a. retain the following if it is supplied by the Licensor
237
+ with the Licensed Material:
238
+
239
+ i. identification of the creator(s) of the Licensed
240
+ Material and any others designated to receive
241
+ attribution, in any reasonable manner requested by
242
+ the Licensor (including by pseudonym if
243
+ designated);
244
+
245
+ ii. a copyright notice;
246
+
247
+ iii. a notice that refers to this Public License;
248
+
249
+ iv. a notice that refers to the disclaimer of
250
+ warranties;
251
+
252
+ v. a URI or hyperlink to the Licensed Material to the
253
+ extent reasonably practicable;
254
+
255
+ b. indicate if You modified the Licensed Material and
256
+ retain an indication of any previous modifications; and
257
+
258
+ c. indicate the Licensed Material is licensed under this
259
+ Public License, and include the text of, or the URI or
260
+ hyperlink to, this Public License.
261
+
262
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
263
+ reasonable manner based on the medium, means, and context in
264
+ which You Share the Licensed Material. For example, it may be
265
+ reasonable to satisfy the conditions by providing a URI or
266
+ hyperlink to a resource that includes the required
267
+ information.
268
+
269
+ 3. If requested by the Licensor, You must remove any of the
270
+ information required by Section 3(a)(1)(A) to the extent
271
+ reasonably practicable.
272
+
273
+ b. ShareAlike.
274
+
275
+ In addition to the conditions in Section 3(a), if You Share
276
+ Adapted Material You produce, the following conditions also apply.
277
+
278
+ 1. The Adapter's License You apply must be a Creative Commons
279
+ license with the same License Elements, this version or
280
+ later, or a BY-SA Compatible License.
281
+
282
+ 2. You must include the text of, or the URI or hyperlink to, the
283
+ Adapter's License You apply. You may satisfy this condition
284
+ in any reasonable manner based on the medium, means, and
285
+ context in which You Share Adapted Material.
286
+
287
+ 3. You may not offer or impose any additional or different terms
288
+ or conditions on, or apply any Effective Technological
289
+ Measures to, Adapted Material that restrict exercise of the
290
+ rights granted under the Adapter's License You apply.
291
+
292
+
293
+ Section 4 -- Sui Generis Database Rights.
294
+
295
+ Where the Licensed Rights include Sui Generis Database Rights that
296
+ apply to Your use of the Licensed Material:
297
+
298
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
299
+ to extract, reuse, reproduce, and Share all or a substantial
300
+ portion of the contents of the database;
301
+
302
+ b. if You include all or a substantial portion of the database
303
+ contents in a database in which You have Sui Generis Database
304
+ Rights, then the database in which You have Sui Generis Database
305
+ Rights (but not its individual contents) is Adapted Material,
306
+
307
+ including for purposes of Section 3(b); and
308
+ c. You must comply with the conditions in Section 3(a) if You Share
309
+ all or a substantial portion of the contents of the database.
310
+
311
+ For the avoidance of doubt, this Section 4 supplements and does not
312
+ replace Your obligations under this Public License where the Licensed
313
+ Rights include other Copyright and Similar Rights.
314
+
315
+
316
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
317
+
318
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
319
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
320
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
321
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
322
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
323
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
324
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
325
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
326
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
327
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
328
+
329
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
330
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
331
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
332
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
333
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
334
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
335
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
336
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
337
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
338
+
339
+ c. The disclaimer of warranties and limitation of liability provided
340
+ above shall be interpreted in a manner that, to the extent
341
+ possible, most closely approximates an absolute disclaimer and
342
+ waiver of all liability.
343
+
344
+
345
+ Section 6 -- Term and Termination.
346
+
347
+ a. This Public License applies for the term of the Copyright and
348
+ Similar Rights licensed here. However, if You fail to comply with
349
+ this Public License, then Your rights under this Public License
350
+ terminate automatically.
351
+
352
+ b. Where Your right to use the Licensed Material has terminated under
353
+ Section 6(a), it reinstates:
354
+
355
+ 1. automatically as of the date the violation is cured, provided
356
+ it is cured within 30 days of Your discovery of the
357
+ violation; or
358
+
359
+ 2. upon express reinstatement by the Licensor.
360
+
361
+ For the avoidance of doubt, this Section 6(b) does not affect any
362
+ right the Licensor may have to seek remedies for Your violations
363
+ of this Public License.
364
+
365
+ c. For the avoidance of doubt, the Licensor may also offer the
366
+ Licensed Material under separate terms or conditions or stop
367
+ distributing the Licensed Material at any time; however, doing so
368
+ will not terminate this Public License.
369
+
370
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
371
+ License.
372
+
373
+
374
+ Section 7 -- Other Terms and Conditions.
375
+
376
+ a. The Licensor shall not be bound by any additional or different
377
+ terms or conditions communicated by You unless expressly agreed.
378
+
379
+ b. Any arrangements, understandings, or agreements regarding the
380
+ Licensed Material not stated herein are separate from and
381
+ independent of the terms and conditions of this Public License.
382
+
383
+
384
+ Section 8 -- Interpretation.
385
+
386
+ a. For the avoidance of doubt, this Public License does not, and
387
+ shall not be interpreted to, reduce, limit, restrict, or impose
388
+ conditions on any use of the Licensed Material that could lawfully
389
+ be made without permission under this Public License.
390
+
391
+ b. To the extent possible, if any provision of this Public License is
392
+ deemed unenforceable, it shall be automatically reformed to the
393
+ minimum extent necessary to make it enforceable. If the provision
394
+ cannot be reformed, it shall be severed from this Public License
395
+ without affecting the enforceability of the remaining terms and
396
+ conditions.
397
+
398
+ c. No term or condition of this Public License will be waived and no
399
+ failure to comply consented to unless expressly agreed to by the
400
+ Licensor.
401
+
402
+ d. Nothing in this Public License constitutes or may be interpreted
403
+ as a limitation upon, or waiver of, any privileges and immunities
404
+ that apply to the Licensor or You, including from the legal
405
+ processes of any jurisdiction or authority.
406
+
407
+
408
+ =======================================================================
409
+
410
+ Creative Commons is not a party to its public
411
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
412
+ its public licenses to material it publishes and in those instances
413
+ will be considered the “Licensor.” The text of the Creative Commons
414
+ public licenses is dedicated to the public domain under the CC0 Public
415
+ Domain Dedication. Except for the limited purpose of indicating that
416
+ material is shared under a Creative Commons public license or as
417
+ otherwise permitted by the Creative Commons policies published at
418
+ creativecommons.org/policies, Creative Commons does not authorize the
419
+ use of the trademark "Creative Commons" or any other trademark or logo
420
+ of Creative Commons without its prior written consent including,
421
+ without limitation, in connection with any unauthorized modifications
422
+ to any of its public licenses or any other arrangements,
423
+ understandings, or agreements concerning use of licensed material. For
424
+ the avoidance of doubt, this paragraph does not form part of the
425
+ public licenses.
426
+
427
+ Creative Commons may be contacted at creativecommons.org.
428
+
LICENSES_SOURCES ADDED
@@ -0,0 +1,1060 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Lemmatization Lists
2
+
3
+ * Author: Michal Měchura
4
+ * URL: https://github.com/michmech/lemmatization-lists/
5
+ * License: ODbL
6
+
7
+ ```
8
+ ## ODC Open Database License (ODbL)
9
+
10
+ ### Preamble
11
+
12
+ The Open Database License (ODbL) is a license agreement intended to
13
+ allow users to freely share, modify, and use this Database while
14
+ maintaining this same freedom for others. Many databases are covered by
15
+ copyright, and therefore this document licenses these rights. Some
16
+ jurisdictions, mainly in the European Union, have specific rights that
17
+ cover databases, and so the ODbL addresses these rights, too. Finally,
18
+ the ODbL is also an agreement in contract for users of this Database to
19
+ act in certain ways in return for accessing this Database.
20
+
21
+ Databases can contain a wide variety of types of content (images,
22
+ audiovisual material, and sounds all in the same database, for example),
23
+ and so the ODbL only governs the rights over the Database, and not the
24
+ contents of the Database individually. Licensors should use the ODbL
25
+ together with another license for the contents, if the contents have a
26
+ single set of rights that uniformly covers all of the contents. If the
27
+ contents have multiple sets of different rights, Licensors should
28
+ describe what rights govern what contents together in the individual
29
+ record or in some other way that clarifies what rights apply.
30
+
31
+ Sometimes the contents of a database, or the database itself, can be
32
+ covered by other rights not addressed here (such as private contracts,
33
+ trade mark over the name, or privacy rights / data protection rights
34
+ over information in the contents), and so you are advised that you may
35
+ have to consult other documents or clear other rights before doing
36
+ activities not covered by this License.
37
+
38
+ ------
39
+
40
+ The Licensor (as defined below)
41
+
42
+ and
43
+
44
+ You (as defined below)
45
+
46
+ agree as follows:
47
+
48
+ ### 1.0 Definitions of Capitalised Words
49
+
50
+ "Collective Database" – Means this Database in unmodified form as part
51
+ of a collection of independent databases in themselves that together are
52
+ assembled into a collective whole. A work that constitutes a Collective
53
+ Database will not be considered a Derivative Database.
54
+
55
+ "Convey" – As a verb, means Using the Database, a Derivative Database,
56
+ or the Database as part of a Collective Database in any way that enables
57
+ a Person to make or receive copies of the Database or a Derivative
58
+ Database. Conveying does not include interaction with a user through a
59
+ computer network, or creating and Using a Produced Work, where no
60
+ transfer of a copy of the Database or a Derivative Database occurs.
61
+ "Contents" – The contents of this Database, which includes the
62
+ information, independent works, or other material collected into the
63
+ Database. For example, the contents of the Database could be factual
64
+ data or works such as images, audiovisual material, text, or sounds.
65
+
66
+ "Database" – A collection of material (the Contents) arranged in a
67
+ systematic or methodical way and individually accessible by electronic
68
+ or other means offered under the terms of this License.
69
+
70
+ "Database Directive" – Means Directive 96/9/EC of the European
71
+ Parliament and of the Council of 11 March 1996 on the legal protection
72
+ of databases, as amended or succeeded.
73
+
74
+ "Database Right" – Means rights resulting from the Chapter III ("sui
75
+ generis") rights in the Database Directive (as amended and as transposed
76
+ by member states), which includes the Extraction and Re-utilisation of
77
+ the whole or a Substantial part of the Contents, as well as any similar
78
+ rights available in the relevant jurisdiction under Section 10.4.
79
+
80
+ "Derivative Database" – Means a database based upon the Database, and
81
+ includes any translation, adaptation, arrangement, modification, or any
82
+ other alteration of the Database or of a Substantial part of the
83
+ Contents. This includes, but is not limited to, Extracting or
84
+ Re-utilising the whole or a Substantial part of the Contents in a new
85
+ Database.
86
+
87
+ "Extraction" – Means the permanent or temporary transfer of all or a
88
+ Substantial part of the Contents to another medium by any means or in
89
+ any form.
90
+
91
+ "License" – Means this license agreement and is both a license of rights
92
+ such as copyright and Database Rights and an agreement in contract.
93
+
94
+ "Licensor" – Means the Person that offers the Database under the terms
95
+ of this License.
96
+
97
+ "Person" – Means a natural or legal person or a body of persons
98
+ corporate or incorporate.
99
+
100
+ "Produced Work" – a work (such as an image, audiovisual material, text,
101
+ or sounds) resulting from using the whole or a Substantial part of the
102
+ Contents (via a search or other query) from this Database, a Derivative
103
+ Database, or this Database as part of a Collective Database.
104
+
105
+ "Publicly" – means to Persons other than You or under Your control by
106
+ either more than 50% ownership or by the power to direct their
107
+ activities (such as contracting with an independent consultant).
108
+
109
+ "Re-utilisation" – means any form of making available to the public all
110
+ or a Substantial part of the Contents by the distribution of copies, by
111
+ renting, by online or other forms of transmission.
112
+
113
+ "Substantial" – Means substantial in terms of quantity or quality or a
114
+ combination of both. The repeated and systematic Extraction or
115
+ Re-utilisation of insubstantial parts of the Contents may amount to the
116
+ Extraction or Re-utilisation of a Substantial part of the Contents.
117
+
118
+ "Use" – As a verb, means doing any act that is restricted by copyright
119
+ or Database Rights whether in the original medium or any other; and
120
+ includes without limitation distributing, copying, publicly performing,
121
+ publicly displaying, and preparing derivative works of the Database, as
122
+ well as modifying the Database as may be technically necessary to use it
123
+ in a different mode or format.
124
+
125
+ "You" – Means a Person exercising rights under this License who has not
126
+ previously violated the terms of this License with respect to the
127
+ Database, or who has received express permission from the Licensor to
128
+ exercise rights under this License despite a previous violation.
129
+
130
+ Words in the singular include the plural and vice versa.
131
+
132
+ ### 2.0 What this License covers
133
+
134
+ 2.1. Legal effect of this document. This License is:
135
+
136
+ a. A license of applicable copyright and neighbouring rights;
137
+
138
+ b. A license of the Database Right; and
139
+
140
+ c. An agreement in contract between You and the Licensor.
141
+
142
+ 2.2 Legal rights covered. This License covers the legal rights in the
143
+ Database, including:
144
+
145
+ a. Copyright. Any copyright or neighbouring rights in the Database.
146
+ The copyright licensed includes any individual elements of the
147
+ Database, but does not cover the copyright over the Contents
148
+ independent of this Database. See Section 2.4 for details. Copyright
149
+ law varies between jurisdictions, but is likely to cover: the Database
150
+ model or schema, which is the structure, arrangement, and organisation
151
+ of the Database, and can also include the Database tables and table
152
+ indexes; the data entry and output sheets; and the Field names of
153
+ Contents stored in the Database;
154
+
155
+ b. Database Rights. Database Rights only extend to the Extraction and
156
+ Re-utilisation of the whole or a Substantial part of the Contents.
157
+ Database Rights can apply even when there is no copyright over the
158
+ Database. Database Rights can also apply when the Contents are removed
159
+ from the Database and are selected and arranged in a way that would
160
+ not infringe any applicable copyright; and
161
+
162
+ c. Contract. This is an agreement between You and the Licensor for
163
+ access to the Database. In return you agree to certain conditions of
164
+ use on this access as outlined in this License.
165
+
166
+ 2.3 Rights not covered.
167
+
168
+ a. This License does not apply to computer programs used in the making
169
+ or operation of the Database;
170
+
171
+ b. This License does not cover any patents over the Contents or the
172
+ Database; and
173
+
174
+ c. This License does not cover any trademarks associated with the
175
+ Database.
176
+
177
+ 2.4 Relationship to Contents in the Database. The individual items of
178
+ the Contents contained in this Database may be covered by other rights,
179
+ including copyright, patent, data protection, privacy, or personality
180
+ rights, and this License does not cover any rights (other than Database
181
+ Rights or in contract) in individual Contents contained in the Database.
182
+ For example, if used on a Database of images (the Contents), this
183
+ License would not apply to copyright over individual images, which could
184
+ have their own separate licenses, or one single license covering all of
185
+ the rights over the images.
186
+
187
+ ### 3.0 Rights granted
188
+
189
+ 3.1 Subject to the terms and conditions of this License, the Licensor
190
+ grants to You a worldwide, royalty-free, non-exclusive, terminable (but
191
+ only under Section 9) license to Use the Database for the duration of
192
+ any applicable copyright and Database Rights. These rights explicitly
193
+ include commercial use, and do not exclude any field of endeavour. To
194
+ the extent possible in the relevant jurisdiction, these rights may be
195
+ exercised in all media and formats whether now known or created in the
196
+ future.
197
+
198
+ The rights granted cover, for example:
199
+
200
+ a. Extraction and Re-utilisation of the whole or a Substantial part of
201
+ the Contents;
202
+
203
+ b. Creation of Derivative Databases;
204
+
205
+ c. Creation of Collective Databases;
206
+
207
+ d. Creation of temporary or permanent reproductions by any means and
208
+ in any form, in whole or in part, including of any Derivative
209
+ Databases or as a part of Collective Databases; and
210
+
211
+ e. Distribution, communication, display, lending, making available, or
212
+ performance to the public by any means and in any form, in whole or in
213
+ part, including of any Derivative Database or as a part of Collective
214
+ Databases.
215
+
216
+ 3.2 Compulsory license schemes. For the avoidance of doubt:
217
+
218
+ a. Non-waivable compulsory license schemes. In those jurisdictions in
219
+ which the right to collect royalties through any statutory or
220
+ compulsory licensing scheme cannot be waived, the Licensor reserves
221
+ the exclusive right to collect such royalties for any exercise by You
222
+ of the rights granted under this License;
223
+
224
+ b. Waivable compulsory license schemes. In those jurisdictions in
225
+ which the right to collect royalties through any statutory or
226
+ compulsory licensing scheme can be waived, the Licensor waives the
227
+ exclusive right to collect such royalties for any exercise by You of
228
+ the rights granted under this License; and,
229
+
230
+ c. Voluntary license schemes. The Licensor waives the right to collect
231
+ royalties, whether individually or, in the event that the Licensor is
232
+ a member of a collecting society that administers voluntary licensing
233
+ schemes, via that society, from any exercise by You of the rights
234
+ granted under this License.
235
+
236
+ 3.3 The right to release the Database under different terms, or to stop
237
+ distributing or making available the Database, is reserved. Note that
238
+ this Database may be multiple-licensed, and so You may have the choice
239
+ of using alternative licenses for this Database. Subject to Section
240
+ 10.4, all other rights not expressly granted by Licensor are reserved.
241
+
242
+ ### 4.0 Conditions of Use
243
+
244
+ 4.1 The rights granted in Section 3 above are expressly made subject to
245
+ Your complying with the following conditions of use. These are important
246
+ conditions of this License, and if You fail to follow them, You will be
247
+ in material breach of its terms.
248
+
249
+ 4.2 Notices. If You Publicly Convey this Database, any Derivative
250
+ Database, or the Database as part of a Collective Database, then You
251
+ must:
252
+
253
+ a. Do so only under the terms of this License or another license
254
+ permitted under Section 4.4;
255
+
256
+ b. Include a copy of this License (or, as applicable, a license
257
+ permitted under Section 4.4) or its Uniform Resource Identifier (URI)
258
+ with the Database or Derivative Database, including both in the
259
+ Database or Derivative Database and in any relevant documentation; and
260
+
261
+ c. Keep intact any copyright or Database Right notices and notices
262
+ that refer to this License.
263
+
264
+ d. If it is not possible to put the required notices in a particular
265
+ file due to its structure, then You must include the notices in a
266
+ location (such as a relevant directory) where users would be likely to
267
+ look for it.
268
+
269
+ 4.3 Notice for using output (Contents). Creating and Using a Produced
270
+ Work does not require the notice in Section 4.2. However, if you
271
+ Publicly Use a Produced Work, You must include a notice associated with
272
+ the Produced Work reasonably calculated to make any Person that uses,
273
+ views, accesses, interacts with, or is otherwise exposed to the Produced
274
+ Work aware that Content was obtained from the Database, Derivative
275
+ Database, or the Database as part of a Collective Database, and that it
276
+ is available under this License.
277
+
278
+ a. Example notice. The following text will satisfy notice under
279
+ Section 4.3:
280
+
281
+ Contains information from DATABASE NAME, which is made available
282
+ here under the Open Database License (ODbL).
283
+
284
+ DATABASE NAME should be replaced with the name of the Database and a
285
+ hyperlink to the URI of the Database. "Open Database License" should
286
+ contain a hyperlink to the URI of the text of this License. If
287
+ hyperlinks are not possible, You should include the plain text of the
288
+ required URI's with the above notice.
289
+
290
+ 4.4 Share alike.
291
+
292
+ a. Any Derivative Database that You Publicly Use must be only under
293
+ the terms of:
294
+
295
+ i. This License;
296
+
297
+ ii. A later version of this License similar in spirit to this
298
+ License; or
299
+
300
+ iii. A compatible license.
301
+
302
+ If You license the Derivative Database under one of the licenses
303
+ mentioned in (iii), You must comply with the terms of that license.
304
+
305
+ b. For the avoidance of doubt, Extraction or Re-utilisation of the
306
+ whole or a Substantial part of the Contents into a new database is a
307
+ Derivative Database and must comply with Section 4.4.
308
+
309
+ c. Derivative Databases and Produced Works. A Derivative Database is
310
+ Publicly Used and so must comply with Section 4.4. if a Produced Work
311
+ created from the Derivative Database is Publicly Used.
312
+
313
+ d. Share Alike and additional Contents. For the avoidance of doubt,
314
+ You must not add Contents to Derivative Databases under Section 4.4 a
315
+ that are incompatible with the rights granted under this License.
316
+
317
+ e. Compatible licenses. Licensors may authorise a proxy to determine
318
+ compatible licenses under Section 4.4 a iii. If they do so, the
319
+ authorised proxy's public statement of acceptance of a compatible
320
+ license grants You permission to use the compatible license.
321
+
322
+
323
+ 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
324
+ in the following:
325
+
326
+ a. For the avoidance of doubt, You are not required to license
327
+ Collective Databases under this License if You incorporate this
328
+ Database or a Derivative Database in the collection, but this License
329
+ still applies to this Database or a Derivative Database as a part of
330
+ the Collective Database;
331
+
332
+ b. Using this Database, a Derivative Database, or this Database as
333
+ part of a Collective Database to create a Produced Work does not
334
+ create a Derivative Database for purposes of Section 4.4; and
335
+
336
+ c. Use of a Derivative Database internally within an organisation is
337
+ not to the public and therefore does not fall under the requirements
338
+ of Section 4.4.
339
+
340
+ 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
341
+ Database or a Produced Work from a Derivative Database, You must also
342
+ offer to recipients of the Derivative Database or Produced Work a copy
343
+ in a machine readable form of:
344
+
345
+ a. The entire Derivative Database; or
346
+
347
+ b. A file containing all of the alterations made to the Database or
348
+ the method of making the alterations to the Database (such as an
349
+ algorithm), including any additional Contents, that make up all the
350
+ differences between the Database and the Derivative Database.
351
+
352
+ The Derivative Database (under a.) or alteration file (under b.) must be
353
+ available at no more than a reasonable production cost for physical
354
+ distributions and free of charge if distributed over the internet.
355
+
356
+ 4.7 Technological measures and additional terms
357
+
358
+ a. This License does not allow You to impose (except subject to
359
+ Section 4.7 b.) any terms or any technological measures on the
360
+ Database, a Derivative Database, or the whole or a Substantial part of
361
+ the Contents that alter or restrict the terms of this License, or any
362
+ rights granted under it, or have the effect or intent of restricting
363
+ the ability of any person to exercise those rights.
364
+
365
+ b. Parallel distribution. You may impose terms or technological
366
+ measures on the Database, a Derivative Database, or the whole or a
367
+ Substantial part of the Contents (a "Restricted Database") in
368
+ contravention of Section 4.74 a. only if You also make a copy of the
369
+ Database or a Derivative Database available to the recipient of the
370
+ Restricted Database:
371
+
372
+ i. That is available without additional fee;
373
+
374
+ ii. That is available in a medium that does not alter or restrict
375
+ the terms of this License, or any rights granted under it, or have
376
+ the effect or intent of restricting the ability of any person to
377
+ exercise those rights (an "Unrestricted Database"); and
378
+
379
+ iii. The Unrestricted Database is at least as accessible to the
380
+ recipient as a practical matter as the Restricted Database.
381
+
382
+ c. For the avoidance of doubt, You may place this Database or a
383
+ Derivative Database in an authenticated environment, behind a
384
+ password, or within a similar access control scheme provided that You
385
+ do not alter or restrict the terms of this License or any rights
386
+ granted under it or have the effect or intent of restricting the
387
+ ability of any person to exercise those rights.
388
+
389
+ 4.8 Licensing of others. You may not sublicense the Database. Each time
390
+ You communicate the Database, the whole or Substantial part of the
391
+ Contents, or any Derivative Database to anyone else in any way, the
392
+ Licensor offers to the recipient a license to the Database on the same
393
+ terms and conditions as this License. You are not responsible for
394
+ enforcing compliance by third parties with this License, but You may
395
+ enforce any rights that You have over a Derivative Database. You are
396
+ solely responsible for any modifications of a Derivative Database made
397
+ by You or another Person at Your direction. You may not impose any
398
+ further restrictions on the exercise of the rights granted or affirmed
399
+ under this License.
400
+
401
+ ### 5.0 Moral rights
402
+
403
+ 5.1 Moral rights. This section covers moral rights, including any rights
404
+ to be identified as the author of the Database or to object to treatment
405
+ that would otherwise prejudice the author's honour and reputation, or
406
+ any other derogatory treatment:
407
+
408
+ a. For jurisdictions allowing waiver of moral rights, Licensor waives
409
+ all moral rights that Licensor may have in the Database to the fullest
410
+ extent possible by the law of the relevant jurisdiction under Section
411
+ 10.4;
412
+
413
+ b. If waiver of moral rights under Section 5.1 a in the relevant
414
+ jurisdiction is not possible, Licensor agrees not to assert any moral
415
+ rights over the Database and waives all claims in moral rights to the
416
+ fullest extent possible by the law of the relevant jurisdiction under
417
+ Section 10.4; and
418
+
419
+ c. For jurisdictions not allowing waiver or an agreement not to assert
420
+ moral rights under Section 5.1 a and b, the author may retain their
421
+ moral rights over certain aspects of the Database.
422
+
423
+ Please note that some jurisdictions do not allow for the waiver of moral
424
+ rights, and so moral rights may still subsist over the Database in some
425
+ jurisdictions.
426
+
427
+ ### 6.0 Fair dealing, Database exceptions, and other rights not affected
428
+
429
+ 6.1 This License does not affect any rights that You or anyone else may
430
+ independently have under any applicable law to make any use of this
431
+ Database, including without limitation:
432
+
433
+ a. Exceptions to the Database Right including: Extraction of Contents
434
+ from non-electronic Databases for private purposes, Extraction for
435
+ purposes of illustration for teaching or scientific research, and
436
+ Extraction or Re-utilisation for public security or an administrative
437
+ or judicial procedure.
438
+
439
+ b. Fair dealing, fair use, or any other legally recognised limitation
440
+ or exception to infringement of copyright or other applicable laws.
441
+
442
+ 6.2 This License does not affect any rights of lawful users to Extract
443
+ and Re-utilise insubstantial parts of the Contents, evaluated
444
+ quantitatively or qualitatively, for any purposes whatsoever, including
445
+ creating a Derivative Database (subject to other rights over the
446
+ Contents, see Section 2.4). The repeated and systematic Extraction or
447
+ Re-utilisation of insubstantial parts of the Contents may however amount
448
+ to the Extraction or Re-utilisation of a Substantial part of the
449
+ Contents.
450
+
451
+ ### 7.0 Warranties and Disclaimer
452
+
453
+ 7.1 The Database is licensed by the Licensor "as is" and without any
454
+ warranty of any kind, either express, implied, or arising by statute,
455
+ custom, course of dealing, or trade usage. Licensor specifically
456
+ disclaims any and all implied warranties or conditions of title,
457
+ non-infringement, accuracy or completeness, the presence or absence of
458
+ errors, fitness for a particular purpose, merchantability, or otherwise.
459
+ Some jurisdictions do not allow the exclusion of implied warranties, so
460
+ this exclusion may not apply to You.
461
+
462
+ ### 8.0 Limitation of liability
463
+
464
+ 8.1 Subject to any liability that may not be excluded or limited by law,
465
+ the Licensor is not liable for, and expressly excludes, all liability
466
+ for loss or damage however and whenever caused to anyone by any use
467
+ under this License, whether by You or by anyone else, and whether caused
468
+ by any fault on the part of the Licensor or not. This exclusion of
469
+ liability includes, but is not limited to, any special, incidental,
470
+ consequential, punitive, or exemplary damages such as loss of revenue,
471
+ data, anticipated profits, and lost business. This exclusion applies
472
+ even if the Licensor has been advised of the possibility of such
473
+ damages.
474
+
475
+ 8.2 If liability may not be excluded by law, it is limited to actual and
476
+ direct financial loss to the extent it is caused by proved negligence on
477
+ the part of the Licensor.
478
+
479
+ ### 9.0 Termination of Your rights under this License
480
+
481
+ 9.1 Any breach by You of the terms and conditions of this License
482
+ automatically terminates this License with immediate effect and without
483
+ notice to You. For the avoidance of doubt, Persons who have received the
484
+ Database, the whole or a Substantial part of the Contents, Derivative
485
+ Databases, or the Database as part of a Collective Database from You
486
+ under this License will not have their licenses terminated provided
487
+ their use is in full compliance with this License or a license granted
488
+ under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
489
+ survive any termination of this License.
490
+
491
+ 9.2 If You are not in breach of the terms of this License, the Licensor
492
+ will not terminate Your rights under it.
493
+
494
+ 9.3 Unless terminated under Section 9.1, this License is granted to You
495
+ for the duration of applicable rights in the Database.
496
+
497
+ 9.4 Reinstatement of rights. If you cease any breach of the terms and
498
+ conditions of this License, then your full rights under this License
499
+ will be reinstated:
500
+
501
+ a. Provisionally and subject to permanent termination until the 60th
502
+ day after cessation of breach;
503
+
504
+ b. Permanently on the 60th day after cessation of breach unless
505
+ otherwise reasonably notified by the Licensor; or
506
+
507
+ c. Permanently if reasonably notified by the Licensor of the
508
+ violation, this is the first time You have received notice of
509
+ violation of this License from the Licensor, and You cure the
510
+ violation prior to 30 days after your receipt of the notice.
511
+
512
+ Persons subject to permanent termination of rights are not eligible to
513
+ be a recipient and receive a license under Section 4.8.
514
+
515
+ 9.5 Notwithstanding the above, Licensor reserves the right to release
516
+ the Database under different license terms or to stop distributing or
517
+ making available the Database. Releasing the Database under different
518
+ license terms or stopping the distribution of the Database will not
519
+ withdraw this License (or any other license that has been, or is
520
+ required to be, granted under the terms of this License), and this
521
+ License will continue in full force and effect unless terminated as
522
+ stated above.
523
+
524
+ ### 10.0 General
525
+
526
+ 10.1 If any provision of this License is held to be invalid or
527
+ unenforceable, that must not affect the validity or enforceability of
528
+ the remainder of the terms and conditions of this License and each
529
+ remaining provision of this License shall be valid and enforced to the
530
+ fullest extent permitted by law.
531
+
532
+ 10.2 This License is the entire agreement between the parties with
533
+ respect to the rights granted here over the Database. It replaces any
534
+ earlier understandings, agreements or representations with respect to
535
+ the Database.
536
+
537
+ 10.3 If You are in breach of the terms of this License, You will not be
538
+ entitled to rely on the terms of this License or to complain of any
539
+ breach by the Licensor.
540
+
541
+ 10.4 Choice of law. This License takes effect in and will be governed by
542
+ the laws of the relevant jurisdiction in which the License terms are
543
+ sought to be enforced. If the standard suite of rights granted under
544
+ applicable copyright law and Database Rights in the relevant
545
+ jurisdiction includes additional rights not granted under this License,
546
+ these additional rights are granted in this License in order to meet the
547
+ terms of this License.```
548
+
549
+
550
+
551
+
552
+ # UD Romanian RRT v2.5
553
+
554
+ * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
+ * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
556
+ * License: CC BY-SA 4.0
557
+
558
+ ```
559
+ Attribution-ShareAlike 4.0 International
560
+
561
+ =======================================================================
562
+
563
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
564
+ does not provide legal services or legal advice. Distribution of
565
+ Creative Commons public licenses does not create a lawyer-client or
566
+ other relationship. Creative Commons makes its licenses and related
567
+ information available on an "as-is" basis. Creative Commons gives no
568
+ warranties regarding its licenses, any material licensed under their
569
+ terms and conditions, or any related information. Creative Commons
570
+ disclaims all liability for damages resulting from their use to the
571
+ fullest extent possible.
572
+
573
+ Using Creative Commons Public Licenses
574
+
575
+ Creative Commons public licenses provide a standard set of terms and
576
+ conditions that creators and other rights holders may use to share
577
+ original works of authorship and other material subject to copyright
578
+ and certain other rights specified in the public license below. The
579
+ following considerations are for informational purposes only, are not
580
+ exhaustive, and do not form part of our licenses.
581
+
582
+ Considerations for licensors: Our public licenses are
583
+ intended for use by those authorized to give the public
584
+ permission to use material in ways otherwise restricted by
585
+ copyright and certain other rights. Our licenses are
586
+ irrevocable. Licensors should read and understand the terms
587
+ and conditions of the license they choose before applying it.
588
+ Licensors should also secure all rights necessary before
589
+ applying our licenses so that the public can reuse the
590
+ material as expected. Licensors should clearly mark any
591
+ material not subject to the license. This includes other CC-
592
+ licensed material, or material used under an exception or
593
+ limitation to copyright. More considerations for licensors:
594
+ wiki.creativecommons.org/Considerations_for_licensors
595
+
596
+ Considerations for the public: By using one of our public
597
+ licenses, a licensor grants the public permission to use the
598
+ licensed material under specified terms and conditions. If
599
+ the licensor's permission is not necessary for any reason--for
600
+ example, because of any applicable exception or limitation to
601
+ copyright--then that use is not regulated by the license. Our
602
+ licenses grant only permissions under copyright and certain
603
+ other rights that a licensor has authority to grant. Use of
604
+ the licensed material may still be restricted for other
605
+ reasons, including because others have copyright or other
606
+ rights in the material. A licensor may make special requests,
607
+ such as asking that all changes be marked or described.
608
+ Although not required by our licenses, you are encouraged to
609
+ respect those requests where reasonable. More considerations
610
+ for the public:
611
+ wiki.creativecommons.org/Considerations_for_licensees
612
+
613
+ =======================================================================
614
+
615
+ Creative Commons Attribution-ShareAlike 4.0 International Public
616
+ License
617
+
618
+ By exercising the Licensed Rights (defined below), You accept and agree
619
+ to be bound by the terms and conditions of this Creative Commons
620
+ Attribution-ShareAlike 4.0 International Public License ("Public
621
+ License"). To the extent this Public License may be interpreted as a
622
+ contract, You are granted the Licensed Rights in consideration of Your
623
+ acceptance of these terms and conditions, and the Licensor grants You
624
+ such rights in consideration of benefits the Licensor receives from
625
+ making the Licensed Material available under these terms and
626
+ conditions.
627
+
628
+
629
+ Section 1 -- Definitions.
630
+
631
+ a. Adapted Material means material subject to Copyright and Similar
632
+ Rights that is derived from or based upon the Licensed Material
633
+ and in which the Licensed Material is translated, altered,
634
+ arranged, transformed, or otherwise modified in a manner requiring
635
+ permission under the Copyright and Similar Rights held by the
636
+ Licensor. For purposes of this Public License, where the Licensed
637
+ Material is a musical work, performance, or sound recording,
638
+ Adapted Material is always produced where the Licensed Material is
639
+ synched in timed relation with a moving image.
640
+
641
+ b. Adapter's License means the license You apply to Your Copyright
642
+ and Similar Rights in Your contributions to Adapted Material in
643
+ accordance with the terms and conditions of this Public License.
644
+
645
+ c. BY-SA Compatible License means a license listed at
646
+ creativecommons.org/compatiblelicenses, approved by Creative
647
+ Commons as essentially the equivalent of this Public License.
648
+
649
+ d. Copyright and Similar Rights means copyright and/or similar rights
650
+ closely related to copyright including, without limitation,
651
+ performance, broadcast, sound recording, and Sui Generis Database
652
+ Rights, without regard to how the rights are labeled or
653
+ categorized. For purposes of this Public License, the rights
654
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
655
+ Rights.
656
+
657
+ e. Effective Technological Measures means those measures that, in the
658
+ absence of proper authority, may not be circumvented under laws
659
+ fulfilling obligations under Article 11 of the WIPO Copyright
660
+ Treaty adopted on December 20, 1996, and/or similar international
661
+ agreements.
662
+
663
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
664
+ any other exception or limitation to Copyright and Similar Rights
665
+ that applies to Your use of the Licensed Material.
666
+
667
+ g. License Elements means the license attributes listed in the name
668
+ of a Creative Commons Public License. The License Elements of this
669
+ Public License are Attribution and ShareAlike.
670
+
671
+ h. Licensed Material means the artistic or literary work, database,
672
+ or other material to which the Licensor applied this Public
673
+ License.
674
+
675
+ i. Licensed Rights means the rights granted to You subject to the
676
+ terms and conditions of this Public License, which are limited to
677
+ all Copyright and Similar Rights that apply to Your use of the
678
+ Licensed Material and that the Licensor has authority to license.
679
+
680
+ j. Licensor means the individual(s) or entity(ies) granting rights
681
+ under this Public License.
682
+
683
+ k. Share means to provide material to the public by any means or
684
+ process that requires permission under the Licensed Rights, such
685
+ as reproduction, public display, public performance, distribution,
686
+ dissemination, communication, or importation, and to make material
687
+ available to the public including in ways that members of the
688
+ public may access the material from a place and at a time
689
+ individually chosen by them.
690
+
691
+ l. Sui Generis Database Rights means rights other than copyright
692
+ resulting from Directive 96/9/EC of the European Parliament and of
693
+ the Council of 11 March 1996 on the legal protection of databases,
694
+ as amended and/or succeeded, as well as other essentially
695
+ equivalent rights anywhere in the world.
696
+
697
+ m. You means the individual or entity exercising the Licensed Rights
698
+ under this Public License. Your has a corresponding meaning.
699
+
700
+
701
+ Section 2 -- Scope.
702
+
703
+ a. License grant.
704
+
705
+ 1. Subject to the terms and conditions of this Public License,
706
+ the Licensor hereby grants You a worldwide, royalty-free,
707
+ non-sublicensable, non-exclusive, irrevocable license to
708
+ exercise the Licensed Rights in the Licensed Material to:
709
+
710
+ a. reproduce and Share the Licensed Material, in whole or
711
+ in part; and
712
+
713
+ b. produce, reproduce, and Share Adapted Material.
714
+
715
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
716
+ Exceptions and Limitations apply to Your use, this Public
717
+ License does not apply, and You do not need to comply with
718
+ its terms and conditions.
719
+
720
+ 3. Term. The term of this Public License is specified in Section
721
+ 6(a).
722
+
723
+ 4. Media and formats; technical modifications allowed. The
724
+ Licensor authorizes You to exercise the Licensed Rights in
725
+ all media and formats whether now known or hereafter created,
726
+ and to make technical modifications necessary to do so. The
727
+ Licensor waives and/or agrees not to assert any right or
728
+ authority to forbid You from making technical modifications
729
+ necessary to exercise the Licensed Rights, including
730
+ technical modifications necessary to circumvent Effective
731
+ Technological Measures. For purposes of this Public License,
732
+ simply making modifications authorized by this Section 2(a)
733
+ (4) never produces Adapted Material.
734
+
735
+ 5. Downstream recipients.
736
+
737
+ a. Offer from the Licensor -- Licensed Material. Every
738
+ recipient of the Licensed Material automatically
739
+ receives an offer from the Licensor to exercise the
740
+ Licensed Rights under the terms and conditions of this
741
+ Public License.
742
+
743
+ b. Additional offer from the Licensor -- Adapted Material.
744
+ Every recipient of Adapted Material from You
745
+ automatically receives an offer from the Licensor to
746
+ exercise the Licensed Rights in the Adapted Material
747
+ under the conditions of the Adapter's License You apply.
748
+
749
+ c. No downstream restrictions. You may not offer or impose
750
+ any additional or different terms or conditions on, or
751
+ apply any Effective Technological Measures to, the
752
+ Licensed Material if doing so restricts exercise of the
753
+ Licensed Rights by any recipient of the Licensed
754
+ Material.
755
+
756
+ 6. No endorsement. Nothing in this Public License constitutes or
757
+ may be construed as permission to assert or imply that You
758
+ are, or that Your use of the Licensed Material is, connected
759
+ with, or sponsored, endorsed, or granted official status by,
760
+ the Licensor or others designated to receive attribution as
761
+ provided in Section 3(a)(1)(A)(i).
762
+
763
+ b. Other rights.
764
+
765
+ 1. Moral rights, such as the right of integrity, are not
766
+ licensed under this Public License, nor are publicity,
767
+ privacy, and/or other similar personality rights; however, to
768
+ the extent possible, the Licensor waives and/or agrees not to
769
+ assert any such rights held by the Licensor to the limited
770
+ extent necessary to allow You to exercise the Licensed
771
+ Rights, but not otherwise.
772
+
773
+ 2. Patent and trademark rights are not licensed under this
774
+ Public License.
775
+
776
+ 3. To the extent possible, the Licensor waives any right to
777
+ collect royalties from You for the exercise of the Licensed
778
+ Rights, whether directly or through a collecting society
779
+ under any voluntary or waivable statutory or compulsory
780
+ licensing scheme. In all other cases the Licensor expressly
781
+ reserves any right to collect such royalties.
782
+
783
+
784
+ Section 3 -- License Conditions.
785
+
786
+ Your exercise of the Licensed Rights is expressly made subject to the
787
+ following conditions.
788
+
789
+ a. Attribution.
790
+
791
+ 1. If You Share the Licensed Material (including in modified
792
+ form), You must:
793
+
794
+ a. retain the following if it is supplied by the Licensor
795
+ with the Licensed Material:
796
+
797
+ i. identification of the creator(s) of the Licensed
798
+ Material and any others designated to receive
799
+ attribution, in any reasonable manner requested by
800
+ the Licensor (including by pseudonym if
801
+ designated);
802
+
803
+ ii. a copyright notice;
804
+
805
+ iii. a notice that refers to this Public License;
806
+
807
+ iv. a notice that refers to the disclaimer of
808
+ warranties;
809
+
810
+ v. a URI or hyperlink to the Licensed Material to the
811
+ extent reasonably practicable;
812
+
813
+ b. indicate if You modified the Licensed Material and
814
+ retain an indication of any previous modifications; and
815
+
816
+ c. indicate the Licensed Material is licensed under this
817
+ Public License, and include the text of, or the URI or
818
+ hyperlink to, this Public License.
819
+
820
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
821
+ reasonable manner based on the medium, means, and context in
822
+ which You Share the Licensed Material. For example, it may be
823
+ reasonable to satisfy the conditions by providing a URI or
824
+ hyperlink to a resource that includes the required
825
+ information.
826
+
827
+ 3. If requested by the Licensor, You must remove any of the
828
+ information required by Section 3(a)(1)(A) to the extent
829
+ reasonably practicable.
830
+
831
+ b. ShareAlike.
832
+
833
+ In addition to the conditions in Section 3(a), if You Share
834
+ Adapted Material You produce, the following conditions also apply.
835
+
836
+ 1. The Adapter's License You apply must be a Creative Commons
837
+ license with the same License Elements, this version or
838
+ later, or a BY-SA Compatible License.
839
+
840
+ 2. You must include the text of, or the URI or hyperlink to, the
841
+ Adapter's License You apply. You may satisfy this condition
842
+ in any reasonable manner based on the medium, means, and
843
+ context in which You Share Adapted Material.
844
+
845
+ 3. You may not offer or impose any additional or different terms
846
+ or conditions on, or apply any Effective Technological
847
+ Measures to, Adapted Material that restrict exercise of the
848
+ rights granted under the Adapter's License You apply.
849
+
850
+
851
+ Section 4 -- Sui Generis Database Rights.
852
+
853
+ Where the Licensed Rights include Sui Generis Database Rights that
854
+ apply to Your use of the Licensed Material:
855
+
856
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
857
+ to extract, reuse, reproduce, and Share all or a substantial
858
+ portion of the contents of the database;
859
+
860
+ b. if You include all or a substantial portion of the database
861
+ contents in a database in which You have Sui Generis Database
862
+ Rights, then the database in which You have Sui Generis Database
863
+ Rights (but not its individual contents) is Adapted Material,
864
+
865
+ including for purposes of Section 3(b); and
866
+ c. You must comply with the conditions in Section 3(a) if You Share
867
+ all or a substantial portion of the contents of the database.
868
+
869
+ For the avoidance of doubt, this Section 4 supplements and does not
870
+ replace Your obligations under this Public License where the Licensed
871
+ Rights include other Copyright and Similar Rights.
872
+
873
+
874
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
875
+
876
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
877
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
878
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
879
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
880
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
881
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
882
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
883
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
884
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
885
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
886
+
887
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
888
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
889
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
890
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
891
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
892
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
893
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
894
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
895
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
896
+
897
+ c. The disclaimer of warranties and limitation of liability provided
898
+ above shall be interpreted in a manner that, to the extent
899
+ possible, most closely approximates an absolute disclaimer and
900
+ waiver of all liability.
901
+
902
+
903
+ Section 6 -- Term and Termination.
904
+
905
+ a. This Public License applies for the term of the Copyright and
906
+ Similar Rights licensed here. However, if You fail to comply with
907
+ this Public License, then Your rights under this Public License
908
+ terminate automatically.
909
+
910
+ b. Where Your right to use the Licensed Material has terminated under
911
+ Section 6(a), it reinstates:
912
+
913
+ 1. automatically as of the date the violation is cured, provided
914
+ it is cured within 30 days of Your discovery of the
915
+ violation; or
916
+
917
+ 2. upon express reinstatement by the Licensor.
918
+
919
+ For the avoidance of doubt, this Section 6(b) does not affect any
920
+ right the Licensor may have to seek remedies for Your violations
921
+ of this Public License.
922
+
923
+ c. For the avoidance of doubt, the Licensor may also offer the
924
+ Licensed Material under separate terms or conditions or stop
925
+ distributing the Licensed Material at any time; however, doing so
926
+ will not terminate this Public License.
927
+
928
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
929
+ License.
930
+
931
+
932
+ Section 7 -- Other Terms and Conditions.
933
+
934
+ a. The Licensor shall not be bound by any additional or different
935
+ terms or conditions communicated by You unless expressly agreed.
936
+
937
+ b. Any arrangements, understandings, or agreements regarding the
938
+ Licensed Material not stated herein are separate from and
939
+ independent of the terms and conditions of this Public License.
940
+
941
+
942
+ Section 8 -- Interpretation.
943
+
944
+ a. For the avoidance of doubt, this Public License does not, and
945
+ shall not be interpreted to, reduce, limit, restrict, or impose
946
+ conditions on any use of the Licensed Material that could lawfully
947
+ be made without permission under this Public License.
948
+
949
+ b. To the extent possible, if any provision of this Public License is
950
+ deemed unenforceable, it shall be automatically reformed to the
951
+ minimum extent necessary to make it enforceable. If the provision
952
+ cannot be reformed, it shall be severed from this Public License
953
+ without affecting the enforceability of the remaining terms and
954
+ conditions.
955
+
956
+ c. No term or condition of this Public License will be waived and no
957
+ failure to comply consented to unless expressly agreed to by the
958
+ Licensor.
959
+
960
+ d. Nothing in this Public License constitutes or may be interpreted
961
+ as a limitation upon, or waiver of, any privileges and immunities
962
+ that apply to the Licensor or You, including from the legal
963
+ processes of any jurisdiction or authority.
964
+
965
+
966
+ =======================================================================
967
+
968
+ Creative Commons is not a party to its public
969
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
970
+ its public licenses to material it publishes and in those instances
971
+ will be considered the “Licensor.” The text of the Creative Commons
972
+ public licenses is dedicated to the public domain under the CC0 Public
973
+ Domain Dedication. Except for the limited purpose of indicating that
974
+ material is shared under a Creative Commons public license or as
975
+ otherwise permitted by the Creative Commons policies published at
976
+ creativecommons.org/policies, Creative Commons does not authorize the
977
+ use of the trademark "Creative Commons" or any other trademark or logo
978
+ of Creative Commons without its prior written consent including,
979
+ without limitation, in connection with any unauthorized modifications
980
+ to any of its public licenses or any other arrangements,
981
+ understandings, or agreements concerning use of licensed material. For
982
+ the avoidance of doubt, this paragraph does not form part of the
983
+ public licenses.
984
+
985
+ Creative Commons may be contacted at creativecommons.org.
986
+
987
+ ```
988
+
989
+
990
+
991
+
992
+ # RONEC - the Romanian Named Entity Corpus (ca9ce460)
993
+
994
+ * Author: Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan
995
+ * URL: https://github.com/dumitrescustefan/ronec
996
+ * License: MIT
997
+
998
+ ```
999
+
1000
+ MIT License
1001
+
1002
+ Copyright (c) 2018 Stefan Dumitrescu
1003
+
1004
+ Permission is hereby granted, free of charge, to any person obtaining a copy
1005
+ of this software and associated documentation files (the "Software"), to deal
1006
+ in the Software without restriction, including without limitation the rights
1007
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
1008
+ copies of the Software, and to permit persons to whom the Software is
1009
+ furnished to do so, subject to the following conditions:
1010
+
1011
+ The above copyright notice and this permission notice shall be included in all
1012
+ copies or substantial portions of the Software.
1013
+
1014
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
1015
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
1016
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
1017
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
1018
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
1019
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
1020
+ SOFTWARE.```
1021
+
1022
+
1023
+
1024
+
1025
+ # Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)
1026
+
1027
+ * Author: Explosion
1028
+ * URL: https://spacy.io
1029
+ * License: CC0
1030
+
1031
+ ```
1032
+ The laws of most jurisdictions throughout the world automatically confer exclusive Copyright and Related Rights (defined below) upon the creator and subsequent owner(s) (each and all, an "owner") of an original work of authorship and/or a database (each, a "Work").
1033
+
1034
+ Certain owners wish to permanently relinquish those rights to a Work for the purpose of contributing to a commons of creative, cultural and scientific works ("Commons") that the public can reliably and without fear of later claims of infringement build upon, modify, incorporate in other works, reuse and redistribute as freely as possible in any form whatsoever and for any purposes, including without limitation commercial purposes. These owners may contribute to the Commons to promote the ideal of a free culture and the further production of creative, cultural and scientific works, or to gain reputation or greater distribution for their Work in part through the use and efforts of others.
1035
+
1036
+ For these and/or other purposes and motivations, and without any expectation of additional consideration or compensation, the person associating CC0 with a Work (the "Affirmer"), to the extent that he or she is an owner of Copyright and Related Rights in the Work, voluntarily elects to apply CC0 to the Work and publicly distribute the Work under its terms, with knowledge of his or her Copyright and Related Rights in the Work and the meaning and intended legal effect of CC0 on those rights.
1037
+
1038
+ 1. Copyright and Related Rights. A Work made available under CC0 may be protected by copyright and related or neighboring rights ("Copyright and Related Rights"). Copyright and Related Rights include, but are not limited to, the following:
1039
+
1040
+ the right to reproduce, adapt, distribute, perform, display, communicate, and translate a Work;
1041
+ moral rights retained by the original author(s) and/or performer(s);
1042
+ publicity and privacy rights pertaining to a person's image or likeness depicted in a Work;
1043
+ rights protecting against unfair competition in regards to a Work, subject to the limitations in paragraph 4(a), below;
1044
+ rights protecting the extraction, dissemination, use and reuse of data in a Work;
1045
+ database rights (such as those arising under Directive 96/9/EC of the European Parliament and of the Council of 11 March 1996 on the legal protection of databases, and under any national implementation thereof, including any amended or successor version of such directive); and
1046
+ other similar, equivalent or corresponding rights throughout the world based on applicable law or treaty, and any national implementations thereof.
1047
+ 2. Waiver. To the greatest extent permitted by, but not in contravention of, applicable law, Affirmer hereby overtly, fully, permanently, irrevocably and unconditionally waives, abandons, and surrenders all of Affirmer's Copyright and Related Rights and associated claims and causes of action, whether now known or unknown (including existing as well as future claims and causes of action), in the Work (i) in all territories worldwide, (ii) for the maximum duration provided by applicable law or treaty (including future time extensions), (iii) in any current or future medium and for any number of copies, and (iv) for any purpose whatsoever, including without limitation commercial, advertising or promotional purposes (the "Waiver"). Affirmer makes the Waiver for the benefit of each member of the public at large and to the detriment of Affirmer's heirs and successors, fully intending that such Waiver shall not be subject to revocation, rescission, cancellation, termination, or any other legal or equitable action to disrupt the quiet enjoyment of the Work by the public as contemplated by Affirmer's express Statement of Purpose.
1048
+
1049
+ 3. Public License Fallback. Should any part of the Waiver for any reason be judged legally invalid or ineffective under applicable law, then the Waiver shall be preserved to the maximum extent permitted taking into account Affirmer's express Statement of Purpose. In addition, to the extent the Waiver is so judged Affirmer hereby grants to each affected person a royalty-free, non transferable, non sublicensable, non exclusive, irrevocable and unconditional license to exercise Affirmer's Copyright and Related Rights in the Work (i) in all territories worldwide, (ii) for the maximum duration provided by applicable law or treaty (including future time extensions), (iii) in any current or future medium and for any number of copies, and (iv) for any purpose whatsoever, including without limitation commercial, advertising or promotional purposes (the "License"). The License shall be deemed effective as of the date CC0 was applied by Affirmer to the Work. Should any part of the License for any reason be judged legally invalid or ineffective under applicable law, such partial invalidity or ineffectiveness shall not invalidate the remainder of the License, and in such case Affirmer hereby affirms that he or she will not (i) exercise any of his or her remaining Copyright and Related Rights in the Work or (ii) assert any associated claims and causes of action with respect to the Work, in either case contrary to Affirmer's express Statement of Purpose.
1050
+
1051
+ 4. Limitations and Disclaimers.
1052
+
1053
+ No trademark or patent rights held by Affirmer are waived, abandoned, surrendered, licensed or otherwise affected by this document.
1054
+ Affirmer offers the Work as-is and makes no representations or warranties of any kind concerning the Work, express, implied, statutory or otherwise, including without limitation warranties of title, merchantability, fitness for a particular purpose, non infringement, or the absence of latent or other defects, accuracy, or the present or absence of errors, whether or not discoverable, all to the greatest extent permissible under applicable law.
1055
+ Affirmer disclaims responsibility for clearing rights of other persons that may apply to the Work or any use thereof, including without limitation any person's Copyright and Related Rights in the Work. Further, Affirmer disclaims responsibility for obtaining any necessary consents, permissions or other rights required for any use of the Work.
1056
+ Affirmer understands and acknowledges that Creative Commons is not a party to this document and has no duty or obligation with respect to this CC0 or use of the Work.```
1057
+
1058
+
1059
+
1060
+
README.md ADDED
@@ -0,0 +1,106 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - spacy
4
+ - token-classification
5
+ language:
6
+ - ro
7
+ license: CC-BY-SA-4.0
8
+ model-index:
9
+ - name: ro_core_news_lg
10
+ results:
11
+ - tasks:
12
+ name: NER
13
+ type: token-classification
14
+ metrics:
15
+ - name: Precision
16
+ type: precision
17
+ value: 0.7588147037
18
+ - name: Recall
19
+ type: recall
20
+ value: 0.7771801767
21
+ - name: F Score
22
+ type: f_score
23
+ value: 0.7678876447
24
+ - tasks:
25
+ name: POS
26
+ type: token-classification
27
+ metrics:
28
+ - name: Accuracy
29
+ type: accuracy
30
+ value: 0.975315026
31
+ - tasks:
32
+ name: SENTER
33
+ type: token-classification
34
+ metrics:
35
+ - name: Precision
36
+ type: precision
37
+ value: 0.9533954727
38
+ - name: Recall
39
+ type: recall
40
+ value: 0.9521276596
41
+ - name: F Score
42
+ type: f_score
43
+ value: 0.9527611444
44
+ - tasks:
45
+ name: UNLABELED_DEPENDENCIES
46
+ type: token-classification
47
+ metrics:
48
+ - name: Accuracy
49
+ type: accuracy
50
+ value: 0.8904573687
51
+ - tasks:
52
+ name: LABELED_DEPENDENCIES
53
+ type: token-classification
54
+ metrics:
55
+ - name: Accuracy
56
+ type: accuracy
57
+ value: 0.8904573687
58
+ ---
59
+ ### Details: https://spacy.io/models/ro#ro_core_news_lg
60
+
61
+ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
62
+
63
+ | Feature | Description |
64
+ | --- | --- |
65
+ | **Name** | `ro_core_news_lg` |
66
+ | **Version** | `3.1.0` |
67
+ | **spaCy** | `>=3.1.0,<3.2.0` |
68
+ | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
+ | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
+ | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
71
+ | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.5](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
+ | **License** | `CC BY-SA 4.0` |
73
+ | **Author** | [Explosion](https://explosion.ai) |
74
+
75
+ ### Label Scheme
76
+
77
+ <details>
78
+
79
+ <summary>View label scheme (534 labels for 4 components)</summary>
80
+
81
+ | Component | Labels |
82
+ | --- | --- |
83
+ | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-p-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrln`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps2ms-s`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp1s`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp-sr`, `Yr` |
84
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:agent`, `nmod:pmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
+ | **`senter`** | `I`, `S` |
86
+ | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
+
88
+ </details>
89
+
90
+ ### Accuracy
91
+
92
+ | Type | Score |
93
+ | --- | --- |
94
+ | `TOKEN_ACC` | 99.90 |
95
+ | `TAG_ACC` | 97.53 |
96
+ | `POS_ACC` | 96.54 |
97
+ | `MORPH_ACC` | 97.61 |
98
+ | `LEMMA_ACC` | 81.87 |
99
+ | `DEP_UAS` | 89.05 |
100
+ | `DEP_LAS` | 84.67 |
101
+ | `ENTS_P` | 75.88 |
102
+ | `ENTS_R` | 77.72 |
103
+ | `ENTS_F` | 76.79 |
104
+ | `SENTS_P` | 95.34 |
105
+ | `SENTS_R` | 95.21 |
106
+ | `SENTS_F` | 95.28 |
accuracy.json ADDED
@@ -0,0 +1,447 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "token_acc": 0.9990029326,
3
+ "tag_acc": 0.975315026,
4
+ "pos_acc": 0.965353945,
5
+ "morph_acc": 0.9760744713,
6
+ "lemma_acc": 0.8186589263,
7
+ "dep_uas": 0.8904573687,
8
+ "dep_las": 0.8467281511,
9
+ "ents_p": 0.7588147037,
10
+ "ents_r": 0.7771801767,
11
+ "ents_f": 0.7678876447,
12
+ "sents_p": 0.9533954727,
13
+ "sents_r": 0.9521276596,
14
+ "sents_f": 0.9527611444,
15
+ "speed": 10281.5880630945,
16
+ "morph_per_feat": {
17
+ "AdpType": {
18
+ "p": 0.9970784641,
19
+ "r": 0.9941739492,
20
+ "f": 0.9956240884
21
+ },
22
+ "Case": {
23
+ "p": 0.9896781203,
24
+ "r": 0.9840648211,
25
+ "f": 0.9868634886
26
+ },
27
+ "Variant": {
28
+ "p": 0.9845559846,
29
+ "r": 0.9205776173,
30
+ "f": 0.9514925373
31
+ },
32
+ "Gender": {
33
+ "p": 0.9840800225,
34
+ "r": 0.9782913165,
35
+ "f": 0.9811771316
36
+ },
37
+ "Number": {
38
+ "p": 0.9833276236,
39
+ "r": 0.9778560073,
40
+ "f": 0.9805841827
41
+ },
42
+ "PronType": {
43
+ "p": 0.9938366718,
44
+ "r": 0.987244898,
45
+ "f": 0.9905298183
46
+ },
47
+ "Definite": {
48
+ "p": 0.9784728611,
49
+ "r": 0.9725676664,
50
+ "f": 0.9755113272
51
+ },
52
+ "Degree": {
53
+ "p": 0.9530685921,
54
+ "r": 0.9428571429,
55
+ "f": 0.947935368
56
+ },
57
+ "Polarity": {
58
+ "p": 0.9884467266,
59
+ "r": 0.985915493,
60
+ "f": 0.9871794872
61
+ },
62
+ "Mood": {
63
+ "p": 0.9740072202,
64
+ "r": 0.9635714286,
65
+ "f": 0.9687612208
66
+ },
67
+ "Person": {
68
+ "p": 0.9837338262,
69
+ "r": 0.9718772827,
70
+ "f": 0.9777696123
71
+ },
72
+ "Tense": {
73
+ "p": 0.9730337079,
74
+ "r": 0.9572586588,
75
+ "f": 0.9650817236
76
+ },
77
+ "VerbForm": {
78
+ "p": 0.9698996656,
79
+ "r": 0.9593572779,
80
+ "f": 0.9645996674
81
+ },
82
+ "NumForm": {
83
+ "p": 0.9901960784,
84
+ "r": 0.9853658537,
85
+ "f": 0.9877750611
86
+ },
87
+ "NumType": {
88
+ "p": 0.9927536232,
89
+ "r": 0.9856115108,
90
+ "f": 0.9891696751
91
+ },
92
+ "PartType": {
93
+ "p": 0.9473684211,
94
+ "r": 0.9,
95
+ "f": 0.9230769231
96
+ },
97
+ "Strength": {
98
+ "p": 0.9897959184,
99
+ "r": 0.9797979798,
100
+ "f": 0.9847715736
101
+ },
102
+ "Reflex": {
103
+ "p": 0.9938461538,
104
+ "r": 0.990797546,
105
+ "f": 0.9923195084
106
+ },
107
+ "Poss": {
108
+ "p": 0.9792387543,
109
+ "r": 0.9895104895,
110
+ "f": 0.9843478261
111
+ },
112
+ "Position": {
113
+ "p": 0.9928057554,
114
+ "r": 0.9517241379,
115
+ "f": 0.9718309859
116
+ },
117
+ "Number[psor]": {
118
+ "p": 0.9295774648,
119
+ "r": 0.9565217391,
120
+ "f": 0.9428571429
121
+ },
122
+ "Abbr": {
123
+ "p": 0.9746835443,
124
+ "r": 0.9058823529,
125
+ "f": 0.9390243902
126
+ },
127
+ "Foreign": {
128
+ "p": 0.0,
129
+ "r": 0.0,
130
+ "f": 0.0
131
+ }
132
+ },
133
+ "dep_las_per_type": {
134
+ "case": {
135
+ "p": 0.9257307139,
136
+ "r": 0.9415204678,
137
+ "f": 0.9335588306
138
+ },
139
+ "det": {
140
+ "p": 0.9473684211,
141
+ "r": 0.9671052632,
142
+ "f": 0.9571351058
143
+ },
144
+ "nmod:tmod": {
145
+ "p": 0.4,
146
+ "r": 0.0465116279,
147
+ "f": 0.0833333333
148
+ },
149
+ "amod": {
150
+ "p": 0.8639212175,
151
+ "r": 0.8756805808,
152
+ "f": 0.8697611537
153
+ },
154
+ "cc": {
155
+ "p": 0.8669354839,
156
+ "r": 0.89958159,
157
+ "f": 0.8829568789
158
+ },
159
+ "conj": {
160
+ "p": 0.5984962406,
161
+ "r": 0.6012084592,
162
+ "f": 0.5998492841
163
+ },
164
+ "nmod": {
165
+ "p": 0.7883565797,
166
+ "r": 0.8217446271,
167
+ "f": 0.8047044259
168
+ },
169
+ "mark": {
170
+ "p": 0.8857142857,
171
+ "r": 0.9056179775,
172
+ "f": 0.8955555556
173
+ },
174
+ "fixed": {
175
+ "p": 0.8689217759,
176
+ "r": 0.7172774869,
177
+ "f": 0.7858508604
178
+ },
179
+ "nsubj": {
180
+ "p": 0.8333333333,
181
+ "r": 0.7814485388,
182
+ "f": 0.806557377
183
+ },
184
+ "advcl:tcl": {
185
+ "p": 0.0,
186
+ "r": 0.0,
187
+ "f": 0.0
188
+ },
189
+ "obj": {
190
+ "p": 0.7794117647,
191
+ "r": 0.8139931741,
192
+ "f": 0.796327212
193
+ },
194
+ "nummod": {
195
+ "p": 0.8703703704,
196
+ "r": 0.8676923077,
197
+ "f": 0.8690292758
198
+ },
199
+ "flat": {
200
+ "p": 0.7441860465,
201
+ "r": 0.6857142857,
202
+ "f": 0.7137546468
203
+ },
204
+ "obl": {
205
+ "p": 0.649068323,
206
+ "r": 0.7116912599,
207
+ "f": 0.6789388197
208
+ },
209
+ "nmod:pmod": {
210
+ "p": 0.44,
211
+ "r": 0.1692307692,
212
+ "f": 0.2444444444
213
+ },
214
+ "acl": {
215
+ "p": 0.7024793388,
216
+ "r": 0.7264957265,
217
+ "f": 0.7142857143
218
+ },
219
+ "advmod": {
220
+ "p": 0.7860962567,
221
+ "r": 0.7577319588,
222
+ "f": 0.7716535433
223
+ },
224
+ "expl:pv": {
225
+ "p": 0.7883597884,
226
+ "r": 0.7967914439,
227
+ "f": 0.7925531915
228
+ },
229
+ "root": {
230
+ "p": 0.917222964,
231
+ "r": 0.9135638298,
232
+ "f": 0.9153897402
233
+ },
234
+ "advcl": {
235
+ "p": 0.5625,
236
+ "r": 0.5853658537,
237
+ "f": 0.5737051793
238
+ },
239
+ "iobj": {
240
+ "p": 0.7591240876,
241
+ "r": 0.7027027027,
242
+ "f": 0.7298245614
243
+ },
244
+ "ccomp": {
245
+ "p": 0.7178217822,
246
+ "r": 0.8146067416,
247
+ "f": 0.7631578947
248
+ },
249
+ "goeswith": {
250
+ "p": 0.875,
251
+ "r": 0.5833333333,
252
+ "f": 0.7
253
+ },
254
+ "parataxis": {
255
+ "p": 0.7027027027,
256
+ "r": 0.5954198473,
257
+ "f": 0.6446280992
258
+ },
259
+ "expl:poss": {
260
+ "p": 0.5909090909,
261
+ "r": 0.6046511628,
262
+ "f": 0.5977011494
263
+ },
264
+ "cop": {
265
+ "p": 0.7647058824,
266
+ "r": 0.8024691358,
267
+ "f": 0.7831325301
268
+ },
269
+ "cc:preconj": {
270
+ "p": 0.0,
271
+ "r": 0.0,
272
+ "f": 0.0
273
+ },
274
+ "aux": {
275
+ "p": 0.9716713881,
276
+ "r": 0.9122340426,
277
+ "f": 0.9410150892
278
+ },
279
+ "expl": {
280
+ "p": 0.5294117647,
281
+ "r": 0.4186046512,
282
+ "f": 0.4675324675
283
+ },
284
+ "appos": {
285
+ "p": 0.4347826087,
286
+ "r": 0.396039604,
287
+ "f": 0.414507772
288
+ },
289
+ "xcomp": {
290
+ "p": 0.5441176471,
291
+ "r": 0.4512195122,
292
+ "f": 0.4933333333
293
+ },
294
+ "csubj": {
295
+ "p": 0.7966101695,
296
+ "r": 0.746031746,
297
+ "f": 0.7704918033
298
+ },
299
+ "nmod:agent": {
300
+ "p": 0.7285714286,
301
+ "r": 0.7846153846,
302
+ "f": 0.7555555556
303
+ },
304
+ "aux:pass": {
305
+ "p": 0.7769784173,
306
+ "r": 0.9,
307
+ "f": 0.833976834
308
+ },
309
+ "dep": {
310
+ "p": 0.0,
311
+ "r": 0.0,
312
+ "f": 0.0
313
+ },
314
+ "nsubj:pass": {
315
+ "p": 0.6111111111,
316
+ "r": 0.6644295302,
317
+ "f": 0.6366559486
318
+ },
319
+ "advmod:tmod": {
320
+ "p": 0.0,
321
+ "r": 0.0,
322
+ "f": 0.0
323
+ },
324
+ "expl:pass": {
325
+ "p": 0.6734693878,
326
+ "r": 0.7252747253,
327
+ "f": 0.6984126984
328
+ },
329
+ "ccomp:pmod": {
330
+ "p": 0.4,
331
+ "r": 0.2666666667,
332
+ "f": 0.32
333
+ },
334
+ "compound": {
335
+ "p": 0.25,
336
+ "r": 0.3333333333,
337
+ "f": 0.2857142857
338
+ },
339
+ "orphan": {
340
+ "p": 0.0,
341
+ "r": 0.0,
342
+ "f": 0.0
343
+ },
344
+ "expl:impers": {
345
+ "p": 0.3333333333,
346
+ "r": 0.1,
347
+ "f": 0.1538461538
348
+ },
349
+ "csubj:pass": {
350
+ "p": 0.25,
351
+ "r": 0.3333333333,
352
+ "f": 0.2857142857
353
+ },
354
+ "vocative": {
355
+ "p": 0.0,
356
+ "r": 0.0,
357
+ "f": 0.0
358
+ },
359
+ "discourse": {
360
+ "p": 0.0,
361
+ "r": 0.0,
362
+ "f": 0.0
363
+ }
364
+ },
365
+ "ents_per_type": {
366
+ "DATETIME": {
367
+ "p": 0.7852348993,
368
+ "r": 0.8153310105,
369
+ "f": 0.8
370
+ },
371
+ "ORGANIZATION": {
372
+ "p": 0.6873065015,
373
+ "r": 0.7070063694,
374
+ "f": 0.6970172684
375
+ },
376
+ "FACILITY": {
377
+ "p": 0.5317460317,
378
+ "r": 0.5114503817,
379
+ "f": 0.5214007782
380
+ },
381
+ "NUMERIC_VALUE": {
382
+ "p": 0.8978723404,
383
+ "r": 0.8940677966,
384
+ "f": 0.8959660297
385
+ },
386
+ "ORDINAL": {
387
+ "p": 0.7931034483,
388
+ "r": 0.8363636364,
389
+ "f": 0.814159292
390
+ },
391
+ "EVENT": {
392
+ "p": 0.5675675676,
393
+ "r": 0.5675675676,
394
+ "f": 0.5675675676
395
+ },
396
+ "GPE": {
397
+ "p": 0.8351409978,
398
+ "r": 0.8850574713,
399
+ "f": 0.859375
400
+ },
401
+ "PERSON": {
402
+ "p": 0.7360890302,
403
+ "r": 0.7768456376,
404
+ "f": 0.7559183673
405
+ },
406
+ "NAT_REL_POL": {
407
+ "p": 0.925170068,
408
+ "r": 0.9066666667,
409
+ "f": 0.9158249158
410
+ },
411
+ "MONEY": {
412
+ "p": 0.9411764706,
413
+ "r": 0.8275862069,
414
+ "f": 0.880733945
415
+ },
416
+ "PRODUCT": {
417
+ "p": 0.6260162602,
418
+ "r": 0.5620437956,
419
+ "f": 0.5923076923
420
+ },
421
+ "LOC": {
422
+ "p": 0.4886363636,
423
+ "r": 0.5657894737,
424
+ "f": 0.5243902439
425
+ },
426
+ "WORK_OF_ART": {
427
+ "p": 0.4285714286,
428
+ "r": 0.4736842105,
429
+ "f": 0.45
430
+ },
431
+ "QUANTITY": {
432
+ "p": 0.8620689655,
433
+ "r": 0.9615384615,
434
+ "f": 0.9090909091
435
+ },
436
+ "PERIOD": {
437
+ "p": 0.9428571429,
438
+ "r": 0.7857142857,
439
+ "f": 0.8571428571
440
+ },
441
+ "LANGUAGE": {
442
+ "p": 0.6,
443
+ "r": 0.75,
444
+ "f": 0.6666666667
445
+ }
446
+ }
447
+ }
attribute_ruler/patterns ADDED
Binary file (49.6 kB). View file
 
config.cfg ADDED
@@ -0,0 +1,260 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/ro-dep-mixed/train.spacy"
3
+ dev = "corpus/ro-dep-mixed/dev.spacy"
4
+ vectors = "corpus/ro_vectors"
5
+ raw = null
6
+ init_tok2vec = null
7
+ vocab_data = null
8
+
9
+ [system]
10
+ gpu_allocator = null
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "ro"
15
+ pipeline = ["tok2vec","tagger","parser","senter","attribute_ruler","lemmatizer","ner"]
16
+ disabled = ["senter"]
17
+ before_creation = null
18
+ after_creation = null
19
+ after_pipeline_creation = null
20
+ batch_size = 256
21
+ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
22
+
23
+ [components]
24
+
25
+ [components.attribute_ruler]
26
+ factory = "attribute_ruler"
27
+ validate = false
28
+
29
+ [components.lemmatizer]
30
+ factory = "lemmatizer"
31
+ mode = "lookup"
32
+ model = null
33
+ overwrite = false
34
+
35
+ [components.ner]
36
+ factory = "ner"
37
+ incorrect_spans_key = null
38
+ moves = null
39
+ update_with_oracle_cut_size = 100
40
+
41
+ [components.ner.model]
42
+ @architectures = "spacy.TransitionBasedParser.v2"
43
+ state_type = "ner"
44
+ extra_state_tokens = false
45
+ hidden_width = 64
46
+ maxout_pieces = 2
47
+ use_upper = true
48
+ nO = null
49
+
50
+ [components.ner.model.tok2vec]
51
+ @architectures = "spacy.Tok2Vec.v2"
52
+
53
+ [components.ner.model.tok2vec.embed]
54
+ @architectures = "spacy.MultiHashEmbed.v2"
55
+ width = 96
56
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
57
+ rows = [5000,2500,2500,2500]
58
+ include_static_vectors = true
59
+
60
+ [components.ner.model.tok2vec.encode]
61
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
62
+ width = 96
63
+ depth = 4
64
+ window_size = 1
65
+ maxout_pieces = 3
66
+
67
+ [components.parser]
68
+ factory = "parser"
69
+ learn_tokens = false
70
+ min_action_freq = 30
71
+ moves = null
72
+ update_with_oracle_cut_size = 100
73
+
74
+ [components.parser.model]
75
+ @architectures = "spacy.TransitionBasedParser.v2"
76
+ state_type = "parser"
77
+ extra_state_tokens = false
78
+ hidden_width = 64
79
+ maxout_pieces = 2
80
+ use_upper = true
81
+ nO = null
82
+
83
+ [components.parser.model.tok2vec]
84
+ @architectures = "spacy.Tok2VecListener.v1"
85
+ width = ${components.tok2vec.model.encode:width}
86
+ upstream = "tok2vec"
87
+
88
+ [components.senter]
89
+ factory = "senter"
90
+
91
+ [components.senter.model]
92
+ @architectures = "spacy.Tagger.v1"
93
+ nO = null
94
+
95
+ [components.senter.model.tok2vec]
96
+ @architectures = "spacy.Tok2Vec.v2"
97
+
98
+ [components.senter.model.tok2vec.embed]
99
+ @architectures = "spacy.MultiHashEmbed.v2"
100
+ width = 16
101
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
102
+ rows = [1000,500,500,500]
103
+ include_static_vectors = true
104
+
105
+ [components.senter.model.tok2vec.encode]
106
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
107
+ width = 16
108
+ depth = 2
109
+ window_size = 1
110
+ maxout_pieces = 2
111
+
112
+ [components.tagger]
113
+ factory = "tagger"
114
+
115
+ [components.tagger.model]
116
+ @architectures = "spacy.Tagger.v1"
117
+ nO = null
118
+
119
+ [components.tagger.model.tok2vec]
120
+ @architectures = "spacy.Tok2VecListener.v1"
121
+ width = ${components.tok2vec.model.encode:width}
122
+ upstream = "tok2vec"
123
+
124
+ [components.tok2vec]
125
+ factory = "tok2vec"
126
+
127
+ [components.tok2vec.model]
128
+ @architectures = "spacy.Tok2Vec.v2"
129
+
130
+ [components.tok2vec.model.embed]
131
+ @architectures = "spacy.MultiHashEmbed.v2"
132
+ width = ${components.tok2vec.model.encode:width}
133
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
+ rows = [5000,2500,2500,2500]
135
+ include_static_vectors = true
136
+
137
+ [components.tok2vec.model.encode]
138
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
139
+ width = 96
140
+ depth = 4
141
+ window_size = 1
142
+ maxout_pieces = 3
143
+
144
+ [corpora]
145
+
146
+ [corpora.dev]
147
+ @readers = "spacy.Corpus.v1"
148
+ limit = 0
149
+ max_length = 0
150
+ path = ${paths:dev}
151
+ gold_preproc = false
152
+ augmenter = null
153
+
154
+ [corpora.train]
155
+ @readers = "spacy.Corpus.v1"
156
+ path = ${paths:train}
157
+ max_length = 5000
158
+ gold_preproc = false
159
+ limit = 0
160
+
161
+ [corpora.train.augmenter]
162
+ @augmenters = "spacy.lower_case.v1"
163
+ level = 0.1
164
+
165
+ [training]
166
+ train_corpus = "corpora.train"
167
+ dev_corpus = "corpora.dev"
168
+ seed = ${system:seed}
169
+ gpu_allocator = ${system:gpu_allocator}
170
+ dropout = 0.1
171
+ accumulate_gradient = 1
172
+ patience = 5000
173
+ max_epochs = 0
174
+ max_steps = 0
175
+ eval_frequency = 1000
176
+ frozen_components = []
177
+ before_to_disk = null
178
+ annotating_components = []
179
+
180
+ [training.batcher]
181
+ @batchers = "spacy.batch_by_words.v1"
182
+ discard_oversize = false
183
+ tolerance = 0.2
184
+ get_length = null
185
+
186
+ [training.batcher.size]
187
+ @schedules = "compounding.v1"
188
+ start = 100
189
+ stop = 1000
190
+ compound = 1.001
191
+ t = 0.0
192
+
193
+ [training.logger]
194
+ @loggers = "spacy.WandbLogger.v1"
195
+ project_name = "spacy-v3.0.0a2"
196
+ remove_config_values = []
197
+
198
+ [training.optimizer]
199
+ @optimizers = "Adam.v1"
200
+ beta1 = 0.9
201
+ beta2 = 0.999
202
+ L2_is_weight_decay = true
203
+ L2 = 0.01
204
+ grad_clip = 1.0
205
+ use_averages = true
206
+ eps = 0.00000001
207
+ learn_rate = 0.001
208
+
209
+ [training.score_weights]
210
+ tag_acc = 0.16
211
+ dep_uas = 0.0
212
+ dep_las = 0.16
213
+ dep_las_per_type = null
214
+ sents_p = null
215
+ sents_r = null
216
+ sents_f = 0.02
217
+ lemma_acc = 0.33
218
+ ents_f = 0.33
219
+ ents_p = 0.0
220
+ ents_r = 0.0
221
+ ents_per_type = null
222
+
223
+ [pretraining]
224
+
225
+ [initialize]
226
+ vocab_data = ${paths.vocab_data}
227
+ vectors = ${paths.vectors}
228
+ init_tok2vec = ${paths.init_tok2vec}
229
+ before_init = null
230
+ after_init = null
231
+
232
+ [initialize.components]
233
+
234
+ [initialize.components.ner]
235
+
236
+ [initialize.components.ner.labels]
237
+ @readers = "spacy.read_labels.v1"
238
+ path = "corpus/labels/ner.json"
239
+ require = false
240
+
241
+ [initialize.components.parser]
242
+
243
+ [initialize.components.parser.labels]
244
+ @readers = "spacy.read_labels.v1"
245
+ path = "corpus/labels/parser.json"
246
+ require = false
247
+
248
+ [initialize.components.tagger]
249
+
250
+ [initialize.components.tagger.labels]
251
+ @readers = "spacy.read_labels.v1"
252
+ path = "corpus/labels/tagger.json"
253
+ require = false
254
+
255
+ [initialize.lookups]
256
+ @misc = "spacy.LookupsDataLoader.v1"
257
+ lang = ${nlp.lang}
258
+ tables = []
259
+
260
+ [initialize.tokenizer]
lemmatizer/lookups/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29d980bbecacfa6599448d2fc5a0e58900ecce80f8674ac1fb8fbdfd434fea11
3
+ size 5598187
meta.json ADDED
@@ -0,0 +1,1067 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "lang":"ro",
3
+ "name":"core_news_lg",
4
+ "version":"3.1.0",
5
+ "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
+ "author":"Explosion",
7
+ "email":"contact@explosion.ai",
8
+ "url":"https://explosion.ai",
9
+ "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.1.0,<3.2.0",
11
+ "spacy_git_version":"caba63b74",
12
+ "vectors":{
13
+ "width":300,
14
+ "vectors":500000,
15
+ "keys":500000,
16
+ "name":"ro_vectors"
17
+ },
18
+ "labels":{
19
+ "tok2vec":[
20
+
21
+ ],
22
+ "tagger":[
23
+ "ARROW",
24
+ "Af",
25
+ "Afcfp-n",
26
+ "Afcfson",
27
+ "Afcfsrn",
28
+ "Afcmpoy",
29
+ "Afcms-n",
30
+ "Afp",
31
+ "Afp-p-n",
32
+ "Afp-poy",
33
+ "Afpf--n",
34
+ "Afpfp-n",
35
+ "Afpfp-ny",
36
+ "Afpfpoy",
37
+ "Afpfpry",
38
+ "Afpfson",
39
+ "Afpfsoy",
40
+ "Afpfsrn",
41
+ "Afpfsry",
42
+ "Afpm--n",
43
+ "Afpmp-n",
44
+ "Afpmpoy",
45
+ "Afpmpry",
46
+ "Afpms-n",
47
+ "Afpmsoy",
48
+ "Afpmsry",
49
+ "Afsfp-n",
50
+ "Afsfsrn",
51
+ "BULLET",
52
+ "COLON",
53
+ "COMMA",
54
+ "Ccssp",
55
+ "Ccsspy",
56
+ "Crssp",
57
+ "Csssp",
58
+ "Cssspy",
59
+ "DASH",
60
+ "DBLQ",
61
+ "Dd3-po---e",
62
+ "Dd3-po---o",
63
+ "Dd3fpo",
64
+ "Dd3fpr",
65
+ "Dd3fpr---e",
66
+ "Dd3fpr---o",
67
+ "Dd3fpr--y",
68
+ "Dd3fso",
69
+ "Dd3fso---e",
70
+ "Dd3fsr",
71
+ "Dd3fsr---e",
72
+ "Dd3fsr---o",
73
+ "Dd3fsr--yo",
74
+ "Dd3mpo",
75
+ "Dd3mpr",
76
+ "Dd3mpr---e",
77
+ "Dd3mpr---o",
78
+ "Dd3mso---e",
79
+ "Dd3msr",
80
+ "Dd3msr---e",
81
+ "Dd3msr---o",
82
+ "Dh1ms",
83
+ "Dh3fp",
84
+ "Dh3fso",
85
+ "Dh3fsr",
86
+ "Dh3mp",
87
+ "Dh3ms",
88
+ "Di3",
89
+ "Di3-----y",
90
+ "Di3--r---e",
91
+ "Di3-po",
92
+ "Di3-po---e",
93
+ "Di3-sr",
94
+ "Di3-sr---e",
95
+ "Di3-sr--y",
96
+ "Di3fp",
97
+ "Di3fpr",
98
+ "Di3fpr---e",
99
+ "Di3fso",
100
+ "Di3fso---e",
101
+ "Di3fsr",
102
+ "Di3fsr---e",
103
+ "Di3mp",
104
+ "Di3mpr",
105
+ "Di3mpr---e",
106
+ "Di3ms",
107
+ "Di3ms----e",
108
+ "Di3mso---e",
109
+ "Di3msr",
110
+ "Di3msr---e",
111
+ "Ds1fp-p",
112
+ "Ds1fp-s",
113
+ "Ds1fsop",
114
+ "Ds1fsos",
115
+ "Ds1fsrp",
116
+ "Ds1fsrs",
117
+ "Ds1fsrs-y",
118
+ "Ds1mp-p",
119
+ "Ds1mp-s",
120
+ "Ds1ms-p",
121
+ "Ds1ms-s",
122
+ "Ds1msrs-y",
123
+ "Ds2---s",
124
+ "Ds2fp-p",
125
+ "Ds2fp-s",
126
+ "Ds2fsrp",
127
+ "Ds2fsrs",
128
+ "Ds2mp-p",
129
+ "Ds2mp-s",
130
+ "Ds2ms-p",
131
+ "Ds2ms-s",
132
+ "Ds3---p",
133
+ "Ds3---s",
134
+ "Ds3fp-s",
135
+ "Ds3fsos",
136
+ "Ds3fsrs",
137
+ "Ds3mp-s",
138
+ "Ds3ms-s",
139
+ "Dw3--r---e",
140
+ "Dw3-po---e",
141
+ "Dw3fpr",
142
+ "Dw3fso---e",
143
+ "Dw3fsr",
144
+ "Dw3mpr",
145
+ "Dw3mso---e",
146
+ "Dw3msr",
147
+ "Dz3fsr---e",
148
+ "Dz3mso---e",
149
+ "Dz3msr---e",
150
+ "EQUAL",
151
+ "EXCL",
152
+ "EXCLHELLIP",
153
+ "GE",
154
+ "GT",
155
+ "HELLIP",
156
+ "I",
157
+ "LCURL",
158
+ "LPAR",
159
+ "LSQR",
160
+ "LT",
161
+ "M",
162
+ "Mc",
163
+ "Mc-p-d",
164
+ "Mc-p-l",
165
+ "Mcfp-l",
166
+ "Mcfp-ln",
167
+ "Mcfprln",
168
+ "Mcfprly",
169
+ "Mcfsoln",
170
+ "Mcfsrln",
171
+ "Mcmp-l",
172
+ "Mcms-ln",
173
+ "Mcmsrl",
174
+ "Mcmsrly",
175
+ "Mffprln",
176
+ "Mffsrln",
177
+ "Mlfpo",
178
+ "Mlfpr",
179
+ "Mlmpr",
180
+ "Mo---l",
181
+ "Mo---ln",
182
+ "Mo-s-r",
183
+ "Mofp-ln",
184
+ "Mofpoly",
185
+ "Mofprly",
186
+ "Mofs-l",
187
+ "Mofsoln",
188
+ "Mofsoly",
189
+ "Mofsrln",
190
+ "Mofsrly",
191
+ "Mompoly",
192
+ "Momprly",
193
+ "Moms-l",
194
+ "Moms-ln",
195
+ "Momsoly",
196
+ "Momsrly",
197
+ "Nc",
198
+ "Nc---n",
199
+ "Ncf--n",
200
+ "Ncfp-n",
201
+ "Ncfpoy",
202
+ "Ncfpry",
203
+ "Ncfs-n",
204
+ "Ncfson",
205
+ "Ncfsoy",
206
+ "Ncfsrn",
207
+ "Ncfsry",
208
+ "Ncfsryy",
209
+ "Ncfsvy",
210
+ "Ncm--n",
211
+ "Ncmp-n",
212
+ "Ncmpoy",
213
+ "Ncmpry",
214
+ "Ncms-n",
215
+ "Ncms-ny",
216
+ "Ncms-y",
217
+ "Ncmsoy",
218
+ "Ncmsrn",
219
+ "Ncmsry",
220
+ "Ncmsryy",
221
+ "Ncmsvn",
222
+ "Ncmsvy",
223
+ "Np",
224
+ "Npfson",
225
+ "Npfsoy",
226
+ "Npfsrn",
227
+ "Npfsry",
228
+ "Npmpoy",
229
+ "Npmpry",
230
+ "Npms-n",
231
+ "Npmsoy",
232
+ "Npmsry",
233
+ "PERCENT",
234
+ "PERIOD",
235
+ "PLUS",
236
+ "PLUSMINUS",
237
+ "Pd3-po",
238
+ "Pd3fpr",
239
+ "Pd3fso",
240
+ "Pd3fsr",
241
+ "Pd3mpo",
242
+ "Pd3mpr",
243
+ "Pd3mpr--y",
244
+ "Pd3mso",
245
+ "Pd3msr",
246
+ "Pi3",
247
+ "Pi3--r",
248
+ "Pi3-po",
249
+ "Pi3-so",
250
+ "Pi3-sr",
251
+ "Pi3fpr",
252
+ "Pi3fso",
253
+ "Pi3fsr",
254
+ "Pi3mpr",
255
+ "Pi3mso",
256
+ "Pi3msr",
257
+ "Pi3msr--y",
258
+ "Pp1-pa--------w",
259
+ "Pp1-pa--y-----w",
260
+ "Pp1-pd--------s",
261
+ "Pp1-pd--------w",
262
+ "Pp1-pd--y-----w",
263
+ "Pp1-pr--------s",
264
+ "Pp1-sa--------s",
265
+ "Pp1-sa--------w",
266
+ "Pp1-sa--y-----w",
267
+ "Pp1-sd--------s",
268
+ "Pp1-sd--------w",
269
+ "Pp1-sd--y-----w",
270
+ "Pp1-sn--------s",
271
+ "Pp2-----------s",
272
+ "Pp2-pa--------w",
273
+ "Pp2-pa--y-----w",
274
+ "Pp2-pd--------w",
275
+ "Pp2-pd--y-----w",
276
+ "Pp2-pr--------s",
277
+ "Pp2-sa--------s",
278
+ "Pp2-sa--------w",
279
+ "Pp2-sa--y-----w",
280
+ "Pp2-sd--------s",
281
+ "Pp2-sd--------w",
282
+ "Pp2-sd--y-----w",
283
+ "Pp2-sn--------s",
284
+ "Pp2-so--------s",
285
+ "Pp2-sr--------s",
286
+ "Pp3-p---------s",
287
+ "Pp3-pd--------w",
288
+ "Pp3-pd--y-----w",
289
+ "Pp3-po--------s",
290
+ "Pp3-sd--------w",
291
+ "Pp3-sd--y-----w",
292
+ "Pp3fpa--------w",
293
+ "Pp3fpa--y-----w",
294
+ "Pp3fpr--------s",
295
+ "Pp3fs---------s",
296
+ "Pp3fsa--------w",
297
+ "Pp3fsa--y-----w",
298
+ "Pp3fso--------s",
299
+ "Pp3fsr--------s",
300
+ "Pp3fsr--y-----s",
301
+ "Pp3mpa--------w",
302
+ "Pp3mpa--y-----w",
303
+ "Pp3mpr--------s",
304
+ "Pp3ms---------s",
305
+ "Pp3msa--------w",
306
+ "Pp3msa--y-----w",
307
+ "Pp3mso--------s",
308
+ "Pp3msr--------s",
309
+ "Pp3msr--y-----s",
310
+ "Ps1fp-s",
311
+ "Ps1fsrp",
312
+ "Ps1fsrs",
313
+ "Ps1mp-p",
314
+ "Ps1ms-p",
315
+ "Ps2fp-s",
316
+ "Ps2fsrp",
317
+ "Ps2fsrs",
318
+ "Ps2ms-s",
319
+ "Ps3---p",
320
+ "Ps3---s",
321
+ "Ps3fp-s",
322
+ "Ps3fsrs",
323
+ "Ps3mp-s",
324
+ "Ps3ms-s",
325
+ "Pw3--r",
326
+ "Pw3-po",
327
+ "Pw3-so",
328
+ "Pw3fpr",
329
+ "Pw3fso",
330
+ "Pw3mpr",
331
+ "Pw3mso",
332
+ "Px3--a--------s",
333
+ "Px3--a--------w",
334
+ "Px3--a--y-----w",
335
+ "Px3--d--------w",
336
+ "Px3--d--y-----w",
337
+ "Pz3-sr",
338
+ "Pz3fsr",
339
+ "QUEST",
340
+ "QUOT",
341
+ "Qf",
342
+ "Qn",
343
+ "Qs",
344
+ "Qs-y",
345
+ "Qz",
346
+ "Qz-y",
347
+ "RCURL",
348
+ "RPAR",
349
+ "RSQR",
350
+ "Rc",
351
+ "Rgc",
352
+ "Rgp",
353
+ "Rgpy",
354
+ "Rgs",
355
+ "Rp",
356
+ "Rw",
357
+ "Rw-y",
358
+ "Rz",
359
+ "SCOLON",
360
+ "SLASH",
361
+ "STAR",
362
+ "Sp",
363
+ "Spsa",
364
+ "Spsay",
365
+ "Spsd",
366
+ "Spsg",
367
+ "Td-po",
368
+ "Tdfpr",
369
+ "Tdfso",
370
+ "Tdfsr",
371
+ "Tdmpr",
372
+ "Tdmso",
373
+ "Tdmsr",
374
+ "Tf-so",
375
+ "Tffpoy",
376
+ "Tffpry",
377
+ "Tffs-y",
378
+ "Tfmpoy",
379
+ "Tfms-y",
380
+ "Tfmsoy",
381
+ "Tfmsry",
382
+ "Ti-po",
383
+ "Tifp-y",
384
+ "Tifso",
385
+ "Tifsr",
386
+ "Timso",
387
+ "Timsr",
388
+ "Tsfp",
389
+ "Tsfs",
390
+ "Tsmp",
391
+ "Tsms",
392
+ "UNDERSC",
393
+ "Va--1",
394
+ "Va--1-----y",
395
+ "Va--1p",
396
+ "Va--1s",
397
+ "Va--1s----y",
398
+ "Va--2p",
399
+ "Va--2p----y",
400
+ "Va--2s",
401
+ "Va--2s----y",
402
+ "Va--3",
403
+ "Va--3-----y",
404
+ "Va--3p",
405
+ "Va--3p----y",
406
+ "Va--3s",
407
+ "Va--3s----y",
408
+ "Vag",
409
+ "Vaii1",
410
+ "Vaii2s",
411
+ "Vaii3p",
412
+ "Vaii3s",
413
+ "Vail3p",
414
+ "Vail3s",
415
+ "Vaip1p",
416
+ "Vaip1s",
417
+ "Vaip2p",
418
+ "Vaip2s",
419
+ "Vaip3p",
420
+ "Vaip3p----y",
421
+ "Vaip3s",
422
+ "Vaip3s----y",
423
+ "Vais3p",
424
+ "Vais3s",
425
+ "Vam-2s",
426
+ "Vanp",
427
+ "Vap--sm",
428
+ "Vasp1p",
429
+ "Vasp1s",
430
+ "Vasp2p",
431
+ "Vasp2s",
432
+ "Vasp3",
433
+ "Vmg",
434
+ "Vmg-------y",
435
+ "Vmii1",
436
+ "Vmii1-----y",
437
+ "Vmii2p",
438
+ "Vmii2s",
439
+ "Vmii3p",
440
+ "Vmii3p----y",
441
+ "Vmii3s",
442
+ "Vmii3s----y",
443
+ "Vmil1",
444
+ "Vmil1p",
445
+ "Vmil2s",
446
+ "Vmil3p",
447
+ "Vmil3p----y",
448
+ "Vmil3s",
449
+ "Vmil3s----y",
450
+ "Vmip1p",
451
+ "Vmip1p----y",
452
+ "Vmip1s",
453
+ "Vmip1s----y",
454
+ "Vmip2p",
455
+ "Vmip2s",
456
+ "Vmip2s----y",
457
+ "Vmip3",
458
+ "Vmip3-----y",
459
+ "Vmip3p",
460
+ "Vmip3s",
461
+ "Vmip3s----y",
462
+ "Vmis1p",
463
+ "Vmis1s",
464
+ "Vmis3p",
465
+ "Vmis3p----y",
466
+ "Vmis3s",
467
+ "Vmis3s----y",
468
+ "Vmm-2p",
469
+ "Vmm-2s",
470
+ "Vmnp",
471
+ "Vmnp------y",
472
+ "Vmp--pf",
473
+ "Vmp--pm",
474
+ "Vmp--sf",
475
+ "Vmp--sm",
476
+ "Vmp--sm---y",
477
+ "Vmsp1p",
478
+ "Vmsp1s",
479
+ "Vmsp2s",
480
+ "Vmsp3",
481
+ "Vmsp3-----y",
482
+ "X",
483
+ "Y",
484
+ "Ya",
485
+ "Yn",
486
+ "Ynfsoy",
487
+ "Ynfsry",
488
+ "Ynmsoy",
489
+ "Ynmsry",
490
+ "Yp",
491
+ "Yp-sr",
492
+ "Yr"
493
+ ],
494
+ "parser":[
495
+ "ROOT",
496
+ "acl",
497
+ "advcl",
498
+ "advcl:tcl",
499
+ "advmod",
500
+ "advmod:tmod",
501
+ "amod",
502
+ "appos",
503
+ "aux",
504
+ "aux:pass",
505
+ "case",
506
+ "cc",
507
+ "cc:preconj",
508
+ "ccomp",
509
+ "ccomp:pmod",
510
+ "compound",
511
+ "conj",
512
+ "cop",
513
+ "csubj",
514
+ "csubj:pass",
515
+ "dep",
516
+ "det",
517
+ "expl",
518
+ "expl:impers",
519
+ "expl:pass",
520
+ "expl:poss",
521
+ "expl:pv",
522
+ "fixed",
523
+ "flat",
524
+ "goeswith",
525
+ "iobj",
526
+ "mark",
527
+ "nmod",
528
+ "nmod:agent",
529
+ "nmod:pmod",
530
+ "nmod:tmod",
531
+ "nsubj",
532
+ "nsubj:pass",
533
+ "nummod",
534
+ "obj",
535
+ "obl",
536
+ "orphan",
537
+ "parataxis",
538
+ "punct",
539
+ "vocative",
540
+ "xcomp"
541
+ ],
542
+ "senter":[
543
+ "I",
544
+ "S"
545
+ ],
546
+ "attribute_ruler":[
547
+
548
+ ],
549
+ "lemmatizer":[
550
+
551
+ ],
552
+ "ner":[
553
+ "DATETIME",
554
+ "EVENT",
555
+ "FACILITY",
556
+ "GPE",
557
+ "LANGUAGE",
558
+ "LOC",
559
+ "MONEY",
560
+ "NAT_REL_POL",
561
+ "NUMERIC_VALUE",
562
+ "ORDINAL",
563
+ "ORGANIZATION",
564
+ "PERIOD",
565
+ "PERSON",
566
+ "PRODUCT",
567
+ "QUANTITY",
568
+ "WORK_OF_ART"
569
+ ]
570
+ },
571
+ "pipeline":[
572
+ "tok2vec",
573
+ "tagger",
574
+ "parser",
575
+ "attribute_ruler",
576
+ "lemmatizer",
577
+ "ner"
578
+ ],
579
+ "components":[
580
+ "tok2vec",
581
+ "tagger",
582
+ "parser",
583
+ "senter",
584
+ "attribute_ruler",
585
+ "lemmatizer",
586
+ "ner"
587
+ ],
588
+ "disabled":[
589
+ "senter"
590
+ ],
591
+ "performance":{
592
+ "token_acc":0.9990029326,
593
+ "tag_acc":0.975315026,
594
+ "pos_acc":0.965353945,
595
+ "morph_acc":0.9760744713,
596
+ "lemma_acc":0.8186589263,
597
+ "dep_uas":0.8904573687,
598
+ "dep_las":0.8467281511,
599
+ "ents_p":0.7588147037,
600
+ "ents_r":0.7771801767,
601
+ "ents_f":0.7678876447,
602
+ "sents_p":0.9533954727,
603
+ "sents_r":0.9521276596,
604
+ "sents_f":0.9527611444,
605
+ "speed":10281.5880630945,
606
+ "morph_per_feat":{
607
+ "AdpType":{
608
+ "p":0.9970784641,
609
+ "r":0.9941739492,
610
+ "f":0.9956240884
611
+ },
612
+ "Case":{
613
+ "p":0.9896781203,
614
+ "r":0.9840648211,
615
+ "f":0.9868634886
616
+ },
617
+ "Variant":{
618
+ "p":0.9845559846,
619
+ "r":0.9205776173,
620
+ "f":0.9514925373
621
+ },
622
+ "Gender":{
623
+ "p":0.9840800225,
624
+ "r":0.9782913165,
625
+ "f":0.9811771316
626
+ },
627
+ "Number":{
628
+ "p":0.9833276236,
629
+ "r":0.9778560073,
630
+ "f":0.9805841827
631
+ },
632
+ "PronType":{
633
+ "p":0.9938366718,
634
+ "r":0.987244898,
635
+ "f":0.9905298183
636
+ },
637
+ "Definite":{
638
+ "p":0.9784728611,
639
+ "r":0.9725676664,
640
+ "f":0.9755113272
641
+ },
642
+ "Degree":{
643
+ "p":0.9530685921,
644
+ "r":0.9428571429,
645
+ "f":0.947935368
646
+ },
647
+ "Polarity":{
648
+ "p":0.9884467266,
649
+ "r":0.985915493,
650
+ "f":0.9871794872
651
+ },
652
+ "Mood":{
653
+ "p":0.9740072202,
654
+ "r":0.9635714286,
655
+ "f":0.9687612208
656
+ },
657
+ "Person":{
658
+ "p":0.9837338262,
659
+ "r":0.9718772827,
660
+ "f":0.9777696123
661
+ },
662
+ "Tense":{
663
+ "p":0.9730337079,
664
+ "r":0.9572586588,
665
+ "f":0.9650817236
666
+ },
667
+ "VerbForm":{
668
+ "p":0.9698996656,
669
+ "r":0.9593572779,
670
+ "f":0.9645996674
671
+ },
672
+ "NumForm":{
673
+ "p":0.9901960784,
674
+ "r":0.9853658537,
675
+ "f":0.9877750611
676
+ },
677
+ "NumType":{
678
+ "p":0.9927536232,
679
+ "r":0.9856115108,
680
+ "f":0.9891696751
681
+ },
682
+ "PartType":{
683
+ "p":0.9473684211,
684
+ "r":0.9,
685
+ "f":0.9230769231
686
+ },
687
+ "Strength":{
688
+ "p":0.9897959184,
689
+ "r":0.9797979798,
690
+ "f":0.9847715736
691
+ },
692
+ "Reflex":{
693
+ "p":0.9938461538,
694
+ "r":0.990797546,
695
+ "f":0.9923195084
696
+ },
697
+ "Poss":{
698
+ "p":0.9792387543,
699
+ "r":0.9895104895,
700
+ "f":0.9843478261
701
+ },
702
+ "Position":{
703
+ "p":0.9928057554,
704
+ "r":0.9517241379,
705
+ "f":0.9718309859
706
+ },
707
+ "Number[psor]":{
708
+ "p":0.9295774648,
709
+ "r":0.9565217391,
710
+ "f":0.9428571429
711
+ },
712
+ "Abbr":{
713
+ "p":0.9746835443,
714
+ "r":0.9058823529,
715
+ "f":0.9390243902
716
+ },
717
+ "Foreign":{
718
+ "p":0.0,
719
+ "r":0.0,
720
+ "f":0.0
721
+ }
722
+ },
723
+ "dep_las_per_type":{
724
+ "case":{
725
+ "p":0.9257307139,
726
+ "r":0.9415204678,
727
+ "f":0.9335588306
728
+ },
729
+ "det":{
730
+ "p":0.9473684211,
731
+ "r":0.9671052632,
732
+ "f":0.9571351058
733
+ },
734
+ "nmod:tmod":{
735
+ "p":0.4,
736
+ "r":0.0465116279,
737
+ "f":0.0833333333
738
+ },
739
+ "amod":{
740
+ "p":0.8639212175,
741
+ "r":0.8756805808,
742
+ "f":0.8697611537
743
+ },
744
+ "cc":{
745
+ "p":0.8669354839,
746
+ "r":0.89958159,
747
+ "f":0.8829568789
748
+ },
749
+ "conj":{
750
+ "p":0.5984962406,
751
+ "r":0.6012084592,
752
+ "f":0.5998492841
753
+ },
754
+ "nmod":{
755
+ "p":0.7883565797,
756
+ "r":0.8217446271,
757
+ "f":0.8047044259
758
+ },
759
+ "mark":{
760
+ "p":0.8857142857,
761
+ "r":0.9056179775,
762
+ "f":0.8955555556
763
+ },
764
+ "fixed":{
765
+ "p":0.8689217759,
766
+ "r":0.7172774869,
767
+ "f":0.7858508604
768
+ },
769
+ "nsubj":{
770
+ "p":0.8333333333,
771
+ "r":0.7814485388,
772
+ "f":0.806557377
773
+ },
774
+ "advcl:tcl":{
775
+ "p":0.0,
776
+ "r":0.0,
777
+ "f":0.0
778
+ },
779
+ "obj":{
780
+ "p":0.7794117647,
781
+ "r":0.8139931741,
782
+ "f":0.796327212
783
+ },
784
+ "nummod":{
785
+ "p":0.8703703704,
786
+ "r":0.8676923077,
787
+ "f":0.8690292758
788
+ },
789
+ "flat":{
790
+ "p":0.7441860465,
791
+ "r":0.6857142857,
792
+ "f":0.7137546468
793
+ },
794
+ "obl":{
795
+ "p":0.649068323,
796
+ "r":0.7116912599,
797
+ "f":0.6789388197
798
+ },
799
+ "nmod:pmod":{
800
+ "p":0.44,
801
+ "r":0.1692307692,
802
+ "f":0.2444444444
803
+ },
804
+ "acl":{
805
+ "p":0.7024793388,
806
+ "r":0.7264957265,
807
+ "f":0.7142857143
808
+ },
809
+ "advmod":{
810
+ "p":0.7860962567,
811
+ "r":0.7577319588,
812
+ "f":0.7716535433
813
+ },
814
+ "expl:pv":{
815
+ "p":0.7883597884,
816
+ "r":0.7967914439,
817
+ "f":0.7925531915
818
+ },
819
+ "root":{
820
+ "p":0.917222964,
821
+ "r":0.9135638298,
822
+ "f":0.9153897402
823
+ },
824
+ "advcl":{
825
+ "p":0.5625,
826
+ "r":0.5853658537,
827
+ "f":0.5737051793
828
+ },
829
+ "iobj":{
830
+ "p":0.7591240876,
831
+ "r":0.7027027027,
832
+ "f":0.7298245614
833
+ },
834
+ "ccomp":{
835
+ "p":0.7178217822,
836
+ "r":0.8146067416,
837
+ "f":0.7631578947
838
+ },
839
+ "goeswith":{
840
+ "p":0.875,
841
+ "r":0.5833333333,
842
+ "f":0.7
843
+ },
844
+ "parataxis":{
845
+ "p":0.7027027027,
846
+ "r":0.5954198473,
847
+ "f":0.6446280992
848
+ },
849
+ "expl:poss":{
850
+ "p":0.5909090909,
851
+ "r":0.6046511628,
852
+ "f":0.5977011494
853
+ },
854
+ "cop":{
855
+ "p":0.7647058824,
856
+ "r":0.8024691358,
857
+ "f":0.7831325301
858
+ },
859
+ "cc:preconj":{
860
+ "p":0.0,
861
+ "r":0.0,
862
+ "f":0.0
863
+ },
864
+ "aux":{
865
+ "p":0.9716713881,
866
+ "r":0.9122340426,
867
+ "f":0.9410150892
868
+ },
869
+ "expl":{
870
+ "p":0.5294117647,
871
+ "r":0.4186046512,
872
+ "f":0.4675324675
873
+ },
874
+ "appos":{
875
+ "p":0.4347826087,
876
+ "r":0.396039604,
877
+ "f":0.414507772
878
+ },
879
+ "xcomp":{
880
+ "p":0.5441176471,
881
+ "r":0.4512195122,
882
+ "f":0.4933333333
883
+ },
884
+ "csubj":{
885
+ "p":0.7966101695,
886
+ "r":0.746031746,
887
+ "f":0.7704918033
888
+ },
889
+ "nmod:agent":{
890
+ "p":0.7285714286,
891
+ "r":0.7846153846,
892
+ "f":0.7555555556
893
+ },
894
+ "aux:pass":{
895
+ "p":0.7769784173,
896
+ "r":0.9,
897
+ "f":0.833976834
898
+ },
899
+ "dep":{
900
+ "p":0.0,
901
+ "r":0.0,
902
+ "f":0.0
903
+ },
904
+ "nsubj:pass":{
905
+ "p":0.6111111111,
906
+ "r":0.6644295302,
907
+ "f":0.6366559486
908
+ },
909
+ "advmod:tmod":{
910
+ "p":0.0,
911
+ "r":0.0,
912
+ "f":0.0
913
+ },
914
+ "expl:pass":{
915
+ "p":0.6734693878,
916
+ "r":0.7252747253,
917
+ "f":0.6984126984
918
+ },
919
+ "ccomp:pmod":{
920
+ "p":0.4,
921
+ "r":0.2666666667,
922
+ "f":0.32
923
+ },
924
+ "compound":{
925
+ "p":0.25,
926
+ "r":0.3333333333,
927
+ "f":0.2857142857
928
+ },
929
+ "orphan":{
930
+ "p":0.0,
931
+ "r":0.0,
932
+ "f":0.0
933
+ },
934
+ "expl:impers":{
935
+ "p":0.3333333333,
936
+ "r":0.1,
937
+ "f":0.1538461538
938
+ },
939
+ "csubj:pass":{
940
+ "p":0.25,
941
+ "r":0.3333333333,
942
+ "f":0.2857142857
943
+ },
944
+ "vocative":{
945
+ "p":0.0,
946
+ "r":0.0,
947
+ "f":0.0
948
+ },
949
+ "discourse":{
950
+ "p":0.0,
951
+ "r":0.0,
952
+ "f":0.0
953
+ }
954
+ },
955
+ "ents_per_type":{
956
+ "DATETIME":{
957
+ "p":0.7852348993,
958
+ "r":0.8153310105,
959
+ "f":0.8
960
+ },
961
+ "ORGANIZATION":{
962
+ "p":0.6873065015,
963
+ "r":0.7070063694,
964
+ "f":0.6970172684
965
+ },
966
+ "FACILITY":{
967
+ "p":0.5317460317,
968
+ "r":0.5114503817,
969
+ "f":0.5214007782
970
+ },
971
+ "NUMERIC_VALUE":{
972
+ "p":0.8978723404,
973
+ "r":0.8940677966,
974
+ "f":0.8959660297
975
+ },
976
+ "ORDINAL":{
977
+ "p":0.7931034483,
978
+ "r":0.8363636364,
979
+ "f":0.814159292
980
+ },
981
+ "EVENT":{
982
+ "p":0.5675675676,
983
+ "r":0.5675675676,
984
+ "f":0.5675675676
985
+ },
986
+ "GPE":{
987
+ "p":0.8351409978,
988
+ "r":0.8850574713,
989
+ "f":0.859375
990
+ },
991
+ "PERSON":{
992
+ "p":0.7360890302,
993
+ "r":0.7768456376,
994
+ "f":0.7559183673
995
+ },
996
+ "NAT_REL_POL":{
997
+ "p":0.925170068,
998
+ "r":0.9066666667,
999
+ "f":0.9158249158
1000
+ },
1001
+ "MONEY":{
1002
+ "p":0.9411764706,
1003
+ "r":0.8275862069,
1004
+ "f":0.880733945
1005
+ },
1006
+ "PRODUCT":{
1007
+ "p":0.6260162602,
1008
+ "r":0.5620437956,
1009
+ "f":0.5923076923
1010
+ },
1011
+ "LOC":{
1012
+ "p":0.4886363636,
1013
+ "r":0.5657894737,
1014
+ "f":0.5243902439
1015
+ },
1016
+ "WORK_OF_ART":{
1017
+ "p":0.4285714286,
1018
+ "r":0.4736842105,
1019
+ "f":0.45
1020
+ },
1021
+ "QUANTITY":{
1022
+ "p":0.8620689655,
1023
+ "r":0.9615384615,
1024
+ "f":0.9090909091
1025
+ },
1026
+ "PERIOD":{
1027
+ "p":0.9428571429,
1028
+ "r":0.7857142857,
1029
+ "f":0.8571428571
1030
+ },
1031
+ "LANGUAGE":{
1032
+ "p":0.6,
1033
+ "r":0.75,
1034
+ "f":0.6666666667
1035
+ }
1036
+ }
1037
+ },
1038
+ "sources":[
1039
+ {
1040
+ "name":"Lemmatization Lists",
1041
+ "url":"https://github.com/michmech/lemmatization-lists/",
1042
+ "license":"ODbL",
1043
+ "author":"Michal M\u011bchura"
1044
+ },
1045
+ {
1046
+ "name":"UD Romanian RRT v2.5",
1047
+ "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1048
+ "license":"CC BY-SA 4.0",
1049
+ "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
1050
+ },
1051
+ {
1052
+ "name":"RONEC - the Romanian Named Entity Corpus (ca9ce460)",
1053
+ "url":"https://github.com/dumitrescustefan/ronec",
1054
+ "license":"MIT",
1055
+ "author":"Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan"
1056
+ },
1057
+ {
1058
+ "name":"Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)",
1059
+ "url":"https://spacy.io",
1060
+ "license":"CC0",
1061
+ "author":"Explosion"
1062
+ }
1063
+ ],
1064
+ "requirements":[
1065
+
1066
+ ]
1067
+ }
ner/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
ner/model ADDED
Binary file (6.95 MB). View file
 
ner/moves ADDED
Binary file (1.05 kB). View file
 
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":30,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
Binary file (312 kB). View file
 
parser/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves�{"0":{"":85972},"1":{"":90580},"2":{"case":22318,"punct":9077,"det":9009,"nsubj":7125,"advmod":6350,"cc":5364,"mark":5291,"aux":4018,"obl":2015,"nummod":1880,"expl:pv":1798,"cop":1706,"amod":1376,"aux:pass":1369,"nsubj:pass":963,"expl:pass":909,"parataxis":877,"obj":866,"advcl":710,"iobj":567,"expl:poss":464,"expl":390,"nmod":204,"nsubj||csubj":154,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":85,"nmod:pmod":74,"cc:preconj":63,"csubj":58,"nsubj:pass||csubj":57,"obj||ccomp":44,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14423,"amod":9673,"obl":7745,"conj":7281,"fixed":5595,"obj":5457,"acl":4102,"advmod":2145,"advcl":2043,"ccomp":1929,"nummod":1646,"nsubj":1278,"nmod:pmod":1208,"flat":1160,"det":1031,"appos":915,"xcomp":886,"iobj":804,"nmod:agent":718,"csubj":626,"nsubj:pass":546,"case":442,"parataxis":426,"nmod:tmod":286,"goeswith":245,"ccomp:pmod":174,"cc":124,"cop":100,"expl:pv":86,"expl":55,"advcl:tcl":52,"compound":50,"csubj:pass":49,"expl:poss":36,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
ro_core_news_lg-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:750b5b5b0dad8fb1b0afc41dff5e52640545d643bee77be5c16b40d364a049c7
3
+ size 571621040
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
senter/model ADDED
Binary file (213 kB). View file
 
tagger/cfg ADDED
@@ -0,0 +1,474 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ "ARROW",
4
+ "Af",
5
+ "Afcfp-n",
6
+ "Afcfson",
7
+ "Afcfsrn",
8
+ "Afcmpoy",
9
+ "Afcms-n",
10
+ "Afp",
11
+ "Afp-p-n",
12
+ "Afp-poy",
13
+ "Afpf--n",
14
+ "Afpfp-n",
15
+ "Afpfp-ny",
16
+ "Afpfpoy",
17
+ "Afpfpry",
18
+ "Afpfson",
19
+ "Afpfsoy",
20
+ "Afpfsrn",
21
+ "Afpfsry",
22
+ "Afpm--n",
23
+ "Afpmp-n",
24
+ "Afpmpoy",
25
+ "Afpmpry",
26
+ "Afpms-n",
27
+ "Afpmsoy",
28
+ "Afpmsry",
29
+ "Afsfp-n",
30
+ "Afsfsrn",
31
+ "BULLET",
32
+ "COLON",
33
+ "COMMA",
34
+ "Ccssp",
35
+ "Ccsspy",
36
+ "Crssp",
37
+ "Csssp",
38
+ "Cssspy",
39
+ "DASH",
40
+ "DBLQ",
41
+ "Dd3-po---e",
42
+ "Dd3-po---o",
43
+ "Dd3fpo",
44
+ "Dd3fpr",
45
+ "Dd3fpr---e",
46
+ "Dd3fpr---o",
47
+ "Dd3fpr--y",
48
+ "Dd3fso",
49
+ "Dd3fso---e",
50
+ "Dd3fsr",
51
+ "Dd3fsr---e",
52
+ "Dd3fsr---o",
53
+ "Dd3fsr--yo",
54
+ "Dd3mpo",
55
+ "Dd3mpr",
56
+ "Dd3mpr---e",
57
+ "Dd3mpr---o",
58
+ "Dd3mso---e",
59
+ "Dd3msr",
60
+ "Dd3msr---e",
61
+ "Dd3msr---o",
62
+ "Dh1ms",
63
+ "Dh3fp",
64
+ "Dh3fso",
65
+ "Dh3fsr",
66
+ "Dh3mp",
67
+ "Dh3ms",
68
+ "Di3",
69
+ "Di3-----y",
70
+ "Di3--r---e",
71
+ "Di3-po",
72
+ "Di3-po---e",
73
+ "Di3-sr",
74
+ "Di3-sr---e",
75
+ "Di3-sr--y",
76
+ "Di3fp",
77
+ "Di3fpr",
78
+ "Di3fpr---e",
79
+ "Di3fso",
80
+ "Di3fso---e",
81
+ "Di3fsr",
82
+ "Di3fsr---e",
83
+ "Di3mp",
84
+ "Di3mpr",
85
+ "Di3mpr---e",
86
+ "Di3ms",
87
+ "Di3ms----e",
88
+ "Di3mso---e",
89
+ "Di3msr",
90
+ "Di3msr---e",
91
+ "Ds1fp-p",
92
+ "Ds1fp-s",
93
+ "Ds1fsop",
94
+ "Ds1fsos",
95
+ "Ds1fsrp",
96
+ "Ds1fsrs",
97
+ "Ds1fsrs-y",
98
+ "Ds1mp-p",
99
+ "Ds1mp-s",
100
+ "Ds1ms-p",
101
+ "Ds1ms-s",
102
+ "Ds1msrs-y",
103
+ "Ds2---s",
104
+ "Ds2fp-p",
105
+ "Ds2fp-s",
106
+ "Ds2fsrp",
107
+ "Ds2fsrs",
108
+ "Ds2mp-p",
109
+ "Ds2mp-s",
110
+ "Ds2ms-p",
111
+ "Ds2ms-s",
112
+ "Ds3---p",
113
+ "Ds3---s",
114
+ "Ds3fp-s",
115
+ "Ds3fsos",
116
+ "Ds3fsrs",
117
+ "Ds3mp-s",
118
+ "Ds3ms-s",
119
+ "Dw3--r---e",
120
+ "Dw3-po---e",
121
+ "Dw3fpr",
122
+ "Dw3fso---e",
123
+ "Dw3fsr",
124
+ "Dw3mpr",
125
+ "Dw3mso---e",
126
+ "Dw3msr",
127
+ "Dz3fsr---e",
128
+ "Dz3mso---e",
129
+ "Dz3msr---e",
130
+ "EQUAL",
131
+ "EXCL",
132
+ "EXCLHELLIP",
133
+ "GE",
134
+ "GT",
135
+ "HELLIP",
136
+ "I",
137
+ "LCURL",
138
+ "LPAR",
139
+ "LSQR",
140
+ "LT",
141
+ "M",
142
+ "Mc",
143
+ "Mc-p-d",
144
+ "Mc-p-l",
145
+ "Mcfp-l",
146
+ "Mcfp-ln",
147
+ "Mcfprln",
148
+ "Mcfprly",
149
+ "Mcfsoln",
150
+ "Mcfsrln",
151
+ "Mcmp-l",
152
+ "Mcms-ln",
153
+ "Mcmsrl",
154
+ "Mcmsrly",
155
+ "Mffprln",
156
+ "Mffsrln",
157
+ "Mlfpo",
158
+ "Mlfpr",
159
+ "Mlmpr",
160
+ "Mo---l",
161
+ "Mo---ln",
162
+ "Mo-s-r",
163
+ "Mofp-ln",
164
+ "Mofpoly",
165
+ "Mofprly",
166
+ "Mofs-l",
167
+ "Mofsoln",
168
+ "Mofsoly",
169
+ "Mofsrln",
170
+ "Mofsrly",
171
+ "Mompoly",
172
+ "Momprly",
173
+ "Moms-l",
174
+ "Moms-ln",
175
+ "Momsoly",
176
+ "Momsrly",
177
+ "Nc",
178
+ "Nc---n",
179
+ "Ncf--n",
180
+ "Ncfp-n",
181
+ "Ncfpoy",
182
+ "Ncfpry",
183
+ "Ncfs-n",
184
+ "Ncfson",
185
+ "Ncfsoy",
186
+ "Ncfsrn",
187
+ "Ncfsry",
188
+ "Ncfsryy",
189
+ "Ncfsvy",
190
+ "Ncm--n",
191
+ "Ncmp-n",
192
+ "Ncmpoy",
193
+ "Ncmpry",
194
+ "Ncms-n",
195
+ "Ncms-ny",
196
+ "Ncms-y",
197
+ "Ncmsoy",
198
+ "Ncmsrn",
199
+ "Ncmsry",
200
+ "Ncmsryy",
201
+ "Ncmsvn",
202
+ "Ncmsvy",
203
+ "Np",
204
+ "Npfson",
205
+ "Npfsoy",
206
+ "Npfsrn",
207
+ "Npfsry",
208
+ "Npmpoy",
209
+ "Npmpry",
210
+ "Npms-n",
211
+ "Npmsoy",
212
+ "Npmsry",
213
+ "PERCENT",
214
+ "PERIOD",
215
+ "PLUS",
216
+ "PLUSMINUS",
217
+ "Pd3-po",
218
+ "Pd3fpr",
219
+ "Pd3fso",
220
+ "Pd3fsr",
221
+ "Pd3mpo",
222
+ "Pd3mpr",
223
+ "Pd3mpr--y",
224
+ "Pd3mso",
225
+ "Pd3msr",
226
+ "Pi3",
227
+ "Pi3--r",
228
+ "Pi3-po",
229
+ "Pi3-so",
230
+ "Pi3-sr",
231
+ "Pi3fpr",
232
+ "Pi3fso",
233
+ "Pi3fsr",
234
+ "Pi3mpr",
235
+ "Pi3mso",
236
+ "Pi3msr",
237
+ "Pi3msr--y",
238
+ "Pp1-pa--------w",
239
+ "Pp1-pa--y-----w",
240
+ "Pp1-pd--------s",
241
+ "Pp1-pd--------w",
242
+ "Pp1-pd--y-----w",
243
+ "Pp1-pr--------s",
244
+ "Pp1-sa--------s",
245
+ "Pp1-sa--------w",
246
+ "Pp1-sa--y-----w",
247
+ "Pp1-sd--------s",
248
+ "Pp1-sd--------w",
249
+ "Pp1-sd--y-----w",
250
+ "Pp1-sn--------s",
251
+ "Pp2-----------s",
252
+ "Pp2-pa--------w",
253
+ "Pp2-pa--y-----w",
254
+ "Pp2-pd--------w",
255
+ "Pp2-pd--y-----w",
256
+ "Pp2-pr--------s",
257
+ "Pp2-sa--------s",
258
+ "Pp2-sa--------w",
259
+ "Pp2-sa--y-----w",
260
+ "Pp2-sd--------s",
261
+ "Pp2-sd--------w",
262
+ "Pp2-sd--y-----w",
263
+ "Pp2-sn--------s",
264
+ "Pp2-so--------s",
265
+ "Pp2-sr--------s",
266
+ "Pp3-p---------s",
267
+ "Pp3-pd--------w",
268
+ "Pp3-pd--y-----w",
269
+ "Pp3-po--------s",
270
+ "Pp3-sd--------w",
271
+ "Pp3-sd--y-----w",
272
+ "Pp3fpa--------w",
273
+ "Pp3fpa--y-----w",
274
+ "Pp3fpr--------s",
275
+ "Pp3fs---------s",
276
+ "Pp3fsa--------w",
277
+ "Pp3fsa--y-----w",
278
+ "Pp3fso--------s",
279
+ "Pp3fsr--------s",
280
+ "Pp3fsr--y-----s",
281
+ "Pp3mpa--------w",
282
+ "Pp3mpa--y-----w",
283
+ "Pp3mpr--------s",
284
+ "Pp3ms---------s",
285
+ "Pp3msa--------w",
286
+ "Pp3msa--y-----w",
287
+ "Pp3mso--------s",
288
+ "Pp3msr--------s",
289
+ "Pp3msr--y-----s",
290
+ "Ps1fp-s",
291
+ "Ps1fsrp",
292
+ "Ps1fsrs",
293
+ "Ps1mp-p",
294
+ "Ps1ms-p",
295
+ "Ps2fp-s",
296
+ "Ps2fsrp",
297
+ "Ps2fsrs",
298
+ "Ps2ms-s",
299
+ "Ps3---p",
300
+ "Ps3---s",
301
+ "Ps3fp-s",
302
+ "Ps3fsrs",
303
+ "Ps3mp-s",
304
+ "Ps3ms-s",
305
+ "Pw3--r",
306
+ "Pw3-po",
307
+ "Pw3-so",
308
+ "Pw3fpr",
309
+ "Pw3fso",
310
+ "Pw3mpr",
311
+ "Pw3mso",
312
+ "Px3--a--------s",
313
+ "Px3--a--------w",
314
+ "Px3--a--y-----w",
315
+ "Px3--d--------w",
316
+ "Px3--d--y-----w",
317
+ "Pz3-sr",
318
+ "Pz3fsr",
319
+ "QUEST",
320
+ "QUOT",
321
+ "Qf",
322
+ "Qn",
323
+ "Qs",
324
+ "Qs-y",
325
+ "Qz",
326
+ "Qz-y",
327
+ "RCURL",
328
+ "RPAR",
329
+ "RSQR",
330
+ "Rc",
331
+ "Rgc",
332
+ "Rgp",
333
+ "Rgpy",
334
+ "Rgs",
335
+ "Rp",
336
+ "Rw",
337
+ "Rw-y",
338
+ "Rz",
339
+ "SCOLON",
340
+ "SLASH",
341
+ "STAR",
342
+ "Sp",
343
+ "Spsa",
344
+ "Spsay",
345
+ "Spsd",
346
+ "Spsg",
347
+ "Td-po",
348
+ "Tdfpr",
349
+ "Tdfso",
350
+ "Tdfsr",
351
+ "Tdmpr",
352
+ "Tdmso",
353
+ "Tdmsr",
354
+ "Tf-so",
355
+ "Tffpoy",
356
+ "Tffpry",
357
+ "Tffs-y",
358
+ "Tfmpoy",
359
+ "Tfms-y",
360
+ "Tfmsoy",
361
+ "Tfmsry",
362
+ "Ti-po",
363
+ "Tifp-y",
364
+ "Tifso",
365
+ "Tifsr",
366
+ "Timso",
367
+ "Timsr",
368
+ "Tsfp",
369
+ "Tsfs",
370
+ "Tsmp",
371
+ "Tsms",
372
+ "UNDERSC",
373
+ "Va--1",
374
+ "Va--1-----y",
375
+ "Va--1p",
376
+ "Va--1s",
377
+ "Va--1s----y",
378
+ "Va--2p",
379
+ "Va--2p----y",
380
+ "Va--2s",
381
+ "Va--2s----y",
382
+ "Va--3",
383
+ "Va--3-----y",
384
+ "Va--3p",
385
+ "Va--3p----y",
386
+ "Va--3s",
387
+ "Va--3s----y",
388
+ "Vag",
389
+ "Vaii1",
390
+ "Vaii2s",
391
+ "Vaii3p",
392
+ "Vaii3s",
393
+ "Vail3p",
394
+ "Vail3s",
395
+ "Vaip1p",
396
+ "Vaip1s",
397
+ "Vaip2p",
398
+ "Vaip2s",
399
+ "Vaip3p",
400
+ "Vaip3p----y",
401
+ "Vaip3s",
402
+ "Vaip3s----y",
403
+ "Vais3p",
404
+ "Vais3s",
405
+ "Vam-2s",
406
+ "Vanp",
407
+ "Vap--sm",
408
+ "Vasp1p",
409
+ "Vasp1s",
410
+ "Vasp2p",
411
+ "Vasp2s",
412
+ "Vasp3",
413
+ "Vmg",
414
+ "Vmg-------y",
415
+ "Vmii1",
416
+ "Vmii1-----y",
417
+ "Vmii2p",
418
+ "Vmii2s",
419
+ "Vmii3p",
420
+ "Vmii3p----y",
421
+ "Vmii3s",
422
+ "Vmii3s----y",
423
+ "Vmil1",
424
+ "Vmil1p",
425
+ "Vmil2s",
426
+ "Vmil3p",
427
+ "Vmil3p----y",
428
+ "Vmil3s",
429
+ "Vmil3s----y",
430
+ "Vmip1p",
431
+ "Vmip1p----y",
432
+ "Vmip1s",
433
+ "Vmip1s----y",
434
+ "Vmip2p",
435
+ "Vmip2s",
436
+ "Vmip2s----y",
437
+ "Vmip3",
438
+ "Vmip3-----y",
439
+ "Vmip3p",
440
+ "Vmip3s",
441
+ "Vmip3s----y",
442
+ "Vmis1p",
443
+ "Vmis1s",
444
+ "Vmis3p",
445
+ "Vmis3p----y",
446
+ "Vmis3s",
447
+ "Vmis3s----y",
448
+ "Vmm-2p",
449
+ "Vmm-2s",
450
+ "Vmnp",
451
+ "Vmnp------y",
452
+ "Vmp--pf",
453
+ "Vmp--pm",
454
+ "Vmp--sf",
455
+ "Vmp--sm",
456
+ "Vmp--sm---y",
457
+ "Vmsp1p",
458
+ "Vmsp1s",
459
+ "Vmsp2s",
460
+ "Vmsp3",
461
+ "Vmsp3-----y",
462
+ "X",
463
+ "Y",
464
+ "Ya",
465
+ "Yn",
466
+ "Ynfsoy",
467
+ "Ynfsry",
468
+ "Ynmsoy",
469
+ "Ynmsry",
470
+ "Yp",
471
+ "Yp-sr",
472
+ "Yr"
473
+ ]
474
+ }
tagger/model ADDED
Binary file (183 kB). View file
 
tok2vec/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
tok2vec/model ADDED
Binary file (6.81 MB). View file
 
tokenizer ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ ��prefix_search�
2
+ ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/key2row ADDED
Binary file (6.87 MB). View file
 
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0442198e6d05377364bc6e0ce4f78c69ae3b1d2ee6feb4c1265384ca182a1dbb
3
+ size 8420995
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce7cb76168909238362bdc6c2e408e130290d7bf6e5bdc62eaf2220d0671b2d7
3
+ size 600000128