osanseviero commited on
Commit
5fe3577
1 Parent(s): f400481

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -14,3 +14,7 @@
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
14
  *.pb filter=lfs diff=lfs merge=lfs -text
15
  *.pt filter=lfs diff=lfs merge=lfs -text
16
  *.pth filter=lfs diff=lfs merge=lfs -text
17
+ *.whl filter=lfs diff=lfs merge=lfs -text
18
+ *.npz filter=lfs diff=lfs merge=lfs -text
19
+ *strings.json filter=lfs diff=lfs merge=lfs -text
20
+ vectors filter=lfs diff=lfs merge=lfs -text
LICENSE ADDED
@@ -0,0 +1,428 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Attribution-ShareAlike 4.0 International
2
+
3
+ =======================================================================
4
+
5
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
6
+ does not provide legal services or legal advice. Distribution of
7
+ Creative Commons public licenses does not create a lawyer-client or
8
+ other relationship. Creative Commons makes its licenses and related
9
+ information available on an "as-is" basis. Creative Commons gives no
10
+ warranties regarding its licenses, any material licensed under their
11
+ terms and conditions, or any related information. Creative Commons
12
+ disclaims all liability for damages resulting from their use to the
13
+ fullest extent possible.
14
+
15
+ Using Creative Commons Public Licenses
16
+
17
+ Creative Commons public licenses provide a standard set of terms and
18
+ conditions that creators and other rights holders may use to share
19
+ original works of authorship and other material subject to copyright
20
+ and certain other rights specified in the public license below. The
21
+ following considerations are for informational purposes only, are not
22
+ exhaustive, and do not form part of our licenses.
23
+
24
+ Considerations for licensors: Our public licenses are
25
+ intended for use by those authorized to give the public
26
+ permission to use material in ways otherwise restricted by
27
+ copyright and certain other rights. Our licenses are
28
+ irrevocable. Licensors should read and understand the terms
29
+ and conditions of the license they choose before applying it.
30
+ Licensors should also secure all rights necessary before
31
+ applying our licenses so that the public can reuse the
32
+ material as expected. Licensors should clearly mark any
33
+ material not subject to the license. This includes other CC-
34
+ licensed material, or material used under an exception or
35
+ limitation to copyright. More considerations for licensors:
36
+ wiki.creativecommons.org/Considerations_for_licensors
37
+
38
+ Considerations for the public: By using one of our public
39
+ licenses, a licensor grants the public permission to use the
40
+ licensed material under specified terms and conditions. If
41
+ the licensor's permission is not necessary for any reason--for
42
+ example, because of any applicable exception or limitation to
43
+ copyright--then that use is not regulated by the license. Our
44
+ licenses grant only permissions under copyright and certain
45
+ other rights that a licensor has authority to grant. Use of
46
+ the licensed material may still be restricted for other
47
+ reasons, including because others have copyright or other
48
+ rights in the material. A licensor may make special requests,
49
+ such as asking that all changes be marked or described.
50
+ Although not required by our licenses, you are encouraged to
51
+ respect those requests where reasonable. More considerations
52
+ for the public:
53
+ wiki.creativecommons.org/Considerations_for_licensees
54
+
55
+ =======================================================================
56
+
57
+ Creative Commons Attribution-ShareAlike 4.0 International Public
58
+ License
59
+
60
+ By exercising the Licensed Rights (defined below), You accept and agree
61
+ to be bound by the terms and conditions of this Creative Commons
62
+ Attribution-ShareAlike 4.0 International Public License ("Public
63
+ License"). To the extent this Public License may be interpreted as a
64
+ contract, You are granted the Licensed Rights in consideration of Your
65
+ acceptance of these terms and conditions, and the Licensor grants You
66
+ such rights in consideration of benefits the Licensor receives from
67
+ making the Licensed Material available under these terms and
68
+ conditions.
69
+
70
+
71
+ Section 1 -- Definitions.
72
+
73
+ a. Adapted Material means material subject to Copyright and Similar
74
+ Rights that is derived from or based upon the Licensed Material
75
+ and in which the Licensed Material is translated, altered,
76
+ arranged, transformed, or otherwise modified in a manner requiring
77
+ permission under the Copyright and Similar Rights held by the
78
+ Licensor. For purposes of this Public License, where the Licensed
79
+ Material is a musical work, performance, or sound recording,
80
+ Adapted Material is always produced where the Licensed Material is
81
+ synched in timed relation with a moving image.
82
+
83
+ b. Adapter's License means the license You apply to Your Copyright
84
+ and Similar Rights in Your contributions to Adapted Material in
85
+ accordance with the terms and conditions of this Public License.
86
+
87
+ c. BY-SA Compatible License means a license listed at
88
+ creativecommons.org/compatiblelicenses, approved by Creative
89
+ Commons as essentially the equivalent of this Public License.
90
+
91
+ d. Copyright and Similar Rights means copyright and/or similar rights
92
+ closely related to copyright including, without limitation,
93
+ performance, broadcast, sound recording, and Sui Generis Database
94
+ Rights, without regard to how the rights are labeled or
95
+ categorized. For purposes of this Public License, the rights
96
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
97
+ Rights.
98
+
99
+ e. Effective Technological Measures means those measures that, in the
100
+ absence of proper authority, may not be circumvented under laws
101
+ fulfilling obligations under Article 11 of the WIPO Copyright
102
+ Treaty adopted on December 20, 1996, and/or similar international
103
+ agreements.
104
+
105
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
106
+ any other exception or limitation to Copyright and Similar Rights
107
+ that applies to Your use of the Licensed Material.
108
+
109
+ g. License Elements means the license attributes listed in the name
110
+ of a Creative Commons Public License. The License Elements of this
111
+ Public License are Attribution and ShareAlike.
112
+
113
+ h. Licensed Material means the artistic or literary work, database,
114
+ or other material to which the Licensor applied this Public
115
+ License.
116
+
117
+ i. Licensed Rights means the rights granted to You subject to the
118
+ terms and conditions of this Public License, which are limited to
119
+ all Copyright and Similar Rights that apply to Your use of the
120
+ Licensed Material and that the Licensor has authority to license.
121
+
122
+ j. Licensor means the individual(s) or entity(ies) granting rights
123
+ under this Public License.
124
+
125
+ k. Share means to provide material to the public by any means or
126
+ process that requires permission under the Licensed Rights, such
127
+ as reproduction, public display, public performance, distribution,
128
+ dissemination, communication, or importation, and to make material
129
+ available to the public including in ways that members of the
130
+ public may access the material from a place and at a time
131
+ individually chosen by them.
132
+
133
+ l. Sui Generis Database Rights means rights other than copyright
134
+ resulting from Directive 96/9/EC of the European Parliament and of
135
+ the Council of 11 March 1996 on the legal protection of databases,
136
+ as amended and/or succeeded, as well as other essentially
137
+ equivalent rights anywhere in the world.
138
+
139
+ m. You means the individual or entity exercising the Licensed Rights
140
+ under this Public License. Your has a corresponding meaning.
141
+
142
+
143
+ Section 2 -- Scope.
144
+
145
+ a. License grant.
146
+
147
+ 1. Subject to the terms and conditions of this Public License,
148
+ the Licensor hereby grants You a worldwide, royalty-free,
149
+ non-sublicensable, non-exclusive, irrevocable license to
150
+ exercise the Licensed Rights in the Licensed Material to:
151
+
152
+ a. reproduce and Share the Licensed Material, in whole or
153
+ in part; and
154
+
155
+ b. produce, reproduce, and Share Adapted Material.
156
+
157
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
158
+ Exceptions and Limitations apply to Your use, this Public
159
+ License does not apply, and You do not need to comply with
160
+ its terms and conditions.
161
+
162
+ 3. Term. The term of this Public License is specified in Section
163
+ 6(a).
164
+
165
+ 4. Media and formats; technical modifications allowed. The
166
+ Licensor authorizes You to exercise the Licensed Rights in
167
+ all media and formats whether now known or hereafter created,
168
+ and to make technical modifications necessary to do so. The
169
+ Licensor waives and/or agrees not to assert any right or
170
+ authority to forbid You from making technical modifications
171
+ necessary to exercise the Licensed Rights, including
172
+ technical modifications necessary to circumvent Effective
173
+ Technological Measures. For purposes of this Public License,
174
+ simply making modifications authorized by this Section 2(a)
175
+ (4) never produces Adapted Material.
176
+
177
+ 5. Downstream recipients.
178
+
179
+ a. Offer from the Licensor -- Licensed Material. Every
180
+ recipient of the Licensed Material automatically
181
+ receives an offer from the Licensor to exercise the
182
+ Licensed Rights under the terms and conditions of this
183
+ Public License.
184
+
185
+ b. Additional offer from the Licensor -- Adapted Material.
186
+ Every recipient of Adapted Material from You
187
+ automatically receives an offer from the Licensor to
188
+ exercise the Licensed Rights in the Adapted Material
189
+ under the conditions of the Adapter's License You apply.
190
+
191
+ c. No downstream restrictions. You may not offer or impose
192
+ any additional or different terms or conditions on, or
193
+ apply any Effective Technological Measures to, the
194
+ Licensed Material if doing so restricts exercise of the
195
+ Licensed Rights by any recipient of the Licensed
196
+ Material.
197
+
198
+ 6. No endorsement. Nothing in this Public License constitutes or
199
+ may be construed as permission to assert or imply that You
200
+ are, or that Your use of the Licensed Material is, connected
201
+ with, or sponsored, endorsed, or granted official status by,
202
+ the Licensor or others designated to receive attribution as
203
+ provided in Section 3(a)(1)(A)(i).
204
+
205
+ b. Other rights.
206
+
207
+ 1. Moral rights, such as the right of integrity, are not
208
+ licensed under this Public License, nor are publicity,
209
+ privacy, and/or other similar personality rights; however, to
210
+ the extent possible, the Licensor waives and/or agrees not to
211
+ assert any such rights held by the Licensor to the limited
212
+ extent necessary to allow You to exercise the Licensed
213
+ Rights, but not otherwise.
214
+
215
+ 2. Patent and trademark rights are not licensed under this
216
+ Public License.
217
+
218
+ 3. To the extent possible, the Licensor waives any right to
219
+ collect royalties from You for the exercise of the Licensed
220
+ Rights, whether directly or through a collecting society
221
+ under any voluntary or waivable statutory or compulsory
222
+ licensing scheme. In all other cases the Licensor expressly
223
+ reserves any right to collect such royalties.
224
+
225
+
226
+ Section 3 -- License Conditions.
227
+
228
+ Your exercise of the Licensed Rights is expressly made subject to the
229
+ following conditions.
230
+
231
+ a. Attribution.
232
+
233
+ 1. If You Share the Licensed Material (including in modified
234
+ form), You must:
235
+
236
+ a. retain the following if it is supplied by the Licensor
237
+ with the Licensed Material:
238
+
239
+ i. identification of the creator(s) of the Licensed
240
+ Material and any others designated to receive
241
+ attribution, in any reasonable manner requested by
242
+ the Licensor (including by pseudonym if
243
+ designated);
244
+
245
+ ii. a copyright notice;
246
+
247
+ iii. a notice that refers to this Public License;
248
+
249
+ iv. a notice that refers to the disclaimer of
250
+ warranties;
251
+
252
+ v. a URI or hyperlink to the Licensed Material to the
253
+ extent reasonably practicable;
254
+
255
+ b. indicate if You modified the Licensed Material and
256
+ retain an indication of any previous modifications; and
257
+
258
+ c. indicate the Licensed Material is licensed under this
259
+ Public License, and include the text of, or the URI or
260
+ hyperlink to, this Public License.
261
+
262
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
263
+ reasonable manner based on the medium, means, and context in
264
+ which You Share the Licensed Material. For example, it may be
265
+ reasonable to satisfy the conditions by providing a URI or
266
+ hyperlink to a resource that includes the required
267
+ information.
268
+
269
+ 3. If requested by the Licensor, You must remove any of the
270
+ information required by Section 3(a)(1)(A) to the extent
271
+ reasonably practicable.
272
+
273
+ b. ShareAlike.
274
+
275
+ In addition to the conditions in Section 3(a), if You Share
276
+ Adapted Material You produce, the following conditions also apply.
277
+
278
+ 1. The Adapter's License You apply must be a Creative Commons
279
+ license with the same License Elements, this version or
280
+ later, or a BY-SA Compatible License.
281
+
282
+ 2. You must include the text of, or the URI or hyperlink to, the
283
+ Adapter's License You apply. You may satisfy this condition
284
+ in any reasonable manner based on the medium, means, and
285
+ context in which You Share Adapted Material.
286
+
287
+ 3. You may not offer or impose any additional or different terms
288
+ or conditions on, or apply any Effective Technological
289
+ Measures to, Adapted Material that restrict exercise of the
290
+ rights granted under the Adapter's License You apply.
291
+
292
+
293
+ Section 4 -- Sui Generis Database Rights.
294
+
295
+ Where the Licensed Rights include Sui Generis Database Rights that
296
+ apply to Your use of the Licensed Material:
297
+
298
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
299
+ to extract, reuse, reproduce, and Share all or a substantial
300
+ portion of the contents of the database;
301
+
302
+ b. if You include all or a substantial portion of the database
303
+ contents in a database in which You have Sui Generis Database
304
+ Rights, then the database in which You have Sui Generis Database
305
+ Rights (but not its individual contents) is Adapted Material,
306
+
307
+ including for purposes of Section 3(b); and
308
+ c. You must comply with the conditions in Section 3(a) if You Share
309
+ all or a substantial portion of the contents of the database.
310
+
311
+ For the avoidance of doubt, this Section 4 supplements and does not
312
+ replace Your obligations under this Public License where the Licensed
313
+ Rights include other Copyright and Similar Rights.
314
+
315
+
316
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
317
+
318
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
319
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
320
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
321
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
322
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
323
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
324
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
325
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
326
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
327
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
328
+
329
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
330
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
331
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
332
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
333
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
334
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
335
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
336
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
337
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
338
+
339
+ c. The disclaimer of warranties and limitation of liability provided
340
+ above shall be interpreted in a manner that, to the extent
341
+ possible, most closely approximates an absolute disclaimer and
342
+ waiver of all liability.
343
+
344
+
345
+ Section 6 -- Term and Termination.
346
+
347
+ a. This Public License applies for the term of the Copyright and
348
+ Similar Rights licensed here. However, if You fail to comply with
349
+ this Public License, then Your rights under this Public License
350
+ terminate automatically.
351
+
352
+ b. Where Your right to use the Licensed Material has terminated under
353
+ Section 6(a), it reinstates:
354
+
355
+ 1. automatically as of the date the violation is cured, provided
356
+ it is cured within 30 days of Your discovery of the
357
+ violation; or
358
+
359
+ 2. upon express reinstatement by the Licensor.
360
+
361
+ For the avoidance of doubt, this Section 6(b) does not affect any
362
+ right the Licensor may have to seek remedies for Your violations
363
+ of this Public License.
364
+
365
+ c. For the avoidance of doubt, the Licensor may also offer the
366
+ Licensed Material under separate terms or conditions or stop
367
+ distributing the Licensed Material at any time; however, doing so
368
+ will not terminate this Public License.
369
+
370
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
371
+ License.
372
+
373
+
374
+ Section 7 -- Other Terms and Conditions.
375
+
376
+ a. The Licensor shall not be bound by any additional or different
377
+ terms or conditions communicated by You unless expressly agreed.
378
+
379
+ b. Any arrangements, understandings, or agreements regarding the
380
+ Licensed Material not stated herein are separate from and
381
+ independent of the terms and conditions of this Public License.
382
+
383
+
384
+ Section 8 -- Interpretation.
385
+
386
+ a. For the avoidance of doubt, this Public License does not, and
387
+ shall not be interpreted to, reduce, limit, restrict, or impose
388
+ conditions on any use of the Licensed Material that could lawfully
389
+ be made without permission under this Public License.
390
+
391
+ b. To the extent possible, if any provision of this Public License is
392
+ deemed unenforceable, it shall be automatically reformed to the
393
+ minimum extent necessary to make it enforceable. If the provision
394
+ cannot be reformed, it shall be severed from this Public License
395
+ without affecting the enforceability of the remaining terms and
396
+ conditions.
397
+
398
+ c. No term or condition of this Public License will be waived and no
399
+ failure to comply consented to unless expressly agreed to by the
400
+ Licensor.
401
+
402
+ d. Nothing in this Public License constitutes or may be interpreted
403
+ as a limitation upon, or waiver of, any privileges and immunities
404
+ that apply to the Licensor or You, including from the legal
405
+ processes of any jurisdiction or authority.
406
+
407
+
408
+ =======================================================================
409
+
410
+ Creative Commons is not a party to its public
411
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
412
+ its public licenses to material it publishes and in those instances
413
+ will be considered the “Licensor.” The text of the Creative Commons
414
+ public licenses is dedicated to the public domain under the CC0 Public
415
+ Domain Dedication. Except for the limited purpose of indicating that
416
+ material is shared under a Creative Commons public license or as
417
+ otherwise permitted by the Creative Commons policies published at
418
+ creativecommons.org/policies, Creative Commons does not authorize the
419
+ use of the trademark "Creative Commons" or any other trademark or logo
420
+ of Creative Commons without its prior written consent including,
421
+ without limitation, in connection with any unauthorized modifications
422
+ to any of its public licenses or any other arrangements,
423
+ understandings, or agreements concerning use of licensed material. For
424
+ the avoidance of doubt, this paragraph does not form part of the
425
+ public licenses.
426
+
427
+ Creative Commons may be contacted at creativecommons.org.
428
+
LICENSES_SOURCES ADDED
@@ -0,0 +1,1024 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Lemmatization Lists
2
+
3
+ * Author: Michal Měchura
4
+ * URL: https://github.com/michmech/lemmatization-lists/
5
+ * License: ODbL
6
+
7
+ ```
8
+ ## ODC Open Database License (ODbL)
9
+
10
+ ### Preamble
11
+
12
+ The Open Database License (ODbL) is a license agreement intended to
13
+ allow users to freely share, modify, and use this Database while
14
+ maintaining this same freedom for others. Many databases are covered by
15
+ copyright, and therefore this document licenses these rights. Some
16
+ jurisdictions, mainly in the European Union, have specific rights that
17
+ cover databases, and so the ODbL addresses these rights, too. Finally,
18
+ the ODbL is also an agreement in contract for users of this Database to
19
+ act in certain ways in return for accessing this Database.
20
+
21
+ Databases can contain a wide variety of types of content (images,
22
+ audiovisual material, and sounds all in the same database, for example),
23
+ and so the ODbL only governs the rights over the Database, and not the
24
+ contents of the Database individually. Licensors should use the ODbL
25
+ together with another license for the contents, if the contents have a
26
+ single set of rights that uniformly covers all of the contents. If the
27
+ contents have multiple sets of different rights, Licensors should
28
+ describe what rights govern what contents together in the individual
29
+ record or in some other way that clarifies what rights apply.
30
+
31
+ Sometimes the contents of a database, or the database itself, can be
32
+ covered by other rights not addressed here (such as private contracts,
33
+ trade mark over the name, or privacy rights / data protection rights
34
+ over information in the contents), and so you are advised that you may
35
+ have to consult other documents or clear other rights before doing
36
+ activities not covered by this License.
37
+
38
+ ------
39
+
40
+ The Licensor (as defined below)
41
+
42
+ and
43
+
44
+ You (as defined below)
45
+
46
+ agree as follows:
47
+
48
+ ### 1.0 Definitions of Capitalised Words
49
+
50
+ "Collective Database" – Means this Database in unmodified form as part
51
+ of a collection of independent databases in themselves that together are
52
+ assembled into a collective whole. A work that constitutes a Collective
53
+ Database will not be considered a Derivative Database.
54
+
55
+ "Convey" – As a verb, means Using the Database, a Derivative Database,
56
+ or the Database as part of a Collective Database in any way that enables
57
+ a Person to make or receive copies of the Database or a Derivative
58
+ Database. Conveying does not include interaction with a user through a
59
+ computer network, or creating and Using a Produced Work, where no
60
+ transfer of a copy of the Database or a Derivative Database occurs.
61
+ "Contents" – The contents of this Database, which includes the
62
+ information, independent works, or other material collected into the
63
+ Database. For example, the contents of the Database could be factual
64
+ data or works such as images, audiovisual material, text, or sounds.
65
+
66
+ "Database" – A collection of material (the Contents) arranged in a
67
+ systematic or methodical way and individually accessible by electronic
68
+ or other means offered under the terms of this License.
69
+
70
+ "Database Directive" – Means Directive 96/9/EC of the European
71
+ Parliament and of the Council of 11 March 1996 on the legal protection
72
+ of databases, as amended or succeeded.
73
+
74
+ "Database Right" – Means rights resulting from the Chapter III ("sui
75
+ generis") rights in the Database Directive (as amended and as transposed
76
+ by member states), which includes the Extraction and Re-utilisation of
77
+ the whole or a Substantial part of the Contents, as well as any similar
78
+ rights available in the relevant jurisdiction under Section 10.4.
79
+
80
+ "Derivative Database" – Means a database based upon the Database, and
81
+ includes any translation, adaptation, arrangement, modification, or any
82
+ other alteration of the Database or of a Substantial part of the
83
+ Contents. This includes, but is not limited to, Extracting or
84
+ Re-utilising the whole or a Substantial part of the Contents in a new
85
+ Database.
86
+
87
+ "Extraction" – Means the permanent or temporary transfer of all or a
88
+ Substantial part of the Contents to another medium by any means or in
89
+ any form.
90
+
91
+ "License" – Means this license agreement and is both a license of rights
92
+ such as copyright and Database Rights and an agreement in contract.
93
+
94
+ "Licensor" – Means the Person that offers the Database under the terms
95
+ of this License.
96
+
97
+ "Person" – Means a natural or legal person or a body of persons
98
+ corporate or incorporate.
99
+
100
+ "Produced Work" – a work (such as an image, audiovisual material, text,
101
+ or sounds) resulting from using the whole or a Substantial part of the
102
+ Contents (via a search or other query) from this Database, a Derivative
103
+ Database, or this Database as part of a Collective Database.
104
+
105
+ "Publicly" – means to Persons other than You or under Your control by
106
+ either more than 50% ownership or by the power to direct their
107
+ activities (such as contracting with an independent consultant).
108
+
109
+ "Re-utilisation" – means any form of making available to the public all
110
+ or a Substantial part of the Contents by the distribution of copies, by
111
+ renting, by online or other forms of transmission.
112
+
113
+ "Substantial" – Means substantial in terms of quantity or quality or a
114
+ combination of both. The repeated and systematic Extraction or
115
+ Re-utilisation of insubstantial parts of the Contents may amount to the
116
+ Extraction or Re-utilisation of a Substantial part of the Contents.
117
+
118
+ "Use" – As a verb, means doing any act that is restricted by copyright
119
+ or Database Rights whether in the original medium or any other; and
120
+ includes without limitation distributing, copying, publicly performing,
121
+ publicly displaying, and preparing derivative works of the Database, as
122
+ well as modifying the Database as may be technically necessary to use it
123
+ in a different mode or format.
124
+
125
+ "You" – Means a Person exercising rights under this License who has not
126
+ previously violated the terms of this License with respect to the
127
+ Database, or who has received express permission from the Licensor to
128
+ exercise rights under this License despite a previous violation.
129
+
130
+ Words in the singular include the plural and vice versa.
131
+
132
+ ### 2.0 What this License covers
133
+
134
+ 2.1. Legal effect of this document. This License is:
135
+
136
+ a. A license of applicable copyright and neighbouring rights;
137
+
138
+ b. A license of the Database Right; and
139
+
140
+ c. An agreement in contract between You and the Licensor.
141
+
142
+ 2.2 Legal rights covered. This License covers the legal rights in the
143
+ Database, including:
144
+
145
+ a. Copyright. Any copyright or neighbouring rights in the Database.
146
+ The copyright licensed includes any individual elements of the
147
+ Database, but does not cover the copyright over the Contents
148
+ independent of this Database. See Section 2.4 for details. Copyright
149
+ law varies between jurisdictions, but is likely to cover: the Database
150
+ model or schema, which is the structure, arrangement, and organisation
151
+ of the Database, and can also include the Database tables and table
152
+ indexes; the data entry and output sheets; and the Field names of
153
+ Contents stored in the Database;
154
+
155
+ b. Database Rights. Database Rights only extend to the Extraction and
156
+ Re-utilisation of the whole or a Substantial part of the Contents.
157
+ Database Rights can apply even when there is no copyright over the
158
+ Database. Database Rights can also apply when the Contents are removed
159
+ from the Database and are selected and arranged in a way that would
160
+ not infringe any applicable copyright; and
161
+
162
+ c. Contract. This is an agreement between You and the Licensor for
163
+ access to the Database. In return you agree to certain conditions of
164
+ use on this access as outlined in this License.
165
+
166
+ 2.3 Rights not covered.
167
+
168
+ a. This License does not apply to computer programs used in the making
169
+ or operation of the Database;
170
+
171
+ b. This License does not cover any patents over the Contents or the
172
+ Database; and
173
+
174
+ c. This License does not cover any trademarks associated with the
175
+ Database.
176
+
177
+ 2.4 Relationship to Contents in the Database. The individual items of
178
+ the Contents contained in this Database may be covered by other rights,
179
+ including copyright, patent, data protection, privacy, or personality
180
+ rights, and this License does not cover any rights (other than Database
181
+ Rights or in contract) in individual Contents contained in the Database.
182
+ For example, if used on a Database of images (the Contents), this
183
+ License would not apply to copyright over individual images, which could
184
+ have their own separate licenses, or one single license covering all of
185
+ the rights over the images.
186
+
187
+ ### 3.0 Rights granted
188
+
189
+ 3.1 Subject to the terms and conditions of this License, the Licensor
190
+ grants to You a worldwide, royalty-free, non-exclusive, terminable (but
191
+ only under Section 9) license to Use the Database for the duration of
192
+ any applicable copyright and Database Rights. These rights explicitly
193
+ include commercial use, and do not exclude any field of endeavour. To
194
+ the extent possible in the relevant jurisdiction, these rights may be
195
+ exercised in all media and formats whether now known or created in the
196
+ future.
197
+
198
+ The rights granted cover, for example:
199
+
200
+ a. Extraction and Re-utilisation of the whole or a Substantial part of
201
+ the Contents;
202
+
203
+ b. Creation of Derivative Databases;
204
+
205
+ c. Creation of Collective Databases;
206
+
207
+ d. Creation of temporary or permanent reproductions by any means and
208
+ in any form, in whole or in part, including of any Derivative
209
+ Databases or as a part of Collective Databases; and
210
+
211
+ e. Distribution, communication, display, lending, making available, or
212
+ performance to the public by any means and in any form, in whole or in
213
+ part, including of any Derivative Database or as a part of Collective
214
+ Databases.
215
+
216
+ 3.2 Compulsory license schemes. For the avoidance of doubt:
217
+
218
+ a. Non-waivable compulsory license schemes. In those jurisdictions in
219
+ which the right to collect royalties through any statutory or
220
+ compulsory licensing scheme cannot be waived, the Licensor reserves
221
+ the exclusive right to collect such royalties for any exercise by You
222
+ of the rights granted under this License;
223
+
224
+ b. Waivable compulsory license schemes. In those jurisdictions in
225
+ which the right to collect royalties through any statutory or
226
+ compulsory licensing scheme can be waived, the Licensor waives the
227
+ exclusive right to collect such royalties for any exercise by You of
228
+ the rights granted under this License; and,
229
+
230
+ c. Voluntary license schemes. The Licensor waives the right to collect
231
+ royalties, whether individually or, in the event that the Licensor is
232
+ a member of a collecting society that administers voluntary licensing
233
+ schemes, via that society, from any exercise by You of the rights
234
+ granted under this License.
235
+
236
+ 3.3 The right to release the Database under different terms, or to stop
237
+ distributing or making available the Database, is reserved. Note that
238
+ this Database may be multiple-licensed, and so You may have the choice
239
+ of using alternative licenses for this Database. Subject to Section
240
+ 10.4, all other rights not expressly granted by Licensor are reserved.
241
+
242
+ ### 4.0 Conditions of Use
243
+
244
+ 4.1 The rights granted in Section 3 above are expressly made subject to
245
+ Your complying with the following conditions of use. These are important
246
+ conditions of this License, and if You fail to follow them, You will be
247
+ in material breach of its terms.
248
+
249
+ 4.2 Notices. If You Publicly Convey this Database, any Derivative
250
+ Database, or the Database as part of a Collective Database, then You
251
+ must:
252
+
253
+ a. Do so only under the terms of this License or another license
254
+ permitted under Section 4.4;
255
+
256
+ b. Include a copy of this License (or, as applicable, a license
257
+ permitted under Section 4.4) or its Uniform Resource Identifier (URI)
258
+ with the Database or Derivative Database, including both in the
259
+ Database or Derivative Database and in any relevant documentation; and
260
+
261
+ c. Keep intact any copyright or Database Right notices and notices
262
+ that refer to this License.
263
+
264
+ d. If it is not possible to put the required notices in a particular
265
+ file due to its structure, then You must include the notices in a
266
+ location (such as a relevant directory) where users would be likely to
267
+ look for it.
268
+
269
+ 4.3 Notice for using output (Contents). Creating and Using a Produced
270
+ Work does not require the notice in Section 4.2. However, if you
271
+ Publicly Use a Produced Work, You must include a notice associated with
272
+ the Produced Work reasonably calculated to make any Person that uses,
273
+ views, accesses, interacts with, or is otherwise exposed to the Produced
274
+ Work aware that Content was obtained from the Database, Derivative
275
+ Database, or the Database as part of a Collective Database, and that it
276
+ is available under this License.
277
+
278
+ a. Example notice. The following text will satisfy notice under
279
+ Section 4.3:
280
+
281
+ Contains information from DATABASE NAME, which is made available
282
+ here under the Open Database License (ODbL).
283
+
284
+ DATABASE NAME should be replaced with the name of the Database and a
285
+ hyperlink to the URI of the Database. "Open Database License" should
286
+ contain a hyperlink to the URI of the text of this License. If
287
+ hyperlinks are not possible, You should include the plain text of the
288
+ required URI's with the above notice.
289
+
290
+ 4.4 Share alike.
291
+
292
+ a. Any Derivative Database that You Publicly Use must be only under
293
+ the terms of:
294
+
295
+ i. This License;
296
+
297
+ ii. A later version of this License similar in spirit to this
298
+ License; or
299
+
300
+ iii. A compatible license.
301
+
302
+ If You license the Derivative Database under one of the licenses
303
+ mentioned in (iii), You must comply with the terms of that license.
304
+
305
+ b. For the avoidance of doubt, Extraction or Re-utilisation of the
306
+ whole or a Substantial part of the Contents into a new database is a
307
+ Derivative Database and must comply with Section 4.4.
308
+
309
+ c. Derivative Databases and Produced Works. A Derivative Database is
310
+ Publicly Used and so must comply with Section 4.4. if a Produced Work
311
+ created from the Derivative Database is Publicly Used.
312
+
313
+ d. Share Alike and additional Contents. For the avoidance of doubt,
314
+ You must not add Contents to Derivative Databases under Section 4.4 a
315
+ that are incompatible with the rights granted under this License.
316
+
317
+ e. Compatible licenses. Licensors may authorise a proxy to determine
318
+ compatible licenses under Section 4.4 a iii. If they do so, the
319
+ authorised proxy's public statement of acceptance of a compatible
320
+ license grants You permission to use the compatible license.
321
+
322
+
323
+ 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
324
+ in the following:
325
+
326
+ a. For the avoidance of doubt, You are not required to license
327
+ Collective Databases under this License if You incorporate this
328
+ Database or a Derivative Database in the collection, but this License
329
+ still applies to this Database or a Derivative Database as a part of
330
+ the Collective Database;
331
+
332
+ b. Using this Database, a Derivative Database, or this Database as
333
+ part of a Collective Database to create a Produced Work does not
334
+ create a Derivative Database for purposes of Section 4.4; and
335
+
336
+ c. Use of a Derivative Database internally within an organisation is
337
+ not to the public and therefore does not fall under the requirements
338
+ of Section 4.4.
339
+
340
+ 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
341
+ Database or a Produced Work from a Derivative Database, You must also
342
+ offer to recipients of the Derivative Database or Produced Work a copy
343
+ in a machine readable form of:
344
+
345
+ a. The entire Derivative Database; or
346
+
347
+ b. A file containing all of the alterations made to the Database or
348
+ the method of making the alterations to the Database (such as an
349
+ algorithm), including any additional Contents, that make up all the
350
+ differences between the Database and the Derivative Database.
351
+
352
+ The Derivative Database (under a.) or alteration file (under b.) must be
353
+ available at no more than a reasonable production cost for physical
354
+ distributions and free of charge if distributed over the internet.
355
+
356
+ 4.7 Technological measures and additional terms
357
+
358
+ a. This License does not allow You to impose (except subject to
359
+ Section 4.7 b.) any terms or any technological measures on the
360
+ Database, a Derivative Database, or the whole or a Substantial part of
361
+ the Contents that alter or restrict the terms of this License, or any
362
+ rights granted under it, or have the effect or intent of restricting
363
+ the ability of any person to exercise those rights.
364
+
365
+ b. Parallel distribution. You may impose terms or technological
366
+ measures on the Database, a Derivative Database, or the whole or a
367
+ Substantial part of the Contents (a "Restricted Database") in
368
+ contravention of Section 4.74 a. only if You also make a copy of the
369
+ Database or a Derivative Database available to the recipient of the
370
+ Restricted Database:
371
+
372
+ i. That is available without additional fee;
373
+
374
+ ii. That is available in a medium that does not alter or restrict
375
+ the terms of this License, or any rights granted under it, or have
376
+ the effect or intent of restricting the ability of any person to
377
+ exercise those rights (an "Unrestricted Database"); and
378
+
379
+ iii. The Unrestricted Database is at least as accessible to the
380
+ recipient as a practical matter as the Restricted Database.
381
+
382
+ c. For the avoidance of doubt, You may place this Database or a
383
+ Derivative Database in an authenticated environment, behind a
384
+ password, or within a similar access control scheme provided that You
385
+ do not alter or restrict the terms of this License or any rights
386
+ granted under it or have the effect or intent of restricting the
387
+ ability of any person to exercise those rights.
388
+
389
+ 4.8 Licensing of others. You may not sublicense the Database. Each time
390
+ You communicate the Database, the whole or Substantial part of the
391
+ Contents, or any Derivative Database to anyone else in any way, the
392
+ Licensor offers to the recipient a license to the Database on the same
393
+ terms and conditions as this License. You are not responsible for
394
+ enforcing compliance by third parties with this License, but You may
395
+ enforce any rights that You have over a Derivative Database. You are
396
+ solely responsible for any modifications of a Derivative Database made
397
+ by You or another Person at Your direction. You may not impose any
398
+ further restrictions on the exercise of the rights granted or affirmed
399
+ under this License.
400
+
401
+ ### 5.0 Moral rights
402
+
403
+ 5.1 Moral rights. This section covers moral rights, including any rights
404
+ to be identified as the author of the Database or to object to treatment
405
+ that would otherwise prejudice the author's honour and reputation, or
406
+ any other derogatory treatment:
407
+
408
+ a. For jurisdictions allowing waiver of moral rights, Licensor waives
409
+ all moral rights that Licensor may have in the Database to the fullest
410
+ extent possible by the law of the relevant jurisdiction under Section
411
+ 10.4;
412
+
413
+ b. If waiver of moral rights under Section 5.1 a in the relevant
414
+ jurisdiction is not possible, Licensor agrees not to assert any moral
415
+ rights over the Database and waives all claims in moral rights to the
416
+ fullest extent possible by the law of the relevant jurisdiction under
417
+ Section 10.4; and
418
+
419
+ c. For jurisdictions not allowing waiver or an agreement not to assert
420
+ moral rights under Section 5.1 a and b, the author may retain their
421
+ moral rights over certain aspects of the Database.
422
+
423
+ Please note that some jurisdictions do not allow for the waiver of moral
424
+ rights, and so moral rights may still subsist over the Database in some
425
+ jurisdictions.
426
+
427
+ ### 6.0 Fair dealing, Database exceptions, and other rights not affected
428
+
429
+ 6.1 This License does not affect any rights that You or anyone else may
430
+ independently have under any applicable law to make any use of this
431
+ Database, including without limitation:
432
+
433
+ a. Exceptions to the Database Right including: Extraction of Contents
434
+ from non-electronic Databases for private purposes, Extraction for
435
+ purposes of illustration for teaching or scientific research, and
436
+ Extraction or Re-utilisation for public security or an administrative
437
+ or judicial procedure.
438
+
439
+ b. Fair dealing, fair use, or any other legally recognised limitation
440
+ or exception to infringement of copyright or other applicable laws.
441
+
442
+ 6.2 This License does not affect any rights of lawful users to Extract
443
+ and Re-utilise insubstantial parts of the Contents, evaluated
444
+ quantitatively or qualitatively, for any purposes whatsoever, including
445
+ creating a Derivative Database (subject to other rights over the
446
+ Contents, see Section 2.4). The repeated and systematic Extraction or
447
+ Re-utilisation of insubstantial parts of the Contents may however amount
448
+ to the Extraction or Re-utilisation of a Substantial part of the
449
+ Contents.
450
+
451
+ ### 7.0 Warranties and Disclaimer
452
+
453
+ 7.1 The Database is licensed by the Licensor "as is" and without any
454
+ warranty of any kind, either express, implied, or arising by statute,
455
+ custom, course of dealing, or trade usage. Licensor specifically
456
+ disclaims any and all implied warranties or conditions of title,
457
+ non-infringement, accuracy or completeness, the presence or absence of
458
+ errors, fitness for a particular purpose, merchantability, or otherwise.
459
+ Some jurisdictions do not allow the exclusion of implied warranties, so
460
+ this exclusion may not apply to You.
461
+
462
+ ### 8.0 Limitation of liability
463
+
464
+ 8.1 Subject to any liability that may not be excluded or limited by law,
465
+ the Licensor is not liable for, and expressly excludes, all liability
466
+ for loss or damage however and whenever caused to anyone by any use
467
+ under this License, whether by You or by anyone else, and whether caused
468
+ by any fault on the part of the Licensor or not. This exclusion of
469
+ liability includes, but is not limited to, any special, incidental,
470
+ consequential, punitive, or exemplary damages such as loss of revenue,
471
+ data, anticipated profits, and lost business. This exclusion applies
472
+ even if the Licensor has been advised of the possibility of such
473
+ damages.
474
+
475
+ 8.2 If liability may not be excluded by law, it is limited to actual and
476
+ direct financial loss to the extent it is caused by proved negligence on
477
+ the part of the Licensor.
478
+
479
+ ### 9.0 Termination of Your rights under this License
480
+
481
+ 9.1 Any breach by You of the terms and conditions of this License
482
+ automatically terminates this License with immediate effect and without
483
+ notice to You. For the avoidance of doubt, Persons who have received the
484
+ Database, the whole or a Substantial part of the Contents, Derivative
485
+ Databases, or the Database as part of a Collective Database from You
486
+ under this License will not have their licenses terminated provided
487
+ their use is in full compliance with this License or a license granted
488
+ under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
489
+ survive any termination of this License.
490
+
491
+ 9.2 If You are not in breach of the terms of this License, the Licensor
492
+ will not terminate Your rights under it.
493
+
494
+ 9.3 Unless terminated under Section 9.1, this License is granted to You
495
+ for the duration of applicable rights in the Database.
496
+
497
+ 9.4 Reinstatement of rights. If you cease any breach of the terms and
498
+ conditions of this License, then your full rights under this License
499
+ will be reinstated:
500
+
501
+ a. Provisionally and subject to permanent termination until the 60th
502
+ day after cessation of breach;
503
+
504
+ b. Permanently on the 60th day after cessation of breach unless
505
+ otherwise reasonably notified by the Licensor; or
506
+
507
+ c. Permanently if reasonably notified by the Licensor of the
508
+ violation, this is the first time You have received notice of
509
+ violation of this License from the Licensor, and You cure the
510
+ violation prior to 30 days after your receipt of the notice.
511
+
512
+ Persons subject to permanent termination of rights are not eligible to
513
+ be a recipient and receive a license under Section 4.8.
514
+
515
+ 9.5 Notwithstanding the above, Licensor reserves the right to release
516
+ the Database under different license terms or to stop distributing or
517
+ making available the Database. Releasing the Database under different
518
+ license terms or stopping the distribution of the Database will not
519
+ withdraw this License (or any other license that has been, or is
520
+ required to be, granted under the terms of this License), and this
521
+ License will continue in full force and effect unless terminated as
522
+ stated above.
523
+
524
+ ### 10.0 General
525
+
526
+ 10.1 If any provision of this License is held to be invalid or
527
+ unenforceable, that must not affect the validity or enforceability of
528
+ the remainder of the terms and conditions of this License and each
529
+ remaining provision of this License shall be valid and enforced to the
530
+ fullest extent permitted by law.
531
+
532
+ 10.2 This License is the entire agreement between the parties with
533
+ respect to the rights granted here over the Database. It replaces any
534
+ earlier understandings, agreements or representations with respect to
535
+ the Database.
536
+
537
+ 10.3 If You are in breach of the terms of this License, You will not be
538
+ entitled to rely on the terms of this License or to complain of any
539
+ breach by the Licensor.
540
+
541
+ 10.4 Choice of law. This License takes effect in and will be governed by
542
+ the laws of the relevant jurisdiction in which the License terms are
543
+ sought to be enforced. If the standard suite of rights granted under
544
+ applicable copyright law and Database Rights in the relevant
545
+ jurisdiction includes additional rights not granted under this License,
546
+ these additional rights are granted in this License in order to meet the
547
+ terms of this License.```
548
+
549
+
550
+
551
+
552
+ # UD Romanian RRT v2.5
553
+
554
+ * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
+ * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
556
+ * License: CC BY-SA 4.0
557
+
558
+ ```
559
+ Attribution-ShareAlike 4.0 International
560
+
561
+ =======================================================================
562
+
563
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
564
+ does not provide legal services or legal advice. Distribution of
565
+ Creative Commons public licenses does not create a lawyer-client or
566
+ other relationship. Creative Commons makes its licenses and related
567
+ information available on an "as-is" basis. Creative Commons gives no
568
+ warranties regarding its licenses, any material licensed under their
569
+ terms and conditions, or any related information. Creative Commons
570
+ disclaims all liability for damages resulting from their use to the
571
+ fullest extent possible.
572
+
573
+ Using Creative Commons Public Licenses
574
+
575
+ Creative Commons public licenses provide a standard set of terms and
576
+ conditions that creators and other rights holders may use to share
577
+ original works of authorship and other material subject to copyright
578
+ and certain other rights specified in the public license below. The
579
+ following considerations are for informational purposes only, are not
580
+ exhaustive, and do not form part of our licenses.
581
+
582
+ Considerations for licensors: Our public licenses are
583
+ intended for use by those authorized to give the public
584
+ permission to use material in ways otherwise restricted by
585
+ copyright and certain other rights. Our licenses are
586
+ irrevocable. Licensors should read and understand the terms
587
+ and conditions of the license they choose before applying it.
588
+ Licensors should also secure all rights necessary before
589
+ applying our licenses so that the public can reuse the
590
+ material as expected. Licensors should clearly mark any
591
+ material not subject to the license. This includes other CC-
592
+ licensed material, or material used under an exception or
593
+ limitation to copyright. More considerations for licensors:
594
+ wiki.creativecommons.org/Considerations_for_licensors
595
+
596
+ Considerations for the public: By using one of our public
597
+ licenses, a licensor grants the public permission to use the
598
+ licensed material under specified terms and conditions. If
599
+ the licensor's permission is not necessary for any reason--for
600
+ example, because of any applicable exception or limitation to
601
+ copyright--then that use is not regulated by the license. Our
602
+ licenses grant only permissions under copyright and certain
603
+ other rights that a licensor has authority to grant. Use of
604
+ the licensed material may still be restricted for other
605
+ reasons, including because others have copyright or other
606
+ rights in the material. A licensor may make special requests,
607
+ such as asking that all changes be marked or described.
608
+ Although not required by our licenses, you are encouraged to
609
+ respect those requests where reasonable. More considerations
610
+ for the public:
611
+ wiki.creativecommons.org/Considerations_for_licensees
612
+
613
+ =======================================================================
614
+
615
+ Creative Commons Attribution-ShareAlike 4.0 International Public
616
+ License
617
+
618
+ By exercising the Licensed Rights (defined below), You accept and agree
619
+ to be bound by the terms and conditions of this Creative Commons
620
+ Attribution-ShareAlike 4.0 International Public License ("Public
621
+ License"). To the extent this Public License may be interpreted as a
622
+ contract, You are granted the Licensed Rights in consideration of Your
623
+ acceptance of these terms and conditions, and the Licensor grants You
624
+ such rights in consideration of benefits the Licensor receives from
625
+ making the Licensed Material available under these terms and
626
+ conditions.
627
+
628
+
629
+ Section 1 -- Definitions.
630
+
631
+ a. Adapted Material means material subject to Copyright and Similar
632
+ Rights that is derived from or based upon the Licensed Material
633
+ and in which the Licensed Material is translated, altered,
634
+ arranged, transformed, or otherwise modified in a manner requiring
635
+ permission under the Copyright and Similar Rights held by the
636
+ Licensor. For purposes of this Public License, where the Licensed
637
+ Material is a musical work, performance, or sound recording,
638
+ Adapted Material is always produced where the Licensed Material is
639
+ synched in timed relation with a moving image.
640
+
641
+ b. Adapter's License means the license You apply to Your Copyright
642
+ and Similar Rights in Your contributions to Adapted Material in
643
+ accordance with the terms and conditions of this Public License.
644
+
645
+ c. BY-SA Compatible License means a license listed at
646
+ creativecommons.org/compatiblelicenses, approved by Creative
647
+ Commons as essentially the equivalent of this Public License.
648
+
649
+ d. Copyright and Similar Rights means copyright and/or similar rights
650
+ closely related to copyright including, without limitation,
651
+ performance, broadcast, sound recording, and Sui Generis Database
652
+ Rights, without regard to how the rights are labeled or
653
+ categorized. For purposes of this Public License, the rights
654
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
655
+ Rights.
656
+
657
+ e. Effective Technological Measures means those measures that, in the
658
+ absence of proper authority, may not be circumvented under laws
659
+ fulfilling obligations under Article 11 of the WIPO Copyright
660
+ Treaty adopted on December 20, 1996, and/or similar international
661
+ agreements.
662
+
663
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
664
+ any other exception or limitation to Copyright and Similar Rights
665
+ that applies to Your use of the Licensed Material.
666
+
667
+ g. License Elements means the license attributes listed in the name
668
+ of a Creative Commons Public License. The License Elements of this
669
+ Public License are Attribution and ShareAlike.
670
+
671
+ h. Licensed Material means the artistic or literary work, database,
672
+ or other material to which the Licensor applied this Public
673
+ License.
674
+
675
+ i. Licensed Rights means the rights granted to You subject to the
676
+ terms and conditions of this Public License, which are limited to
677
+ all Copyright and Similar Rights that apply to Your use of the
678
+ Licensed Material and that the Licensor has authority to license.
679
+
680
+ j. Licensor means the individual(s) or entity(ies) granting rights
681
+ under this Public License.
682
+
683
+ k. Share means to provide material to the public by any means or
684
+ process that requires permission under the Licensed Rights, such
685
+ as reproduction, public display, public performance, distribution,
686
+ dissemination, communication, or importation, and to make material
687
+ available to the public including in ways that members of the
688
+ public may access the material from a place and at a time
689
+ individually chosen by them.
690
+
691
+ l. Sui Generis Database Rights means rights other than copyright
692
+ resulting from Directive 96/9/EC of the European Parliament and of
693
+ the Council of 11 March 1996 on the legal protection of databases,
694
+ as amended and/or succeeded, as well as other essentially
695
+ equivalent rights anywhere in the world.
696
+
697
+ m. You means the individual or entity exercising the Licensed Rights
698
+ under this Public License. Your has a corresponding meaning.
699
+
700
+
701
+ Section 2 -- Scope.
702
+
703
+ a. License grant.
704
+
705
+ 1. Subject to the terms and conditions of this Public License,
706
+ the Licensor hereby grants You a worldwide, royalty-free,
707
+ non-sublicensable, non-exclusive, irrevocable license to
708
+ exercise the Licensed Rights in the Licensed Material to:
709
+
710
+ a. reproduce and Share the Licensed Material, in whole or
711
+ in part; and
712
+
713
+ b. produce, reproduce, and Share Adapted Material.
714
+
715
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
716
+ Exceptions and Limitations apply to Your use, this Public
717
+ License does not apply, and You do not need to comply with
718
+ its terms and conditions.
719
+
720
+ 3. Term. The term of this Public License is specified in Section
721
+ 6(a).
722
+
723
+ 4. Media and formats; technical modifications allowed. The
724
+ Licensor authorizes You to exercise the Licensed Rights in
725
+ all media and formats whether now known or hereafter created,
726
+ and to make technical modifications necessary to do so. The
727
+ Licensor waives and/or agrees not to assert any right or
728
+ authority to forbid You from making technical modifications
729
+ necessary to exercise the Licensed Rights, including
730
+ technical modifications necessary to circumvent Effective
731
+ Technological Measures. For purposes of this Public License,
732
+ simply making modifications authorized by this Section 2(a)
733
+ (4) never produces Adapted Material.
734
+
735
+ 5. Downstream recipients.
736
+
737
+ a. Offer from the Licensor -- Licensed Material. Every
738
+ recipient of the Licensed Material automatically
739
+ receives an offer from the Licensor to exercise the
740
+ Licensed Rights under the terms and conditions of this
741
+ Public License.
742
+
743
+ b. Additional offer from the Licensor -- Adapted Material.
744
+ Every recipient of Adapted Material from You
745
+ automatically receives an offer from the Licensor to
746
+ exercise the Licensed Rights in the Adapted Material
747
+ under the conditions of the Adapter's License You apply.
748
+
749
+ c. No downstream restrictions. You may not offer or impose
750
+ any additional or different terms or conditions on, or
751
+ apply any Effective Technological Measures to, the
752
+ Licensed Material if doing so restricts exercise of the
753
+ Licensed Rights by any recipient of the Licensed
754
+ Material.
755
+
756
+ 6. No endorsement. Nothing in this Public License constitutes or
757
+ may be construed as permission to assert or imply that You
758
+ are, or that Your use of the Licensed Material is, connected
759
+ with, or sponsored, endorsed, or granted official status by,
760
+ the Licensor or others designated to receive attribution as
761
+ provided in Section 3(a)(1)(A)(i).
762
+
763
+ b. Other rights.
764
+
765
+ 1. Moral rights, such as the right of integrity, are not
766
+ licensed under this Public License, nor are publicity,
767
+ privacy, and/or other similar personality rights; however, to
768
+ the extent possible, the Licensor waives and/or agrees not to
769
+ assert any such rights held by the Licensor to the limited
770
+ extent necessary to allow You to exercise the Licensed
771
+ Rights, but not otherwise.
772
+
773
+ 2. Patent and trademark rights are not licensed under this
774
+ Public License.
775
+
776
+ 3. To the extent possible, the Licensor waives any right to
777
+ collect royalties from You for the exercise of the Licensed
778
+ Rights, whether directly or through a collecting society
779
+ under any voluntary or waivable statutory or compulsory
780
+ licensing scheme. In all other cases the Licensor expressly
781
+ reserves any right to collect such royalties.
782
+
783
+
784
+ Section 3 -- License Conditions.
785
+
786
+ Your exercise of the Licensed Rights is expressly made subject to the
787
+ following conditions.
788
+
789
+ a. Attribution.
790
+
791
+ 1. If You Share the Licensed Material (including in modified
792
+ form), You must:
793
+
794
+ a. retain the following if it is supplied by the Licensor
795
+ with the Licensed Material:
796
+
797
+ i. identification of the creator(s) of the Licensed
798
+ Material and any others designated to receive
799
+ attribution, in any reasonable manner requested by
800
+ the Licensor (including by pseudonym if
801
+ designated);
802
+
803
+ ii. a copyright notice;
804
+
805
+ iii. a notice that refers to this Public License;
806
+
807
+ iv. a notice that refers to the disclaimer of
808
+ warranties;
809
+
810
+ v. a URI or hyperlink to the Licensed Material to the
811
+ extent reasonably practicable;
812
+
813
+ b. indicate if You modified the Licensed Material and
814
+ retain an indication of any previous modifications; and
815
+
816
+ c. indicate the Licensed Material is licensed under this
817
+ Public License, and include the text of, or the URI or
818
+ hyperlink to, this Public License.
819
+
820
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
821
+ reasonable manner based on the medium, means, and context in
822
+ which You Share the Licensed Material. For example, it may be
823
+ reasonable to satisfy the conditions by providing a URI or
824
+ hyperlink to a resource that includes the required
825
+ information.
826
+
827
+ 3. If requested by the Licensor, You must remove any of the
828
+ information required by Section 3(a)(1)(A) to the extent
829
+ reasonably practicable.
830
+
831
+ b. ShareAlike.
832
+
833
+ In addition to the conditions in Section 3(a), if You Share
834
+ Adapted Material You produce, the following conditions also apply.
835
+
836
+ 1. The Adapter's License You apply must be a Creative Commons
837
+ license with the same License Elements, this version or
838
+ later, or a BY-SA Compatible License.
839
+
840
+ 2. You must include the text of, or the URI or hyperlink to, the
841
+ Adapter's License You apply. You may satisfy this condition
842
+ in any reasonable manner based on the medium, means, and
843
+ context in which You Share Adapted Material.
844
+
845
+ 3. You may not offer or impose any additional or different terms
846
+ or conditions on, or apply any Effective Technological
847
+ Measures to, Adapted Material that restrict exercise of the
848
+ rights granted under the Adapter's License You apply.
849
+
850
+
851
+ Section 4 -- Sui Generis Database Rights.
852
+
853
+ Where the Licensed Rights include Sui Generis Database Rights that
854
+ apply to Your use of the Licensed Material:
855
+
856
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
857
+ to extract, reuse, reproduce, and Share all or a substantial
858
+ portion of the contents of the database;
859
+
860
+ b. if You include all or a substantial portion of the database
861
+ contents in a database in which You have Sui Generis Database
862
+ Rights, then the database in which You have Sui Generis Database
863
+ Rights (but not its individual contents) is Adapted Material,
864
+
865
+ including for purposes of Section 3(b); and
866
+ c. You must comply with the conditions in Section 3(a) if You Share
867
+ all or a substantial portion of the contents of the database.
868
+
869
+ For the avoidance of doubt, this Section 4 supplements and does not
870
+ replace Your obligations under this Public License where the Licensed
871
+ Rights include other Copyright and Similar Rights.
872
+
873
+
874
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
875
+
876
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
877
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
878
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
879
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
880
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
881
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
882
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
883
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
884
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
885
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
886
+
887
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
888
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
889
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
890
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
891
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
892
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
893
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
894
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
895
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
896
+
897
+ c. The disclaimer of warranties and limitation of liability provided
898
+ above shall be interpreted in a manner that, to the extent
899
+ possible, most closely approximates an absolute disclaimer and
900
+ waiver of all liability.
901
+
902
+
903
+ Section 6 -- Term and Termination.
904
+
905
+ a. This Public License applies for the term of the Copyright and
906
+ Similar Rights licensed here. However, if You fail to comply with
907
+ this Public License, then Your rights under this Public License
908
+ terminate automatically.
909
+
910
+ b. Where Your right to use the Licensed Material has terminated under
911
+ Section 6(a), it reinstates:
912
+
913
+ 1. automatically as of the date the violation is cured, provided
914
+ it is cured within 30 days of Your discovery of the
915
+ violation; or
916
+
917
+ 2. upon express reinstatement by the Licensor.
918
+
919
+ For the avoidance of doubt, this Section 6(b) does not affect any
920
+ right the Licensor may have to seek remedies for Your violations
921
+ of this Public License.
922
+
923
+ c. For the avoidance of doubt, the Licensor may also offer the
924
+ Licensed Material under separate terms or conditions or stop
925
+ distributing the Licensed Material at any time; however, doing so
926
+ will not terminate this Public License.
927
+
928
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
929
+ License.
930
+
931
+
932
+ Section 7 -- Other Terms and Conditions.
933
+
934
+ a. The Licensor shall not be bound by any additional or different
935
+ terms or conditions communicated by You unless expressly agreed.
936
+
937
+ b. Any arrangements, understandings, or agreements regarding the
938
+ Licensed Material not stated herein are separate from and
939
+ independent of the terms and conditions of this Public License.
940
+
941
+
942
+ Section 8 -- Interpretation.
943
+
944
+ a. For the avoidance of doubt, this Public License does not, and
945
+ shall not be interpreted to, reduce, limit, restrict, or impose
946
+ conditions on any use of the Licensed Material that could lawfully
947
+ be made without permission under this Public License.
948
+
949
+ b. To the extent possible, if any provision of this Public License is
950
+ deemed unenforceable, it shall be automatically reformed to the
951
+ minimum extent necessary to make it enforceable. If the provision
952
+ cannot be reformed, it shall be severed from this Public License
953
+ without affecting the enforceability of the remaining terms and
954
+ conditions.
955
+
956
+ c. No term or condition of this Public License will be waived and no
957
+ failure to comply consented to unless expressly agreed to by the
958
+ Licensor.
959
+
960
+ d. Nothing in this Public License constitutes or may be interpreted
961
+ as a limitation upon, or waiver of, any privileges and immunities
962
+ that apply to the Licensor or You, including from the legal
963
+ processes of any jurisdiction or authority.
964
+
965
+
966
+ =======================================================================
967
+
968
+ Creative Commons is not a party to its public
969
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
970
+ its public licenses to material it publishes and in those instances
971
+ will be considered the “Licensor.” The text of the Creative Commons
972
+ public licenses is dedicated to the public domain under the CC0 Public
973
+ Domain Dedication. Except for the limited purpose of indicating that
974
+ material is shared under a Creative Commons public license or as
975
+ otherwise permitted by the Creative Commons policies published at
976
+ creativecommons.org/policies, Creative Commons does not authorize the
977
+ use of the trademark "Creative Commons" or any other trademark or logo
978
+ of Creative Commons without its prior written consent including,
979
+ without limitation, in connection with any unauthorized modifications
980
+ to any of its public licenses or any other arrangements,
981
+ understandings, or agreements concerning use of licensed material. For
982
+ the avoidance of doubt, this paragraph does not form part of the
983
+ public licenses.
984
+
985
+ Creative Commons may be contacted at creativecommons.org.
986
+
987
+ ```
988
+
989
+
990
+
991
+
992
+ # RONEC - the Romanian Named Entity Corpus (ca9ce460)
993
+
994
+ * Author: Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan
995
+ * URL: https://github.com/dumitrescustefan/ronec
996
+ * License: MIT
997
+
998
+ ```
999
+
1000
+ MIT License
1001
+
1002
+ Copyright (c) 2018 Stefan Dumitrescu
1003
+
1004
+ Permission is hereby granted, free of charge, to any person obtaining a copy
1005
+ of this software and associated documentation files (the "Software"), to deal
1006
+ in the Software without restriction, including without limitation the rights
1007
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
1008
+ copies of the Software, and to permit persons to whom the Software is
1009
+ furnished to do so, subject to the following conditions:
1010
+
1011
+ The above copyright notice and this permission notice shall be included in all
1012
+ copies or substantial portions of the Software.
1013
+
1014
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
1015
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
1016
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
1017
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
1018
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
1019
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
1020
+ SOFTWARE.```
1021
+
1022
+
1023
+
1024
+
README.md ADDED
@@ -0,0 +1,106 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - spacy
4
+ - token-classification
5
+ language:
6
+ - ro
7
+ license: CC-BY-SA-4.0
8
+ model-index:
9
+ - name: ro_core_news_sm
10
+ results:
11
+ - tasks:
12
+ name: NER
13
+ type: token-classification
14
+ metrics:
15
+ - name: Precision
16
+ type: precision
17
+ value: 0.7244897959
18
+ - name: Recall
19
+ type: recall
20
+ value: 0.7091817134
21
+ - name: F Score
22
+ type: f_score
23
+ value: 0.7167540283
24
+ - tasks:
25
+ name: POS
26
+ type: token-classification
27
+ metrics:
28
+ - name: Accuracy
29
+ type: accuracy
30
+ value: 0.9612171026
31
+ - tasks:
32
+ name: SENTER
33
+ type: token-classification
34
+ metrics:
35
+ - name: Precision
36
+ type: precision
37
+ value: 0.950928382
38
+ - name: Recall
39
+ type: recall
40
+ value: 0.9534574468
41
+ - name: F Score
42
+ type: f_score
43
+ value: 0.9521912351
44
+ - tasks:
45
+ name: UNLABELED_DEPENDENCIES
46
+ type: token-classification
47
+ metrics:
48
+ - name: Accuracy
49
+ type: accuracy
50
+ value: 0.8705660969
51
+ - tasks:
52
+ name: LABELED_DEPENDENCIES
53
+ type: token-classification
54
+ metrics:
55
+ - name: Accuracy
56
+ type: accuracy
57
+ value: 0.8705660969
58
+ ---
59
+ ### Details: https://spacy.io/models/ro#ro_core_news_sm
60
+
61
+ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
62
+
63
+ | Feature | Description |
64
+ | --- | --- |
65
+ | **Name** | `ro_core_news_sm` |
66
+ | **Version** | `3.1.0` |
67
+ | **spaCy** | `>=3.1.0,<3.2.0` |
68
+ | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
+ | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
+ | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
+ | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.5](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan) |
72
+ | **License** | `CC BY-SA 4.0` |
73
+ | **Author** | [Explosion](https://explosion.ai) |
74
+
75
+ ### Label Scheme
76
+
77
+ <details>
78
+
79
+ <summary>View label scheme (534 labels for 4 components)</summary>
80
+
81
+ | Component | Labels |
82
+ | --- | --- |
83
+ | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-p-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrln`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps2ms-s`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp1s`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp-sr`, `Yr` |
84
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:agent`, `nmod:pmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
+ | **`senter`** | `I`, `S` |
86
+ | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
+
88
+ </details>
89
+
90
+ ### Accuracy
91
+
92
+ | Type | Score |
93
+ | --- | --- |
94
+ | `TOKEN_ACC` | 99.90 |
95
+ | `TAG_ACC` | 96.12 |
96
+ | `POS_ACC` | 95.56 |
97
+ | `MORPH_ACC` | 96.24 |
98
+ | `LEMMA_ACC` | 81.87 |
99
+ | `DEP_UAS` | 87.06 |
100
+ | `DEP_LAS` | 81.89 |
101
+ | `SENTS_P` | 95.09 |
102
+ | `SENTS_R` | 95.35 |
103
+ | `SENTS_F` | 95.22 |
104
+ | `ENTS_P` | 72.45 |
105
+ | `ENTS_R` | 70.92 |
106
+ | `ENTS_F` | 71.68 |
accuracy.json ADDED
@@ -0,0 +1,437 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "token_acc": 0.9990029326,
3
+ "tag_acc": 0.9612171026,
4
+ "pos_acc": 0.9556105925,
5
+ "morph_acc": 0.9623560794,
6
+ "lemma_acc": 0.8186589263,
7
+ "dep_uas": 0.8705660969,
8
+ "dep_las": 0.8189173932,
9
+ "sents_p": 0.950928382,
10
+ "sents_r": 0.9534574468,
11
+ "sents_f": 0.9521912351,
12
+ "speed": 9573.3292142848,
13
+ "morph_per_feat": {
14
+ "Case": {
15
+ "p": 0.985836417,
16
+ "r": 0.9825766247,
17
+ "f": 0.9842038217
18
+ },
19
+ "Gender": {
20
+ "p": 0.9825625474,
21
+ "r": 0.9721215152,
22
+ "f": 0.9773141457
23
+ },
24
+ "Number": {
25
+ "p": 0.9856352415,
26
+ "r": 0.9757716212,
27
+ "f": 0.98067863
28
+ },
29
+ "Person": {
30
+ "p": 0.9789227166,
31
+ "r": 0.9852681202,
32
+ "f": 0.9820851689
33
+ },
34
+ "PronType": {
35
+ "p": 0.9951523546,
36
+ "r": 0.99308915,
37
+ "f": 0.9941196818
38
+ },
39
+ "Polarity": {
40
+ "p": 0.9918166939,
41
+ "r": 0.993442623,
42
+ "f": 0.9926289926
43
+ },
44
+ "AdpType": {
45
+ "p": 0.9976311337,
46
+ "r": 0.9976311337,
47
+ "f": 0.9976311337
48
+ },
49
+ "Definite": {
50
+ "p": 0.9854908915,
51
+ "r": 0.9703174603,
52
+ "f": 0.9778453171
53
+ },
54
+ "Degree": {
55
+ "p": 0.9436038514,
56
+ "r": 0.9214237743,
57
+ "f": 0.9323819232
58
+ },
59
+ "VerbForm": {
60
+ "p": 0.9632545932,
61
+ "r": 0.9760638298,
62
+ "f": 0.9696169089
63
+ },
64
+ "Abbr": {
65
+ "p": 0.9804878049,
66
+ "r": 0.8973214286,
67
+ "f": 0.9370629371
68
+ },
69
+ "Poss": {
70
+ "p": 1.0,
71
+ "r": 0.9951807229,
72
+ "f": 0.9975845411
73
+ },
74
+ "NumForm": {
75
+ "p": 0.9944134078,
76
+ "r": 0.9807162534,
77
+ "f": 0.987517337
78
+ },
79
+ "NumType": {
80
+ "p": 0.995821727,
81
+ "r": 0.9821428571,
82
+ "f": 0.9889349931
83
+ },
84
+ "Reflex": {
85
+ "p": 1.0,
86
+ "r": 0.9935897436,
87
+ "f": 0.9967845659
88
+ },
89
+ "Strength": {
90
+ "p": 0.9841269841,
91
+ "r": 0.9841269841,
92
+ "f": 0.9841269841
93
+ },
94
+ "Mood": {
95
+ "p": 0.9446428571,
96
+ "r": 0.9706422018,
97
+ "f": 0.9574660633
98
+ },
99
+ "Tense": {
100
+ "p": 0.9470672389,
101
+ "r": 0.9622093023,
102
+ "f": 0.9545782264
103
+ },
104
+ "Variant": {
105
+ "p": 0.9867549669,
106
+ "r": 0.9612903226,
107
+ "f": 0.9738562092
108
+ },
109
+ "Position": {
110
+ "p": 1.0,
111
+ "r": 0.9910714286,
112
+ "f": 0.9955156951
113
+ },
114
+ "Number[psor]": {
115
+ "p": 1.0,
116
+ "r": 0.9666666667,
117
+ "f": 0.9830508475
118
+ },
119
+ "PartType": {
120
+ "p": 1.0,
121
+ "r": 0.9459459459,
122
+ "f": 0.9722222222
123
+ },
124
+ "Foreign": {
125
+ "p": 0.0,
126
+ "r": 0.0,
127
+ "f": 0.0
128
+ }
129
+ },
130
+ "dep_las_per_type": {
131
+ "root": {
132
+ "p": 0.8519855596,
133
+ "r": 0.8888888889,
134
+ "f": 0.8700460829
135
+ },
136
+ "mark": {
137
+ "p": 0.9138576779,
138
+ "r": 0.920754717,
139
+ "f": 0.9172932331
140
+ },
141
+ "case": {
142
+ "p": 0.9520392749,
143
+ "r": 0.9452568429,
144
+ "f": 0.948635936
145
+ },
146
+ "nmod:tmod": {
147
+ "p": 0.5151515152,
148
+ "r": 0.1428571429,
149
+ "f": 0.2236842105
150
+ },
151
+ "amod": {
152
+ "p": 0.8744625967,
153
+ "r": 0.8662691652,
154
+ "f": 0.8703465982
155
+ },
156
+ "nsubj": {
157
+ "p": 0.8336106489,
158
+ "r": 0.7914691943,
159
+ "f": 0.811993517
160
+ },
161
+ "nmod": {
162
+ "p": 0.8005068791,
163
+ "r": 0.7893609425,
164
+ "f": 0.7948948409
165
+ },
166
+ "aux": {
167
+ "p": 0.9757462687,
168
+ "r": 0.9561243144,
169
+ "f": 0.9658356417
170
+ },
171
+ "advcl": {
172
+ "p": 0.5620915033,
173
+ "r": 0.6466165414,
174
+ "f": 0.6013986014
175
+ },
176
+ "obj": {
177
+ "p": 0.7931769723,
178
+ "r": 0.8591224018,
179
+ "f": 0.8248337029
180
+ },
181
+ "det": {
182
+ "p": 0.9598623853,
183
+ "r": 0.9479048698,
184
+ "f": 0.9538461538
185
+ },
186
+ "cc": {
187
+ "p": 0.9182389937,
188
+ "r": 0.9144050104,
189
+ "f": 0.9163179916
190
+ },
191
+ "conj": {
192
+ "p": 0.5696969697,
193
+ "r": 0.5446118192,
194
+ "f": 0.5568720379
195
+ },
196
+ "nummod": {
197
+ "p": 0.8888888889,
198
+ "r": 0.8571428571,
199
+ "f": 0.8727272727
200
+ },
201
+ "acl": {
202
+ "p": 0.7819767442,
203
+ "r": 0.6950904393,
204
+ "f": 0.7359781122
205
+ },
206
+ "advmod": {
207
+ "p": 0.7928388747,
208
+ "r": 0.8179419525,
209
+ "f": 0.8051948052
210
+ },
211
+ "obl": {
212
+ "p": 0.6563500534,
213
+ "r": 0.7804568528,
214
+ "f": 0.7130434783
215
+ },
216
+ "expl:pass": {
217
+ "p": 0.8372093023,
218
+ "r": 0.6666666667,
219
+ "f": 0.7422680412
220
+ },
221
+ "nsubj:pass": {
222
+ "p": 0.8582089552,
223
+ "r": 0.7012195122,
224
+ "f": 0.7718120805
225
+ },
226
+ "fixed": {
227
+ "p": 0.8381742739,
228
+ "r": 0.8541226216,
229
+ "f": 0.8460732984
230
+ },
231
+ "appos": {
232
+ "p": 0.4291666667,
233
+ "r": 0.393129771,
234
+ "f": 0.4103585657
235
+ },
236
+ "parataxis": {
237
+ "p": 0.2045454545,
238
+ "r": 0.2571428571,
239
+ "f": 0.2278481013
240
+ },
241
+ "aux:pass": {
242
+ "p": 0.9256756757,
243
+ "r": 0.9133333333,
244
+ "f": 0.9194630872
245
+ },
246
+ "nmod:agent": {
247
+ "p": 0.9462365591,
248
+ "r": 0.7927927928,
249
+ "f": 0.862745098
250
+ },
251
+ "ccomp": {
252
+ "p": 0.8770491803,
253
+ "r": 0.8294573643,
254
+ "f": 0.8525896414
255
+ },
256
+ "nmod:pmod": {
257
+ "p": 0.3555555556,
258
+ "r": 0.2666666667,
259
+ "f": 0.3047619048
260
+ },
261
+ "iobj": {
262
+ "p": 0.7619047619,
263
+ "r": 0.7901234568,
264
+ "f": 0.7757575758
265
+ },
266
+ "flat": {
267
+ "p": 0.7560321716,
268
+ "r": 0.7421052632,
269
+ "f": 0.7490039841
270
+ },
271
+ "cop": {
272
+ "p": 0.7952755906,
273
+ "r": 0.814516129,
274
+ "f": 0.8047808765
275
+ },
276
+ "csubj": {
277
+ "p": 0.7619047619,
278
+ "r": 0.7619047619,
279
+ "f": 0.7619047619
280
+ },
281
+ "dep": {
282
+ "p": 0.0,
283
+ "r": 0.0,
284
+ "f": 0.0
285
+ },
286
+ "expl:pv": {
287
+ "p": 0.7532467532,
288
+ "r": 0.8405797101,
289
+ "f": 0.7945205479
290
+ },
291
+ "expl": {
292
+ "p": 0.7037037037,
293
+ "r": 0.7037037037,
294
+ "f": 0.7037037037
295
+ },
296
+ "vocative": {
297
+ "p": 0.0,
298
+ "r": 0.0,
299
+ "f": 0.0
300
+ },
301
+ "expl:poss": {
302
+ "p": 0.9310344828,
303
+ "r": 0.8709677419,
304
+ "f": 0.9
305
+ },
306
+ "goeswith": {
307
+ "p": 0.1,
308
+ "r": 0.3333333333,
309
+ "f": 0.1538461538
310
+ },
311
+ "xcomp": {
312
+ "p": 0.325,
313
+ "r": 0.4814814815,
314
+ "f": 0.3880597015
315
+ },
316
+ "compound": {
317
+ "p": 0.3333333333,
318
+ "r": 0.5714285714,
319
+ "f": 0.4210526316
320
+ },
321
+ "ccomp:pmod": {
322
+ "p": 0.0,
323
+ "r": 0.0,
324
+ "f": 0.0
325
+ },
326
+ "orphan": {
327
+ "p": 0.0,
328
+ "r": 0.0,
329
+ "f": 0.0
330
+ },
331
+ "expl:impers": {
332
+ "p": 0.5,
333
+ "r": 0.3333333333,
334
+ "f": 0.4
335
+ },
336
+ "list": {
337
+ "p": 0.0,
338
+ "r": 0.0,
339
+ "f": 0.0
340
+ },
341
+ "cc:preconj": {
342
+ "p": 0.0,
343
+ "r": 0.0,
344
+ "f": 0.0
345
+ },
346
+ "csubj:pass": {
347
+ "p": 0.0,
348
+ "r": 0.0,
349
+ "f": 0.0
350
+ }
351
+ },
352
+ "ents_p": 0.7244897959,
353
+ "ents_r": 0.7091817134,
354
+ "ents_f": 0.7167540283,
355
+ "ents_per_type": {
356
+ "DATETIME": {
357
+ "p": 0.7889273356,
358
+ "r": 0.7944250871,
359
+ "f": 0.7916666667
360
+ },
361
+ "ORGANIZATION": {
362
+ "p": 0.6656050955,
363
+ "r": 0.6656050955,
364
+ "f": 0.6656050955
365
+ },
366
+ "FACILITY": {
367
+ "p": 0.4953271028,
368
+ "r": 0.4045801527,
369
+ "f": 0.4453781513
370
+ },
371
+ "NUMERIC_VALUE": {
372
+ "p": 0.875,
373
+ "r": 0.9194915254,
374
+ "f": 0.8966942149
375
+ },
376
+ "ORDINAL": {
377
+ "p": 0.7962962963,
378
+ "r": 0.7818181818,
379
+ "f": 0.7889908257
380
+ },
381
+ "EVENT": {
382
+ "p": 0.6551724138,
383
+ "r": 0.5135135135,
384
+ "f": 0.5757575758
385
+ },
386
+ "GPE": {
387
+ "p": 0.8025477707,
388
+ "r": 0.8689655172,
389
+ "f": 0.8344370861
390
+ },
391
+ "PERSON": {
392
+ "p": 0.675862069,
393
+ "r": 0.6577181208,
394
+ "f": 0.6666666667
395
+ },
396
+ "NAT_REL_POL": {
397
+ "p": 0.8976377953,
398
+ "r": 0.76,
399
+ "f": 0.8231046931
400
+ },
401
+ "MONEY": {
402
+ "p": 0.8867924528,
403
+ "r": 0.8103448276,
404
+ "f": 0.8468468468
405
+ },
406
+ "PRODUCT": {
407
+ "p": 0.4424778761,
408
+ "r": 0.3649635036,
409
+ "f": 0.4
410
+ },
411
+ "LOC": {
412
+ "p": 0.4461538462,
413
+ "r": 0.3815789474,
414
+ "f": 0.4113475177
415
+ },
416
+ "WORK_OF_ART": {
417
+ "p": 0.24,
418
+ "r": 0.3157894737,
419
+ "f": 0.2727272727
420
+ },
421
+ "QUANTITY": {
422
+ "p": 0.8518518519,
423
+ "r": 0.8846153846,
424
+ "f": 0.8679245283
425
+ },
426
+ "LANGUAGE": {
427
+ "p": 0.5714285714,
428
+ "r": 1.0,
429
+ "f": 0.7272727273
430
+ },
431
+ "PERIOD": {
432
+ "p": 0.8717948718,
433
+ "r": 0.8095238095,
434
+ "f": 0.8395061728
435
+ }
436
+ }
437
+ }
attribute_ruler/patterns ADDED
Binary file (49.6 kB). View file
 
config.cfg ADDED
@@ -0,0 +1,260 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [paths]
2
+ train = "corpus/ro-dep-mixed/train.spacy"
3
+ dev = "corpus/ro-dep-mixed/dev.spacy"
4
+ vectors = null
5
+ raw = null
6
+ init_tok2vec = null
7
+ vocab_data = null
8
+
9
+ [system]
10
+ gpu_allocator = null
11
+ seed = 0
12
+
13
+ [nlp]
14
+ lang = "ro"
15
+ pipeline = ["tok2vec","tagger","parser","senter","attribute_ruler","lemmatizer","ner"]
16
+ disabled = ["senter"]
17
+ before_creation = null
18
+ after_creation = null
19
+ after_pipeline_creation = null
20
+ batch_size = 256
21
+ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
22
+
23
+ [components]
24
+
25
+ [components.attribute_ruler]
26
+ factory = "attribute_ruler"
27
+ validate = false
28
+
29
+ [components.lemmatizer]
30
+ factory = "lemmatizer"
31
+ mode = "lookup"
32
+ model = null
33
+ overwrite = false
34
+
35
+ [components.ner]
36
+ factory = "ner"
37
+ incorrect_spans_key = null
38
+ moves = null
39
+ update_with_oracle_cut_size = 100
40
+
41
+ [components.ner.model]
42
+ @architectures = "spacy.TransitionBasedParser.v2"
43
+ state_type = "ner"
44
+ extra_state_tokens = false
45
+ hidden_width = 64
46
+ maxout_pieces = 2
47
+ use_upper = true
48
+ nO = null
49
+
50
+ [components.ner.model.tok2vec]
51
+ @architectures = "spacy.Tok2Vec.v2"
52
+
53
+ [components.ner.model.tok2vec.embed]
54
+ @architectures = "spacy.MultiHashEmbed.v2"
55
+ width = 96
56
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
57
+ rows = [5000,2500,2500,2500]
58
+ include_static_vectors = false
59
+
60
+ [components.ner.model.tok2vec.encode]
61
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
62
+ width = 96
63
+ depth = 4
64
+ window_size = 1
65
+ maxout_pieces = 3
66
+
67
+ [components.parser]
68
+ factory = "parser"
69
+ learn_tokens = false
70
+ min_action_freq = 30
71
+ moves = null
72
+ update_with_oracle_cut_size = 100
73
+
74
+ [components.parser.model]
75
+ @architectures = "spacy.TransitionBasedParser.v2"
76
+ state_type = "parser"
77
+ extra_state_tokens = false
78
+ hidden_width = 64
79
+ maxout_pieces = 2
80
+ use_upper = true
81
+ nO = null
82
+
83
+ [components.parser.model.tok2vec]
84
+ @architectures = "spacy.Tok2VecListener.v1"
85
+ width = ${components.tok2vec.model.encode:width}
86
+ upstream = "tok2vec"
87
+
88
+ [components.senter]
89
+ factory = "senter"
90
+
91
+ [components.senter.model]
92
+ @architectures = "spacy.Tagger.v1"
93
+ nO = null
94
+
95
+ [components.senter.model.tok2vec]
96
+ @architectures = "spacy.Tok2Vec.v2"
97
+
98
+ [components.senter.model.tok2vec.embed]
99
+ @architectures = "spacy.MultiHashEmbed.v2"
100
+ width = 16
101
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
102
+ rows = [1000,500,500,500]
103
+ include_static_vectors = false
104
+
105
+ [components.senter.model.tok2vec.encode]
106
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
107
+ width = 16
108
+ depth = 2
109
+ window_size = 1
110
+ maxout_pieces = 2
111
+
112
+ [components.tagger]
113
+ factory = "tagger"
114
+
115
+ [components.tagger.model]
116
+ @architectures = "spacy.Tagger.v1"
117
+ nO = null
118
+
119
+ [components.tagger.model.tok2vec]
120
+ @architectures = "spacy.Tok2VecListener.v1"
121
+ width = ${components.tok2vec.model.encode:width}
122
+ upstream = "tok2vec"
123
+
124
+ [components.tok2vec]
125
+ factory = "tok2vec"
126
+
127
+ [components.tok2vec.model]
128
+ @architectures = "spacy.Tok2Vec.v2"
129
+
130
+ [components.tok2vec.model.embed]
131
+ @architectures = "spacy.MultiHashEmbed.v2"
132
+ width = ${components.tok2vec.model.encode:width}
133
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
+ rows = [5000,2500,2500,2500]
135
+ include_static_vectors = false
136
+
137
+ [components.tok2vec.model.encode]
138
+ @architectures = "spacy.MaxoutWindowEncoder.v2"
139
+ width = 96
140
+ depth = 4
141
+ window_size = 1
142
+ maxout_pieces = 3
143
+
144
+ [corpora]
145
+
146
+ [corpora.dev]
147
+ @readers = "spacy.Corpus.v1"
148
+ limit = 0
149
+ max_length = 0
150
+ path = ${paths:dev}
151
+ gold_preproc = false
152
+ augmenter = null
153
+
154
+ [corpora.train]
155
+ @readers = "spacy.Corpus.v1"
156
+ path = ${paths:train}
157
+ max_length = 5000
158
+ gold_preproc = false
159
+ limit = 0
160
+
161
+ [corpora.train.augmenter]
162
+ @augmenters = "spacy.lower_case.v1"
163
+ level = 0.1
164
+
165
+ [training]
166
+ train_corpus = "corpora.train"
167
+ dev_corpus = "corpora.dev"
168
+ seed = ${system:seed}
169
+ gpu_allocator = ${system:gpu_allocator}
170
+ dropout = 0.1
171
+ accumulate_gradient = 1
172
+ patience = 5000
173
+ max_epochs = 0
174
+ max_steps = 0
175
+ eval_frequency = 1000
176
+ frozen_components = []
177
+ before_to_disk = null
178
+ annotating_components = []
179
+
180
+ [training.batcher]
181
+ @batchers = "spacy.batch_by_words.v1"
182
+ discard_oversize = false
183
+ tolerance = 0.2
184
+ get_length = null
185
+
186
+ [training.batcher.size]
187
+ @schedules = "compounding.v1"
188
+ start = 100
189
+ stop = 1000
190
+ compound = 1.001
191
+ t = 0.0
192
+
193
+ [training.logger]
194
+ @loggers = "spacy.WandbLogger.v1"
195
+ project_name = "spacy-v3.0.0a2"
196
+ remove_config_values = []
197
+
198
+ [training.optimizer]
199
+ @optimizers = "Adam.v1"
200
+ beta1 = 0.9
201
+ beta2 = 0.999
202
+ L2_is_weight_decay = true
203
+ L2 = 0.01
204
+ grad_clip = 1.0
205
+ use_averages = true
206
+ eps = 0.00000001
207
+ learn_rate = 0.001
208
+
209
+ [training.score_weights]
210
+ tag_acc = 0.16
211
+ dep_uas = 0.0
212
+ dep_las = 0.16
213
+ dep_las_per_type = null
214
+ sents_p = null
215
+ sents_r = null
216
+ sents_f = 0.02
217
+ lemma_acc = 0.33
218
+ ents_f = 0.33
219
+ ents_p = 0.0
220
+ ents_r = 0.0
221
+ ents_per_type = null
222
+
223
+ [pretraining]
224
+
225
+ [initialize]
226
+ vocab_data = ${paths.vocab_data}
227
+ vectors = ${paths.vectors}
228
+ init_tok2vec = ${paths.init_tok2vec}
229
+ before_init = null
230
+ after_init = null
231
+
232
+ [initialize.components]
233
+
234
+ [initialize.components.ner]
235
+
236
+ [initialize.components.ner.labels]
237
+ @readers = "spacy.read_labels.v1"
238
+ path = "corpus/labels/ner.json"
239
+ require = false
240
+
241
+ [initialize.components.parser]
242
+
243
+ [initialize.components.parser.labels]
244
+ @readers = "spacy.read_labels.v1"
245
+ path = "corpus/labels/parser.json"
246
+ require = false
247
+
248
+ [initialize.components.tagger]
249
+
250
+ [initialize.components.tagger.labels]
251
+ @readers = "spacy.read_labels.v1"
252
+ path = "corpus/labels/tagger.json"
253
+ require = false
254
+
255
+ [initialize.lookups]
256
+ @misc = "spacy.LookupsDataLoader.v1"
257
+ lang = ${nlp.lang}
258
+ tables = []
259
+
260
+ [initialize.tokenizer]
lemmatizer/lookups/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29d980bbecacfa6599448d2fc5a0e58900ecce80f8674ac1fb8fbdfd434fea11
3
+ size 5598187
meta.json ADDED
@@ -0,0 +1,1051 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "lang":"ro",
3
+ "name":"core_news_sm",
4
+ "version":"3.1.0",
5
+ "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
+ "author":"Explosion",
7
+ "email":"contact@explosion.ai",
8
+ "url":"https://explosion.ai",
9
+ "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.1.0,<3.2.0",
11
+ "spacy_git_version":"caba63b74",
12
+ "vectors":{
13
+ "width":0,
14
+ "vectors":0,
15
+ "keys":0,
16
+ "name":null
17
+ },
18
+ "labels":{
19
+ "tok2vec":[
20
+
21
+ ],
22
+ "tagger":[
23
+ "ARROW",
24
+ "Af",
25
+ "Afcfp-n",
26
+ "Afcfson",
27
+ "Afcfsrn",
28
+ "Afcmpoy",
29
+ "Afcms-n",
30
+ "Afp",
31
+ "Afp-p-n",
32
+ "Afp-poy",
33
+ "Afpf--n",
34
+ "Afpfp-n",
35
+ "Afpfp-ny",
36
+ "Afpfpoy",
37
+ "Afpfpry",
38
+ "Afpfson",
39
+ "Afpfsoy",
40
+ "Afpfsrn",
41
+ "Afpfsry",
42
+ "Afpm--n",
43
+ "Afpmp-n",
44
+ "Afpmpoy",
45
+ "Afpmpry",
46
+ "Afpms-n",
47
+ "Afpmsoy",
48
+ "Afpmsry",
49
+ "Afsfp-n",
50
+ "Afsfsrn",
51
+ "BULLET",
52
+ "COLON",
53
+ "COMMA",
54
+ "Ccssp",
55
+ "Ccsspy",
56
+ "Crssp",
57
+ "Csssp",
58
+ "Cssspy",
59
+ "DASH",
60
+ "DBLQ",
61
+ "Dd3-po---e",
62
+ "Dd3-po---o",
63
+ "Dd3fpo",
64
+ "Dd3fpr",
65
+ "Dd3fpr---e",
66
+ "Dd3fpr---o",
67
+ "Dd3fpr--y",
68
+ "Dd3fso",
69
+ "Dd3fso---e",
70
+ "Dd3fsr",
71
+ "Dd3fsr---e",
72
+ "Dd3fsr---o",
73
+ "Dd3fsr--yo",
74
+ "Dd3mpo",
75
+ "Dd3mpr",
76
+ "Dd3mpr---e",
77
+ "Dd3mpr---o",
78
+ "Dd3mso---e",
79
+ "Dd3msr",
80
+ "Dd3msr---e",
81
+ "Dd3msr---o",
82
+ "Dh1ms",
83
+ "Dh3fp",
84
+ "Dh3fso",
85
+ "Dh3fsr",
86
+ "Dh3mp",
87
+ "Dh3ms",
88
+ "Di3",
89
+ "Di3-----y",
90
+ "Di3--r---e",
91
+ "Di3-po",
92
+ "Di3-po---e",
93
+ "Di3-sr",
94
+ "Di3-sr---e",
95
+ "Di3-sr--y",
96
+ "Di3fp",
97
+ "Di3fpr",
98
+ "Di3fpr---e",
99
+ "Di3fso",
100
+ "Di3fso---e",
101
+ "Di3fsr",
102
+ "Di3fsr---e",
103
+ "Di3mp",
104
+ "Di3mpr",
105
+ "Di3mpr---e",
106
+ "Di3ms",
107
+ "Di3ms----e",
108
+ "Di3mso---e",
109
+ "Di3msr",
110
+ "Di3msr---e",
111
+ "Ds1fp-p",
112
+ "Ds1fp-s",
113
+ "Ds1fsop",
114
+ "Ds1fsos",
115
+ "Ds1fsrp",
116
+ "Ds1fsrs",
117
+ "Ds1fsrs-y",
118
+ "Ds1mp-p",
119
+ "Ds1mp-s",
120
+ "Ds1ms-p",
121
+ "Ds1ms-s",
122
+ "Ds1msrs-y",
123
+ "Ds2---s",
124
+ "Ds2fp-p",
125
+ "Ds2fp-s",
126
+ "Ds2fsrp",
127
+ "Ds2fsrs",
128
+ "Ds2mp-p",
129
+ "Ds2mp-s",
130
+ "Ds2ms-p",
131
+ "Ds2ms-s",
132
+ "Ds3---p",
133
+ "Ds3---s",
134
+ "Ds3fp-s",
135
+ "Ds3fsos",
136
+ "Ds3fsrs",
137
+ "Ds3mp-s",
138
+ "Ds3ms-s",
139
+ "Dw3--r---e",
140
+ "Dw3-po---e",
141
+ "Dw3fpr",
142
+ "Dw3fso---e",
143
+ "Dw3fsr",
144
+ "Dw3mpr",
145
+ "Dw3mso---e",
146
+ "Dw3msr",
147
+ "Dz3fsr---e",
148
+ "Dz3mso---e",
149
+ "Dz3msr---e",
150
+ "EQUAL",
151
+ "EXCL",
152
+ "EXCLHELLIP",
153
+ "GE",
154
+ "GT",
155
+ "HELLIP",
156
+ "I",
157
+ "LCURL",
158
+ "LPAR",
159
+ "LSQR",
160
+ "LT",
161
+ "M",
162
+ "Mc",
163
+ "Mc-p-d",
164
+ "Mc-p-l",
165
+ "Mcfp-l",
166
+ "Mcfp-ln",
167
+ "Mcfprln",
168
+ "Mcfprly",
169
+ "Mcfsoln",
170
+ "Mcfsrln",
171
+ "Mcmp-l",
172
+ "Mcms-ln",
173
+ "Mcmsrl",
174
+ "Mcmsrly",
175
+ "Mffprln",
176
+ "Mffsrln",
177
+ "Mlfpo",
178
+ "Mlfpr",
179
+ "Mlmpr",
180
+ "Mo---l",
181
+ "Mo---ln",
182
+ "Mo-s-r",
183
+ "Mofp-ln",
184
+ "Mofpoly",
185
+ "Mofprly",
186
+ "Mofs-l",
187
+ "Mofsoln",
188
+ "Mofsoly",
189
+ "Mofsrln",
190
+ "Mofsrly",
191
+ "Mompoly",
192
+ "Momprly",
193
+ "Moms-l",
194
+ "Moms-ln",
195
+ "Momsoly",
196
+ "Momsrly",
197
+ "Nc",
198
+ "Nc---n",
199
+ "Ncf--n",
200
+ "Ncfp-n",
201
+ "Ncfpoy",
202
+ "Ncfpry",
203
+ "Ncfs-n",
204
+ "Ncfson",
205
+ "Ncfsoy",
206
+ "Ncfsrn",
207
+ "Ncfsry",
208
+ "Ncfsryy",
209
+ "Ncfsvy",
210
+ "Ncm--n",
211
+ "Ncmp-n",
212
+ "Ncmpoy",
213
+ "Ncmpry",
214
+ "Ncms-n",
215
+ "Ncms-ny",
216
+ "Ncms-y",
217
+ "Ncmsoy",
218
+ "Ncmsrn",
219
+ "Ncmsry",
220
+ "Ncmsryy",
221
+ "Ncmsvn",
222
+ "Ncmsvy",
223
+ "Np",
224
+ "Npfson",
225
+ "Npfsoy",
226
+ "Npfsrn",
227
+ "Npfsry",
228
+ "Npmpoy",
229
+ "Npmpry",
230
+ "Npms-n",
231
+ "Npmsoy",
232
+ "Npmsry",
233
+ "PERCENT",
234
+ "PERIOD",
235
+ "PLUS",
236
+ "PLUSMINUS",
237
+ "Pd3-po",
238
+ "Pd3fpr",
239
+ "Pd3fso",
240
+ "Pd3fsr",
241
+ "Pd3mpo",
242
+ "Pd3mpr",
243
+ "Pd3mpr--y",
244
+ "Pd3mso",
245
+ "Pd3msr",
246
+ "Pi3",
247
+ "Pi3--r",
248
+ "Pi3-po",
249
+ "Pi3-so",
250
+ "Pi3-sr",
251
+ "Pi3fpr",
252
+ "Pi3fso",
253
+ "Pi3fsr",
254
+ "Pi3mpr",
255
+ "Pi3mso",
256
+ "Pi3msr",
257
+ "Pi3msr--y",
258
+ "Pp1-pa--------w",
259
+ "Pp1-pa--y-----w",
260
+ "Pp1-pd--------s",
261
+ "Pp1-pd--------w",
262
+ "Pp1-pd--y-----w",
263
+ "Pp1-pr--------s",
264
+ "Pp1-sa--------s",
265
+ "Pp1-sa--------w",
266
+ "Pp1-sa--y-----w",
267
+ "Pp1-sd--------s",
268
+ "Pp1-sd--------w",
269
+ "Pp1-sd--y-----w",
270
+ "Pp1-sn--------s",
271
+ "Pp2-----------s",
272
+ "Pp2-pa--------w",
273
+ "Pp2-pa--y-----w",
274
+ "Pp2-pd--------w",
275
+ "Pp2-pd--y-----w",
276
+ "Pp2-pr--------s",
277
+ "Pp2-sa--------s",
278
+ "Pp2-sa--------w",
279
+ "Pp2-sa--y-----w",
280
+ "Pp2-sd--------s",
281
+ "Pp2-sd--------w",
282
+ "Pp2-sd--y-----w",
283
+ "Pp2-sn--------s",
284
+ "Pp2-so--------s",
285
+ "Pp2-sr--------s",
286
+ "Pp3-p---------s",
287
+ "Pp3-pd--------w",
288
+ "Pp3-pd--y-----w",
289
+ "Pp3-po--------s",
290
+ "Pp3-sd--------w",
291
+ "Pp3-sd--y-----w",
292
+ "Pp3fpa--------w",
293
+ "Pp3fpa--y-----w",
294
+ "Pp3fpr--------s",
295
+ "Pp3fs---------s",
296
+ "Pp3fsa--------w",
297
+ "Pp3fsa--y-----w",
298
+ "Pp3fso--------s",
299
+ "Pp3fsr--------s",
300
+ "Pp3fsr--y-----s",
301
+ "Pp3mpa--------w",
302
+ "Pp3mpa--y-----w",
303
+ "Pp3mpr--------s",
304
+ "Pp3ms---------s",
305
+ "Pp3msa--------w",
306
+ "Pp3msa--y-----w",
307
+ "Pp3mso--------s",
308
+ "Pp3msr--------s",
309
+ "Pp3msr--y-----s",
310
+ "Ps1fp-s",
311
+ "Ps1fsrp",
312
+ "Ps1fsrs",
313
+ "Ps1mp-p",
314
+ "Ps1ms-p",
315
+ "Ps2fp-s",
316
+ "Ps2fsrp",
317
+ "Ps2fsrs",
318
+ "Ps2ms-s",
319
+ "Ps3---p",
320
+ "Ps3---s",
321
+ "Ps3fp-s",
322
+ "Ps3fsrs",
323
+ "Ps3mp-s",
324
+ "Ps3ms-s",
325
+ "Pw3--r",
326
+ "Pw3-po",
327
+ "Pw3-so",
328
+ "Pw3fpr",
329
+ "Pw3fso",
330
+ "Pw3mpr",
331
+ "Pw3mso",
332
+ "Px3--a--------s",
333
+ "Px3--a--------w",
334
+ "Px3--a--y-----w",
335
+ "Px3--d--------w",
336
+ "Px3--d--y-----w",
337
+ "Pz3-sr",
338
+ "Pz3fsr",
339
+ "QUEST",
340
+ "QUOT",
341
+ "Qf",
342
+ "Qn",
343
+ "Qs",
344
+ "Qs-y",
345
+ "Qz",
346
+ "Qz-y",
347
+ "RCURL",
348
+ "RPAR",
349
+ "RSQR",
350
+ "Rc",
351
+ "Rgc",
352
+ "Rgp",
353
+ "Rgpy",
354
+ "Rgs",
355
+ "Rp",
356
+ "Rw",
357
+ "Rw-y",
358
+ "Rz",
359
+ "SCOLON",
360
+ "SLASH",
361
+ "STAR",
362
+ "Sp",
363
+ "Spsa",
364
+ "Spsay",
365
+ "Spsd",
366
+ "Spsg",
367
+ "Td-po",
368
+ "Tdfpr",
369
+ "Tdfso",
370
+ "Tdfsr",
371
+ "Tdmpr",
372
+ "Tdmso",
373
+ "Tdmsr",
374
+ "Tf-so",
375
+ "Tffpoy",
376
+ "Tffpry",
377
+ "Tffs-y",
378
+ "Tfmpoy",
379
+ "Tfms-y",
380
+ "Tfmsoy",
381
+ "Tfmsry",
382
+ "Ti-po",
383
+ "Tifp-y",
384
+ "Tifso",
385
+ "Tifsr",
386
+ "Timso",
387
+ "Timsr",
388
+ "Tsfp",
389
+ "Tsfs",
390
+ "Tsmp",
391
+ "Tsms",
392
+ "UNDERSC",
393
+ "Va--1",
394
+ "Va--1-----y",
395
+ "Va--1p",
396
+ "Va--1s",
397
+ "Va--1s----y",
398
+ "Va--2p",
399
+ "Va--2p----y",
400
+ "Va--2s",
401
+ "Va--2s----y",
402
+ "Va--3",
403
+ "Va--3-----y",
404
+ "Va--3p",
405
+ "Va--3p----y",
406
+ "Va--3s",
407
+ "Va--3s----y",
408
+ "Vag",
409
+ "Vaii1",
410
+ "Vaii2s",
411
+ "Vaii3p",
412
+ "Vaii3s",
413
+ "Vail3p",
414
+ "Vail3s",
415
+ "Vaip1p",
416
+ "Vaip1s",
417
+ "Vaip2p",
418
+ "Vaip2s",
419
+ "Vaip3p",
420
+ "Vaip3p----y",
421
+ "Vaip3s",
422
+ "Vaip3s----y",
423
+ "Vais3p",
424
+ "Vais3s",
425
+ "Vam-2s",
426
+ "Vanp",
427
+ "Vap--sm",
428
+ "Vasp1p",
429
+ "Vasp1s",
430
+ "Vasp2p",
431
+ "Vasp2s",
432
+ "Vasp3",
433
+ "Vmg",
434
+ "Vmg-------y",
435
+ "Vmii1",
436
+ "Vmii1-----y",
437
+ "Vmii2p",
438
+ "Vmii2s",
439
+ "Vmii3p",
440
+ "Vmii3p----y",
441
+ "Vmii3s",
442
+ "Vmii3s----y",
443
+ "Vmil1",
444
+ "Vmil1p",
445
+ "Vmil2s",
446
+ "Vmil3p",
447
+ "Vmil3p----y",
448
+ "Vmil3s",
449
+ "Vmil3s----y",
450
+ "Vmip1p",
451
+ "Vmip1p----y",
452
+ "Vmip1s",
453
+ "Vmip1s----y",
454
+ "Vmip2p",
455
+ "Vmip2s",
456
+ "Vmip2s----y",
457
+ "Vmip3",
458
+ "Vmip3-----y",
459
+ "Vmip3p",
460
+ "Vmip3s",
461
+ "Vmip3s----y",
462
+ "Vmis1p",
463
+ "Vmis1s",
464
+ "Vmis3p",
465
+ "Vmis3p----y",
466
+ "Vmis3s",
467
+ "Vmis3s----y",
468
+ "Vmm-2p",
469
+ "Vmm-2s",
470
+ "Vmnp",
471
+ "Vmnp------y",
472
+ "Vmp--pf",
473
+ "Vmp--pm",
474
+ "Vmp--sf",
475
+ "Vmp--sm",
476
+ "Vmp--sm---y",
477
+ "Vmsp1p",
478
+ "Vmsp1s",
479
+ "Vmsp2s",
480
+ "Vmsp3",
481
+ "Vmsp3-----y",
482
+ "X",
483
+ "Y",
484
+ "Ya",
485
+ "Yn",
486
+ "Ynfsoy",
487
+ "Ynfsry",
488
+ "Ynmsoy",
489
+ "Ynmsry",
490
+ "Yp",
491
+ "Yp-sr",
492
+ "Yr"
493
+ ],
494
+ "parser":[
495
+ "ROOT",
496
+ "acl",
497
+ "advcl",
498
+ "advcl:tcl",
499
+ "advmod",
500
+ "advmod:tmod",
501
+ "amod",
502
+ "appos",
503
+ "aux",
504
+ "aux:pass",
505
+ "case",
506
+ "cc",
507
+ "cc:preconj",
508
+ "ccomp",
509
+ "ccomp:pmod",
510
+ "compound",
511
+ "conj",
512
+ "cop",
513
+ "csubj",
514
+ "csubj:pass",
515
+ "dep",
516
+ "det",
517
+ "expl",
518
+ "expl:impers",
519
+ "expl:pass",
520
+ "expl:poss",
521
+ "expl:pv",
522
+ "fixed",
523
+ "flat",
524
+ "goeswith",
525
+ "iobj",
526
+ "mark",
527
+ "nmod",
528
+ "nmod:agent",
529
+ "nmod:pmod",
530
+ "nmod:tmod",
531
+ "nsubj",
532
+ "nsubj:pass",
533
+ "nummod",
534
+ "obj",
535
+ "obl",
536
+ "orphan",
537
+ "parataxis",
538
+ "punct",
539
+ "vocative",
540
+ "xcomp"
541
+ ],
542
+ "senter":[
543
+ "I",
544
+ "S"
545
+ ],
546
+ "attribute_ruler":[
547
+
548
+ ],
549
+ "lemmatizer":[
550
+
551
+ ],
552
+ "ner":[
553
+ "DATETIME",
554
+ "EVENT",
555
+ "FACILITY",
556
+ "GPE",
557
+ "LANGUAGE",
558
+ "LOC",
559
+ "MONEY",
560
+ "NAT_REL_POL",
561
+ "NUMERIC_VALUE",
562
+ "ORDINAL",
563
+ "ORGANIZATION",
564
+ "PERIOD",
565
+ "PERSON",
566
+ "PRODUCT",
567
+ "QUANTITY",
568
+ "WORK_OF_ART"
569
+ ]
570
+ },
571
+ "pipeline":[
572
+ "tok2vec",
573
+ "tagger",
574
+ "parser",
575
+ "attribute_ruler",
576
+ "lemmatizer",
577
+ "ner"
578
+ ],
579
+ "components":[
580
+ "tok2vec",
581
+ "tagger",
582
+ "parser",
583
+ "senter",
584
+ "attribute_ruler",
585
+ "lemmatizer",
586
+ "ner"
587
+ ],
588
+ "disabled":[
589
+ "senter"
590
+ ],
591
+ "performance":{
592
+ "token_acc":0.9990029326,
593
+ "tag_acc":0.9612171026,
594
+ "pos_acc":0.9556105925,
595
+ "morph_acc":0.9623560794,
596
+ "lemma_acc":0.8186589263,
597
+ "dep_uas":0.8705660969,
598
+ "dep_las":0.8189173932,
599
+ "sents_p":0.950928382,
600
+ "sents_r":0.9534574468,
601
+ "sents_f":0.9521912351,
602
+ "speed":9573.3292142848,
603
+ "morph_per_feat":{
604
+ "Case":{
605
+ "p":0.985836417,
606
+ "r":0.9825766247,
607
+ "f":0.9842038217
608
+ },
609
+ "Gender":{
610
+ "p":0.9825625474,
611
+ "r":0.9721215152,
612
+ "f":0.9773141457
613
+ },
614
+ "Number":{
615
+ "p":0.9856352415,
616
+ "r":0.9757716212,
617
+ "f":0.98067863
618
+ },
619
+ "Person":{
620
+ "p":0.9789227166,
621
+ "r":0.9852681202,
622
+ "f":0.9820851689
623
+ },
624
+ "PronType":{
625
+ "p":0.9951523546,
626
+ "r":0.99308915,
627
+ "f":0.9941196818
628
+ },
629
+ "Polarity":{
630
+ "p":0.9918166939,
631
+ "r":0.993442623,
632
+ "f":0.9926289926
633
+ },
634
+ "AdpType":{
635
+ "p":0.9976311337,
636
+ "r":0.9976311337,
637
+ "f":0.9976311337
638
+ },
639
+ "Definite":{
640
+ "p":0.9854908915,
641
+ "r":0.9703174603,
642
+ "f":0.9778453171
643
+ },
644
+ "Degree":{
645
+ "p":0.9436038514,
646
+ "r":0.9214237743,
647
+ "f":0.9323819232
648
+ },
649
+ "VerbForm":{
650
+ "p":0.9632545932,
651
+ "r":0.9760638298,
652
+ "f":0.9696169089
653
+ },
654
+ "Abbr":{
655
+ "p":0.9804878049,
656
+ "r":0.8973214286,
657
+ "f":0.9370629371
658
+ },
659
+ "Poss":{
660
+ "p":1.0,
661
+ "r":0.9951807229,
662
+ "f":0.9975845411
663
+ },
664
+ "NumForm":{
665
+ "p":0.9944134078,
666
+ "r":0.9807162534,
667
+ "f":0.987517337
668
+ },
669
+ "NumType":{
670
+ "p":0.995821727,
671
+ "r":0.9821428571,
672
+ "f":0.9889349931
673
+ },
674
+ "Reflex":{
675
+ "p":1.0,
676
+ "r":0.9935897436,
677
+ "f":0.9967845659
678
+ },
679
+ "Strength":{
680
+ "p":0.9841269841,
681
+ "r":0.9841269841,
682
+ "f":0.9841269841
683
+ },
684
+ "Mood":{
685
+ "p":0.9446428571,
686
+ "r":0.9706422018,
687
+ "f":0.9574660633
688
+ },
689
+ "Tense":{
690
+ "p":0.9470672389,
691
+ "r":0.9622093023,
692
+ "f":0.9545782264
693
+ },
694
+ "Variant":{
695
+ "p":0.9867549669,
696
+ "r":0.9612903226,
697
+ "f":0.9738562092
698
+ },
699
+ "Position":{
700
+ "p":1.0,
701
+ "r":0.9910714286,
702
+ "f":0.9955156951
703
+ },
704
+ "Number[psor]":{
705
+ "p":1.0,
706
+ "r":0.9666666667,
707
+ "f":0.9830508475
708
+ },
709
+ "PartType":{
710
+ "p":1.0,
711
+ "r":0.9459459459,
712
+ "f":0.9722222222
713
+ },
714
+ "Foreign":{
715
+ "p":0.0,
716
+ "r":0.0,
717
+ "f":0.0
718
+ }
719
+ },
720
+ "dep_las_per_type":{
721
+ "root":{
722
+ "p":0.8519855596,
723
+ "r":0.8888888889,
724
+ "f":0.8700460829
725
+ },
726
+ "mark":{
727
+ "p":0.9138576779,
728
+ "r":0.920754717,
729
+ "f":0.9172932331
730
+ },
731
+ "case":{
732
+ "p":0.9520392749,
733
+ "r":0.9452568429,
734
+ "f":0.948635936
735
+ },
736
+ "nmod:tmod":{
737
+ "p":0.5151515152,
738
+ "r":0.1428571429,
739
+ "f":0.2236842105
740
+ },
741
+ "amod":{
742
+ "p":0.8744625967,
743
+ "r":0.8662691652,
744
+ "f":0.8703465982
745
+ },
746
+ "nsubj":{
747
+ "p":0.8336106489,
748
+ "r":0.7914691943,
749
+ "f":0.811993517
750
+ },
751
+ "nmod":{
752
+ "p":0.8005068791,
753
+ "r":0.7893609425,
754
+ "f":0.7948948409
755
+ },
756
+ "aux":{
757
+ "p":0.9757462687,
758
+ "r":0.9561243144,
759
+ "f":0.9658356417
760
+ },
761
+ "advcl":{
762
+ "p":0.5620915033,
763
+ "r":0.6466165414,
764
+ "f":0.6013986014
765
+ },
766
+ "obj":{
767
+ "p":0.7931769723,
768
+ "r":0.8591224018,
769
+ "f":0.8248337029
770
+ },
771
+ "det":{
772
+ "p":0.9598623853,
773
+ "r":0.9479048698,
774
+ "f":0.9538461538
775
+ },
776
+ "cc":{
777
+ "p":0.9182389937,
778
+ "r":0.9144050104,
779
+ "f":0.9163179916
780
+ },
781
+ "conj":{
782
+ "p":0.5696969697,
783
+ "r":0.5446118192,
784
+ "f":0.5568720379
785
+ },
786
+ "nummod":{
787
+ "p":0.8888888889,
788
+ "r":0.8571428571,
789
+ "f":0.8727272727
790
+ },
791
+ "acl":{
792
+ "p":0.7819767442,
793
+ "r":0.6950904393,
794
+ "f":0.7359781122
795
+ },
796
+ "advmod":{
797
+ "p":0.7928388747,
798
+ "r":0.8179419525,
799
+ "f":0.8051948052
800
+ },
801
+ "obl":{
802
+ "p":0.6563500534,
803
+ "r":0.7804568528,
804
+ "f":0.7130434783
805
+ },
806
+ "expl:pass":{
807
+ "p":0.8372093023,
808
+ "r":0.6666666667,
809
+ "f":0.7422680412
810
+ },
811
+ "nsubj:pass":{
812
+ "p":0.8582089552,
813
+ "r":0.7012195122,
814
+ "f":0.7718120805
815
+ },
816
+ "fixed":{
817
+ "p":0.8381742739,
818
+ "r":0.8541226216,
819
+ "f":0.8460732984
820
+ },
821
+ "appos":{
822
+ "p":0.4291666667,
823
+ "r":0.393129771,
824
+ "f":0.4103585657
825
+ },
826
+ "parataxis":{
827
+ "p":0.2045454545,
828
+ "r":0.2571428571,
829
+ "f":0.2278481013
830
+ },
831
+ "aux:pass":{
832
+ "p":0.9256756757,
833
+ "r":0.9133333333,
834
+ "f":0.9194630872
835
+ },
836
+ "nmod:agent":{
837
+ "p":0.9462365591,
838
+ "r":0.7927927928,
839
+ "f":0.862745098
840
+ },
841
+ "ccomp":{
842
+ "p":0.8770491803,
843
+ "r":0.8294573643,
844
+ "f":0.8525896414
845
+ },
846
+ "nmod:pmod":{
847
+ "p":0.3555555556,
848
+ "r":0.2666666667,
849
+ "f":0.3047619048
850
+ },
851
+ "iobj":{
852
+ "p":0.7619047619,
853
+ "r":0.7901234568,
854
+ "f":0.7757575758
855
+ },
856
+ "flat":{
857
+ "p":0.7560321716,
858
+ "r":0.7421052632,
859
+ "f":0.7490039841
860
+ },
861
+ "cop":{
862
+ "p":0.7952755906,
863
+ "r":0.814516129,
864
+ "f":0.8047808765
865
+ },
866
+ "csubj":{
867
+ "p":0.7619047619,
868
+ "r":0.7619047619,
869
+ "f":0.7619047619
870
+ },
871
+ "dep":{
872
+ "p":0.0,
873
+ "r":0.0,
874
+ "f":0.0
875
+ },
876
+ "expl:pv":{
877
+ "p":0.7532467532,
878
+ "r":0.8405797101,
879
+ "f":0.7945205479
880
+ },
881
+ "expl":{
882
+ "p":0.7037037037,
883
+ "r":0.7037037037,
884
+ "f":0.7037037037
885
+ },
886
+ "vocative":{
887
+ "p":0.0,
888
+ "r":0.0,
889
+ "f":0.0
890
+ },
891
+ "expl:poss":{
892
+ "p":0.9310344828,
893
+ "r":0.8709677419,
894
+ "f":0.9
895
+ },
896
+ "goeswith":{
897
+ "p":0.1,
898
+ "r":0.3333333333,
899
+ "f":0.1538461538
900
+ },
901
+ "xcomp":{
902
+ "p":0.325,
903
+ "r":0.4814814815,
904
+ "f":0.3880597015
905
+ },
906
+ "compound":{
907
+ "p":0.3333333333,
908
+ "r":0.5714285714,
909
+ "f":0.4210526316
910
+ },
911
+ "ccomp:pmod":{
912
+ "p":0.0,
913
+ "r":0.0,
914
+ "f":0.0
915
+ },
916
+ "orphan":{
917
+ "p":0.0,
918
+ "r":0.0,
919
+ "f":0.0
920
+ },
921
+ "expl:impers":{
922
+ "p":0.5,
923
+ "r":0.3333333333,
924
+ "f":0.4
925
+ },
926
+ "list":{
927
+ "p":0.0,
928
+ "r":0.0,
929
+ "f":0.0
930
+ },
931
+ "cc:preconj":{
932
+ "p":0.0,
933
+ "r":0.0,
934
+ "f":0.0
935
+ },
936
+ "csubj:pass":{
937
+ "p":0.0,
938
+ "r":0.0,
939
+ "f":0.0
940
+ }
941
+ },
942
+ "ents_p":0.7244897959,
943
+ "ents_r":0.7091817134,
944
+ "ents_f":0.7167540283,
945
+ "ents_per_type":{
946
+ "DATETIME":{
947
+ "p":0.7889273356,
948
+ "r":0.7944250871,
949
+ "f":0.7916666667
950
+ },
951
+ "ORGANIZATION":{
952
+ "p":0.6656050955,
953
+ "r":0.6656050955,
954
+ "f":0.6656050955
955
+ },
956
+ "FACILITY":{
957
+ "p":0.4953271028,
958
+ "r":0.4045801527,
959
+ "f":0.4453781513
960
+ },
961
+ "NUMERIC_VALUE":{
962
+ "p":0.875,
963
+ "r":0.9194915254,
964
+ "f":0.8966942149
965
+ },
966
+ "ORDINAL":{
967
+ "p":0.7962962963,
968
+ "r":0.7818181818,
969
+ "f":0.7889908257
970
+ },
971
+ "EVENT":{
972
+ "p":0.6551724138,
973
+ "r":0.5135135135,
974
+ "f":0.5757575758
975
+ },
976
+ "GPE":{
977
+ "p":0.8025477707,
978
+ "r":0.8689655172,
979
+ "f":0.8344370861
980
+ },
981
+ "PERSON":{
982
+ "p":0.675862069,
983
+ "r":0.6577181208,
984
+ "f":0.6666666667
985
+ },
986
+ "NAT_REL_POL":{
987
+ "p":0.8976377953,
988
+ "r":0.76,
989
+ "f":0.8231046931
990
+ },
991
+ "MONEY":{
992
+ "p":0.8867924528,
993
+ "r":0.8103448276,
994
+ "f":0.8468468468
995
+ },
996
+ "PRODUCT":{
997
+ "p":0.4424778761,
998
+ "r":0.3649635036,
999
+ "f":0.4
1000
+ },
1001
+ "LOC":{
1002
+ "p":0.4461538462,
1003
+ "r":0.3815789474,
1004
+ "f":0.4113475177
1005
+ },
1006
+ "WORK_OF_ART":{
1007
+ "p":0.24,
1008
+ "r":0.3157894737,
1009
+ "f":0.2727272727
1010
+ },
1011
+ "QUANTITY":{
1012
+ "p":0.8518518519,
1013
+ "r":0.8846153846,
1014
+ "f":0.8679245283
1015
+ },
1016
+ "LANGUAGE":{
1017
+ "p":0.5714285714,
1018
+ "r":1.0,
1019
+ "f":0.7272727273
1020
+ },
1021
+ "PERIOD":{
1022
+ "p":0.8717948718,
1023
+ "r":0.8095238095,
1024
+ "f":0.8395061728
1025
+ }
1026
+ }
1027
+ },
1028
+ "sources":[
1029
+ {
1030
+ "name":"Lemmatization Lists",
1031
+ "url":"https://github.com/michmech/lemmatization-lists/",
1032
+ "license":"ODbL",
1033
+ "author":"Michal M\u011bchura"
1034
+ },
1035
+ {
1036
+ "name":"UD Romanian RRT v2.5",
1037
+ "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1038
+ "license":"CC BY-SA 4.0",
1039
+ "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
1040
+ },
1041
+ {
1042
+ "name":"RONEC - the Romanian Named Entity Corpus (ca9ce460)",
1043
+ "url":"https://github.com/dumitrescustefan/ronec",
1044
+ "license":"MIT",
1045
+ "author":"Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan"
1046
+ }
1047
+ ],
1048
+ "requirements":[
1049
+
1050
+ ]
1051
+ }
ner/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":1,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
ner/model ADDED
Binary file (6.73 MB). View file
 
ner/moves ADDED
Binary file (1.05 kB). View file
 
parser/cfg ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "moves":null,
3
+ "update_with_oracle_cut_size":100,
4
+ "multitasks":[
5
+
6
+ ],
7
+ "min_action_freq":30,
8
+ "learn_tokens":false,
9
+ "beam_width":1,
10
+ "beam_density":0.0,
11
+ "beam_update_prob":0.0,
12
+ "incorrect_spans_key":null
13
+ }
parser/model ADDED
Binary file (312 kB). View file
 
parser/moves ADDED
@@ -0,0 +1 @@
 
 
1
+ ��moves�{"0":{"":85972},"1":{"":90580},"2":{"case":22318,"punct":9077,"det":9009,"nsubj":7125,"advmod":6350,"cc":5364,"mark":5291,"aux":4018,"obl":2015,"nummod":1880,"expl:pv":1798,"cop":1706,"amod":1376,"aux:pass":1369,"nsubj:pass":963,"expl:pass":909,"parataxis":877,"obj":866,"advcl":710,"iobj":567,"expl:poss":464,"expl":390,"nmod":204,"nsubj||csubj":154,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":85,"nmod:pmod":74,"cc:preconj":63,"csubj":58,"nsubj:pass||csubj":57,"obj||ccomp":44,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14423,"amod":9673,"obl":7745,"conj":7281,"fixed":5595,"obj":5457,"acl":4102,"advmod":2145,"advcl":2043,"ccomp":1929,"nummod":1646,"nsubj":1278,"nmod:pmod":1208,"flat":1160,"det":1031,"appos":915,"xcomp":886,"iobj":804,"nmod:agent":718,"csubj":626,"nsubj:pass":546,"case":442,"parataxis":426,"nmod:tmod":286,"goeswith":245,"ccomp:pmod":174,"cc":124,"cop":100,"expl:pv":86,"expl":55,"advcl:tcl":52,"compound":50,"csubj:pass":49,"expl:poss":36,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
ro_core_news_sm-any-py3-none-any.whl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29fcd7c11fa94cfdb875cd8f250c87cb7c720bdc94a02fcedd734ede1a2ee408
3
+ size 16363069
senter/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
senter/model ADDED
Binary file (190 kB). View file
 
tagger/cfg ADDED
@@ -0,0 +1,474 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ "ARROW",
4
+ "Af",
5
+ "Afcfp-n",
6
+ "Afcfson",
7
+ "Afcfsrn",
8
+ "Afcmpoy",
9
+ "Afcms-n",
10
+ "Afp",
11
+ "Afp-p-n",
12
+ "Afp-poy",
13
+ "Afpf--n",
14
+ "Afpfp-n",
15
+ "Afpfp-ny",
16
+ "Afpfpoy",
17
+ "Afpfpry",
18
+ "Afpfson",
19
+ "Afpfsoy",
20
+ "Afpfsrn",
21
+ "Afpfsry",
22
+ "Afpm--n",
23
+ "Afpmp-n",
24
+ "Afpmpoy",
25
+ "Afpmpry",
26
+ "Afpms-n",
27
+ "Afpmsoy",
28
+ "Afpmsry",
29
+ "Afsfp-n",
30
+ "Afsfsrn",
31
+ "BULLET",
32
+ "COLON",
33
+ "COMMA",
34
+ "Ccssp",
35
+ "Ccsspy",
36
+ "Crssp",
37
+ "Csssp",
38
+ "Cssspy",
39
+ "DASH",
40
+ "DBLQ",
41
+ "Dd3-po---e",
42
+ "Dd3-po---o",
43
+ "Dd3fpo",
44
+ "Dd3fpr",
45
+ "Dd3fpr---e",
46
+ "Dd3fpr---o",
47
+ "Dd3fpr--y",
48
+ "Dd3fso",
49
+ "Dd3fso---e",
50
+ "Dd3fsr",
51
+ "Dd3fsr---e",
52
+ "Dd3fsr---o",
53
+ "Dd3fsr--yo",
54
+ "Dd3mpo",
55
+ "Dd3mpr",
56
+ "Dd3mpr---e",
57
+ "Dd3mpr---o",
58
+ "Dd3mso---e",
59
+ "Dd3msr",
60
+ "Dd3msr---e",
61
+ "Dd3msr---o",
62
+ "Dh1ms",
63
+ "Dh3fp",
64
+ "Dh3fso",
65
+ "Dh3fsr",
66
+ "Dh3mp",
67
+ "Dh3ms",
68
+ "Di3",
69
+ "Di3-----y",
70
+ "Di3--r---e",
71
+ "Di3-po",
72
+ "Di3-po---e",
73
+ "Di3-sr",
74
+ "Di3-sr---e",
75
+ "Di3-sr--y",
76
+ "Di3fp",
77
+ "Di3fpr",
78
+ "Di3fpr---e",
79
+ "Di3fso",
80
+ "Di3fso---e",
81
+ "Di3fsr",
82
+ "Di3fsr---e",
83
+ "Di3mp",
84
+ "Di3mpr",
85
+ "Di3mpr---e",
86
+ "Di3ms",
87
+ "Di3ms----e",
88
+ "Di3mso---e",
89
+ "Di3msr",
90
+ "Di3msr---e",
91
+ "Ds1fp-p",
92
+ "Ds1fp-s",
93
+ "Ds1fsop",
94
+ "Ds1fsos",
95
+ "Ds1fsrp",
96
+ "Ds1fsrs",
97
+ "Ds1fsrs-y",
98
+ "Ds1mp-p",
99
+ "Ds1mp-s",
100
+ "Ds1ms-p",
101
+ "Ds1ms-s",
102
+ "Ds1msrs-y",
103
+ "Ds2---s",
104
+ "Ds2fp-p",
105
+ "Ds2fp-s",
106
+ "Ds2fsrp",
107
+ "Ds2fsrs",
108
+ "Ds2mp-p",
109
+ "Ds2mp-s",
110
+ "Ds2ms-p",
111
+ "Ds2ms-s",
112
+ "Ds3---p",
113
+ "Ds3---s",
114
+ "Ds3fp-s",
115
+ "Ds3fsos",
116
+ "Ds3fsrs",
117
+ "Ds3mp-s",
118
+ "Ds3ms-s",
119
+ "Dw3--r---e",
120
+ "Dw3-po---e",
121
+ "Dw3fpr",
122
+ "Dw3fso---e",
123
+ "Dw3fsr",
124
+ "Dw3mpr",
125
+ "Dw3mso---e",
126
+ "Dw3msr",
127
+ "Dz3fsr---e",
128
+ "Dz3mso---e",
129
+ "Dz3msr---e",
130
+ "EQUAL",
131
+ "EXCL",
132
+ "EXCLHELLIP",
133
+ "GE",
134
+ "GT",
135
+ "HELLIP",
136
+ "I",
137
+ "LCURL",
138
+ "LPAR",
139
+ "LSQR",
140
+ "LT",
141
+ "M",
142
+ "Mc",
143
+ "Mc-p-d",
144
+ "Mc-p-l",
145
+ "Mcfp-l",
146
+ "Mcfp-ln",
147
+ "Mcfprln",
148
+ "Mcfprly",
149
+ "Mcfsoln",
150
+ "Mcfsrln",
151
+ "Mcmp-l",
152
+ "Mcms-ln",
153
+ "Mcmsrl",
154
+ "Mcmsrly",
155
+ "Mffprln",
156
+ "Mffsrln",
157
+ "Mlfpo",
158
+ "Mlfpr",
159
+ "Mlmpr",
160
+ "Mo---l",
161
+ "Mo---ln",
162
+ "Mo-s-r",
163
+ "Mofp-ln",
164
+ "Mofpoly",
165
+ "Mofprly",
166
+ "Mofs-l",
167
+ "Mofsoln",
168
+ "Mofsoly",
169
+ "Mofsrln",
170
+ "Mofsrly",
171
+ "Mompoly",
172
+ "Momprly",
173
+ "Moms-l",
174
+ "Moms-ln",
175
+ "Momsoly",
176
+ "Momsrly",
177
+ "Nc",
178
+ "Nc---n",
179
+ "Ncf--n",
180
+ "Ncfp-n",
181
+ "Ncfpoy",
182
+ "Ncfpry",
183
+ "Ncfs-n",
184
+ "Ncfson",
185
+ "Ncfsoy",
186
+ "Ncfsrn",
187
+ "Ncfsry",
188
+ "Ncfsryy",
189
+ "Ncfsvy",
190
+ "Ncm--n",
191
+ "Ncmp-n",
192
+ "Ncmpoy",
193
+ "Ncmpry",
194
+ "Ncms-n",
195
+ "Ncms-ny",
196
+ "Ncms-y",
197
+ "Ncmsoy",
198
+ "Ncmsrn",
199
+ "Ncmsry",
200
+ "Ncmsryy",
201
+ "Ncmsvn",
202
+ "Ncmsvy",
203
+ "Np",
204
+ "Npfson",
205
+ "Npfsoy",
206
+ "Npfsrn",
207
+ "Npfsry",
208
+ "Npmpoy",
209
+ "Npmpry",
210
+ "Npms-n",
211
+ "Npmsoy",
212
+ "Npmsry",
213
+ "PERCENT",
214
+ "PERIOD",
215
+ "PLUS",
216
+ "PLUSMINUS",
217
+ "Pd3-po",
218
+ "Pd3fpr",
219
+ "Pd3fso",
220
+ "Pd3fsr",
221
+ "Pd3mpo",
222
+ "Pd3mpr",
223
+ "Pd3mpr--y",
224
+ "Pd3mso",
225
+ "Pd3msr",
226
+ "Pi3",
227
+ "Pi3--r",
228
+ "Pi3-po",
229
+ "Pi3-so",
230
+ "Pi3-sr",
231
+ "Pi3fpr",
232
+ "Pi3fso",
233
+ "Pi3fsr",
234
+ "Pi3mpr",
235
+ "Pi3mso",
236
+ "Pi3msr",
237
+ "Pi3msr--y",
238
+ "Pp1-pa--------w",
239
+ "Pp1-pa--y-----w",
240
+ "Pp1-pd--------s",
241
+ "Pp1-pd--------w",
242
+ "Pp1-pd--y-----w",
243
+ "Pp1-pr--------s",
244
+ "Pp1-sa--------s",
245
+ "Pp1-sa--------w",
246
+ "Pp1-sa--y-----w",
247
+ "Pp1-sd--------s",
248
+ "Pp1-sd--------w",
249
+ "Pp1-sd--y-----w",
250
+ "Pp1-sn--------s",
251
+ "Pp2-----------s",
252
+ "Pp2-pa--------w",
253
+ "Pp2-pa--y-----w",
254
+ "Pp2-pd--------w",
255
+ "Pp2-pd--y-----w",
256
+ "Pp2-pr--------s",
257
+ "Pp2-sa--------s",
258
+ "Pp2-sa--------w",
259
+ "Pp2-sa--y-----w",
260
+ "Pp2-sd--------s",
261
+ "Pp2-sd--------w",
262
+ "Pp2-sd--y-----w",
263
+ "Pp2-sn--------s",
264
+ "Pp2-so--------s",
265
+ "Pp2-sr--------s",
266
+ "Pp3-p---------s",
267
+ "Pp3-pd--------w",
268
+ "Pp3-pd--y-----w",
269
+ "Pp3-po--------s",
270
+ "Pp3-sd--------w",
271
+ "Pp3-sd--y-----w",
272
+ "Pp3fpa--------w",
273
+ "Pp3fpa--y-----w",
274
+ "Pp3fpr--------s",
275
+ "Pp3fs---------s",
276
+ "Pp3fsa--------w",
277
+ "Pp3fsa--y-----w",
278
+ "Pp3fso--------s",
279
+ "Pp3fsr--------s",
280
+ "Pp3fsr--y-----s",
281
+ "Pp3mpa--------w",
282
+ "Pp3mpa--y-----w",
283
+ "Pp3mpr--------s",
284
+ "Pp3ms---------s",
285
+ "Pp3msa--------w",
286
+ "Pp3msa--y-----w",
287
+ "Pp3mso--------s",
288
+ "Pp3msr--------s",
289
+ "Pp3msr--y-----s",
290
+ "Ps1fp-s",
291
+ "Ps1fsrp",
292
+ "Ps1fsrs",
293
+ "Ps1mp-p",
294
+ "Ps1ms-p",
295
+ "Ps2fp-s",
296
+ "Ps2fsrp",
297
+ "Ps2fsrs",
298
+ "Ps2ms-s",
299
+ "Ps3---p",
300
+ "Ps3---s",
301
+ "Ps3fp-s",
302
+ "Ps3fsrs",
303
+ "Ps3mp-s",
304
+ "Ps3ms-s",
305
+ "Pw3--r",
306
+ "Pw3-po",
307
+ "Pw3-so",
308
+ "Pw3fpr",
309
+ "Pw3fso",
310
+ "Pw3mpr",
311
+ "Pw3mso",
312
+ "Px3--a--------s",
313
+ "Px3--a--------w",
314
+ "Px3--a--y-----w",
315
+ "Px3--d--------w",
316
+ "Px3--d--y-----w",
317
+ "Pz3-sr",
318
+ "Pz3fsr",
319
+ "QUEST",
320
+ "QUOT",
321
+ "Qf",
322
+ "Qn",
323
+ "Qs",
324
+ "Qs-y",
325
+ "Qz",
326
+ "Qz-y",
327
+ "RCURL",
328
+ "RPAR",
329
+ "RSQR",
330
+ "Rc",
331
+ "Rgc",
332
+ "Rgp",
333
+ "Rgpy",
334
+ "Rgs",
335
+ "Rp",
336
+ "Rw",
337
+ "Rw-y",
338
+ "Rz",
339
+ "SCOLON",
340
+ "SLASH",
341
+ "STAR",
342
+ "Sp",
343
+ "Spsa",
344
+ "Spsay",
345
+ "Spsd",
346
+ "Spsg",
347
+ "Td-po",
348
+ "Tdfpr",
349
+ "Tdfso",
350
+ "Tdfsr",
351
+ "Tdmpr",
352
+ "Tdmso",
353
+ "Tdmsr",
354
+ "Tf-so",
355
+ "Tffpoy",
356
+ "Tffpry",
357
+ "Tffs-y",
358
+ "Tfmpoy",
359
+ "Tfms-y",
360
+ "Tfmsoy",
361
+ "Tfmsry",
362
+ "Ti-po",
363
+ "Tifp-y",
364
+ "Tifso",
365
+ "Tifsr",
366
+ "Timso",
367
+ "Timsr",
368
+ "Tsfp",
369
+ "Tsfs",
370
+ "Tsmp",
371
+ "Tsms",
372
+ "UNDERSC",
373
+ "Va--1",
374
+ "Va--1-----y",
375
+ "Va--1p",
376
+ "Va--1s",
377
+ "Va--1s----y",
378
+ "Va--2p",
379
+ "Va--2p----y",
380
+ "Va--2s",
381
+ "Va--2s----y",
382
+ "Va--3",
383
+ "Va--3-----y",
384
+ "Va--3p",
385
+ "Va--3p----y",
386
+ "Va--3s",
387
+ "Va--3s----y",
388
+ "Vag",
389
+ "Vaii1",
390
+ "Vaii2s",
391
+ "Vaii3p",
392
+ "Vaii3s",
393
+ "Vail3p",
394
+ "Vail3s",
395
+ "Vaip1p",
396
+ "Vaip1s",
397
+ "Vaip2p",
398
+ "Vaip2s",
399
+ "Vaip3p",
400
+ "Vaip3p----y",
401
+ "Vaip3s",
402
+ "Vaip3s----y",
403
+ "Vais3p",
404
+ "Vais3s",
405
+ "Vam-2s",
406
+ "Vanp",
407
+ "Vap--sm",
408
+ "Vasp1p",
409
+ "Vasp1s",
410
+ "Vasp2p",
411
+ "Vasp2s",
412
+ "Vasp3",
413
+ "Vmg",
414
+ "Vmg-------y",
415
+ "Vmii1",
416
+ "Vmii1-----y",
417
+ "Vmii2p",
418
+ "Vmii2s",
419
+ "Vmii3p",
420
+ "Vmii3p----y",
421
+ "Vmii3s",
422
+ "Vmii3s----y",
423
+ "Vmil1",
424
+ "Vmil1p",
425
+ "Vmil2s",
426
+ "Vmil3p",
427
+ "Vmil3p----y",
428
+ "Vmil3s",
429
+ "Vmil3s----y",
430
+ "Vmip1p",
431
+ "Vmip1p----y",
432
+ "Vmip1s",
433
+ "Vmip1s----y",
434
+ "Vmip2p",
435
+ "Vmip2s",
436
+ "Vmip2s----y",
437
+ "Vmip3",
438
+ "Vmip3-----y",
439
+ "Vmip3p",
440
+ "Vmip3s",
441
+ "Vmip3s----y",
442
+ "Vmis1p",
443
+ "Vmis1s",
444
+ "Vmis3p",
445
+ "Vmis3p----y",
446
+ "Vmis3s",
447
+ "Vmis3s----y",
448
+ "Vmm-2p",
449
+ "Vmm-2s",
450
+ "Vmnp",
451
+ "Vmnp------y",
452
+ "Vmp--pf",
453
+ "Vmp--pm",
454
+ "Vmp--sf",
455
+ "Vmp--sm",
456
+ "Vmp--sm---y",
457
+ "Vmsp1p",
458
+ "Vmsp1s",
459
+ "Vmsp2s",
460
+ "Vmsp3",
461
+ "Vmsp3-----y",
462
+ "X",
463
+ "Y",
464
+ "Ya",
465
+ "Yn",
466
+ "Ynfsoy",
467
+ "Ynfsry",
468
+ "Ynmsoy",
469
+ "Ynmsry",
470
+ "Yp",
471
+ "Yp-sr",
472
+ "Yr"
473
+ ]
474
+ }
tagger/model ADDED
Binary file (183 kB). View file
 
tok2vec/cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+
3
+ }
tok2vec/model ADDED
Binary file (6.59 MB). View file
 
tokenizer ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ ��prefix_search�
2
+ ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/key2row ADDED
@@ -0,0 +1 @@
 
 
1
+
vocab/lookups.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1
vocab/strings.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:559be5fcdd8d099c39c785a46b22493570df3d7f7ced2f87e9b25f59292b77bd
3
+ size 1089368
vocab/vectors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14772b683e726436d5948ad3fff2b43d036ef2ebbe3458aafed6004e05a40706
3
+ size 128