EC2 Default User commited on
Commit
5cc8dce
1 Parent(s): 5516623

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -1,554 +1,3 @@
1
- # Lemmatization Lists
2
-
3
- * Author: Michal Měchura
4
- * URL: https://github.com/michmech/lemmatization-lists/
5
- * License: ODbL
6
-
7
- ```
8
- ## ODC Open Database License (ODbL)
9
-
10
- ### Preamble
11
-
12
- The Open Database License (ODbL) is a license agreement intended to
13
- allow users to freely share, modify, and use this Database while
14
- maintaining this same freedom for others. Many databases are covered by
15
- copyright, and therefore this document licenses these rights. Some
16
- jurisdictions, mainly in the European Union, have specific rights that
17
- cover databases, and so the ODbL addresses these rights, too. Finally,
18
- the ODbL is also an agreement in contract for users of this Database to
19
- act in certain ways in return for accessing this Database.
20
-
21
- Databases can contain a wide variety of types of content (images,
22
- audiovisual material, and sounds all in the same database, for example),
23
- and so the ODbL only governs the rights over the Database, and not the
24
- contents of the Database individually. Licensors should use the ODbL
25
- together with another license for the contents, if the contents have a
26
- single set of rights that uniformly covers all of the contents. If the
27
- contents have multiple sets of different rights, Licensors should
28
- describe what rights govern what contents together in the individual
29
- record or in some other way that clarifies what rights apply.
30
-
31
- Sometimes the contents of a database, or the database itself, can be
32
- covered by other rights not addressed here (such as private contracts,
33
- trade mark over the name, or privacy rights / data protection rights
34
- over information in the contents), and so you are advised that you may
35
- have to consult other documents or clear other rights before doing
36
- activities not covered by this License.
37
-
38
- ------
39
-
40
- The Licensor (as defined below)
41
-
42
- and
43
-
44
- You (as defined below)
45
-
46
- agree as follows:
47
-
48
- ### 1.0 Definitions of Capitalised Words
49
-
50
- "Collective Database" – Means this Database in unmodified form as part
51
- of a collection of independent databases in themselves that together are
52
- assembled into a collective whole. A work that constitutes a Collective
53
- Database will not be considered a Derivative Database.
54
-
55
- "Convey" – As a verb, means Using the Database, a Derivative Database,
56
- or the Database as part of a Collective Database in any way that enables
57
- a Person to make or receive copies of the Database or a Derivative
58
- Database. Conveying does not include interaction with a user through a
59
- computer network, or creating and Using a Produced Work, where no
60
- transfer of a copy of the Database or a Derivative Database occurs.
61
- "Contents" – The contents of this Database, which includes the
62
- information, independent works, or other material collected into the
63
- Database. For example, the contents of the Database could be factual
64
- data or works such as images, audiovisual material, text, or sounds.
65
-
66
- "Database" – A collection of material (the Contents) arranged in a
67
- systematic or methodical way and individually accessible by electronic
68
- or other means offered under the terms of this License.
69
-
70
- "Database Directive" – Means Directive 96/9/EC of the European
71
- Parliament and of the Council of 11 March 1996 on the legal protection
72
- of databases, as amended or succeeded.
73
-
74
- "Database Right" – Means rights resulting from the Chapter III ("sui
75
- generis") rights in the Database Directive (as amended and as transposed
76
- by member states), which includes the Extraction and Re-utilisation of
77
- the whole or a Substantial part of the Contents, as well as any similar
78
- rights available in the relevant jurisdiction under Section 10.4.
79
-
80
- "Derivative Database" – Means a database based upon the Database, and
81
- includes any translation, adaptation, arrangement, modification, or any
82
- other alteration of the Database or of a Substantial part of the
83
- Contents. This includes, but is not limited to, Extracting or
84
- Re-utilising the whole or a Substantial part of the Contents in a new
85
- Database.
86
-
87
- "Extraction" – Means the permanent or temporary transfer of all or a
88
- Substantial part of the Contents to another medium by any means or in
89
- any form.
90
-
91
- "License" – Means this license agreement and is both a license of rights
92
- such as copyright and Database Rights and an agreement in contract.
93
-
94
- "Licensor" – Means the Person that offers the Database under the terms
95
- of this License.
96
-
97
- "Person" – Means a natural or legal person or a body of persons
98
- corporate or incorporate.
99
-
100
- "Produced Work" – a work (such as an image, audiovisual material, text,
101
- or sounds) resulting from using the whole or a Substantial part of the
102
- Contents (via a search or other query) from this Database, a Derivative
103
- Database, or this Database as part of a Collective Database.
104
-
105
- "Publicly" – means to Persons other than You or under Your control by
106
- either more than 50% ownership or by the power to direct their
107
- activities (such as contracting with an independent consultant).
108
-
109
- "Re-utilisation" – means any form of making available to the public all
110
- or a Substantial part of the Contents by the distribution of copies, by
111
- renting, by online or other forms of transmission.
112
-
113
- "Substantial" – Means substantial in terms of quantity or quality or a
114
- combination of both. The repeated and systematic Extraction or
115
- Re-utilisation of insubstantial parts of the Contents may amount to the
116
- Extraction or Re-utilisation of a Substantial part of the Contents.
117
-
118
- "Use" – As a verb, means doing any act that is restricted by copyright
119
- or Database Rights whether in the original medium or any other; and
120
- includes without limitation distributing, copying, publicly performing,
121
- publicly displaying, and preparing derivative works of the Database, as
122
- well as modifying the Database as may be technically necessary to use it
123
- in a different mode or format.
124
-
125
- "You" – Means a Person exercising rights under this License who has not
126
- previously violated the terms of this License with respect to the
127
- Database, or who has received express permission from the Licensor to
128
- exercise rights under this License despite a previous violation.
129
-
130
- Words in the singular include the plural and vice versa.
131
-
132
- ### 2.0 What this License covers
133
-
134
- 2.1. Legal effect of this document. This License is:
135
-
136
- a. A license of applicable copyright and neighbouring rights;
137
-
138
- b. A license of the Database Right; and
139
-
140
- c. An agreement in contract between You and the Licensor.
141
-
142
- 2.2 Legal rights covered. This License covers the legal rights in the
143
- Database, including:
144
-
145
- a. Copyright. Any copyright or neighbouring rights in the Database.
146
- The copyright licensed includes any individual elements of the
147
- Database, but does not cover the copyright over the Contents
148
- independent of this Database. See Section 2.4 for details. Copyright
149
- law varies between jurisdictions, but is likely to cover: the Database
150
- model or schema, which is the structure, arrangement, and organisation
151
- of the Database, and can also include the Database tables and table
152
- indexes; the data entry and output sheets; and the Field names of
153
- Contents stored in the Database;
154
-
155
- b. Database Rights. Database Rights only extend to the Extraction and
156
- Re-utilisation of the whole or a Substantial part of the Contents.
157
- Database Rights can apply even when there is no copyright over the
158
- Database. Database Rights can also apply when the Contents are removed
159
- from the Database and are selected and arranged in a way that would
160
- not infringe any applicable copyright; and
161
-
162
- c. Contract. This is an agreement between You and the Licensor for
163
- access to the Database. In return you agree to certain conditions of
164
- use on this access as outlined in this License.
165
-
166
- 2.3 Rights not covered.
167
-
168
- a. This License does not apply to computer programs used in the making
169
- or operation of the Database;
170
-
171
- b. This License does not cover any patents over the Contents or the
172
- Database; and
173
-
174
- c. This License does not cover any trademarks associated with the
175
- Database.
176
-
177
- 2.4 Relationship to Contents in the Database. The individual items of
178
- the Contents contained in this Database may be covered by other rights,
179
- including copyright, patent, data protection, privacy, or personality
180
- rights, and this License does not cover any rights (other than Database
181
- Rights or in contract) in individual Contents contained in the Database.
182
- For example, if used on a Database of images (the Contents), this
183
- License would not apply to copyright over individual images, which could
184
- have their own separate licenses, or one single license covering all of
185
- the rights over the images.
186
-
187
- ### 3.0 Rights granted
188
-
189
- 3.1 Subject to the terms and conditions of this License, the Licensor
190
- grants to You a worldwide, royalty-free, non-exclusive, terminable (but
191
- only under Section 9) license to Use the Database for the duration of
192
- any applicable copyright and Database Rights. These rights explicitly
193
- include commercial use, and do not exclude any field of endeavour. To
194
- the extent possible in the relevant jurisdiction, these rights may be
195
- exercised in all media and formats whether now known or created in the
196
- future.
197
-
198
- The rights granted cover, for example:
199
-
200
- a. Extraction and Re-utilisation of the whole or a Substantial part of
201
- the Contents;
202
-
203
- b. Creation of Derivative Databases;
204
-
205
- c. Creation of Collective Databases;
206
-
207
- d. Creation of temporary or permanent reproductions by any means and
208
- in any form, in whole or in part, including of any Derivative
209
- Databases or as a part of Collective Databases; and
210
-
211
- e. Distribution, communication, display, lending, making available, or
212
- performance to the public by any means and in any form, in whole or in
213
- part, including of any Derivative Database or as a part of Collective
214
- Databases.
215
-
216
- 3.2 Compulsory license schemes. For the avoidance of doubt:
217
-
218
- a. Non-waivable compulsory license schemes. In those jurisdictions in
219
- which the right to collect royalties through any statutory or
220
- compulsory licensing scheme cannot be waived, the Licensor reserves
221
- the exclusive right to collect such royalties for any exercise by You
222
- of the rights granted under this License;
223
-
224
- b. Waivable compulsory license schemes. In those jurisdictions in
225
- which the right to collect royalties through any statutory or
226
- compulsory licensing scheme can be waived, the Licensor waives the
227
- exclusive right to collect such royalties for any exercise by You of
228
- the rights granted under this License; and,
229
-
230
- c. Voluntary license schemes. The Licensor waives the right to collect
231
- royalties, whether individually or, in the event that the Licensor is
232
- a member of a collecting society that administers voluntary licensing
233
- schemes, via that society, from any exercise by You of the rights
234
- granted under this License.
235
-
236
- 3.3 The right to release the Database under different terms, or to stop
237
- distributing or making available the Database, is reserved. Note that
238
- this Database may be multiple-licensed, and so You may have the choice
239
- of using alternative licenses for this Database. Subject to Section
240
- 10.4, all other rights not expressly granted by Licensor are reserved.
241
-
242
- ### 4.0 Conditions of Use
243
-
244
- 4.1 The rights granted in Section 3 above are expressly made subject to
245
- Your complying with the following conditions of use. These are important
246
- conditions of this License, and if You fail to follow them, You will be
247
- in material breach of its terms.
248
-
249
- 4.2 Notices. If You Publicly Convey this Database, any Derivative
250
- Database, or the Database as part of a Collective Database, then You
251
- must:
252
-
253
- a. Do so only under the terms of this License or another license
254
- permitted under Section 4.4;
255
-
256
- b. Include a copy of this License (or, as applicable, a license
257
- permitted under Section 4.4) or its Uniform Resource Identifier (URI)
258
- with the Database or Derivative Database, including both in the
259
- Database or Derivative Database and in any relevant documentation; and
260
-
261
- c. Keep intact any copyright or Database Right notices and notices
262
- that refer to this License.
263
-
264
- d. If it is not possible to put the required notices in a particular
265
- file due to its structure, then You must include the notices in a
266
- location (such as a relevant directory) where users would be likely to
267
- look for it.
268
-
269
- 4.3 Notice for using output (Contents). Creating and Using a Produced
270
- Work does not require the notice in Section 4.2. However, if you
271
- Publicly Use a Produced Work, You must include a notice associated with
272
- the Produced Work reasonably calculated to make any Person that uses,
273
- views, accesses, interacts with, or is otherwise exposed to the Produced
274
- Work aware that Content was obtained from the Database, Derivative
275
- Database, or the Database as part of a Collective Database, and that it
276
- is available under this License.
277
-
278
- a. Example notice. The following text will satisfy notice under
279
- Section 4.3:
280
-
281
- Contains information from DATABASE NAME, which is made available
282
- here under the Open Database License (ODbL).
283
-
284
- DATABASE NAME should be replaced with the name of the Database and a
285
- hyperlink to the URI of the Database. "Open Database License" should
286
- contain a hyperlink to the URI of the text of this License. If
287
- hyperlinks are not possible, You should include the plain text of the
288
- required URI's with the above notice.
289
-
290
- 4.4 Share alike.
291
-
292
- a. Any Derivative Database that You Publicly Use must be only under
293
- the terms of:
294
-
295
- i. This License;
296
-
297
- ii. A later version of this License similar in spirit to this
298
- License; or
299
-
300
- iii. A compatible license.
301
-
302
- If You license the Derivative Database under one of the licenses
303
- mentioned in (iii), You must comply with the terms of that license.
304
-
305
- b. For the avoidance of doubt, Extraction or Re-utilisation of the
306
- whole or a Substantial part of the Contents into a new database is a
307
- Derivative Database and must comply with Section 4.4.
308
-
309
- c. Derivative Databases and Produced Works. A Derivative Database is
310
- Publicly Used and so must comply with Section 4.4. if a Produced Work
311
- created from the Derivative Database is Publicly Used.
312
-
313
- d. Share Alike and additional Contents. For the avoidance of doubt,
314
- You must not add Contents to Derivative Databases under Section 4.4 a
315
- that are incompatible with the rights granted under this License.
316
-
317
- e. Compatible licenses. Licensors may authorise a proxy to determine
318
- compatible licenses under Section 4.4 a iii. If they do so, the
319
- authorised proxy's public statement of acceptance of a compatible
320
- license grants You permission to use the compatible license.
321
-
322
-
323
- 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
324
- in the following:
325
-
326
- a. For the avoidance of doubt, You are not required to license
327
- Collective Databases under this License if You incorporate this
328
- Database or a Derivative Database in the collection, but this License
329
- still applies to this Database or a Derivative Database as a part of
330
- the Collective Database;
331
-
332
- b. Using this Database, a Derivative Database, or this Database as
333
- part of a Collective Database to create a Produced Work does not
334
- create a Derivative Database for purposes of Section 4.4; and
335
-
336
- c. Use of a Derivative Database internally within an organisation is
337
- not to the public and therefore does not fall under the requirements
338
- of Section 4.4.
339
-
340
- 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
341
- Database or a Produced Work from a Derivative Database, You must also
342
- offer to recipients of the Derivative Database or Produced Work a copy
343
- in a machine readable form of:
344
-
345
- a. The entire Derivative Database; or
346
-
347
- b. A file containing all of the alterations made to the Database or
348
- the method of making the alterations to the Database (such as an
349
- algorithm), including any additional Contents, that make up all the
350
- differences between the Database and the Derivative Database.
351
-
352
- The Derivative Database (under a.) or alteration file (under b.) must be
353
- available at no more than a reasonable production cost for physical
354
- distributions and free of charge if distributed over the internet.
355
-
356
- 4.7 Technological measures and additional terms
357
-
358
- a. This License does not allow You to impose (except subject to
359
- Section 4.7 b.) any terms or any technological measures on the
360
- Database, a Derivative Database, or the whole or a Substantial part of
361
- the Contents that alter or restrict the terms of this License, or any
362
- rights granted under it, or have the effect or intent of restricting
363
- the ability of any person to exercise those rights.
364
-
365
- b. Parallel distribution. You may impose terms or technological
366
- measures on the Database, a Derivative Database, or the whole or a
367
- Substantial part of the Contents (a "Restricted Database") in
368
- contravention of Section 4.74 a. only if You also make a copy of the
369
- Database or a Derivative Database available to the recipient of the
370
- Restricted Database:
371
-
372
- i. That is available without additional fee;
373
-
374
- ii. That is available in a medium that does not alter or restrict
375
- the terms of this License, or any rights granted under it, or have
376
- the effect or intent of restricting the ability of any person to
377
- exercise those rights (an "Unrestricted Database"); and
378
-
379
- iii. The Unrestricted Database is at least as accessible to the
380
- recipient as a practical matter as the Restricted Database.
381
-
382
- c. For the avoidance of doubt, You may place this Database or a
383
- Derivative Database in an authenticated environment, behind a
384
- password, or within a similar access control scheme provided that You
385
- do not alter or restrict the terms of this License or any rights
386
- granted under it or have the effect or intent of restricting the
387
- ability of any person to exercise those rights.
388
-
389
- 4.8 Licensing of others. You may not sublicense the Database. Each time
390
- You communicate the Database, the whole or Substantial part of the
391
- Contents, or any Derivative Database to anyone else in any way, the
392
- Licensor offers to the recipient a license to the Database on the same
393
- terms and conditions as this License. You are not responsible for
394
- enforcing compliance by third parties with this License, but You may
395
- enforce any rights that You have over a Derivative Database. You are
396
- solely responsible for any modifications of a Derivative Database made
397
- by You or another Person at Your direction. You may not impose any
398
- further restrictions on the exercise of the rights granted or affirmed
399
- under this License.
400
-
401
- ### 5.0 Moral rights
402
-
403
- 5.1 Moral rights. This section covers moral rights, including any rights
404
- to be identified as the author of the Database or to object to treatment
405
- that would otherwise prejudice the author's honour and reputation, or
406
- any other derogatory treatment:
407
-
408
- a. For jurisdictions allowing waiver of moral rights, Licensor waives
409
- all moral rights that Licensor may have in the Database to the fullest
410
- extent possible by the law of the relevant jurisdiction under Section
411
- 10.4;
412
-
413
- b. If waiver of moral rights under Section 5.1 a in the relevant
414
- jurisdiction is not possible, Licensor agrees not to assert any moral
415
- rights over the Database and waives all claims in moral rights to the
416
- fullest extent possible by the law of the relevant jurisdiction under
417
- Section 10.4; and
418
-
419
- c. For jurisdictions not allowing waiver or an agreement not to assert
420
- moral rights under Section 5.1 a and b, the author may retain their
421
- moral rights over certain aspects of the Database.
422
-
423
- Please note that some jurisdictions do not allow for the waiver of moral
424
- rights, and so moral rights may still subsist over the Database in some
425
- jurisdictions.
426
-
427
- ### 6.0 Fair dealing, Database exceptions, and other rights not affected
428
-
429
- 6.1 This License does not affect any rights that You or anyone else may
430
- independently have under any applicable law to make any use of this
431
- Database, including without limitation:
432
-
433
- a. Exceptions to the Database Right including: Extraction of Contents
434
- from non-electronic Databases for private purposes, Extraction for
435
- purposes of illustration for teaching or scientific research, and
436
- Extraction or Re-utilisation for public security or an administrative
437
- or judicial procedure.
438
-
439
- b. Fair dealing, fair use, or any other legally recognised limitation
440
- or exception to infringement of copyright or other applicable laws.
441
-
442
- 6.2 This License does not affect any rights of lawful users to Extract
443
- and Re-utilise insubstantial parts of the Contents, evaluated
444
- quantitatively or qualitatively, for any purposes whatsoever, including
445
- creating a Derivative Database (subject to other rights over the
446
- Contents, see Section 2.4). The repeated and systematic Extraction or
447
- Re-utilisation of insubstantial parts of the Contents may however amount
448
- to the Extraction or Re-utilisation of a Substantial part of the
449
- Contents.
450
-
451
- ### 7.0 Warranties and Disclaimer
452
-
453
- 7.1 The Database is licensed by the Licensor "as is" and without any
454
- warranty of any kind, either express, implied, or arising by statute,
455
- custom, course of dealing, or trade usage. Licensor specifically
456
- disclaims any and all implied warranties or conditions of title,
457
- non-infringement, accuracy or completeness, the presence or absence of
458
- errors, fitness for a particular purpose, merchantability, or otherwise.
459
- Some jurisdictions do not allow the exclusion of implied warranties, so
460
- this exclusion may not apply to You.
461
-
462
- ### 8.0 Limitation of liability
463
-
464
- 8.1 Subject to any liability that may not be excluded or limited by law,
465
- the Licensor is not liable for, and expressly excludes, all liability
466
- for loss or damage however and whenever caused to anyone by any use
467
- under this License, whether by You or by anyone else, and whether caused
468
- by any fault on the part of the Licensor or not. This exclusion of
469
- liability includes, but is not limited to, any special, incidental,
470
- consequential, punitive, or exemplary damages such as loss of revenue,
471
- data, anticipated profits, and lost business. This exclusion applies
472
- even if the Licensor has been advised of the possibility of such
473
- damages.
474
-
475
- 8.2 If liability may not be excluded by law, it is limited to actual and
476
- direct financial loss to the extent it is caused by proved negligence on
477
- the part of the Licensor.
478
-
479
- ### 9.0 Termination of Your rights under this License
480
-
481
- 9.1 Any breach by You of the terms and conditions of this License
482
- automatically terminates this License with immediate effect and without
483
- notice to You. For the avoidance of doubt, Persons who have received the
484
- Database, the whole or a Substantial part of the Contents, Derivative
485
- Databases, or the Database as part of a Collective Database from You
486
- under this License will not have their licenses terminated provided
487
- their use is in full compliance with this License or a license granted
488
- under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
489
- survive any termination of this License.
490
-
491
- 9.2 If You are not in breach of the terms of this License, the Licensor
492
- will not terminate Your rights under it.
493
-
494
- 9.3 Unless terminated under Section 9.1, this License is granted to You
495
- for the duration of applicable rights in the Database.
496
-
497
- 9.4 Reinstatement of rights. If you cease any breach of the terms and
498
- conditions of this License, then your full rights under this License
499
- will be reinstated:
500
-
501
- a. Provisionally and subject to permanent termination until the 60th
502
- day after cessation of breach;
503
-
504
- b. Permanently on the 60th day after cessation of breach unless
505
- otherwise reasonably notified by the Licensor; or
506
-
507
- c. Permanently if reasonably notified by the Licensor of the
508
- violation, this is the first time You have received notice of
509
- violation of this License from the Licensor, and You cure the
510
- violation prior to 30 days after your receipt of the notice.
511
-
512
- Persons subject to permanent termination of rights are not eligible to
513
- be a recipient and receive a license under Section 4.8.
514
-
515
- 9.5 Notwithstanding the above, Licensor reserves the right to release
516
- the Database under different license terms or to stop distributing or
517
- making available the Database. Releasing the Database under different
518
- license terms or stopping the distribution of the Database will not
519
- withdraw this License (or any other license that has been, or is
520
- required to be, granted under the terms of this License), and this
521
- License will continue in full force and effect unless terminated as
522
- stated above.
523
-
524
- ### 10.0 General
525
-
526
- 10.1 If any provision of this License is held to be invalid or
527
- unenforceable, that must not affect the validity or enforceability of
528
- the remainder of the terms and conditions of this License and each
529
- remaining provision of this License shall be valid and enforced to the
530
- fullest extent permitted by law.
531
-
532
- 10.2 This License is the entire agreement between the parties with
533
- respect to the rights granted here over the Database. It replaces any
534
- earlier understandings, agreements or representations with respect to
535
- the Database.
536
-
537
- 10.3 If You are in breach of the terms of this License, You will not be
538
- entitled to rely on the terms of this License or to complain of any
539
- breach by the Licensor.
540
-
541
- 10.4 Choice of law. This License takes effect in and will be governed by
542
- the laws of the relevant jurisdiction in which the License terms are
543
- sought to be enforced. If the standard suite of rights granted under
544
- applicable copyright law and Database Rights in the relevant
545
- jurisdiction includes additional rights not granted under this License,
546
- these additional rights are granted in this License in order to meet the
547
- terms of this License.```
548
-
549
-
550
-
551
-
552
  # UD Romanian RRT v2.8
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # UD Romanian RRT v2.8
2
 
3
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
README.md CHANGED
@@ -14,61 +14,76 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7485865058
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7629658087
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7557077626
 
 
 
 
 
 
 
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9619726156
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.9626168224
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.9587765957
41
- - name: SENTER F Score
42
- type: f_score
43
- value: 0.9606928714
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8893350063
 
 
 
 
 
 
 
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.8893350063
 
 
 
 
 
 
 
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_md
60
 
61
- Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
62
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_md` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
- | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
- | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
71
- | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,13 +91,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
76
 
77
  <details>
78
 
79
- <summary>View label scheme (541 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
84
  | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
- | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
88
  </details>
@@ -95,18 +109,18 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
95
  | `TOKEN_P` | 99.67 |
96
  | `TOKEN_R` | 99.57 |
97
  | `TOKEN_F` | 99.59 |
98
- | `TAG_ACC` | 96.20 |
99
- | `SENTS_P` | 96.26 |
100
- | `SENTS_R` | 95.88 |
101
- | `SENTS_F` | 96.07 |
102
- | `DEP_UAS` | 88.93 |
103
- | `DEP_LAS` | 83.88 |
104
- | `POS_ACC` | 93.82 |
105
- | `MORPH_ACC` | 94.69 |
 
106
  | `MORPH_MICRO_P` | 98.71 |
107
- | `MORPH_MICRO_R` | 95.58 |
108
- | `MORPH_MICRO_F` | 96.84 |
109
- | `LEMMA_ACC` | 81.83 |
110
- | `ENTS_P` | 74.86 |
111
- | `ENTS_R` | 76.30 |
112
- | `ENTS_F` | 75.57 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7497185741
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.767575874
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7585421412
24
+ - task:
25
+ name: TAG
26
+ type: token-classification
27
+ metrics:
28
+ - name: TAG (XPOS) Accuracy
29
+ type: accuracy
30
+ value: 0.9627631502
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
+ - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.938272277
38
  - task:
39
+ name: MORPH
40
  type: token-classification
41
  metrics:
42
+ - name: Morph (UFeats) Accuracy
43
+ type: accuracy
44
+ value: 0.9472820032
 
 
 
 
 
 
45
  - task:
46
+ name: LEMMA
47
  type: token-classification
48
  metrics:
49
+ - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9547600199
52
+ - task:
53
+ name: UNLABELED_DEPENDENCIES
54
+ type: token-classification
55
+ metrics:
56
+ - name: Unlabeled Attachment Score (UAS)
57
+ type: f_score
58
+ value: 0.8861775868
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
+ - name: Labeled Attachment Score (LAS)
64
+ type: f_score
65
+ value: 0.8312266725
66
+ - task:
67
+ name: SENTS
68
+ type: token-classification
69
+ metrics:
70
+ - name: Sentences F-Score
71
+ type: f_score
72
+ value: 0.9620253165
73
  ---
74
  ### Details: https://spacy.io/models/ro#ro_core_news_md
75
 
76
+ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner, attribute_ruler.
77
 
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `ro_core_news_md` |
81
+ | **Version** | `3.3.0` |
82
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
83
+ | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `lemmatizer`, `attribute_ruler`, `ner` |
84
+ | **Components** | `tok2vec`, `tagger`, `parser`, `lemmatizer`, `senter`, `attribute_ruler`, `ner` |
85
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
86
+ | **Sources** | [UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
87
  | **License** | `CC BY-SA 4.0` |
88
  | **Author** | [Explosion](https://explosion.ai) |
89
 
91
 
92
  <details>
93
 
94
+ <summary>View label scheme (539 labels for 3 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
  | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
99
  | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
 
100
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
101
 
102
  </details>
109
  | `TOKEN_P` | 99.67 |
110
  | `TOKEN_R` | 99.57 |
111
  | `TOKEN_F` | 99.59 |
112
+ | `TAG_ACC` | 96.28 |
113
+ | `SENTS_P` | 96.40 |
114
+ | `SENTS_R` | 96.01 |
115
+ | `SENTS_F` | 96.20 |
116
+ | `DEP_UAS` | 88.62 |
117
+ | `DEP_LAS` | 83.12 |
118
+ | `LEMMA_ACC` | 95.48 |
119
+ | `POS_ACC` | 93.83 |
120
+ | `MORPH_ACC` | 94.73 |
121
  | `MORPH_MICRO_P` | 98.71 |
122
+ | `MORPH_MICRO_R` | 95.64 |
123
+ | `MORPH_MICRO_F` | 96.91 |
124
+ | `ENTS_P` | 74.97 |
125
+ | `ENTS_R` | 76.76 |
126
+ | `ENTS_F` | 75.85 |
 
accuracy.json CHANGED
@@ -3,217 +3,212 @@
3
  "token_p": 0.9967350492,
4
  "token_r": 0.9957244934,
5
  "token_f": 0.9959492157,
6
- "tag_acc": 0.9619726156,
7
- "sents_p": 0.9626168224,
8
- "sents_r": 0.9587765957,
9
- "sents_f": 0.9606928714,
10
- "dep_uas": 0.8893350063,
11
- "dep_las": 0.8388068128,
12
  "dep_las_per_type": {
13
- "case": {
14
- "p": 0.9337493999,
15
- "r": 0.9492435334,
16
- "f": 0.9414327202
17
  },
18
- "det": {
19
- "p": 0.9484425349,
20
- "r": 0.966083151,
21
- "f": 0.9571815718
 
 
 
 
 
22
  },
23
  "nmod:tmod": {
24
- "p": 0.6666666667,
25
- "r": 0.0930232558,
26
- "f": 0.1632653061
27
  },
28
  "amod": {
29
- "p": 0.8737690242,
30
- "r": 0.8864668483,
31
- "f": 0.8800721371
32
  },
33
- "cc": {
34
- "p": 0.877016129,
35
- "r": 0.910041841,
36
- "f": 0.8932238193
37
- },
38
- "conj": {
39
- "p": 0.5879699248,
40
- "r": 0.5915279879,
41
- "f": 0.5897435897
42
  },
43
  "nmod": {
44
- "p": 0.7885679164,
45
- "r": 0.8099747475,
46
- "f": 0.7991279975
47
- },
48
- "mark": {
49
- "p": 0.9161147903,
50
- "r": 0.9222222222,
51
- "f": 0.919158361
52
- },
53
- "fixed": {
54
- "p": 0.8559322034,
55
- "r": 0.7163120567,
56
- "f": 0.7799227799
57
  },
58
- "nsubj": {
59
- "p": 0.8134920635,
60
- "r": 0.7824427481,
61
- "f": 0.7976653696
62
  },
63
- "advcl:tcl": {
64
- "p": 0.0,
65
- "r": 0.0,
66
- "f": 0.0
67
  },
68
  "obj": {
69
- "p": 0.7793880837,
70
- "r": 0.8273504274,
71
- "f": 0.8026533997
72
  },
73
- "nummod": {
74
- "p": 0.8892405063,
75
- "r": 0.8619631902,
76
- "f": 0.8753894081
77
  },
78
- "flat": {
79
- "p": 0.7441860465,
80
- "r": 0.6857142857,
81
- "f": 0.7137546468
82
  },
83
- "obl": {
84
- "p": 0.6402378593,
85
- "r": 0.731596829,
86
- "f": 0.6828752643
87
  },
88
- "obl:pmod": {
89
- "p": 0.4375,
90
- "r": 0.1615384615,
91
- "f": 0.2359550562
92
  },
93
  "acl": {
94
- "p": 0.7222222222,
95
- "r": 0.7303370787,
96
- "f": 0.7262569832
97
  },
98
  "advmod": {
99
- "p": 0.8060686016,
100
- "r": 0.7823303457,
101
- "f": 0.7940220923
102
- },
103
- "expl:pv": {
104
- "p": 0.7777777778,
105
- "r": 0.8191489362,
106
- "f": 0.7979274611
107
  },
108
- "root": {
109
- "p": 0.9103078983,
110
- "r": 0.9042553191,
111
- "f": 0.9072715143
112
  },
113
- "advcl": {
114
- "p": 0.5579710145,
115
- "r": 0.6260162602,
116
- "f": 0.5900383142
117
  },
118
- "iobj": {
119
- "p": 0.7966101695,
120
- "r": 0.6394557823,
121
- "f": 0.7094339623
122
  },
123
- "ccomp": {
124
- "p": 0.6995073892,
125
- "r": 0.802259887,
126
- "f": 0.7473684211
127
  },
128
- "goeswith": {
129
- "p": 0.25,
130
- "r": 0.1428571429,
131
- "f": 0.1818181818
132
  },
133
  "parataxis": {
134
- "p": 0.8494623656,
135
- "r": 0.6030534351,
136
- "f": 0.7053571429
137
- },
138
- "expl:poss": {
139
- "p": 0.6086956522,
140
- "r": 0.6511627907,
141
- "f": 0.6292134831
142
  },
143
- "cop": {
144
- "p": 0.75,
145
- "r": 0.773006135,
146
- "f": 0.7613293051
147
  },
148
- "cc:preconj": {
149
  "p": 0.0,
150
  "r": 0.0,
151
  "f": 0.0
152
  },
153
- "aux": {
154
- "p": 0.9661971831,
155
- "r": 0.9122340426,
156
- "f": 0.9384404925
157
  },
158
- "expl": {
159
- "p": 0.5714285714,
160
- "r": 0.4761904762,
161
- "f": 0.5194805195
162
  },
163
- "appos": {
164
- "p": 0.4691358025,
165
- "r": 0.3762376238,
166
- "f": 0.4175824176
167
  },
168
- "xcomp": {
169
- "p": 0.5538461538,
170
- "r": 0.4337349398,
171
- "f": 0.4864864865
172
  },
173
- "nsubj:pass": {
174
- "p": 0.5878787879,
175
- "r": 0.6381578947,
176
- "f": 0.6119873817
177
  },
178
  "csubj": {
179
- "p": 0.8448275862,
180
- "r": 0.7777777778,
181
- "f": 0.8099173554
182
  },
183
  "obl:agent": {
184
- "p": 0.7538461538,
185
- "r": 0.7538461538,
186
- "f": 0.7538461538
187
- },
188
- "aux:pass": {
189
- "p": 0.7428571429,
190
- "r": 0.8666666667,
191
- "f": 0.8
192
- },
193
- "dep": {
194
  "p": 0.0,
195
  "r": 0.0,
196
  "f": 0.0
197
  },
198
- "advmod:tmod": {
199
  "p": 0.0,
200
  "r": 0.0,
201
  "f": 0.0
202
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
203
  "compound": {
204
- "p": 0.5,
205
- "r": 0.6666666667,
206
- "f": 0.5714285714
 
 
 
 
 
207
  },
208
  "ccomp:pmod": {
209
- "p": 0.5,
210
- "r": 0.1875,
211
- "f": 0.2727272727
212
  },
213
- "expl:pass": {
214
- "p": 0.6808510638,
215
- "r": 0.7032967033,
216
- "f": 0.6918918919
 
 
 
 
 
217
  },
218
  "orphan": {
219
  "p": 0.0,
@@ -222,232 +217,232 @@
222
  },
223
  "expl:impers": {
224
  "p": 0.5,
225
- "r": 0.1,
226
- "f": 0.1666666667
227
  },
228
- "csubj:pass": {
229
- "p": 0.6666666667,
230
- "r": 0.6666666667,
231
- "f": 0.6666666667
232
  },
233
- "vocative": {
234
  "p": 0.0,
235
  "r": 0.0,
236
  "f": 0.0
237
  },
238
- "discourse": {
239
  "p": 0.0,
240
  "r": 0.0,
241
  "f": 0.0
242
  }
243
  },
244
- "pos_acc": 0.9381923087,
245
- "morph_acc": 0.9469023954,
246
- "morph_micro_p": 0.9870716332,
247
- "morph_micro_r": 0.9558096483,
248
- "morph_micro_f": 0.9683797083,
 
249
  "morph_per_feat": {
250
- "AdpType": {
251
- "p": 0.9954051796,
252
- "r": 0.9941593659,
253
- "f": 0.9947818827
254
- },
255
  "Case": {
256
- "p": 0.9873727088,
257
- "r": 0.9820391627,
258
- "f": 0.9846987136
259
- },
260
- "Variant": {
261
- "p": 0.976744186,
262
- "r": 0.9130434783,
263
- "f": 0.9438202247
264
  },
265
  "Gender": {
266
- "p": 0.9821478774,
267
- "r": 0.9776129845,
268
- "f": 0.9798751841
269
  },
270
  "Number": {
271
- "p": 0.9810964083,
272
- "r": 0.9438508752,
273
- "f": 0.9621133125
274
  },
275
- "PronType": {
276
- "p": 0.9902862986,
277
- "r": 0.9872579001,
278
- "f": 0.9887697805
279
- },
280
- "Definite": {
281
- "p": 0.9788447388,
282
- "r": 0.9734723747,
283
- "f": 0.9761511649
284
  },
285
- "Degree": {
286
- "p": 0.9568913175,
287
- "r": 0.9347568209,
288
- "f": 0.9456945695
289
  },
290
  "Polarity": {
291
- "p": 0.9884318766,
292
- "r": 0.9858974359,
293
- "f": 0.9871630295
294
  },
295
- "Mood": {
296
- "p": 0.9740072202,
297
- "r": 0.9677187948,
298
- "f": 0.9708528248
299
  },
300
- "Person": {
301
- "p": 0.9764359352,
302
- "r": 0.9696526508,
303
- "f": 0.9730324711
304
  },
305
- "Tense": {
306
- "p": 0.9707207207,
307
- "r": 0.9563609467,
308
- "f": 0.9634873323
309
  },
310
  "VerbForm": {
311
- "p": 0.9714013346,
312
- "r": 0.9622285175,
313
- "f": 0.9667931689
 
 
 
 
 
 
 
 
 
 
314
  },
315
  "NumForm": {
316
- "p": 0.9758064516,
317
- "r": 0.2929782082,
318
- "f": 0.4506517691
319
  },
320
  "NumType": {
321
- "p": 0.9846153846,
322
- "r": 0.3054892601,
323
- "f": 0.4663023679
324
  },
325
- "PartType": {
326
- "p": 0.9473684211,
327
- "r": 0.9230769231,
328
- "f": 0.9350649351
329
  },
330
  "Strength": {
331
- "p": 0.9914675768,
332
- "r": 0.97319933,
333
- "f": 0.9822485207
334
  },
335
- "Reflex": {
336
- "p": 0.9938461538,
337
- "r": 0.9877675841,
338
- "f": 0.990797546
339
  },
340
- "Poss": {
341
- "p": 0.986013986,
342
- "r": 0.986013986,
343
- "f": 0.986013986
 
 
 
 
 
344
  },
345
  "Position": {
346
- "p": 0.986013986,
347
- "r": 0.9724137931,
348
- "f": 0.9791666667
349
  },
350
  "Number[psor]": {
351
- "p": 0.9420289855,
352
- "r": 0.9558823529,
353
- "f": 0.9489051095
 
 
 
 
 
354
  },
355
  "Foreign": {
356
  "p": 0.0,
357
  "r": 0.0,
358
  "f": 0.0
359
- },
360
- "Abbr": {
361
- "p": 0.9620253165,
362
- "r": 0.9156626506,
363
- "f": 0.9382716049
364
  }
365
  },
366
- "lemma_acc": 0.8183070924,
367
- "ents_p": 0.7485865058,
368
- "ents_r": 0.7629658087,
369
- "ents_f": 0.7557077626,
370
  "ents_per_type": {
371
  "DATETIME": {
372
- "p": 0.0,
373
- "r": 0.0,
374
- "f": 0.0
375
  },
376
- "PERSON": {
377
- "p": 0.0,
378
- "r": 0.0,
379
- "f": 0.0
380
- },
381
- "PRODUCT": {
382
- "p": 0.0,
383
- "r": 0.0,
384
- "f": 0.0
385
  },
386
- "LOC": {
387
- "p": 0.0,
388
- "r": 0.0,
389
- "f": 0.0
390
  },
391
- "GPE": {
392
- "p": 0.0,
393
- "r": 0.0,
394
- "f": 0.0
395
  },
396
  "ORDINAL": {
397
- "p": 0.0,
398
- "r": 0.0,
399
- "f": 0.0
400
  },
401
- "NUMERIC_VALUE": {
402
- "p": 0.0,
403
- "r": 0.0,
404
- "f": 0.0
405
  },
406
- "ORGANIZATION": {
407
- "p": 0.0,
408
- "r": 0.0,
409
- "f": 0.0
 
 
 
 
 
410
  },
411
  "NAT_REL_POL": {
412
- "p": 0.0,
413
- "r": 0.0,
414
- "f": 0.0
415
  },
416
- "WORK_OF_ART": {
417
- "p": 0.0,
418
- "r": 0.0,
419
- "f": 0.0
420
  },
421
- "EVENT": {
422
- "p": 0.0,
423
- "r": 0.0,
424
- "f": 0.0
425
  },
426
- "FACILITY": {
427
- "p": 0.0,
428
- "r": 0.0,
429
- "f": 0.0
 
 
 
 
 
430
  },
431
  "QUANTITY": {
432
- "p": 0.0,
433
- "r": 0.0,
434
- "f": 0.0
435
  },
436
- "MONEY": {
437
- "p": 0.0,
438
- "r": 0.0,
439
- "f": 0.0
440
  },
441
  "LANGUAGE": {
442
- "p": 0.0,
443
- "r": 0.0,
444
- "f": 0.0
445
- },
446
- "PERIOD": {
447
- "p": 0.0,
448
- "r": 0.0,
449
- "f": 0.0
450
  }
451
  },
452
- "speed": 8391.5537539766
453
  }
3
  "token_p": 0.9967350492,
4
  "token_r": 0.9957244934,
5
  "token_f": 0.9959492157,
6
+ "tag_acc": 0.9627631502,
7
+ "sents_p": 0.9639519359,
8
+ "sents_r": 0.960106383,
9
+ "sents_f": 0.9620253165,
10
+ "dep_uas": 0.8861775868,
11
+ "dep_las": 0.8312266725,
12
  "dep_las_per_type": {
13
+ "root": {
14
+ "p": 0.8707360862,
15
+ "r": 0.9133709981,
16
+ "f": 0.8915441176
17
  },
18
+ "mark": {
19
+ "p": 0.9283018868,
20
+ "r": 0.9283018868,
21
+ "f": 0.9283018868
22
+ },
23
+ "case": {
24
+ "p": 0.9663643235,
25
+ "r": 0.9587551556,
26
+ "f": 0.9625447017
27
  },
28
  "nmod:tmod": {
29
+ "p": 0.62,
30
+ "r": 0.2605042017,
31
+ "f": 0.3668639053
32
  },
33
  "amod": {
34
+ "p": 0.8989813243,
35
+ "r": 0.902044293,
36
+ "f": 0.9005102041
37
  },
38
+ "nsubj": {
39
+ "p": 0.8516129032,
40
+ "r": 0.8341232227,
41
+ "f": 0.8427773344
 
 
 
 
 
42
  },
43
  "nmod": {
44
+ "p": 0.826055313,
45
+ "r": 0.8104248483,
46
+ "f": 0.8181654352
 
 
 
 
 
 
 
 
 
 
47
  },
48
+ "aux": {
49
+ "p": 0.9703703704,
50
+ "r": 0.957952468,
51
+ "f": 0.9641214351
52
  },
53
+ "advcl": {
54
+ "p": 0.5436241611,
55
+ "r": 0.6090225564,
56
+ "f": 0.5744680851
57
  },
58
  "obj": {
59
+ "p": 0.8295454545,
60
+ "r": 0.8429561201,
61
+ "f": 0.8361970218
62
  },
63
+ "det": {
64
+ "p": 0.9611428571,
65
+ "r": 0.9524348811,
66
+ "f": 0.9567690557
67
  },
68
+ "cc": {
69
+ "p": 0.9329140461,
70
+ "r": 0.9290187891,
71
+ "f": 0.9309623431
72
  },
73
+ "conj": {
74
+ "p": 0.5798742138,
75
+ "r": 0.5341830823,
76
+ "f": 0.5560916767
77
  },
78
+ "nummod": {
79
+ "p": 0.884375,
80
+ "r": 0.8788819876,
81
+ "f": 0.8816199377
82
  },
83
  "acl": {
84
+ "p": 0.7793696275,
85
+ "r": 0.7028423773,
86
+ "f": 0.7391304348
87
  },
88
  "advmod": {
89
+ "p": 0.8041237113,
90
+ "r": 0.8232189974,
91
+ "f": 0.813559322
 
 
 
 
 
92
  },
93
+ "obl": {
94
+ "p": 0.6925566343,
95
+ "r": 0.8147208122,
96
+ "f": 0.7486880466
97
  },
98
+ "expl:pass": {
99
+ "p": 0.8367346939,
100
+ "r": 0.7592592593,
101
+ "f": 0.7961165049
102
  },
103
+ "nsubj:pass": {
104
+ "p": 0.8116883117,
105
+ "r": 0.762195122,
106
+ "f": 0.786163522
107
  },
108
+ "fixed": {
109
+ "p": 0.8574380165,
110
+ "r": 0.8773784355,
111
+ "f": 0.8672936259
112
  },
113
+ "appos": {
114
+ "p": 0.4957264957,
115
+ "r": 0.4427480916,
116
+ "f": 0.4677419355
117
  },
118
  "parataxis": {
119
+ "p": 0.2631578947,
120
+ "r": 0.2857142857,
121
+ "f": 0.2739726027
 
 
 
 
 
122
  },
123
+ "aux:pass": {
124
+ "p": 0.9391891892,
125
+ "r": 0.9266666667,
126
+ "f": 0.932885906
127
  },
128
+ "nmod:agent": {
129
  "p": 0.0,
130
  "r": 0.0,
131
  "f": 0.0
132
  },
133
+ "ccomp": {
134
+ "p": 0.8852459016,
135
+ "r": 0.8372093023,
136
+ "f": 0.8605577689
137
  },
138
+ "nmod:pmod": {
139
+ "p": 0.0,
140
+ "r": 0.0,
141
+ "f": 0.0
142
  },
143
+ "iobj": {
144
+ "p": 0.734939759,
145
+ "r": 0.7530864198,
146
+ "f": 0.743902439
147
  },
148
+ "flat": {
149
+ "p": 0.8031088083,
150
+ "r": 0.8157894737,
151
+ "f": 0.8093994778
152
  },
153
+ "cop": {
154
+ "p": 0.8225806452,
155
+ "r": 0.8225806452,
156
+ "f": 0.8225806452
157
  },
158
  "csubj": {
159
+ "p": 0.85,
160
+ "r": 0.8095238095,
161
+ "f": 0.8292682927
162
  },
163
  "obl:agent": {
 
 
 
 
 
 
 
 
 
 
164
  "p": 0.0,
165
  "r": 0.0,
166
  "f": 0.0
167
  },
168
+ "obl:pmod": {
169
  "p": 0.0,
170
  "r": 0.0,
171
  "f": 0.0
172
  },
173
+ "expl": {
174
+ "p": 0.6551724138,
175
+ "r": 0.7037037037,
176
+ "f": 0.6785714286
177
+ },
178
+ "xcomp": {
179
+ "p": 0.4137931034,
180
+ "r": 0.4444444444,
181
+ "f": 0.4285714286
182
+ },
183
+ "expl:pv": {
184
+ "p": 0.7631578947,
185
+ "r": 0.8405797101,
186
+ "f": 0.8
187
+ },
188
  "compound": {
189
+ "p": 0.4,
190
+ "r": 0.5714285714,
191
+ "f": 0.4705882353
192
+ },
193
+ "dep": {
194
+ "p": 0.0,
195
+ "r": 0.0,
196
+ "f": 0.0
197
  },
198
  "ccomp:pmod": {
199
+ "p": 0.2857142857,
200
+ "r": 0.6666666667,
201
+ "f": 0.4
202
  },
203
+ "expl:poss": {
204
+ "p": 1.0,
205
+ "r": 0.8387096774,
206
+ "f": 0.9122807018
207
+ },
208
+ "goeswith": {
209
+ "p": 0.0,
210
+ "r": 0.0,
211
+ "f": 0.0
212
  },
213
  "orphan": {
214
  "p": 0.0,
217
  },
218
  "expl:impers": {
219
  "p": 0.5,
220
+ "r": 0.3333333333,
221
+ "f": 0.4
222
  },
223
+ "cc:preconj": {
224
+ "p": 0.0,
225
+ "r": 0.0,
226
+ "f": 0.0
227
  },
228
+ "list": {
229
  "p": 0.0,
230
  "r": 0.0,
231
  "f": 0.0
232
  },
233
+ "csubj:pass": {
234
  "p": 0.0,
235
  "r": 0.0,
236
  "f": 0.0
237
  }
238
  },
239
+ "lemma_acc": 0.9547600199,
240
+ "pos_acc": 0.938272277,
241
+ "morph_acc": 0.9472820032,
242
+ "morph_micro_p": 0.9871439375,
243
+ "morph_micro_r": 0.9564194708,
244
+ "morph_micro_f": 0.9691151644,
245
  "morph_per_feat": {
 
 
 
 
 
246
  "Case": {
247
+ "p": 0.9916975348,
248
+ "r": 0.9874093857,
249
+ "f": 0.9895488147
 
 
 
 
 
250
  },
251
  "Gender": {
252
+ "p": 0.9891837505,
253
+ "r": 0.983247906,
254
+ "f": 0.9862068966
255
  },
256
  "Number": {
257
+ "p": 0.9870188509,
258
+ "r": 0.9210997577,
259
+ "f": 0.9529206626
260
  },
261
+ "Person": {
262
+ "p": 0.9846878681,
263
+ "r": 0.9852681202,
264
+ "f": 0.9849779087
 
 
 
 
 
265
  },
266
+ "PronType": {
267
+ "p": 0.9965301874,
268
+ "r": 0.992398065,
269
+ "f": 0.9944598338
270
  },
271
  "Polarity": {
272
+ "p": 0.9869918699,
273
+ "r": 0.9950819672,
274
+ "f": 0.9910204082
275
  },
276
+ "AdpType": {
277
+ "p": 0.9983039349,
278
+ "r": 0.9959390863,
279
+ "f": 0.9971201084
280
  },
281
+ "Definite": {
282
+ "p": 0.9855930847,
283
+ "r": 0.9773015873,
284
+ "f": 0.9814298239
285
  },
286
+ "Degree": {
287
+ "p": 0.9530931339,
288
+ "r": 0.9415715245,
289
+ "f": 0.9472972973
290
  },
291
  "VerbForm": {
292
+ "p": 0.9708994709,
293
+ "r": 0.9760638298,
294
+ "f": 0.9734748011
295
+ },
296
+ "Abbr": {
297
+ "p": 0.9797979798,
298
+ "r": 0.8660714286,
299
+ "f": 0.9194312796
300
+ },
301
+ "Poss": {
302
+ "p": 1.0,
303
+ "r": 0.9951807229,
304
+ "f": 0.9975845411
305
  },
306
  "NumForm": {
307
+ "p": 0.9958677686,
308
+ "r": 0.3319559229,
309
+ "f": 0.4979338843
310
  },
311
  "NumType": {
312
+ "p": 0.9959016393,
313
+ "r": 0.3337912088,
314
+ "f": 0.5
315
  },
316
+ "Reflex": {
317
+ "p": 1.0,
318
+ "r": 0.9935897436,
319
+ "f": 0.9967845659
320
  },
321
  "Strength": {
322
+ "p": 0.9919678715,
323
+ "r": 0.9801587302,
324
+ "f": 0.9860279441
325
  },
326
+ "Mood": {
327
+ "p": 0.9690909091,
328
+ "r": 0.9779816514,
329
+ "f": 0.9735159817
330
  },
331
+ "Tense": {
332
+ "p": 0.9682080925,
333
+ "r": 0.9738372093,
334
+ "f": 0.9710144928
335
+ },
336
+ "Variant": {
337
+ "p": 0.9932432432,
338
+ "r": 0.9483870968,
339
+ "f": 0.9702970297
340
  },
341
  "Position": {
342
+ "p": 1.0,
343
+ "r": 0.9910714286,
344
+ "f": 0.9955156951
345
  },
346
  "Number[psor]": {
347
+ "p": 1.0,
348
+ "r": 0.9666666667,
349
+ "f": 0.9830508475
350
+ },
351
+ "PartType": {
352
+ "p": 1.0,
353
+ "r": 0.9459459459,
354
+ "f": 0.9722222222
355
  },
356
  "Foreign": {
357
  "p": 0.0,
358
  "r": 0.0,
359
  "f": 0.0
 
 
 
 
 
360
  }
361
  },
362
+ "ents_p": 0.7497185741,
363
+ "ents_r": 0.767575874,
364
+ "ents_f": 0.7585421412,
 
365
  "ents_per_type": {
366
  "DATETIME": {
367
+ "p": 0.7823129252,
368
+ "r": 0.8013937282,
369
+ "f": 0.7917383821
370
  },
371
+ "ORGANIZATION": {
372
+ "p": 0.6882352941,
373
+ "r": 0.7452229299,
374
+ "f": 0.7155963303
 
 
 
 
 
375
  },
376
+ "FACILITY": {
377
+ "p": 0.5409836066,
378
+ "r": 0.5038167939,
379
+ "f": 0.5217391304
380
  },
381
+ "NUMERIC_VALUE": {
382
+ "p": 0.9110169492,
383
+ "r": 0.9110169492,
384
+ "f": 0.9110169492
385
  },
386
  "ORDINAL": {
387
+ "p": 0.7833333333,
388
+ "r": 0.8545454545,
389
+ "f": 0.8173913043
390
  },
391
+ "EVENT": {
392
+ "p": 0.6060606061,
393
+ "r": 0.5405405405,
394
+ "f": 0.5714285714
395
  },
396
+ "GPE": {
397
+ "p": 0.8362445415,
398
+ "r": 0.8804597701,
399
+ "f": 0.8577827548
400
+ },
401
+ "PERSON": {
402
+ "p": 0.7057010786,
403
+ "r": 0.7684563758,
404
+ "f": 0.7357429719
405
  },
406
  "NAT_REL_POL": {
407
+ "p": 0.9416058394,
408
+ "r": 0.86,
409
+ "f": 0.8989547038
410
  },
411
+ "MONEY": {
412
+ "p": 0.8888888889,
413
+ "r": 0.8275862069,
414
+ "f": 0.8571428571
415
  },
416
+ "PRODUCT": {
417
+ "p": 0.5338983051,
418
+ "r": 0.4598540146,
419
+ "f": 0.4941176471
420
  },
421
+ "LOC": {
422
+ "p": 0.5063291139,
423
+ "r": 0.5263157895,
424
+ "f": 0.5161290323
425
+ },
426
+ "WORK_OF_ART": {
427
+ "p": 0.3846153846,
428
+ "r": 0.2631578947,
429
+ "f": 0.3125
430
  },
431
  "QUANTITY": {
432
+ "p": 0.7878787879,
433
+ "r": 1.0,
434
+ "f": 0.8813559322
435
  },
436
+ "PERIOD": {
437
+ "p": 0.8823529412,
438
+ "r": 0.7142857143,
439
+ "f": 0.7894736842
440
  },
441
  "LANGUAGE": {
442
+ "p": 0.8,
443
+ "r": 1.0,
444
+ "f": 0.8888888889
 
 
 
 
 
445
  }
446
  },
447
+ "speed": 8965.3614636966
448
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
config.cfg CHANGED
@@ -10,7 +10,7 @@ seed = 0
10
 
11
  [nlp]
12
  lang = "ro"
13
- pipeline = ["tok2vec","tagger","parser","senter","attribute_ruler","lemmatizer","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
@@ -26,11 +26,22 @@ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
29
- factory = "lemmatizer"
30
- mode = "lookup"
31
- model = null
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
 
 
 
 
 
 
 
 
 
 
 
34
 
35
  [components.ner]
36
  factory = "ner"
@@ -55,7 +66,7 @@ nO = null
55
  @architectures = "spacy.MultiHashEmbed.v2"
56
  width = 96
57
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
58
- rows = [5000,2500,2500,2500,100]
59
  include_static_vectors = true
60
 
61
  [components.ner.model.tok2vec.encode]
@@ -93,8 +104,9 @@ overwrite = false
93
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
94
 
95
  [components.senter.model]
96
- @architectures = "spacy.Tagger.v1"
97
  nO = null
 
98
 
99
  [components.senter.model.tok2vec]
100
  @architectures = "spacy.Tok2Vec.v2"
@@ -115,12 +127,14 @@ maxout_pieces = 2
115
 
116
  [components.tagger]
117
  factory = "tagger"
 
118
  overwrite = false
119
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
120
 
121
  [components.tagger.model]
122
- @architectures = "spacy.Tagger.v1"
123
  nO = null
 
124
 
125
  [components.tagger.model.tok2vec]
126
  @architectures = "spacy.Tok2VecListener.v1"
@@ -137,7 +151,7 @@ factory = "tok2vec"
137
  @architectures = "spacy.MultiHashEmbed.v2"
138
  width = ${components.tok2vec.model.encode:width}
139
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
140
- rows = [5000,2500,2500,2500,100]
141
  include_static_vectors = true
142
 
143
  [components.tok2vec.model.encode]
@@ -174,7 +188,7 @@ dropout = 0.1
174
  accumulate_gradient = 1
175
  patience = 5000
176
  max_epochs = 0
177
- max_steps = 0
178
  eval_frequency = 1000
179
  frozen_components = []
180
  before_to_disk = null
@@ -209,15 +223,15 @@ eps = 0.00000001
209
  learn_rate = 0.001
210
 
211
  [training.score_weights]
212
- tag_acc = 0.16
213
  dep_uas = 0.0
214
- dep_las = 0.16
215
  dep_las_per_type = null
216
  sents_p = null
217
  sents_r = null
218
- sents_f = 0.02
219
- lemma_acc = 0.5
220
- ents_f = 0.16
221
  ents_p = 0.0
222
  ents_r = 0.0
223
  ents_per_type = null
10
 
11
  [nlp]
12
  lang = "ro"
13
+ pipeline = ["tok2vec","tagger","parser","lemmatizer","senter","attribute_ruler","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
26
  validate = false
27
 
28
  [components.lemmatizer]
29
+ factory = "trainable_lemmatizer"
30
+ backoff = "orth"
31
+ min_tree_freq = 3
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
+ top_k = 1
35
+
36
+ [components.lemmatizer.model]
37
+ @architectures = "spacy.Tagger.v2"
38
+ nO = null
39
+ normalize = false
40
+
41
+ [components.lemmatizer.model.tok2vec]
42
+ @architectures = "spacy.Tok2VecListener.v1"
43
+ width = ${components.tok2vec.model.encode:width}
44
+ upstream = "tok2vec"
45
 
46
  [components.ner]
47
  factory = "ner"
66
  @architectures = "spacy.MultiHashEmbed.v2"
67
  width = 96
68
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
69
+ rows = [5000,1000,2500,2500,50]
70
  include_static_vectors = true
71
 
72
  [components.ner.model.tok2vec.encode]
104
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
105
 
106
  [components.senter.model]
107
+ @architectures = "spacy.Tagger.v2"
108
  nO = null
109
+ normalize = false
110
 
111
  [components.senter.model.tok2vec]
112
  @architectures = "spacy.Tok2Vec.v2"
127
 
128
  [components.tagger]
129
  factory = "tagger"
130
+ neg_prefix = "!"
131
  overwrite = false
132
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
133
 
134
  [components.tagger.model]
135
+ @architectures = "spacy.Tagger.v2"
136
  nO = null
137
+ normalize = false
138
 
139
  [components.tagger.model.tok2vec]
140
  @architectures = "spacy.Tok2VecListener.v1"
151
  @architectures = "spacy.MultiHashEmbed.v2"
152
  width = ${components.tok2vec.model.encode:width}
153
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
154
+ rows = [5000,1000,2500,2500,50]
155
  include_static_vectors = true
156
 
157
  [components.tok2vec.model.encode]
188
  accumulate_gradient = 1
189
  patience = 5000
190
  max_epochs = 0
191
+ max_steps = 100000
192
  eval_frequency = 1000
193
  frozen_components = []
194
  before_to_disk = null
223
  learn_rate = 0.001
224
 
225
  [training.score_weights]
226
+ tag_acc = 0.29
227
  dep_uas = 0.0
228
+ dep_las = 0.29
229
  dep_las_per_type = null
230
  sents_p = null
231
  sents_r = null
232
+ sents_f = 0.04
233
+ lemma_acc = 0.1
234
+ ents_f = 0.29
235
  ents_p = 0.0
236
  ents_r = 0.0
237
  ents_per_type = null
lemmatizer/cfg ADDED
@@ -0,0 +1,1141 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ 1,
4
+ 2,
5
+ 3,
6
+ 7,
7
+ 9,
8
+ 12,
9
+ 14,
10
+ 15,
11
+ 19,
12
+ 21,
13
+ 23,
14
+ 27,
15
+ 29,
16
+ 31,
17
+ 33,
18
+ 35,
19
+ 37,
20
+ 39,
21
+ 42,
22
+ 44,
23
+ 46,
24
+ 48,
25
+ 50,
26
+ 52,
27
+ 58,
28
+ 59,
29
+ 63,
30
+ 64,
31
+ 65,
32
+ 68,
33
+ 70,
34
+ 73,
35
+ 75,
36
+ 77,
37
+ 80,
38
+ 82,
39
+ 83,
40
+ 87,
41
+ 89,
42
+ 90,
43
+ 91,
44
+ 93,
45
+ 95,
46
+ 96,
47
+ 97,
48
+ 100,
49
+ 102,
50
+ 104,
51
+ 106,
52
+ 107,
53
+ 109,
54
+ 105,
55
+ 111,
56
+ 113,
57
+ 114,
58
+ 117,
59
+ 119,
60
+ 122,
61
+ 123,
62
+ 124,
63
+ 126,
64
+ 129,
65
+ 133,
66
+ 136,
67
+ 138,
68
+ 140,
69
+ 141,
70
+ 144,
71
+ 146,
72
+ 148,
73
+ 149,
74
+ 152,
75
+ 155,
76
+ 156,
77
+ 157,
78
+ 159,
79
+ 161,
80
+ 163,
81
+ 165,
82
+ 168,
83
+ 170,
84
+ 171,
85
+ 173,
86
+ 175,
87
+ 176,
88
+ 178,
89
+ 180,
90
+ 182,
91
+ 184,
92
+ 185,
93
+ 186,
94
+ 188,
95
+ 190,
96
+ 192,
97
+ 194,
98
+ 195,
99
+ 197,
100
+ 199,
101
+ 201,
102
+ 202,
103
+ 205,
104
+ 207,
105
+ 209,
106
+ 210,
107
+ 212,
108
+ 213,
109
+ 214,
110
+ 217,
111
+ 219,
112
+ 220,
113
+ 222,
114
+ 225,
115
+ 227,
116
+ 229,
117
+ 230,
118
+ 232,
119
+ 233,
120
+ 235,
121
+ 238,
122
+ 240,
123
+ 242,
124
+ 244,
125
+ 247,
126
+ 249,
127
+ 251,
128
+ 253,
129
+ 254,
130
+ 256,
131
+ 258,
132
+ 260,
133
+ 262,
134
+ 263,
135
+ 265,
136
+ 267,
137
+ 269,
138
+ 273,
139
+ 274,
140
+ 276,
141
+ 278,
142
+ 280,
143
+ 281,
144
+ 283,
145
+ 285,
146
+ 287,
147
+ 289,
148
+ 291,
149
+ 293,
150
+ 295,
151
+ 297,
152
+ 298,
153
+ 300,
154
+ 302,
155
+ 304,
156
+ 306,
157
+ 308,
158
+ 310,
159
+ 311,
160
+ 313,
161
+ 314,
162
+ 316,
163
+ 317,
164
+ 319,
165
+ 322,
166
+ 323,
167
+ 325,
168
+ 328,
169
+ 329,
170
+ 332,
171
+ 336,
172
+ 338,
173
+ 340,
174
+ 341,
175
+ 343,
176
+ 344,
177
+ 346,
178
+ 347,
179
+ 349,
180
+ 350,
181
+ 352,
182
+ 354,
183
+ 356,
184
+ 358,
185
+ 359,
186
+ 360,
187
+ 361,
188
+ 363,
189
+ 364,
190
+ 366,
191
+ 368,
192
+ 369,
193
+ 371,
194
+ 71,
195
+ 373,
196
+ 375,
197
+ 377,
198
+ 379,
199
+ 381,
200
+ 384,
201
+ 386,
202
+ 387,
203
+ 390,
204
+ 391,
205
+ 393,
206
+ 394,
207
+ 395,
208
+ 396,
209
+ 397,
210
+ 399,
211
+ 400,
212
+ 402,
213
+ 403,
214
+ 405,
215
+ 406,
216
+ 410,
217
+ 414,
218
+ 416,
219
+ 418,
220
+ 419,
221
+ 422,
222
+ 424,
223
+ 426,
224
+ 427,
225
+ 428,
226
+ 430,
227
+ 432,
228
+ 434,
229
+ 436,
230
+ 437,
231
+ 438,
232
+ 440,
233
+ 443,
234
+ 446,
235
+ 447,
236
+ 448,
237
+ 450,
238
+ 452,
239
+ 454,
240
+ 455,
241
+ 456,
242
+ 459,
243
+ 461,
244
+ 463,
245
+ 466,
246
+ 467,
247
+ 468,
248
+ 472,
249
+ 475,
250
+ 476,
251
+ 477,
252
+ 478,
253
+ 481,
254
+ 482,
255
+ 483,
256
+ 485,
257
+ 486,
258
+ 487,
259
+ 488,
260
+ 490,
261
+ 492,
262
+ 494,
263
+ 497,
264
+ 501,
265
+ 503,
266
+ 506,
267
+ 508,
268
+ 509,
269
+ 510,
270
+ 512,
271
+ 514,
272
+ 515,
273
+ 517,
274
+ 518,
275
+ 519,
276
+ 522,
277
+ 524,
278
+ 527,
279
+ 529,
280
+ 530,
281
+ 532,
282
+ 534,
283
+ 536,
284
+ 538,
285
+ 541,
286
+ 544,
287
+ 546,
288
+ 548,
289
+ 550,
290
+ 551,
291
+ 553,
292
+ 554,
293
+ 556,
294
+ 557,
295
+ 559,
296
+ 560,
297
+ 561,
298
+ 563,
299
+ 567,
300
+ 569,
301
+ 38,
302
+ 571,
303
+ 572,
304
+ 574,
305
+ 576,
306
+ 579,
307
+ 581,
308
+ 583,
309
+ 586,
310
+ 589,
311
+ 590,
312
+ 591,
313
+ 593,
314
+ 594,
315
+ 597,
316
+ 600,
317
+ 601,
318
+ 603,
319
+ 605,
320
+ 606,
321
+ 608,
322
+ 609,
323
+ 611,
324
+ 612,
325
+ 615,
326
+ 617,
327
+ 619,
328
+ 621,
329
+ 623,
330
+ 625,
331
+ 627,
332
+ 629,
333
+ 631,
334
+ 633,
335
+ 635,
336
+ 638,
337
+ 640,
338
+ 643,
339
+ 645,
340
+ 647,
341
+ 648,
342
+ 649,
343
+ 651,
344
+ 653,
345
+ 655,
346
+ 657,
347
+ 659,
348
+ 660,
349
+ 661,
350
+ 663,
351
+ 665,
352
+ 667,
353
+ 668,
354
+ 670,
355
+ 671,
356
+ 672,
357
+ 674,
358
+ 675,
359
+ 676,
360
+ 677,
361
+ 678,
362
+ 680,
363
+ 681,
364
+ 682,
365
+ 685,
366
+ 687,
367
+ 690,
368
+ 692,
369
+ 694,
370
+ 696,
371
+ 697,
372
+ 699,
373
+ 700,
374
+ 702,
375
+ 703,
376
+ 705,
377
+ 706,
378
+ 709,
379
+ 712,
380
+ 714,
381
+ 716,
382
+ 718,
383
+ 720,
384
+ 722,
385
+ 724,
386
+ 726,
387
+ 727,
388
+ 728,
389
+ 730,
390
+ 732,
391
+ 734,
392
+ 735,
393
+ 736,
394
+ 738,
395
+ 741,
396
+ 742,
397
+ 744,
398
+ 745,
399
+ 746,
400
+ 747,
401
+ 748,
402
+ 750,
403
+ 751,
404
+ 753,
405
+ 756,
406
+ 757,
407
+ 758,
408
+ 760,
409
+ 762,
410
+ 766,
411
+ 768,
412
+ 769,
413
+ 771,
414
+ 773,
415
+ 774,
416
+ 776,
417
+ 778,
418
+ 780,
419
+ 781,
420
+ 784,
421
+ 786,
422
+ 788,
423
+ 790,
424
+ 791,
425
+ 792,
426
+ 795,
427
+ 796,
428
+ 798,
429
+ 799,
430
+ 801,
431
+ 803,
432
+ 804,
433
+ 806,
434
+ 808,
435
+ 809,
436
+ 812,
437
+ 814,
438
+ 816,
439
+ 818,
440
+ 820,
441
+ 821,
442
+ 822,
443
+ 824,
444
+ 825,
445
+ 827,
446
+ 828,
447
+ 829,
448
+ 830,
449
+ 832,
450
+ 834,
451
+ 835,
452
+ 837,
453
+ 838,
454
+ 841,
455
+ 842,
456
+ 843,
457
+ 844,
458
+ 845,
459
+ 846,
460
+ 847,
461
+ 848,
462
+ 850,
463
+ 852,
464
+ 854,
465
+ 855,
466
+ 856,
467
+ 858,
468
+ 861,
469
+ 862,
470
+ 863,
471
+ 864,
472
+ 865,
473
+ 867,
474
+ 869,
475
+ 871,
476
+ 873,
477
+ 875,
478
+ 877,
479
+ 880,
480
+ 882,
481
+ 885,
482
+ 887,
483
+ 889,
484
+ 891,
485
+ 893,
486
+ 895,
487
+ 898,
488
+ 899,
489
+ 900,
490
+ 901,
491
+ 903,
492
+ 906,
493
+ 908,
494
+ 909,
495
+ 910,
496
+ 912,
497
+ 913,
498
+ 914,
499
+ 916,
500
+ 918,
501
+ 919,
502
+ 921,
503
+ 922,
504
+ 924,
505
+ 925,
506
+ 928,
507
+ 929,
508
+ 930,
509
+ 932,
510
+ 935,
511
+ 937,
512
+ 939,
513
+ 940,
514
+ 941,
515
+ 943,
516
+ 945,
517
+ 946,
518
+ 947,
519
+ 948,
520
+ 949,
521
+ 950,
522
+ 951,
523
+ 953,
524
+ 954,
525
+ 955,
526
+ 959,
527
+ 960,
528
+ 962,
529
+ 963,
530
+ 965,
531
+ 966,
532
+ 968,
533
+ 970,
534
+ 971,
535
+ 972,
536
+ 973,
537
+ 974,
538
+ 976,
539
+ 978,
540
+ 980,
541
+ 982,
542
+ 983,
543
+ 984,
544
+ 987,
545
+ 988,
546
+ 989,
547
+ 991,
548
+ 992,
549
+ 995,
550
+ 996,
551
+ 997,
552
+ 998,
553
+ 999,
554
+ 1000,
555
+ 1001,
556
+ 1003,
557
+ 1004,
558
+ 1007,
559
+ 1008,
560
+ 1009,
561
+ 1010,
562
+ 1012,
563
+ 1013,
564
+ 1015,
565
+ 1021,
566
+ 1022,
567
+ 1023,
568
+ 1025,
569
+ 1026,
570
+ 1028,
571
+ 1029,
572
+ 1031,
573
+ 1032,
574
+ 1033,
575
+ 1034,
576
+ 1036,
577
+ 1037,
578
+ 1039,
579
+ 1041,
580
+ 1042,
581
+ 1043,
582
+ 1046,
583
+ 1048,
584
+ 1049,
585
+ 1050,
586
+ 1051,
587
+ 1052,
588
+ 1055,
589
+ 1056,
590
+ 1057,
591
+ 1058,
592
+ 1059,
593
+ 1060,
594
+ 1061,
595
+ 1062,
596
+ 1063,
597
+ 1069,
598
+ 1071,
599
+ 1072,
600
+ 1073,
601
+ 1074,
602
+ 1075,
603
+ 1076,
604
+ 1070,
605
+ 1077,
606
+ 1079,
607
+ 1080,
608
+ 1081,
609
+ 1082,
610
+ 1083,
611
+ 1084,
612
+ 1086,
613
+ 1087,
614
+ 1088,
615
+ 1092,
616
+ 1093,
617
+ 1095,
618
+ 1097,
619
+ 1099,
620
+ 1102,
621
+ 1104,
622
+ 1106,
623
+ 1108,
624
+ 1110,
625
+ 1112,
626
+ 1115,
627
+ 1117,
628
+ 1118,
629
+ 1119,
630
+ 1120,
631
+ 1121,
632
+ 1122,
633
+ 1126,
634
+ 1127,
635
+ 1129,
636
+ 1130,
637
+ 1131,
638
+ 1137,
639
+ 1145,
640
+ 1146,
641
+ 1150,
642
+ 1151,
643
+ 1152,
644
+ 1153,
645
+ 1154,
646
+ 1155,
647
+ 1156,
648
+ 1157,
649
+ 1161,
650
+ 1164,
651
+ 1165,
652
+ 1168,
653
+ 1169,
654
+ 1170,
655
+ 1172,
656
+ 1174,
657
+ 1176,
658
+ 1177,
659
+ 1178,
660
+ 1180,
661
+ 1183,
662
+ 1184,
663
+ 1185,
664
+ 1187,
665
+ 1190,
666
+ 1191,
667
+ 1193,
668
+ 1195,
669
+ 1196,
670
+ 1197,
671
+ 1200,
672
+ 1201,
673
+ 1202,
674
+ 1206,
675
+ 1207,
676
+ 1208,
677
+ 1210,
678
+ 1211,
679
+ 1212,
680
+ 1215,
681
+ 1216,
682
+ 1217,
683
+ 1218,
684
+ 1219,
685
+ 1221,
686
+ 1222,
687
+ 1223,
688
+ 1224,
689
+ 1225,
690
+ 1227,
691
+ 1228,
692
+ 1229,
693
+ 1231,
694
+ 1234,
695
+ 1236,
696
+ 1238,
697
+ 1239,
698
+ 1240,
699
+ 1241,
700
+ 1242,
701
+ 1244,
702
+ 1246,
703
+ 1249,
704
+ 1252,
705
+ 1254,
706
+ 1255,
707
+ 1256,
708
+ 1257,
709
+ 1259,
710
+ 1261,
711
+ 1262,
712
+ 1263,
713
+ 1264,
714
+ 1265,
715
+ 1267,
716
+ 1269,
717
+ 1270,
718
+ 1271,
719
+ 1272,
720
+ 1274,
721
+ 1275,
722
+ 1276,
723
+ 1277,
724
+ 1278,
725
+ 1279,
726
+ 1280,
727
+ 1282,
728
+ 1288,
729
+ 1289,
730
+ 1295,
731
+ 1297,
732
+ 1298,
733
+ 1301,
734
+ 1303,
735
+ 1304,
736
+ 1305,
737
+ 1306,
738
+ 1307,
739
+ 1309,
740
+ 1310,
741
+ 1312,
742
+ 1313,
743
+ 1315,
744
+ 1316,
745
+ 1317,
746
+ 1319,
747
+ 1320,
748
+ 1321,
749
+ 1323,
750
+ 1324,
751
+ 1326,
752
+ 1328,
753
+ 1330,
754
+ 1332,
755
+ 1334,
756
+ 1335,
757
+ 1336,
758
+ 1337,
759
+ 1339,
760
+ 1340,
761
+ 1342,
762
+ 1344,
763
+ 1345,
764
+ 1346,
765
+ 1347,
766
+ 1349,
767
+ 1350,
768
+ 1351,
769
+ 1355,
770
+ 1356,
771
+ 1357,
772
+ 1358,
773
+ 1360,
774
+ 1361,
775
+ 1362,
776
+ 1364,
777
+ 1366,
778
+ 1368,
779
+ 1369,
780
+ 1370,
781
+ 1371,
782
+ 1373,
783
+ 1374,
784
+ 1376,
785
+ 1377,
786
+ 1378,
787
+ 1380,
788
+ 1382,
789
+ 1383,
790
+ 1384,
791
+ 1388,
792
+ 1389,
793
+ 1390,
794
+ 1394,
795
+ 1395,
796
+ 1396,
797
+ 1398,
798
+ 1399,
799
+ 1400,
800
+ 1402,
801
+ 1403,
802
+ 1404,
803
+ 1406,
804
+ 1409,
805
+ 1410,
806
+ 1411,
807
+ 1413,
808
+ 1414,
809
+ 1415,
810
+ 1416,
811
+ 1417,
812
+ 1418,
813
+ 1420,
814
+ 1422,
815
+ 1425,
816
+ 1427,
817
+ 1429,
818
+ 1430,
819
+ 1431,
820
+ 1432,
821
+ 1434,
822
+ 1435,
823
+ 1437,
824
+ 1438,
825
+ 1439,
826
+ 1442,
827
+ 1443,
828
+ 1445,
829
+ 1446,
830
+ 1447,
831
+ 1448,
832
+ 1449,
833
+ 1450,
834
+ 704,
835
+ 1452,
836
+ 1454,
837
+ 1459,
838
+ 1462,
839
+ 1463,
840
+ 1465,
841
+ 1469,
842
+ 1470,
843
+ 1472,
844
+ 1474,
845
+ 1475,
846
+ 1476,
847
+ 1477,
848
+ 1481,
849
+ 1482,
850
+ 1484,
851
+ 1485,
852
+ 1486,
853
+ 1487,
854
+ 1489,
855
+ 1491,
856
+ 1493,
857
+ 1497,
858
+ 1498,
859
+ 1499,
860
+ 1502,
861
+ 1503,
862
+ 1504,
863
+ 1505,
864
+ 1507,
865
+ 1508,
866
+ 1510,
867
+ 1511,
868
+ 1513,
869
+ 1515,
870
+ 1517,
871
+ 1519,
872
+ 1520,
873
+ 1523,
874
+ 1525,
875
+ 1526,
876
+ 1528,
877
+ 1529,
878
+ 1531,
879
+ 1533,
880
+ 1535,
881
+ 1536,
882
+ 1538,
883
+ 1539,
884
+ 1541,
885
+ 1543,
886
+ 1544,
887
+ 1545,
888
+ 1546,
889
+ 1548,
890
+ 1550,
891
+ 1552,
892
+ 1553,
893
+ 1555,
894
+ 1556,
895
+ 1558,
896
+ 1559,
897
+ 1560,
898
+ 1561,
899
+ 1563,
900
+ 1564,
901
+ 1566,
902
+ 1568,
903
+ 1569,
904
+ 1570,
905
+ 1571,
906
+ 1573,
907
+ 1574,
908
+ 1576,
909
+ 1578,
910
+ 1580,
911
+ 1584,
912
+ 1587,
913
+ 1588,
914
+ 1589,
915
+ 1590,
916
+ 1592,
917
+ 1594,
918
+ 1595,
919
+ 1596,
920
+ 1598,
921
+ 1601,
922
+ 1603,
923
+ 1605,
924
+ 1607,
925
+ 1608,
926
+ 1609,
927
+ 1610,
928
+ 1611,
929
+ 1613,
930
+ 1614,
931
+ 1616,
932
+ 1618,
933
+ 1622,
934
+ 1623,
935
+ 1624,
936
+ 1626,
937
+ 1627,
938
+ 1628,
939
+ 1632,
940
+ 1633,
941
+ 1634,
942
+ 547,
943
+ 1636,
944
+ 1637,
945
+ 1640,
946
+ 1642,
947
+ 1643,
948
+ 1645,
949
+ 1646,
950
+ 1647,
951
+ 1649,
952
+ 1650,
953
+ 1651,
954
+ 1652,
955
+ 1654,
956
+ 1656,
957
+ 1657,
958
+ 1658,
959
+ 1662,
960
+ 1665,
961
+ 1666,
962
+ 1669,
963
+ 1670,
964
+ 1671,
965
+ 1672,
966
+ 1674,
967
+ 1676,
968
+ 1677,
969
+ 1679,
970
+ 1681,
971
+ 1683,
972
+ 1684,
973
+ 1686,
974
+ 1687,
975
+ 1688,
976
+ 1689,
977
+ 1690,
978
+ 1691,
979
+ 1692,
980
+ 1693,
981
+ 1694,
982
+ 1695,
983
+ 1696,
984
+ 1697,
985
+ 1699,
986
+ 1703,
987
+ 1704,
988
+ 1706,
989
+ 1707,
990
+ 1709,
991
+ 1711,
992
+ 1712,
993
+ 1713,
994
+ 1714,
995
+ 1716,
996
+ 1719,
997
+ 1720,
998
+ 1721,
999
+ 1723,
1000
+ 1724,
1001
+ 1725,
1002
+ 1726,
1003
+ 1728,
1004
+ 1730,
1005
+ 1732,
1006
+ 1734,
1007
+ 1735,
1008
+ 1736,
1009
+ 1737,
1010
+ 1739,
1011
+ 1740,
1012
+ 1741,
1013
+ 1742,
1014
+ 1746,
1015
+ 1747,
1016
+ 1748,
1017
+ 1750,
1018
+ 1753,
1019
+ 1754,
1020
+ 1755,
1021
+ 1757,
1022
+ 1759,
1023
+ 1760,
1024
+ 1762,
1025
+ 1765,
1026
+ 1767,
1027
+ 1768,
1028
+ 1770,
1029
+ 1772,
1030
+ 1773,
1031
+ 1774,
1032
+ 1775,
1033
+ 1778,
1034
+ 1780,
1035
+ 1781,
1036
+ 1783,
1037
+ 1785,
1038
+ 1787,
1039
+ 1790,
1040
+ 1791,
1041
+ 1793,
1042
+ 1796,
1043
+ 1797,
1044
+ 1800,
1045
+ 1801,
1046
+ 1803,
1047
+ 1807,
1048
+ 1810,
1049
+ 1811,
1050
+ 1812,
1051
+ 1813,
1052
+ 1819,
1053
+ 1820,
1054
+ 1821,
1055
+ 1822,
1056
+ 1823,
1057
+ 1824,
1058
+ 1826,
1059
+ 1827,
1060
+ 1828,
1061
+ 1830,
1062
+ 1831,
1063
+ 1833,
1064
+ 1834,
1065
+ 1836,
1066
+ 1837,
1067
+ 1838,
1068
+ 1839,
1069
+ 1841,
1070
+ 1845,
1071
+ 1847,
1072
+ 1848,
1073
+ 1850,
1074
+ 1852,
1075
+ 1853,
1076
+ 1855,
1077
+ 1856,
1078
+ 1858,
1079
+ 1860,
1080
+ 1861,
1081
+ 1862,
1082
+ 1864,
1083
+ 1865,
1084
+ 1866,
1085
+ 1868,
1086
+ 1872,
1087
+ 1874,
1088
+ 1876,
1089
+ 1878,
1090
+ 1880,
1091
+ 1882,
1092
+ 1883,
1093
+ 1885,
1094
+ 1887,
1095
+ 1888,
1096
+ 1892,
1097
+ 1893,
1098
+ 1894,
1099
+ 1895,
1100
+ 1896,
1101
+ 1897,
1102
+ 1898,
1103
+ 1899,
1104
+ 1905,
1105
+ 1906,
1106
+ 1908,
1107
+ 1910,
1108
+ 1912,
1109
+ 1913,
1110
+ 1914,
1111
+ 1915,
1112
+ 1918,
1113
+ 1919,
1114
+ 1920,
1115
+ 1922,
1116
+ 1923,
1117
+ 1925,
1118
+ 1926,
1119
+ 1928,
1120
+ 1929,
1121
+ 1930,
1122
+ 1932,
1123
+ 1934,
1124
+ 1935,
1125
+ 1936,
1126
+ 1937,
1127
+ 1939,
1128
+ 1940,
1129
+ 1941,
1130
+ 1947,
1131
+ 1948,
1132
+ 1949,
1133
+ 1950,
1134
+ 1951,
1135
+ 1952,
1136
+ 1953,
1137
+ 1955,
1138
+ 1956,
1139
+ 1958
1140
+ ]
1141
+ }
lemmatizer/{lookups/lookups.bin → model} RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:29d980bbecacfa6599448d2fc5a0e58900ecce80f8674ac1fb8fbdfd434fea11
3
- size 5598187
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4318f7949ea178e82db2e969ce2b6c6858c6e2350b0560c030bbe2c7f3b12ecf
3
+ size 441598
lemmatizer/trees ADDED
Binary file (186 kB). View file
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"ro",
3
  "name":"core_news_md",
4
- "version":"3.2.0",
5
- "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
@@ -546,15 +546,8 @@
546
  "vocative",
547
  "xcomp"
548
  ],
549
- "senter":[
550
- "I",
551
- "S"
552
- ],
553
  "attribute_ruler":[
554
 
555
- ],
556
- "lemmatizer":[
557
-
558
  ],
559
  "ner":[
560
  "DATETIME",
@@ -579,17 +572,17 @@
579
  "tok2vec",
580
  "tagger",
581
  "parser",
582
- "attribute_ruler",
583
  "lemmatizer",
 
584
  "ner"
585
  ],
586
  "components":[
587
  "tok2vec",
588
  "tagger",
589
  "parser",
 
590
  "senter",
591
  "attribute_ruler",
592
- "lemmatizer",
593
  "ner"
594
  ],
595
  "disabled":[
@@ -600,217 +593,212 @@
600
  "token_p":0.9967350492,
601
  "token_r":0.9957244934,
602
  "token_f":0.9959492157,
603
- "tag_acc":0.9619726156,
604
- "sents_p":0.9626168224,
605
- "sents_r":0.9587765957,
606
- "sents_f":0.9606928714,
607
- "dep_uas":0.8893350063,
608
- "dep_las":0.8388068128,
609
  "dep_las_per_type":{
610
- "case":{
611
- "p":0.9337493999,
612
- "r":0.9492435334,
613
- "f":0.9414327202
614
  },
615
- "det":{
616
- "p":0.9484425349,
617
- "r":0.966083151,
618
- "f":0.9571815718
 
 
 
 
 
619
  },
620
  "nmod:tmod":{
621
- "p":0.6666666667,
622
- "r":0.0930232558,
623
- "f":0.1632653061
624
  },
625
  "amod":{
626
- "p":0.8737690242,
627
- "r":0.8864668483,
628
- "f":0.8800721371
629
  },
630
- "cc":{
631
- "p":0.877016129,
632
- "r":0.910041841,
633
- "f":0.8932238193
634
- },
635
- "conj":{
636
- "p":0.5879699248,
637
- "r":0.5915279879,
638
- "f":0.5897435897
639
  },
640
  "nmod":{
641
- "p":0.7885679164,
642
- "r":0.8099747475,
643
- "f":0.7991279975
644
  },
645
- "mark":{
646
- "p":0.9161147903,
647
- "r":0.9222222222,
648
- "f":0.919158361
649
- },
650
- "fixed":{
651
- "p":0.8559322034,
652
- "r":0.7163120567,
653
- "f":0.7799227799
654
- },
655
- "nsubj":{
656
- "p":0.8134920635,
657
- "r":0.7824427481,
658
- "f":0.7976653696
659
  },
660
- "advcl:tcl":{
661
- "p":0.0,
662
- "r":0.0,
663
- "f":0.0
664
  },
665
  "obj":{
666
- "p":0.7793880837,
667
- "r":0.8273504274,
668
- "f":0.8026533997
669
  },
670
- "nummod":{
671
- "p":0.8892405063,
672
- "r":0.8619631902,
673
- "f":0.8753894081
674
  },
675
- "flat":{
676
- "p":0.7441860465,
677
- "r":0.6857142857,
678
- "f":0.7137546468
679
  },
680
- "obl":{
681
- "p":0.6402378593,
682
- "r":0.731596829,
683
- "f":0.6828752643
684
  },
685
- "obl:pmod":{
686
- "p":0.4375,
687
- "r":0.1615384615,
688
- "f":0.2359550562
689
  },
690
  "acl":{
691
- "p":0.7222222222,
692
- "r":0.7303370787,
693
- "f":0.7262569832
694
  },
695
  "advmod":{
696
- "p":0.8060686016,
697
- "r":0.7823303457,
698
- "f":0.7940220923
699
- },
700
- "expl:pv":{
701
- "p":0.7777777778,
702
- "r":0.8191489362,
703
- "f":0.7979274611
704
  },
705
- "root":{
706
- "p":0.9103078983,
707
- "r":0.9042553191,
708
- "f":0.9072715143
709
  },
710
- "advcl":{
711
- "p":0.5579710145,
712
- "r":0.6260162602,
713
- "f":0.5900383142
714
  },
715
- "iobj":{
716
- "p":0.7966101695,
717
- "r":0.6394557823,
718
- "f":0.7094339623
719
  },
720
- "ccomp":{
721
- "p":0.6995073892,
722
- "r":0.802259887,
723
- "f":0.7473684211
724
  },
725
- "goeswith":{
726
- "p":0.25,
727
- "r":0.1428571429,
728
- "f":0.1818181818
729
  },
730
  "parataxis":{
731
- "p":0.8494623656,
732
- "r":0.6030534351,
733
- "f":0.7053571429
734
- },
735
- "expl:poss":{
736
- "p":0.6086956522,
737
- "r":0.6511627907,
738
- "f":0.6292134831
739
  },
740
- "cop":{
741
- "p":0.75,
742
- "r":0.773006135,
743
- "f":0.7613293051
744
  },
745
- "cc:preconj":{
746
  "p":0.0,
747
  "r":0.0,
748
  "f":0.0
749
  },
750
- "aux":{
751
- "p":0.9661971831,
752
- "r":0.9122340426,
753
- "f":0.9384404925
754
  },
755
- "expl":{
756
- "p":0.5714285714,
757
- "r":0.4761904762,
758
- "f":0.5194805195
759
  },
760
- "appos":{
761
- "p":0.4691358025,
762
- "r":0.3762376238,
763
- "f":0.4175824176
764
  },
765
- "xcomp":{
766
- "p":0.5538461538,
767
- "r":0.4337349398,
768
- "f":0.4864864865
769
  },
770
- "nsubj:pass":{
771
- "p":0.5878787879,
772
- "r":0.6381578947,
773
- "f":0.6119873817
774
  },
775
  "csubj":{
776
- "p":0.8448275862,
777
- "r":0.7777777778,
778
- "f":0.8099173554
779
  },
780
  "obl:agent":{
781
- "p":0.7538461538,
782
- "r":0.7538461538,
783
- "f":0.7538461538
784
- },
785
- "aux:pass":{
786
- "p":0.7428571429,
787
- "r":0.8666666667,
788
- "f":0.8
789
- },
790
- "dep":{
791
  "p":0.0,
792
  "r":0.0,
793
  "f":0.0
794
  },
795
- "advmod:tmod":{
796
  "p":0.0,
797
  "r":0.0,
798
  "f":0.0
799
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
800
  "compound":{
801
- "p":0.5,
802
- "r":0.6666666667,
803
- "f":0.5714285714
 
 
 
 
 
804
  },
805
  "ccomp:pmod":{
806
- "p":0.5,
807
- "r":0.1875,
808
- "f":0.2727272727
809
  },
810
- "expl:pass":{
811
- "p":0.6808510638,
812
- "r":0.7032967033,
813
- "f":0.6918918919
 
 
 
 
 
814
  },
815
  "orphan":{
816
  "p":0.0,
@@ -819,242 +807,236 @@
819
  },
820
  "expl:impers":{
821
  "p":0.5,
822
- "r":0.1,
823
- "f":0.1666666667
824
  },
825
- "csubj:pass":{
826
- "p":0.6666666667,
827
- "r":0.6666666667,
828
- "f":0.6666666667
829
  },
830
- "vocative":{
831
  "p":0.0,
832
  "r":0.0,
833
  "f":0.0
834
  },
835
- "discourse":{
836
  "p":0.0,
837
  "r":0.0,
838
  "f":0.0
839
  }
840
  },
841
- "pos_acc":0.9381923087,
842
- "morph_acc":0.9469023954,
843
- "morph_micro_p":0.9870716332,
844
- "morph_micro_r":0.9558096483,
845
- "morph_micro_f":0.9683797083,
 
846
  "morph_per_feat":{
847
- "AdpType":{
848
- "p":0.9954051796,
849
- "r":0.9941593659,
850
- "f":0.9947818827
851
- },
852
  "Case":{
853
- "p":0.9873727088,
854
- "r":0.9820391627,
855
- "f":0.9846987136
856
- },
857
- "Variant":{
858
- "p":0.976744186,
859
- "r":0.9130434783,
860
- "f":0.9438202247
861
  },
862
  "Gender":{
863
- "p":0.9821478774,
864
- "r":0.9776129845,
865
- "f":0.9798751841
866
  },
867
  "Number":{
868
- "p":0.9810964083,
869
- "r":0.9438508752,
870
- "f":0.9621133125
871
  },
872
- "PronType":{
873
- "p":0.9902862986,
874
- "r":0.9872579001,
875
- "f":0.9887697805
876
- },
877
- "Definite":{
878
- "p":0.9788447388,
879
- "r":0.9734723747,
880
- "f":0.9761511649
881
  },
882
- "Degree":{
883
- "p":0.9568913175,
884
- "r":0.9347568209,
885
- "f":0.9456945695
886
  },
887
  "Polarity":{
888
- "p":0.9884318766,
889
- "r":0.9858974359,
890
- "f":0.9871630295
891
  },
892
- "Mood":{
893
- "p":0.9740072202,
894
- "r":0.9677187948,
895
- "f":0.9708528248
896
  },
897
- "Person":{
898
- "p":0.9764359352,
899
- "r":0.9696526508,
900
- "f":0.9730324711
901
  },
902
- "Tense":{
903
- "p":0.9707207207,
904
- "r":0.9563609467,
905
- "f":0.9634873323
906
  },
907
  "VerbForm":{
908
- "p":0.9714013346,
909
- "r":0.9622285175,
910
- "f":0.9667931689
 
 
 
 
 
 
 
 
 
 
911
  },
912
  "NumForm":{
913
- "p":0.9758064516,
914
- "r":0.2929782082,
915
- "f":0.4506517691
916
  },
917
  "NumType":{
918
- "p":0.9846153846,
919
- "r":0.3054892601,
920
- "f":0.4663023679
921
  },
922
- "PartType":{
923
- "p":0.9473684211,
924
- "r":0.9230769231,
925
- "f":0.9350649351
926
  },
927
  "Strength":{
928
- "p":0.9914675768,
929
- "r":0.97319933,
930
- "f":0.9822485207
931
  },
932
- "Reflex":{
933
- "p":0.9938461538,
934
- "r":0.9877675841,
935
- "f":0.990797546
936
  },
937
- "Poss":{
938
- "p":0.986013986,
939
- "r":0.986013986,
940
- "f":0.986013986
 
 
 
 
 
941
  },
942
  "Position":{
943
- "p":0.986013986,
944
- "r":0.9724137931,
945
- "f":0.9791666667
946
  },
947
  "Number[psor]":{
948
- "p":0.9420289855,
949
- "r":0.9558823529,
950
- "f":0.9489051095
 
 
 
 
 
951
  },
952
  "Foreign":{
953
  "p":0.0,
954
  "r":0.0,
955
  "f":0.0
956
- },
957
- "Abbr":{
958
- "p":0.9620253165,
959
- "r":0.9156626506,
960
- "f":0.9382716049
961
  }
962
  },
963
- "lemma_acc":0.8183070924,
964
- "ents_p":0.7485865058,
965
- "ents_r":0.7629658087,
966
- "ents_f":0.7557077626,
967
  "ents_per_type":{
968
  "DATETIME":{
969
- "p":0.0,
970
- "r":0.0,
971
- "f":0.0
972
  },
973
- "PERSON":{
974
- "p":0.0,
975
- "r":0.0,
976
- "f":0.0
977
- },
978
- "PRODUCT":{
979
- "p":0.0,
980
- "r":0.0,
981
- "f":0.0
982
  },
983
- "LOC":{
984
- "p":0.0,
985
- "r":0.0,
986
- "f":0.0
987
  },
988
- "GPE":{
989
- "p":0.0,
990
- "r":0.0,
991
- "f":0.0
992
  },
993
  "ORDINAL":{
994
- "p":0.0,
995
- "r":0.0,
996
- "f":0.0
997
  },
998
- "NUMERIC_VALUE":{
999
- "p":0.0,
1000
- "r":0.0,
1001
- "f":0.0
1002
  },
1003
- "ORGANIZATION":{
1004
- "p":0.0,
1005
- "r":0.0,
1006
- "f":0.0
 
 
 
 
 
1007
  },
1008
  "NAT_REL_POL":{
1009
- "p":0.0,
1010
- "r":0.0,
1011
- "f":0.0
1012
  },
1013
- "WORK_OF_ART":{
1014
- "p":0.0,
1015
- "r":0.0,
1016
- "f":0.0
1017
  },
1018
- "EVENT":{
1019
- "p":0.0,
1020
- "r":0.0,
1021
- "f":0.0
1022
  },
1023
- "FACILITY":{
1024
- "p":0.0,
1025
- "r":0.0,
1026
- "f":0.0
 
 
 
 
 
1027
  },
1028
  "QUANTITY":{
1029
- "p":0.0,
1030
- "r":0.0,
1031
- "f":0.0
1032
  },
1033
- "MONEY":{
1034
- "p":0.0,
1035
- "r":0.0,
1036
- "f":0.0
1037
  },
1038
  "LANGUAGE":{
1039
- "p":0.0,
1040
- "r":0.0,
1041
- "f":0.0
1042
- },
1043
- "PERIOD":{
1044
- "p":0.0,
1045
- "r":0.0,
1046
- "f":0.0
1047
  }
1048
  },
1049
- "speed":8391.5537539766
1050
  },
1051
  "sources":[
1052
- {
1053
- "name":"Lemmatization Lists",
1054
- "url":"https://github.com/michmech/lemmatization-lists/",
1055
- "license":"ODbL",
1056
- "author":"Michal M\u011bchura"
1057
- },
1058
  {
1059
  "name":"UD Romanian RRT v2.8",
1060
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1
  {
2
  "lang":"ro",
3
  "name":"core_news_md",
4
+ "version":"3.3.0",
5
+ "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner, attribute_ruler.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
546
  "vocative",
547
  "xcomp"
548
  ],
 
 
 
 
549
  "attribute_ruler":[
550
 
 
 
 
551
  ],
552
  "ner":[
553
  "DATETIME",
572
  "tok2vec",
573
  "tagger",
574
  "parser",
 
575
  "lemmatizer",
576
+ "attribute_ruler",
577
  "ner"
578
  ],
579
  "components":[
580
  "tok2vec",
581
  "tagger",
582
  "parser",
583
+ "lemmatizer",
584
  "senter",
585
  "attribute_ruler",
 
586
  "ner"
587
  ],
588
  "disabled":[
593
  "token_p":0.9967350492,
594
  "token_r":0.9957244934,
595
  "token_f":0.9959492157,
596
+ "tag_acc":0.9627631502,
597
+ "sents_p":0.9639519359,
598
+ "sents_r":0.960106383,
599
+ "sents_f":0.9620253165,
600
+ "dep_uas":0.8861775868,
601
+ "dep_las":0.8312266725,
602
  "dep_las_per_type":{
603
+ "root":{
604
+ "p":0.8707360862,
605
+ "r":0.9133709981,
606
+ "f":0.8915441176
607
  },
608
+ "mark":{
609
+ "p":0.9283018868,
610
+ "r":0.9283018868,
611
+ "f":0.9283018868
612
+ },
613
+ "case":{
614
+ "p":0.9663643235,
615
+ "r":0.9587551556,
616
+ "f":0.9625447017
617
  },
618
  "nmod:tmod":{
619
+ "p":0.62,
620
+ "r":0.2605042017,
621
+ "f":0.3668639053
622
  },
623
  "amod":{
624
+ "p":0.8989813243,
625
+ "r":0.902044293,
626
+ "f":0.9005102041
627
  },
628
+ "nsubj":{
629
+ "p":0.8516129032,
630
+ "r":0.8341232227,
631
+ "f":0.8427773344
 
 
 
 
 
632
  },
633
  "nmod":{
634
+ "p":0.826055313,
635
+ "r":0.8104248483,
636
+ "f":0.8181654352
637
  },
638
+ "aux":{
639
+ "p":0.9703703704,
640
+ "r":0.957952468,
641
+ "f":0.9641214351
 
 
 
 
 
 
 
 
 
 
642
  },
643
+ "advcl":{
644
+ "p":0.5436241611,
645
+ "r":0.6090225564,
646
+ "f":0.5744680851
647
  },
648
  "obj":{
649
+ "p":0.8295454545,
650
+ "r":0.8429561201,
651
+ "f":0.8361970218
652
  },
653
+ "det":{
654
+ "p":0.9611428571,
655
+ "r":0.9524348811,
656
+ "f":0.9567690557
657
  },
658
+ "cc":{
659
+ "p":0.9329140461,
660
+ "r":0.9290187891,
661
+ "f":0.9309623431
662
  },
663
+ "conj":{
664
+ "p":0.5798742138,
665
+ "r":0.5341830823,
666
+ "f":0.5560916767
667
  },
668
+ "nummod":{
669
+ "p":0.884375,
670
+ "r":0.8788819876,
671
+ "f":0.8816199377
672
  },
673
  "acl":{
674
+ "p":0.7793696275,
675
+ "r":0.7028423773,
676
+ "f":0.7391304348
677
  },
678
  "advmod":{
679
+ "p":0.8041237113,
680
+ "r":0.8232189974,
681
+ "f":0.813559322
 
 
 
 
 
682
  },
683
+ "obl":{
684
+ "p":0.6925566343,
685
+ "r":0.8147208122,
686
+ "f":0.7486880466
687
  },
688
+ "expl:pass":{
689
+ "p":0.8367346939,
690
+ "r":0.7592592593,
691
+ "f":0.7961165049
692
  },
693
+ "nsubj:pass":{
694
+ "p":0.8116883117,
695
+ "r":0.762195122,
696
+ "f":0.786163522
697
  },
698
+ "fixed":{
699
+ "p":0.8574380165,
700
+ "r":0.8773784355,
701
+ "f":0.8672936259
702
  },
703
+ "appos":{
704
+ "p":0.4957264957,
705
+ "r":0.4427480916,
706
+ "f":0.4677419355
707
  },
708
  "parataxis":{
709
+ "p":0.2631578947,
710
+ "r":0.2857142857,
711
+ "f":0.2739726027
 
 
 
 
 
712
  },
713
+ "aux:pass":{
714
+ "p":0.9391891892,
715
+ "r":0.9266666667,
716
+ "f":0.932885906
717
  },
718
+ "nmod:agent":{
719
  "p":0.0,
720
  "r":0.0,
721
  "f":0.0
722
  },
723
+ "ccomp":{
724
+ "p":0.8852459016,
725
+ "r":0.8372093023,
726
+ "f":0.8605577689
727
  },
728
+ "nmod:pmod":{
729
+ "p":0.0,
730
+ "r":0.0,
731
+ "f":0.0
732
  },
733
+ "iobj":{
734
+ "p":0.734939759,
735
+ "r":0.7530864198,
736
+ "f":0.743902439
737
  },
738
+ "flat":{
739
+ "p":0.8031088083,
740
+ "r":0.8157894737,
741
+ "f":0.8093994778
742
  },
743
+ "cop":{
744
+ "p":0.8225806452,
745
+ "r":0.8225806452,
746
+ "f":0.8225806452
747
  },
748
  "csubj":{
749
+ "p":0.85,
750
+ "r":0.8095238095,
751
+ "f":0.8292682927
752
  },
753
  "obl:agent":{
 
 
 
 
 
 
 
 
 
 
754
  "p":0.0,
755
  "r":0.0,
756
  "f":0.0
757
  },
758
+ "obl:pmod":{
759
  "p":0.0,
760
  "r":0.0,
761
  "f":0.0
762
  },
763
+ "expl":{
764
+ "p":0.6551724138,
765
+ "r":0.7037037037,
766
+ "f":0.6785714286
767
+ },
768
+ "xcomp":{
769
+ "p":0.4137931034,
770
+ "r":0.4444444444,
771
+ "f":0.4285714286
772
+ },
773
+ "expl:pv":{
774
+ "p":0.7631578947,
775
+ "r":0.8405797101,
776
+ "f":0.8
777
+ },
778
  "compound":{
779
+ "p":0.4,
780
+ "r":0.5714285714,
781
+ "f":0.4705882353
782
+ },
783
+ "dep":{
784
+ "p":0.0,
785
+ "r":0.0,
786
+ "f":0.0
787
  },
788
  "ccomp:pmod":{
789
+ "p":0.2857142857,
790
+ "r":0.6666666667,
791
+ "f":0.4
792
  },
793
+ "expl:poss":{
794
+ "p":1.0,
795
+ "r":0.8387096774,
796
+ "f":0.9122807018
797
+ },
798
+ "goeswith":{
799
+ "p":0.0,
800
+ "r":0.0,
801
+ "f":0.0
802
  },
803
  "orphan":{
804
  "p":0.0,
807
  },
808
  "expl:impers":{
809
  "p":0.5,
810
+ "r":0.3333333333,
811
+ "f":0.4
812
  },
813
+ "cc:preconj":{
814
+ "p":0.0,
815
+ "r":0.0,
816
+ "f":0.0
817
  },
818
+ "list":{
819
  "p":0.0,
820
  "r":0.0,
821
  "f":0.0
822
  },
823
+ "csubj:pass":{
824
  "p":0.0,
825
  "r":0.0,
826
  "f":0.0
827
  }
828
  },
829
+ "lemma_acc":0.9547600199,
830
+ "pos_acc":0.938272277,
831
+ "morph_acc":0.9472820032,
832
+ "morph_micro_p":0.9871439375,
833
+ "morph_micro_r":0.9564194708,
834
+ "morph_micro_f":0.9691151644,
835
  "morph_per_feat":{
 
 
 
 
 
836
  "Case":{
837
+ "p":0.9916975348,
838
+ "r":0.9874093857,
839
+ "f":0.9895488147
 
 
 
 
 
840
  },
841
  "Gender":{
842
+ "p":0.9891837505,
843
+ "r":0.983247906,
844
+ "f":0.9862068966
845
  },
846
  "Number":{
847
+ "p":0.9870188509,
848
+ "r":0.9210997577,
849
+ "f":0.9529206626
850
  },
851
+ "Person":{
852
+ "p":0.9846878681,
853
+ "r":0.9852681202,
854
+ "f":0.9849779087
 
 
 
 
 
855
  },
856
+ "PronType":{
857
+ "p":0.9965301874,
858
+ "r":0.992398065,
859
+ "f":0.9944598338
860
  },
861
  "Polarity":{
862
+ "p":0.9869918699,
863
+ "r":0.9950819672,
864
+ "f":0.9910204082
865
  },
866
+ "AdpType":{
867
+ "p":0.9983039349,
868
+ "r":0.9959390863,
869
+ "f":0.9971201084
870
  },
871
+ "Definite":{
872
+ "p":0.9855930847,
873
+ "r":0.9773015873,
874
+ "f":0.9814298239
875
  },
876
+ "Degree":{
877
+ "p":0.9530931339,
878
+ "r":0.9415715245,
879
+ "f":0.9472972973
880
  },
881
  "VerbForm":{
882
+ "p":0.9708994709,
883
+ "r":0.9760638298,
884
+ "f":0.9734748011
885
+ },
886
+ "Abbr":{
887
+ "p":0.9797979798,
888
+ "r":0.8660714286,
889
+ "f":0.9194312796
890
+ },
891
+ "Poss":{
892
+ "p":1.0,
893
+ "r":0.9951807229,
894
+ "f":0.9975845411
895
  },
896
  "NumForm":{
897
+ "p":0.9958677686,
898
+ "r":0.3319559229,
899
+ "f":0.4979338843
900
  },
901
  "NumType":{
902
+ "p":0.9959016393,
903
+ "r":0.3337912088,
904
+ "f":0.5
905
  },
906
+ "Reflex":{
907
+ "p":1.0,
908
+ "r":0.9935897436,
909
+ "f":0.9967845659
910
  },
911
  "Strength":{
912
+ "p":0.9919678715,
913
+ "r":0.9801587302,
914
+ "f":0.9860279441
915
  },
916
+ "Mood":{
917
+ "p":0.9690909091,
918
+ "r":0.9779816514,
919
+ "f":0.9735159817
920
  },
921
+ "Tense":{
922
+ "p":0.9682080925,
923
+ "r":0.9738372093,
924
+ "f":0.9710144928
925
+ },
926
+ "Variant":{
927
+ "p":0.9932432432,
928
+ "r":0.9483870968,
929
+ "f":0.9702970297
930
  },
931
  "Position":{
932
+ "p":1.0,
933
+ "r":0.9910714286,
934
+ "f":0.9955156951
935
  },
936
  "Number[psor]":{
937
+ "p":1.0,
938
+ "r":0.9666666667,
939
+ "f":0.9830508475
940
+ },
941
+ "PartType":{
942
+ "p":1.0,
943
+ "r":0.9459459459,
944
+ "f":0.9722222222
945
  },
946
  "Foreign":{
947
  "p":0.0,
948
  "r":0.0,
949
  "f":0.0
 
 
 
 
 
950
  }
951
  },
952
+ "ents_p":0.7497185741,
953
+ "ents_r":0.767575874,
954
+ "ents_f":0.7585421412,
 
955
  "ents_per_type":{
956
  "DATETIME":{
957
+ "p":0.7823129252,
958
+ "r":0.8013937282,
959
+ "f":0.7917383821
960
  },
961
+ "ORGANIZATION":{
962
+ "p":0.6882352941,
963
+ "r":0.7452229299,
964
+ "f":0.7155963303
 
 
 
 
 
965
  },
966
+ "FACILITY":{
967
+ "p":0.5409836066,
968
+ "r":0.5038167939,
969
+ "f":0.5217391304
970
  },
971
+ "NUMERIC_VALUE":{
972
+ "p":0.9110169492,
973
+ "r":0.9110169492,
974
+ "f":0.9110169492
975
  },
976
  "ORDINAL":{
977
+ "p":0.7833333333,
978
+ "r":0.8545454545,
979
+ "f":0.8173913043
980
  },
981
+ "EVENT":{
982
+ "p":0.6060606061,
983
+ "r":0.5405405405,
984
+ "f":0.5714285714
985
  },
986
+ "GPE":{
987
+ "p":0.8362445415,
988
+ "r":0.8804597701,
989
+ "f":0.8577827548
990
+ },
991
+ "PERSON":{
992
+ "p":0.7057010786,
993
+ "r":0.7684563758,
994
+ "f":0.7357429719
995
  },
996
  "NAT_REL_POL":{
997
+ "p":0.9416058394,
998
+ "r":0.86,
999
+ "f":0.8989547038
1000
  },
1001
+ "MONEY":{
1002
+ "p":0.8888888889,
1003
+ "r":0.8275862069,
1004
+ "f":0.8571428571
1005
  },
1006
+ "PRODUCT":{
1007
+ "p":0.5338983051,
1008
+ "r":0.4598540146,
1009
+ "f":0.4941176471
1010
  },
1011
+ "LOC":{
1012
+ "p":0.5063291139,
1013
+ "r":0.5263157895,
1014
+ "f":0.5161290323
1015
+ },
1016
+ "WORK_OF_ART":{
1017
+ "p":0.3846153846,
1018
+ "r":0.2631578947,
1019
+ "f":0.3125
1020
  },
1021
  "QUANTITY":{
1022
+ "p":0.7878787879,
1023
+ "r":1.0,
1024
+ "f":0.8813559322
1025
  },
1026
+ "PERIOD":{
1027
+ "p":0.8823529412,
1028
+ "r":0.7142857143,
1029
+ "f":0.7894736842
1030
  },
1031
  "LANGUAGE":{
1032
+ "p":0.8,
1033
+ "r":1.0,
1034
+ "f":0.8888888889
 
 
 
 
 
1035
  }
1036
  },
1037
+ "speed":8965.3614636966
1038
  },
1039
  "sources":[
 
 
 
 
 
 
1040
  {
1041
  "name":"UD Romanian RRT v2.8",
1042
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:af1ed7869b088ef6c15e78e97aff53f895e24c681d9a2884ca3c3bdc67ec750d
3
- size 7104273
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1856a2f5458a33a206116943a4e5f1c122434bbb057cce80e8c1d338db482342
3
+ size 6509073
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e4811f6843c371af6a4cd65e4ccf66817bf6e64fc1face9da55418f9ad73f78a
3
  size 312109
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:481906d5788929f35c553036175961b9c88a59972f0fcbeaf3c43980f60a2c4e
3
  size 312109
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves� {"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
1
+ ��moves� {"0":{"":86376},"1":{"":90696},"2":{"case":22303,"punct":9088,"det":9042,"nsubj":7088,"advmod":6428,"mark":5425,"cc":5402,"aux":4012,"obl":2033,"nummod":1890,"expl:pv":1817,"cop":1713,"amod":1377,"aux:pass":1372,"nsubj:pass":1015,"expl:pass":912,"parataxis":895,"obj":884,"advcl":717,"iobj":581,"expl:poss":475,"expl":397,"nmod":203,"nsubj||csubj":155,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":83,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":46,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16741,"punct":14596,"amod":9707,"obl":7797,"conj":7308,"fixed":5514,"obj":5466,"acl":4105,"advmod":2102,"advcl":2057,"ccomp":1935,"nummod":1670,"nsubj":1290,"obl:pmod":1211,"flat":1170,"det":1040,"appos":917,"xcomp":893,"iobj":805,"obl:agent":718,"csubj":633,"nsubj:pass":554,"parataxis":436,"case":434,"nmod:tmod":282,"ccomp:pmod":179,"cc":124,"cop":101,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":33,"dep":0},"4":{"ROOT":8043}}�cfg��neg_key�
ro_core_news_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:76845c3f800eae6e4f52e3066ee5a28bf103bdbbe4db9b0649a72f4b8d897f98
3
- size 46220322
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a74de89ce43e999e37d3b174505e588627cd7093d25a6f4b7ffcb01ae418e0f1
3
+ size 42464778
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e36ad4f74aa0333a33a5352443f9bf265d7b32fc7552167638cff69be63c1114
3
- size 219901
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e622477a61b23f1c605a05506f9ee69884a5d0e972e9a225d999c750bf359ae0
3
+ size 219953
tagger/cfg CHANGED
@@ -478,5 +478,6 @@
478
  "Yp-sr",
479
  "Yr"
480
  ],
 
481
  "overwrite":false
482
  }
478
  "Yp-sr",
479
  "Yr"
480
  ],
481
+ "neg_prefix":"!",
482
  "overwrite":false
483
  }
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9ec3ea800ab53cbbdcee08fd3435376a8d8d4a0fa5526e02de97671736eb2e7
3
- size 185466
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b6990f8232ec048d0554c798214fefb2364a6a4b73c03724fb71a903ae6d2a0
3
+ size 185518
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc6d2cc3f7c76dd48650cf40fc894366a14a40f1bc6cd11d112feb4d40c8c598
3
- size 6960804
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5473723565043a410429f8d14663781c0d856dd5fdba05ea8697944af838b25b
3
+ size 6365604
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
1
+ ��prefix_search�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4534edb1d1b8e8017538d692a57054e6179b5b351805c50502b2f0ef77b79ec7
3
- size 10070837
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:28c8c63ae8c2e0f7c605f24168d34a72a8c2ef313589da6374324ba9d3863ae5
3
+ size 10103432