EC2 Default User commited on
Commit
48119ad
1 Parent(s): 7e25a4f

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -378,6 +378,8 @@ Creative Commons Notice
378
  * License: CC BY 4.0
379
 
380
  ```
 
 
381
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
382
 
383
  Section 1 – Definitions.
@@ -467,557 +469,6 @@ Nothing in this Public License constitutes or may be interpreted as a limitation
467
 
468
 
469
 
470
- # Lemmatization Lists
471
-
472
- * Author: Michal Měchura
473
- * URL: https://github.com/michmech/lemmatization-lists/
474
- * License: ODbL
475
-
476
- ```
477
- ## ODC Open Database License (ODbL)
478
-
479
- ### Preamble
480
-
481
- The Open Database License (ODbL) is a license agreement intended to
482
- allow users to freely share, modify, and use this Database while
483
- maintaining this same freedom for others. Many databases are covered by
484
- copyright, and therefore this document licenses these rights. Some
485
- jurisdictions, mainly in the European Union, have specific rights that
486
- cover databases, and so the ODbL addresses these rights, too. Finally,
487
- the ODbL is also an agreement in contract for users of this Database to
488
- act in certain ways in return for accessing this Database.
489
-
490
- Databases can contain a wide variety of types of content (images,
491
- audiovisual material, and sounds all in the same database, for example),
492
- and so the ODbL only governs the rights over the Database, and not the
493
- contents of the Database individually. Licensors should use the ODbL
494
- together with another license for the contents, if the contents have a
495
- single set of rights that uniformly covers all of the contents. If the
496
- contents have multiple sets of different rights, Licensors should
497
- describe what rights govern what contents together in the individual
498
- record or in some other way that clarifies what rights apply.
499
-
500
- Sometimes the contents of a database, or the database itself, can be
501
- covered by other rights not addressed here (such as private contracts,
502
- trade mark over the name, or privacy rights / data protection rights
503
- over information in the contents), and so you are advised that you may
504
- have to consult other documents or clear other rights before doing
505
- activities not covered by this License.
506
-
507
- ------
508
-
509
- The Licensor (as defined below)
510
-
511
- and
512
-
513
- You (as defined below)
514
-
515
- agree as follows:
516
-
517
- ### 1.0 Definitions of Capitalised Words
518
-
519
- "Collective Database" – Means this Database in unmodified form as part
520
- of a collection of independent databases in themselves that together are
521
- assembled into a collective whole. A work that constitutes a Collective
522
- Database will not be considered a Derivative Database.
523
-
524
- "Convey" – As a verb, means Using the Database, a Derivative Database,
525
- or the Database as part of a Collective Database in any way that enables
526
- a Person to make or receive copies of the Database or a Derivative
527
- Database. Conveying does not include interaction with a user through a
528
- computer network, or creating and Using a Produced Work, where no
529
- transfer of a copy of the Database or a Derivative Database occurs.
530
- "Contents" – The contents of this Database, which includes the
531
- information, independent works, or other material collected into the
532
- Database. For example, the contents of the Database could be factual
533
- data or works such as images, audiovisual material, text, or sounds.
534
-
535
- "Database" – A collection of material (the Contents) arranged in a
536
- systematic or methodical way and individually accessible by electronic
537
- or other means offered under the terms of this License.
538
-
539
- "Database Directive" – Means Directive 96/9/EC of the European
540
- Parliament and of the Council of 11 March 1996 on the legal protection
541
- of databases, as amended or succeeded.
542
-
543
- "Database Right" – Means rights resulting from the Chapter III ("sui
544
- generis") rights in the Database Directive (as amended and as transposed
545
- by member states), which includes the Extraction and Re-utilisation of
546
- the whole or a Substantial part of the Contents, as well as any similar
547
- rights available in the relevant jurisdiction under Section 10.4.
548
-
549
- "Derivative Database" – Means a database based upon the Database, and
550
- includes any translation, adaptation, arrangement, modification, or any
551
- other alteration of the Database or of a Substantial part of the
552
- Contents. This includes, but is not limited to, Extracting or
553
- Re-utilising the whole or a Substantial part of the Contents in a new
554
- Database.
555
-
556
- "Extraction" – Means the permanent or temporary transfer of all or a
557
- Substantial part of the Contents to another medium by any means or in
558
- any form.
559
-
560
- "License" – Means this license agreement and is both a license of rights
561
- such as copyright and Database Rights and an agreement in contract.
562
-
563
- "Licensor" – Means the Person that offers the Database under the terms
564
- of this License.
565
-
566
- "Person" – Means a natural or legal person or a body of persons
567
- corporate or incorporate.
568
-
569
- "Produced Work" – a work (such as an image, audiovisual material, text,
570
- or sounds) resulting from using the whole or a Substantial part of the
571
- Contents (via a search or other query) from this Database, a Derivative
572
- Database, or this Database as part of a Collective Database.
573
-
574
- "Publicly" – means to Persons other than You or under Your control by
575
- either more than 50% ownership or by the power to direct their
576
- activities (such as contracting with an independent consultant).
577
-
578
- "Re-utilisation" – means any form of making available to the public all
579
- or a Substantial part of the Contents by the distribution of copies, by
580
- renting, by online or other forms of transmission.
581
-
582
- "Substantial" – Means substantial in terms of quantity or quality or a
583
- combination of both. The repeated and systematic Extraction or
584
- Re-utilisation of insubstantial parts of the Contents may amount to the
585
- Extraction or Re-utilisation of a Substantial part of the Contents.
586
-
587
- "Use" – As a verb, means doing any act that is restricted by copyright
588
- or Database Rights whether in the original medium or any other; and
589
- includes without limitation distributing, copying, publicly performing,
590
- publicly displaying, and preparing derivative works of the Database, as
591
- well as modifying the Database as may be technically necessary to use it
592
- in a different mode or format.
593
-
594
- "You" – Means a Person exercising rights under this License who has not
595
- previously violated the terms of this License with respect to the
596
- Database, or who has received express permission from the Licensor to
597
- exercise rights under this License despite a previous violation.
598
-
599
- Words in the singular include the plural and vice versa.
600
-
601
- ### 2.0 What this License covers
602
-
603
- 2.1. Legal effect of this document. This License is:
604
-
605
- a. A license of applicable copyright and neighbouring rights;
606
-
607
- b. A license of the Database Right; and
608
-
609
- c. An agreement in contract between You and the Licensor.
610
-
611
- 2.2 Legal rights covered. This License covers the legal rights in the
612
- Database, including:
613
-
614
- a. Copyright. Any copyright or neighbouring rights in the Database.
615
- The copyright licensed includes any individual elements of the
616
- Database, but does not cover the copyright over the Contents
617
- independent of this Database. See Section 2.4 for details. Copyright
618
- law varies between jurisdictions, but is likely to cover: the Database
619
- model or schema, which is the structure, arrangement, and organisation
620
- of the Database, and can also include the Database tables and table
621
- indexes; the data entry and output sheets; and the Field names of
622
- Contents stored in the Database;
623
-
624
- b. Database Rights. Database Rights only extend to the Extraction and
625
- Re-utilisation of the whole or a Substantial part of the Contents.
626
- Database Rights can apply even when there is no copyright over the
627
- Database. Database Rights can also apply when the Contents are removed
628
- from the Database and are selected and arranged in a way that would
629
- not infringe any applicable copyright; and
630
-
631
- c. Contract. This is an agreement between You and the Licensor for
632
- access to the Database. In return you agree to certain conditions of
633
- use on this access as outlined in this License.
634
-
635
- 2.3 Rights not covered.
636
-
637
- a. This License does not apply to computer programs used in the making
638
- or operation of the Database;
639
-
640
- b. This License does not cover any patents over the Contents or the
641
- Database; and
642
-
643
- c. This License does not cover any trademarks associated with the
644
- Database.
645
-
646
- 2.4 Relationship to Contents in the Database. The individual items of
647
- the Contents contained in this Database may be covered by other rights,
648
- including copyright, patent, data protection, privacy, or personality
649
- rights, and this License does not cover any rights (other than Database
650
- Rights or in contract) in individual Contents contained in the Database.
651
- For example, if used on a Database of images (the Contents), this
652
- License would not apply to copyright over individual images, which could
653
- have their own separate licenses, or one single license covering all of
654
- the rights over the images.
655
-
656
- ### 3.0 Rights granted
657
-
658
- 3.1 Subject to the terms and conditions of this License, the Licensor
659
- grants to You a worldwide, royalty-free, non-exclusive, terminable (but
660
- only under Section 9) license to Use the Database for the duration of
661
- any applicable copyright and Database Rights. These rights explicitly
662
- include commercial use, and do not exclude any field of endeavour. To
663
- the extent possible in the relevant jurisdiction, these rights may be
664
- exercised in all media and formats whether now known or created in the
665
- future.
666
-
667
- The rights granted cover, for example:
668
-
669
- a. Extraction and Re-utilisation of the whole or a Substantial part of
670
- the Contents;
671
-
672
- b. Creation of Derivative Databases;
673
-
674
- c. Creation of Collective Databases;
675
-
676
- d. Creation of temporary or permanent reproductions by any means and
677
- in any form, in whole or in part, including of any Derivative
678
- Databases or as a part of Collective Databases; and
679
-
680
- e. Distribution, communication, display, lending, making available, or
681
- performance to the public by any means and in any form, in whole or in
682
- part, including of any Derivative Database or as a part of Collective
683
- Databases.
684
-
685
- 3.2 Compulsory license schemes. For the avoidance of doubt:
686
-
687
- a. Non-waivable compulsory license schemes. In those jurisdictions in
688
- which the right to collect royalties through any statutory or
689
- compulsory licensing scheme cannot be waived, the Licensor reserves
690
- the exclusive right to collect such royalties for any exercise by You
691
- of the rights granted under this License;
692
-
693
- b. Waivable compulsory license schemes. In those jurisdictions in
694
- which the right to collect royalties through any statutory or
695
- compulsory licensing scheme can be waived, the Licensor waives the
696
- exclusive right to collect such royalties for any exercise by You of
697
- the rights granted under this License; and,
698
-
699
- c. Voluntary license schemes. The Licensor waives the right to collect
700
- royalties, whether individually or, in the event that the Licensor is
701
- a member of a collecting society that administers voluntary licensing
702
- schemes, via that society, from any exercise by You of the rights
703
- granted under this License.
704
-
705
- 3.3 The right to release the Database under different terms, or to stop
706
- distributing or making available the Database, is reserved. Note that
707
- this Database may be multiple-licensed, and so You may have the choice
708
- of using alternative licenses for this Database. Subject to Section
709
- 10.4, all other rights not expressly granted by Licensor are reserved.
710
-
711
- ### 4.0 Conditions of Use
712
-
713
- 4.1 The rights granted in Section 3 above are expressly made subject to
714
- Your complying with the following conditions of use. These are important
715
- conditions of this License, and if You fail to follow them, You will be
716
- in material breach of its terms.
717
-
718
- 4.2 Notices. If You Publicly Convey this Database, any Derivative
719
- Database, or the Database as part of a Collective Database, then You
720
- must:
721
-
722
- a. Do so only under the terms of this License or another license
723
- permitted under Section 4.4;
724
-
725
- b. Include a copy of this License (or, as applicable, a license
726
- permitted under Section 4.4) or its Uniform Resource Identifier (URI)
727
- with the Database or Derivative Database, including both in the
728
- Database or Derivative Database and in any relevant documentation; and
729
-
730
- c. Keep intact any copyright or Database Right notices and notices
731
- that refer to this License.
732
-
733
- d. If it is not possible to put the required notices in a particular
734
- file due to its structure, then You must include the notices in a
735
- location (such as a relevant directory) where users would be likely to
736
- look for it.
737
-
738
- 4.3 Notice for using output (Contents). Creating and Using a Produced
739
- Work does not require the notice in Section 4.2. However, if you
740
- Publicly Use a Produced Work, You must include a notice associated with
741
- the Produced Work reasonably calculated to make any Person that uses,
742
- views, accesses, interacts with, or is otherwise exposed to the Produced
743
- Work aware that Content was obtained from the Database, Derivative
744
- Database, or the Database as part of a Collective Database, and that it
745
- is available under this License.
746
-
747
- a. Example notice. The following text will satisfy notice under
748
- Section 4.3:
749
-
750
- Contains information from DATABASE NAME, which is made available
751
- here under the Open Database License (ODbL).
752
-
753
- DATABASE NAME should be replaced with the name of the Database and a
754
- hyperlink to the URI of the Database. "Open Database License" should
755
- contain a hyperlink to the URI of the text of this License. If
756
- hyperlinks are not possible, You should include the plain text of the
757
- required URI's with the above notice.
758
-
759
- 4.4 Share alike.
760
-
761
- a. Any Derivative Database that You Publicly Use must be only under
762
- the terms of:
763
-
764
- i. This License;
765
-
766
- ii. A later version of this License similar in spirit to this
767
- License; or
768
-
769
- iii. A compatible license.
770
-
771
- If You license the Derivative Database under one of the licenses
772
- mentioned in (iii), You must comply with the terms of that license.
773
-
774
- b. For the avoidance of doubt, Extraction or Re-utilisation of the
775
- whole or a Substantial part of the Contents into a new database is a
776
- Derivative Database and must comply with Section 4.4.
777
-
778
- c. Derivative Databases and Produced Works. A Derivative Database is
779
- Publicly Used and so must comply with Section 4.4. if a Produced Work
780
- created from the Derivative Database is Publicly Used.
781
-
782
- d. Share Alike and additional Contents. For the avoidance of doubt,
783
- You must not add Contents to Derivative Databases under Section 4.4 a
784
- that are incompatible with the rights granted under this License.
785
-
786
- e. Compatible licenses. Licensors may authorise a proxy to determine
787
- compatible licenses under Section 4.4 a iii. If they do so, the
788
- authorised proxy's public statement of acceptance of a compatible
789
- license grants You permission to use the compatible license.
790
-
791
-
792
- 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
793
- in the following:
794
-
795
- a. For the avoidance of doubt, You are not required to license
796
- Collective Databases under this License if You incorporate this
797
- Database or a Derivative Database in the collection, but this License
798
- still applies to this Database or a Derivative Database as a part of
799
- the Collective Database;
800
-
801
- b. Using this Database, a Derivative Database, or this Database as
802
- part of a Collective Database to create a Produced Work does not
803
- create a Derivative Database for purposes of Section 4.4; and
804
-
805
- c. Use of a Derivative Database internally within an organisation is
806
- not to the public and therefore does not fall under the requirements
807
- of Section 4.4.
808
-
809
- 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
810
- Database or a Produced Work from a Derivative Database, You must also
811
- offer to recipients of the Derivative Database or Produced Work a copy
812
- in a machine readable form of:
813
-
814
- a. The entire Derivative Database; or
815
-
816
- b. A file containing all of the alterations made to the Database or
817
- the method of making the alterations to the Database (such as an
818
- algorithm), including any additional Contents, that make up all the
819
- differences between the Database and the Derivative Database.
820
-
821
- The Derivative Database (under a.) or alteration file (under b.) must be
822
- available at no more than a reasonable production cost for physical
823
- distributions and free of charge if distributed over the internet.
824
-
825
- 4.7 Technological measures and additional terms
826
-
827
- a. This License does not allow You to impose (except subject to
828
- Section 4.7 b.) any terms or any technological measures on the
829
- Database, a Derivative Database, or the whole or a Substantial part of
830
- the Contents that alter or restrict the terms of this License, or any
831
- rights granted under it, or have the effect or intent of restricting
832
- the ability of any person to exercise those rights.
833
-
834
- b. Parallel distribution. You may impose terms or technological
835
- measures on the Database, a Derivative Database, or the whole or a
836
- Substantial part of the Contents (a "Restricted Database") in
837
- contravention of Section 4.74 a. only if You also make a copy of the
838
- Database or a Derivative Database available to the recipient of the
839
- Restricted Database:
840
-
841
- i. That is available without additional fee;
842
-
843
- ii. That is available in a medium that does not alter or restrict
844
- the terms of this License, or any rights granted under it, or have
845
- the effect or intent of restricting the ability of any person to
846
- exercise those rights (an "Unrestricted Database"); and
847
-
848
- iii. The Unrestricted Database is at least as accessible to the
849
- recipient as a practical matter as the Restricted Database.
850
-
851
- c. For the avoidance of doubt, You may place this Database or a
852
- Derivative Database in an authenticated environment, behind a
853
- password, or within a similar access control scheme provided that You
854
- do not alter or restrict the terms of this License or any rights
855
- granted under it or have the effect or intent of restricting the
856
- ability of any person to exercise those rights.
857
-
858
- 4.8 Licensing of others. You may not sublicense the Database. Each time
859
- You communicate the Database, the whole or Substantial part of the
860
- Contents, or any Derivative Database to anyone else in any way, the
861
- Licensor offers to the recipient a license to the Database on the same
862
- terms and conditions as this License. You are not responsible for
863
- enforcing compliance by third parties with this License, but You may
864
- enforce any rights that You have over a Derivative Database. You are
865
- solely responsible for any modifications of a Derivative Database made
866
- by You or another Person at Your direction. You may not impose any
867
- further restrictions on the exercise of the rights granted or affirmed
868
- under this License.
869
-
870
- ### 5.0 Moral rights
871
-
872
- 5.1 Moral rights. This section covers moral rights, including any rights
873
- to be identified as the author of the Database or to object to treatment
874
- that would otherwise prejudice the author's honour and reputation, or
875
- any other derogatory treatment:
876
-
877
- a. For jurisdictions allowing waiver of moral rights, Licensor waives
878
- all moral rights that Licensor may have in the Database to the fullest
879
- extent possible by the law of the relevant jurisdiction under Section
880
- 10.4;
881
-
882
- b. If waiver of moral rights under Section 5.1 a in the relevant
883
- jurisdiction is not possible, Licensor agrees not to assert any moral
884
- rights over the Database and waives all claims in moral rights to the
885
- fullest extent possible by the law of the relevant jurisdiction under
886
- Section 10.4; and
887
-
888
- c. For jurisdictions not allowing waiver or an agreement not to assert
889
- moral rights under Section 5.1 a and b, the author may retain their
890
- moral rights over certain aspects of the Database.
891
-
892
- Please note that some jurisdictions do not allow for the waiver of moral
893
- rights, and so moral rights may still subsist over the Database in some
894
- jurisdictions.
895
-
896
- ### 6.0 Fair dealing, Database exceptions, and other rights not affected
897
-
898
- 6.1 This License does not affect any rights that You or anyone else may
899
- independently have under any applicable law to make any use of this
900
- Database, including without limitation:
901
-
902
- a. Exceptions to the Database Right including: Extraction of Contents
903
- from non-electronic Databases for private purposes, Extraction for
904
- purposes of illustration for teaching or scientific research, and
905
- Extraction or Re-utilisation for public security or an administrative
906
- or judicial procedure.
907
-
908
- b. Fair dealing, fair use, or any other legally recognised limitation
909
- or exception to infringement of copyright or other applicable laws.
910
-
911
- 6.2 This License does not affect any rights of lawful users to Extract
912
- and Re-utilise insubstantial parts of the Contents, evaluated
913
- quantitatively or qualitatively, for any purposes whatsoever, including
914
- creating a Derivative Database (subject to other rights over the
915
- Contents, see Section 2.4). The repeated and systematic Extraction or
916
- Re-utilisation of insubstantial parts of the Contents may however amount
917
- to the Extraction or Re-utilisation of a Substantial part of the
918
- Contents.
919
-
920
- ### 7.0 Warranties and Disclaimer
921
-
922
- 7.1 The Database is licensed by the Licensor "as is" and without any
923
- warranty of any kind, either express, implied, or arising by statute,
924
- custom, course of dealing, or trade usage. Licensor specifically
925
- disclaims any and all implied warranties or conditions of title,
926
- non-infringement, accuracy or completeness, the presence or absence of
927
- errors, fitness for a particular purpose, merchantability, or otherwise.
928
- Some jurisdictions do not allow the exclusion of implied warranties, so
929
- this exclusion may not apply to You.
930
-
931
- ### 8.0 Limitation of liability
932
-
933
- 8.1 Subject to any liability that may not be excluded or limited by law,
934
- the Licensor is not liable for, and expressly excludes, all liability
935
- for loss or damage however and whenever caused to anyone by any use
936
- under this License, whether by You or by anyone else, and whether caused
937
- by any fault on the part of the Licensor or not. This exclusion of
938
- liability includes, but is not limited to, any special, incidental,
939
- consequential, punitive, or exemplary damages such as loss of revenue,
940
- data, anticipated profits, and lost business. This exclusion applies
941
- even if the Licensor has been advised of the possibility of such
942
- damages.
943
-
944
- 8.2 If liability may not be excluded by law, it is limited to actual and
945
- direct financial loss to the extent it is caused by proved negligence on
946
- the part of the Licensor.
947
-
948
- ### 9.0 Termination of Your rights under this License
949
-
950
- 9.1 Any breach by You of the terms and conditions of this License
951
- automatically terminates this License with immediate effect and without
952
- notice to You. For the avoidance of doubt, Persons who have received the
953
- Database, the whole or a Substantial part of the Contents, Derivative
954
- Databases, or the Database as part of a Collective Database from You
955
- under this License will not have their licenses terminated provided
956
- their use is in full compliance with this License or a license granted
957
- under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
958
- survive any termination of this License.
959
-
960
- 9.2 If You are not in breach of the terms of this License, the Licensor
961
- will not terminate Your rights under it.
962
-
963
- 9.3 Unless terminated under Section 9.1, this License is granted to You
964
- for the duration of applicable rights in the Database.
965
-
966
- 9.4 Reinstatement of rights. If you cease any breach of the terms and
967
- conditions of this License, then your full rights under this License
968
- will be reinstated:
969
-
970
- a. Provisionally and subject to permanent termination until the 60th
971
- day after cessation of breach;
972
-
973
- b. Permanently on the 60th day after cessation of breach unless
974
- otherwise reasonably notified by the Licensor; or
975
-
976
- c. Permanently if reasonably notified by the Licensor of the
977
- violation, this is the first time You have received notice of
978
- violation of this License from the Licensor, and You cure the
979
- violation prior to 30 days after your receipt of the notice.
980
-
981
- Persons subject to permanent termination of rights are not eligible to
982
- be a recipient and receive a license under Section 4.8.
983
-
984
- 9.5 Notwithstanding the above, Licensor reserves the right to release
985
- the Database under different license terms or to stop distributing or
986
- making available the Database. Releasing the Database under different
987
- license terms or stopping the distribution of the Database will not
988
- withdraw this License (or any other license that has been, or is
989
- required to be, granted under the terms of this License), and this
990
- License will continue in full force and effect unless terminated as
991
- stated above.
992
-
993
- ### 10.0 General
994
-
995
- 10.1 If any provision of this License is held to be invalid or
996
- unenforceable, that must not affect the validity or enforceability of
997
- the remainder of the terms and conditions of this License and each
998
- remaining provision of this License shall be valid and enforced to the
999
- fullest extent permitted by law.
1000
-
1001
- 10.2 This License is the entire agreement between the parties with
1002
- respect to the rights granted here over the Database. It replaces any
1003
- earlier understandings, agreements or representations with respect to
1004
- the Database.
1005
-
1006
- 10.3 If You are in breach of the terms of this License, You will not be
1007
- entitled to rely on the terms of this License or to complain of any
1008
- breach by the Licensor.
1009
-
1010
- 10.4 Choice of law. This License takes effect in and will be governed by
1011
- the laws of the relevant jurisdiction in which the License terms are
1012
- sought to be enforced. If the standard suite of rights granted under
1013
- applicable copyright law and Database Rights in the relevant
1014
- jurisdiction includes additional rights not granted under this License,
1015
- these additional rights are granted in this License in order to meet the
1016
- terms of this License.```
1017
-
1018
-
1019
-
1020
-
1021
  # Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)
1022
 
1023
  * Author: Explosion
378
  * License: CC BY 4.0
379
 
380
  ```
381
+ Creative Commons Attribution 4.0 International Public License
382
+
383
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
384
 
385
  Section 1 – Definitions.
469
 
470
 
471
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
472
  # Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)
473
 
474
  * Author: Explosion
README.md CHANGED
@@ -14,61 +14,76 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8753707462
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8744493392
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8749098001
 
 
 
 
 
 
 
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9707455175
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.9685314685
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.9822695035
41
- - name: SENTER F Score
42
- type: f_score
43
- value: 0.9753521127
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.9097643791
 
 
 
 
 
 
 
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.9097643791
 
 
 
 
 
 
 
58
  ---
59
  ### Details: https://spacy.io/models/it#it_core_news_md
60
 
61
- Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
62
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `it_core_news_md` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
- | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
- | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
71
- | **Sources** | [UD Italian ISDT v2.8](https://github.com/UniversalDependencies/UD_Italian-ISDT) (Bosco, Cristina; Lenci, Alessandro; Montemagni, Simonetta; Simi, Maria)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)<br />[Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-NC-SA 3.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,14 +91,13 @@ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger,
76
 
77
  <details>
78
 
79
- <summary>View label scheme (443 labels for 5 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`morphologizer`** | `POS=PROPN`, `POS=PUNCT`, `Gender=Masc\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=AUX\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADP`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=ADP\|PronType=Art`, `Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=VERB\|VerbForm=Inf`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `POS=CCONJ`, `NumType=Card\|POS=NUM`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Clitic=Yes\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `Definite=Def\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=ADV`, `POS=NOUN`, `Number=Sing\|POS=NOUN`, `POS=VERB\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Tot`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|POS=NOUN`, `POS=SCONJ`, `Number=Sing\|POS=DET\|PronType=Ind`, `POS=ADV\|PronType=Neg`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Ind`, `POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=PRON\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Cmp\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Abs\|POS=ADV`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=DET\|PronType=Exc`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Int`, `POS=PRON\|PronType=Int`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Sing\|POS=ADP`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Foreign=Yes\|POS=X`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=INTJ\|Polarity=Neg`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `POS=INTJ\|Polarity=Pos`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `POS=DET\|PronType=Int`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Prs`, `Degree=Abs\|Gender=Masc\|Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Ind`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Degree=Abs\|Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Tot`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADJ`, `NumType=Ord\|POS=ADJ`, `POS=DET\|PronType=Rel`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Masc\|Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|PronType=Int`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|POS=DET\|PronType=Art`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `POS=SYM`, `Clitic=Yes\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Degree=Abs\|Gender=Fem\|Number=Plur\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=PRON\|PronType=Ind`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `POS=X`, `Gender=Masc\|POS=ADJ`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=PART`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Int`, `Clitic=Yes\|Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Rel`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `NumType=Range\|POS=NUM`, `Number=Plur\|POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=ADV\|PronType=Prs`, `Clitic=Yes\|Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Imp\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=DET\|PronType=Ind`, `Number=Plur\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Number=Plur\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=ADP`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=ADV\|Person=3\|PronType=Prs`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=ADV\|Person=3\|PronType=Prs`, `POS=DET\|PronType=Tot`, `POS=PRON\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=NUM`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Masc\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Sing\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Int`, `Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Int`, `Clitic=Yes\|Number=Plur\|POS=PRON\|PronType=Prs`, `Foreign=Yes\|Number=Sing\|POS=X`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `POS=PRON\|PronType=Prs`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=SCONJ\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Cmp\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=ADP`, `Gender=Fem\|POS=ADJ`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=ADJ\|Poss=Yes\|PronType=Prs`, `Foreign=Yes\|POS=NOUN`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Number=Sing\|POS=X`, `Foreign=Yes\|Gender=Masc\|POS=X`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Definite=Def\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Definite=Def\|Gender=Fem\|POS=DET`, `Definite=Def\|POS=DET`, `Foreign=Yes\|POS=PROPN`, `NumType=Card\|POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADV`, `Gender=Masc\|Number=Plur\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2`, `Clitic=Yes\|Number=Plur\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=DET`, `Number=Sing\|POS=DET`, `Gender=Masc\|Number=Sing\|POS=PRON`, `POS=DET` |
84
  | **`tagger`** | `A`, `AP`, `B`, `BN`, `B_PC`, `CC`, `CS`, `DD`, `DE`, `DI`, `DQ`, `DR`, `E`, `E_RD`, `FB`, `FC`, `FF`, `FS`, `I`, `N`, `NO`, `PART`, `PC`, `PC_PC`, `PD`, `PE`, `PI`, `PP`, `PQ`, `PR`, `RD`, `RI`, `S`, `SP`, `SW`, `SYM`, `T`, `V`, `VA`, `VA_PC`, `VM`, `VM_PC`, `VM_PC_PC`, `V_B`, `V_PC`, `V_PC_PC`, `X` |
85
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `cop`, `csubj`, `dep`, `det`, `det:poss`, `det:predet`, `discourse`, `expl`, `expl:impers`, `expl:pass`, `fixed`, `flat`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `parataxis`, `punct`, `vocative`, `xcomp` |
86
- | **`senter`** | `I`, `S` |
87
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
88
 
89
  </details>
@@ -92,22 +106,22 @@ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger,
92
 
93
  | Type | Score |
94
  | --- | --- |
95
- | `TAG_ACC` | 97.07 |
96
- | `SENTS_P` | 96.85 |
97
- | `SENTS_R` | 98.23 |
98
- | `SENTS_F` | 97.54 |
99
- | `ENTS_P` | 87.54 |
100
- | `ENTS_R` | 87.44 |
101
- | `ENTS_F` | 87.49 |
102
  | `TOKEN_ACC` | 99.96 |
103
  | `TOKEN_P` | 99.80 |
104
  | `TOKEN_R` | 99.78 |
105
  | `TOKEN_F` | 99.79 |
106
- | `POS_ACC` | 97.46 |
107
- | `MORPH_ACC` | 97.20 |
108
- | `MORPH_MICRO_P` | 98.64 |
109
  | `MORPH_MICRO_R` | 98.07 |
110
- | `MORPH_MICRO_F` | 98.35 |
111
- | `DEP_UAS` | 90.98 |
 
 
 
 
112
  | `DEP_LAS` | 87.50 |
113
- | `LEMMA_ACC` | 86.59 |
 
 
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8742779079
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8732213169
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.873749293
24
+ - task:
25
+ name: TAG
26
+ type: token-classification
27
+ metrics:
28
+ - name: TAG (XPOS) Accuracy
29
+ type: accuracy
30
+ value: 0.9704322818
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
+ - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9743866271
38
  - task:
39
+ name: MORPH
40
  type: token-classification
41
  metrics:
42
+ - name: Morph (UFeats) Accuracy
43
+ type: accuracy
44
+ value: 0.971641724
 
 
 
 
 
 
45
  - task:
46
+ name: LEMMA
47
  type: token-classification
48
  metrics:
49
+ - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.972232207
52
+ - task:
53
+ name: UNLABELED_DEPENDENCIES
54
+ type: token-classification
55
+ metrics:
56
+ - name: Unlabeled Attachment Score (UAS)
57
+ type: f_score
58
+ value: 0.9112894926
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
+ - name: Labeled Attachment Score (LAS)
64
+ type: f_score
65
+ value: 0.8750192951
66
+ - task:
67
+ name: SENTS
68
+ type: token-classification
69
+ metrics:
70
+ - name: Sentences F-Score
71
+ type: f_score
72
+ value: 0.9718309859
73
  ---
74
  ### Details: https://spacy.io/models/it#it_core_news_md
75
 
76
+ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.
77
 
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `it_core_news_md` |
81
+ | **Version** | `3.3.0` |
82
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
83
+ | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `attribute_ruler`, `ner` |
84
+ | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `senter`, `attribute_ruler`, `ner` |
85
  | **Vectors** | 500000 keys, 20000 unique vectors (300 dimensions) |
86
+ | **Sources** | [UD Italian ISDT v2.8](https://github.com/UniversalDependencies/UD_Italian-ISDT) (Bosco, Cristina; Lenci, Alessandro; Montemagni, Simonetta; Simi, Maria)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
87
  | **License** | `CC BY-NC-SA 3.0` |
88
  | **Author** | [Explosion](https://explosion.ai) |
89
 
91
 
92
  <details>
93
 
94
+ <summary>View label scheme (441 labels for 4 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
  | **`morphologizer`** | `POS=PROPN`, `POS=PUNCT`, `Gender=Masc\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=AUX\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADP`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=ADP\|PronType=Art`, `Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=VERB\|VerbForm=Inf`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `POS=CCONJ`, `NumType=Card\|POS=NUM`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Clitic=Yes\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `Definite=Def\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=ADV`, `POS=NOUN`, `Number=Sing\|POS=NOUN`, `POS=VERB\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Tot`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|POS=NOUN`, `POS=SCONJ`, `Number=Sing\|POS=DET\|PronType=Ind`, `POS=ADV\|PronType=Neg`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Ind`, `POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=PRON\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Cmp\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Abs\|POS=ADV`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=DET\|PronType=Exc`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Int`, `POS=PRON\|PronType=Int`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Sing\|POS=ADP`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Foreign=Yes\|POS=X`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=INTJ\|Polarity=Neg`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `POS=INTJ\|Polarity=Pos`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `POS=DET\|PronType=Int`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Prs`, `Degree=Abs\|Gender=Masc\|Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Ind`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Degree=Abs\|Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Tot`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADJ`, `NumType=Ord\|POS=ADJ`, `POS=DET\|PronType=Rel`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Masc\|Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|PronType=Int`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|POS=DET\|PronType=Art`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `POS=SYM`, `Clitic=Yes\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Degree=Abs\|Gender=Fem\|Number=Plur\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=PRON\|PronType=Ind`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `POS=X`, `Gender=Masc\|POS=ADJ`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=PART`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Int`, `Clitic=Yes\|Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Rel`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `NumType=Range\|POS=NUM`, `Number=Plur\|POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=ADV\|PronType=Prs`, `Clitic=Yes\|Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Imp\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=DET\|PronType=Ind`, `Number=Plur\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Number=Plur\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=ADP`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=ADV\|Person=3\|PronType=Prs`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=ADV\|Person=3\|PronType=Prs`, `POS=DET\|PronType=Tot`, `POS=PRON\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=NUM`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Masc\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Sing\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Int`, `Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Int`, `Clitic=Yes\|Number=Plur\|POS=PRON\|PronType=Prs`, `Foreign=Yes\|Number=Sing\|POS=X`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `POS=PRON\|PronType=Prs`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=SCONJ\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Cmp\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=ADP`, `Gender=Fem\|POS=ADJ`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=ADJ\|Poss=Yes\|PronType=Prs`, `Foreign=Yes\|POS=NOUN`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Number=Sing\|POS=X`, `Foreign=Yes\|Gender=Masc\|POS=X`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Definite=Def\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Definite=Def\|Gender=Fem\|POS=DET`, `Definite=Def\|POS=DET`, `Foreign=Yes\|POS=PROPN`, `NumType=Card\|POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADV`, `Gender=Masc\|Number=Plur\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2`, `Clitic=Yes\|Number=Plur\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=DET`, `Number=Sing\|POS=DET`, `Gender=Masc\|Number=Sing\|POS=PRON`, `POS=DET` |
99
  | **`tagger`** | `A`, `AP`, `B`, `BN`, `B_PC`, `CC`, `CS`, `DD`, `DE`, `DI`, `DQ`, `DR`, `E`, `E_RD`, `FB`, `FC`, `FF`, `FS`, `I`, `N`, `NO`, `PART`, `PC`, `PC_PC`, `PD`, `PE`, `PI`, `PP`, `PQ`, `PR`, `RD`, `RI`, `S`, `SP`, `SW`, `SYM`, `T`, `V`, `VA`, `VA_PC`, `VM`, `VM_PC`, `VM_PC_PC`, `V_B`, `V_PC`, `V_PC_PC`, `X` |
100
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `cop`, `csubj`, `dep`, `det`, `det:poss`, `det:predet`, `discourse`, `expl`, `expl:impers`, `expl:pass`, `fixed`, `flat`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `parataxis`, `punct`, `vocative`, `xcomp` |
 
101
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
102
 
103
  </details>
106
 
107
  | Type | Score |
108
  | --- | --- |
 
 
 
 
 
 
 
109
  | `TOKEN_ACC` | 99.96 |
110
  | `TOKEN_P` | 99.80 |
111
  | `TOKEN_R` | 99.78 |
112
  | `TOKEN_F` | 99.79 |
113
+ | `POS_ACC` | 97.44 |
114
+ | `MORPH_ACC` | 97.16 |
115
+ | `MORPH_MICRO_P` | 98.68 |
116
  | `MORPH_MICRO_R` | 98.07 |
117
+ | `MORPH_MICRO_F` | 98.37 |
118
+ | `TAG_ACC` | 97.04 |
119
+ | `SENTS_P` | 96.50 |
120
+ | `SENTS_R` | 97.87 |
121
+ | `SENTS_F` | 97.18 |
122
+ | `DEP_UAS` | 91.13 |
123
  | `DEP_LAS` | 87.50 |
124
+ | `LEMMA_ACC` | 97.22 |
125
+ | `ENTS_P` | 87.43 |
126
+ | `ENTS_R` | 87.32 |
127
+ | `ENTS_F` | 87.37 |
accuracy.json CHANGED
@@ -1,58 +1,28 @@
1
  {
2
- "tag_acc": 0.9707455175,
3
- "sents_p": 0.9685314685,
4
- "sents_r": 0.9822695035,
5
- "sents_f": 0.9753521127,
6
- "ents_p": 0.8753707462,
7
- "ents_r": 0.8744493392,
8
- "ents_f": 0.8749098001,
9
- "ents_per_type": {
10
- "ORG": {
11
- "p": 0.0,
12
- "r": 0.0,
13
- "f": 0.0
14
- },
15
- "LOC": {
16
- "p": 0.0,
17
- "r": 0.0,
18
- "f": 0.0
19
- },
20
- "PER": {
21
- "p": 0.0,
22
- "r": 0.0,
23
- "f": 0.0
24
- },
25
- "MISC": {
26
- "p": 0.0,
27
- "r": 0.0,
28
- "f": 0.0
29
- }
30
- },
31
- "speed": 9476.0962556217,
32
  "token_acc": 0.9996405141,
33
  "token_p": 0.9980235379,
34
  "token_r": 0.9978442468,
35
  "token_f": 0.9979338843,
36
- "pos_acc": 0.9746101649,
37
- "morph_acc": 0.9720449438,
38
- "morph_micro_p": 0.9863999603,
39
- "morph_micro_r": 0.980704698,
40
- "morph_micro_f": 0.9835440845,
41
  "morph_per_feat": {
42
  "Gender": {
43
- "p": 0.9903510573,
44
- "r": 0.9848917926,
45
- "f": 0.9876138806
46
  },
47
  "Number": {
48
- "p": 0.9924095607,
49
  "r": 0.9866730893,
50
- "f": 0.9895330113
51
  },
52
  "NumType": {
53
- "p": 0.9847328244,
54
- "r": 0.9520295203,
55
- "f": 0.9681050657
56
  },
57
  "Definite": {
58
  "p": 0.9964454976,
@@ -60,133 +30,137 @@
60
  "f": 0.9970361589
61
  },
62
  "PronType": {
63
- "p": 0.9916839917,
64
- "r": 0.9863523573,
65
- "f": 0.989010989
66
  },
67
  "Mood": {
68
- "p": 0.9588377724,
69
- "r": 0.9507803121,
70
- "f": 0.9547920434
71
  },
72
  "Person": {
73
- "p": 0.972575906,
74
- "r": 0.9612778316,
75
- "f": 0.9668938656
76
  },
77
  "Tense": {
78
- "p": 0.9607173356,
79
- "r": 0.9574468085,
80
- "f": 0.9590792839
81
  },
82
  "VerbForm": {
83
- "p": 0.9735927728,
84
- "r": 0.9708939709,
85
- "f": 0.972241499
86
  },
87
  "Degree": {
88
- "p": 0.8235294118,
89
- "r": 0.8235294118,
90
- "f": 0.8235294118
91
  },
92
  "Clitic": {
93
- "p": 1.0,
94
- "r": 0.9786096257,
95
- "f": 0.9891891892
96
  },
97
  "Poss": {
98
- "p": 0.9852941176,
99
  "r": 1.0,
100
- "f": 0.9925925926
101
  },
102
  "Polarity": {
103
  "p": 1.0,
104
- "r": 0.6666666667,
105
- "f": 0.8
106
  },
107
  "Foreign": {
108
- "p": 1.0,
109
- "r": 0.4,
110
- "f": 0.5714285714
111
  }
112
  },
113
- "dep_uas": 0.9097643791,
114
- "dep_las": 0.8749871386,
 
 
 
 
115
  "dep_las_per_type": {
116
  "root": {
117
- "p": 0.8898601399,
118
- "r": 0.9024822695,
119
- "f": 0.8961267606
120
  },
121
  "flat:name": {
122
- "p": 0.9230769231,
123
- "r": 0.9113924051,
124
- "f": 0.9171974522
125
  },
126
  "case": {
127
- "p": 0.9811435523,
128
- "r": 0.9817407182,
129
- "f": 0.9814420444
130
  },
131
  "nmod": {
132
- "p": 0.8126888218,
133
- "r": 0.8251533742,
134
- "f": 0.8188736682
135
  },
136
  "nummod": {
137
- "p": 0.8895027624,
138
- "r": 0.875,
139
- "f": 0.8821917808
140
  },
141
  "det": {
142
- "p": 0.976146789,
143
- "r": 0.9779411765,
144
- "f": 0.9770431589
145
  },
146
  "nsubj": {
147
- "p": 0.8474903475,
148
- "r": 0.8474903475,
149
- "f": 0.8474903475
150
  },
151
  "aux": {
152
- "p": 0.9311926606,
153
- "r": 0.935483871,
154
- "f": 0.9333333333
155
  },
156
  "advmod": {
157
- "p": 0.8159090909,
158
- "r": 0.8233944954,
159
- "f": 0.8196347032
160
  },
161
  "obj": {
162
- "p": 0.8684210526,
163
- "r": 0.9075,
164
- "f": 0.8875305623
165
  },
166
  "cc": {
167
- "p": 0.9242902208,
168
- "r": 0.8987730061,
169
- "f": 0.9113530327
170
  },
171
  "conj": {
172
- "p": 0.670984456,
173
- "r": 0.7,
174
- "f": 0.6851851852
175
  },
176
  "det:predet": {
177
- "p": 0.9473684211,
178
  "r": 1.0,
179
- "f": 0.972972973
180
  },
181
  "amod": {
182
- "p": 0.9081325301,
183
- "r": 0.904047976,
184
- "f": 0.9060856499
185
  },
186
  "mark": {
187
- "p": 0.9262295082,
188
- "r": 0.9338842975,
189
- "f": 0.9300411523
190
  },
191
  "cop": {
192
  "p": 0.828358209,
@@ -194,24 +168,24 @@
194
  "f": 0.8538461538
195
  },
196
  "xcomp": {
197
- "p": 0.7578947368,
198
- "r": 0.75,
199
- "f": 0.7539267016
200
  },
201
  "obl": {
202
- "p": 0.8242612753,
203
- "r": 0.7748538012,
204
- "f": 0.7987942728
205
  },
206
  "acl:relcl": {
207
- "p": 0.7923076923,
208
- "r": 0.7518248175,
209
- "f": 0.7715355805
210
  },
211
  "acl": {
212
- "p": 0.6363636364,
213
- "r": 0.6086956522,
214
- "f": 0.6222222222
215
  },
216
  "ccomp": {
217
  "p": 0.7,
@@ -219,14 +193,14 @@
219
  "f": 0.6885245902
220
  },
221
  "expl": {
222
- "p": 0.9178082192,
223
- "r": 0.858974359,
224
- "f": 0.8874172185
225
  },
226
  "nsubj:pass": {
227
- "p": 0.8289473684,
228
- "r": 0.7777777778,
229
- "f": 0.8025477707
230
  },
231
  "aux:pass": {
232
  "p": 0.8658536585,
@@ -234,19 +208,24 @@
234
  "f": 0.8819875776
235
  },
236
  "parataxis": {
237
- "p": 0.5,
238
- "r": 0.1290322581,
239
- "f": 0.2051282051
 
 
 
 
 
240
  },
241
  "advcl": {
242
- "p": 0.5961538462,
243
- "r": 0.6549295775,
244
- "f": 0.6241610738
245
  },
246
  "det:poss": {
247
- "p": 0.9855072464,
248
  "r": 1.0,
249
- "f": 0.9927007299
250
  },
251
  "flat": {
252
  "p": 1.0,
@@ -254,39 +233,34 @@
254
  "f": 1.0
255
  },
256
  "appos": {
257
- "p": 0.5,
258
- "r": 0.4615384615,
259
- "f": 0.48
260
  },
261
  "obl:agent": {
262
- "p": 0.9487179487,
263
  "r": 0.880952381,
264
- "f": 0.9135802469
265
- },
266
- "dep": {
267
- "p": 0.0,
268
- "r": 0.0,
269
- "f": 0.0
270
  },
271
  "iobj": {
272
- "p": 0.8260869565,
273
- "r": 0.95,
274
- "f": 0.8837209302
275
  },
276
  "expl:impers": {
277
- "p": 0.5333333333,
278
  "r": 1.0,
279
- "f": 0.6956521739
280
  },
281
  "csubj": {
282
- "p": 0.75,
283
- "r": 0.4615384615,
284
- "f": 0.5714285714
285
  },
286
  "compound": {
287
- "p": 0.64,
288
- "r": 0.6153846154,
289
- "f": 0.6274509804
290
  },
291
  "discourse": {
292
  "p": 0.0,
@@ -294,30 +268,56 @@
294
  "f": 0.0
295
  },
296
  "fixed": {
297
- "p": 0.8461538462,
298
  "r": 0.7857142857,
299
- "f": 0.8148148148
300
  },
301
  "expl:pass": {
302
- "p": 0.5384615385,
303
- "r": 0.6363636364,
304
- "f": 0.5833333333
305
  },
306
- "flat:foreign": {
307
- "p": 0.5,
308
  "r": 0.3333333333,
309
- "f": 0.4
310
  },
311
  "orphan": {
312
  "p": 0.0,
313
  "r": 0.0,
314
  "f": 0.0
315
  },
316
- "vocative": {
317
- "p": 0.5,
318
  "r": 0.3333333333,
319
- "f": 0.4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
320
  }
321
  },
322
- "lemma_acc": 0.8659237958
323
  }
1
  {
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  "token_acc": 0.9996405141,
3
  "token_p": 0.9980235379,
4
  "token_r": 0.9978442468,
5
  "token_f": 0.9979338843,
6
+ "pos_acc": 0.9743866271,
7
+ "morph_acc": 0.971641724,
8
+ "morph_micro_p": 0.9867911411,
9
+ "morph_micro_r": 0.9806553494,
10
+ "morph_micro_f": 0.9837136775,
11
  "morph_per_feat": {
12
  "Gender": {
13
+ "p": 0.990151826,
14
+ "r": 0.9853001225,
15
+ "f": 0.9877200164
16
  },
17
  "Number": {
18
+ "p": 0.9933721306,
19
  "r": 0.9866730893,
20
+ "f": 0.9900112776
21
  },
22
  "NumType": {
23
+ "p": 0.9886792453,
24
+ "r": 0.9667896679,
25
+ "f": 0.9776119403
26
  },
27
  "Definite": {
28
  "p": 0.9964454976,
30
  "f": 0.9970361589
31
  },
32
  "PronType": {
33
+ "p": 0.9904643449,
34
+ "r": 0.988006617,
35
+ "f": 0.9892339545
36
  },
37
  "Mood": {
38
+ "p": 0.9621489621,
39
+ "r": 0.9459783914,
40
+ "f": 0.9539951574
41
  },
42
  "Person": {
43
+ "p": 0.9703849951,
44
+ "r": 0.9515972894,
45
+ "f": 0.9608993157
46
  },
47
  "Tense": {
48
+ "p": 0.9640718563,
49
+ "r": 0.9591489362,
50
+ "f": 0.9616040956
51
  },
52
  "VerbForm": {
53
+ "p": 0.9735744089,
54
+ "r": 0.9702009702,
55
+ "f": 0.9718847622
56
  },
57
  "Degree": {
58
+ "p": 0.8823529412,
59
+ "r": 0.8823529412,
60
+ "f": 0.8823529412
61
  },
62
  "Clitic": {
63
+ "p": 0.9945945946,
64
+ "r": 0.9839572193,
65
+ "f": 0.9892473118
66
  },
67
  "Poss": {
68
+ "p": 1.0,
69
  "r": 1.0,
70
+ "f": 1.0
71
  },
72
  "Polarity": {
73
  "p": 1.0,
74
+ "r": 1.0,
75
+ "f": 1.0
76
  },
77
  "Foreign": {
78
+ "p": 0.5,
79
+ "r": 0.2,
80
+ "f": 0.2857142857
81
  }
82
  },
83
+ "tag_acc": 0.9704322818,
84
+ "sents_p": 0.965034965,
85
+ "sents_r": 0.9787234043,
86
+ "sents_f": 0.9718309859,
87
+ "dep_uas": 0.9112894926,
88
+ "dep_las": 0.8750192951,
89
  "dep_las_per_type": {
90
  "root": {
91
+ "p": 0.8933566434,
92
+ "r": 0.9060283688,
93
+ "f": 0.8996478873
94
  },
95
  "flat:name": {
96
+ "p": 0.9044585987,
97
+ "r": 0.8987341772,
98
+ "f": 0.9015873016
99
  },
100
  "case": {
101
+ "p": 0.978749241,
102
+ "r": 0.9811320755,
103
+ "f": 0.9799392097
104
  },
105
  "nmod": {
106
+ "p": 0.8202247191,
107
+ "r": 0.8210633947,
108
+ "f": 0.8206438426
109
  },
110
  "nummod": {
111
+ "p": 0.912568306,
112
+ "r": 0.9076086957,
113
+ "f": 0.9100817439
114
  },
115
  "det": {
116
+ "p": 0.9760809568,
117
+ "r": 0.9751838235,
118
+ "f": 0.9756321839
119
  },
120
  "nsubj": {
121
+ "p": 0.8664047151,
122
+ "r": 0.8513513514,
123
+ "f": 0.858812074
124
  },
125
  "aux": {
126
+ "p": 0.9493087558,
127
+ "r": 0.9493087558,
128
+ "f": 0.9493087558
129
  },
130
  "advmod": {
131
+ "p": 0.8139013453,
132
+ "r": 0.8325688073,
133
+ "f": 0.8231292517
134
  },
135
  "obj": {
136
+ "p": 0.8613138686,
137
+ "r": 0.885,
138
+ "f": 0.8729963009
139
  },
140
  "cc": {
141
+ "p": 0.9065420561,
142
+ "r": 0.8926380368,
143
+ "f": 0.8995363215
144
  },
145
  "conj": {
146
+ "p": 0.7012987013,
147
+ "r": 0.7297297297,
148
+ "f": 0.7152317881
149
  },
150
  "det:predet": {
151
+ "p": 0.9,
152
  "r": 1.0,
153
+ "f": 0.9473684211
154
  },
155
  "amod": {
156
+ "p": 0.9221374046,
157
+ "r": 0.9055472264,
158
+ "f": 0.9137670197
159
  },
160
  "mark": {
161
+ "p": 0.9265306122,
162
+ "r": 0.9380165289,
163
+ "f": 0.932238193
164
  },
165
  "cop": {
166
  "p": 0.828358209,
168
  "f": 0.8538461538
169
  },
170
  "xcomp": {
171
+ "p": 0.7717391304,
172
+ "r": 0.7395833333,
173
+ "f": 0.7553191489
174
  },
175
  "obl": {
176
+ "p": 0.8104776579,
177
+ "r": 0.769005848,
178
+ "f": 0.7891972993
179
  },
180
  "acl:relcl": {
181
+ "p": 0.7368421053,
182
+ "r": 0.7153284672,
183
+ "f": 0.7259259259
184
  },
185
  "acl": {
186
+ "p": 0.6476190476,
187
+ "r": 0.5913043478,
188
+ "f": 0.6181818182
189
  },
190
  "ccomp": {
191
  "p": 0.7,
193
  "f": 0.6885245902
194
  },
195
  "expl": {
196
+ "p": 0.9189189189,
197
+ "r": 0.8717948718,
198
+ "f": 0.8947368421
199
  },
200
  "nsubj:pass": {
201
+ "p": 0.7831325301,
202
+ "r": 0.8024691358,
203
+ "f": 0.7926829268
204
  },
205
  "aux:pass": {
206
  "p": 0.8658536585,
208
  "f": 0.8819875776
209
  },
210
  "parataxis": {
211
+ "p": 0.5454545455,
212
+ "r": 0.1935483871,
213
+ "f": 0.2857142857
214
+ },
215
+ "dep": {
216
+ "p": 0.0,
217
+ "r": 0.0,
218
+ "f": 0.0
219
  },
220
  "advcl": {
221
+ "p": 0.5732484076,
222
+ "r": 0.6338028169,
223
+ "f": 0.602006689
224
  },
225
  "det:poss": {
226
+ "p": 0.9577464789,
227
  "r": 1.0,
228
+ "f": 0.9784172662
229
  },
230
  "flat": {
231
  "p": 1.0,
233
  "f": 1.0
234
  },
235
  "appos": {
236
+ "p": 0.5757575758,
237
+ "r": 0.4871794872,
238
+ "f": 0.5277777778
239
  },
240
  "obl:agent": {
241
+ "p": 0.8409090909,
242
  "r": 0.880952381,
243
+ "f": 0.8604651163
 
 
 
 
 
244
  },
245
  "iobj": {
246
+ "p": 0.7619047619,
247
+ "r": 0.8,
248
+ "f": 0.7804878049
249
  },
250
  "expl:impers": {
251
+ "p": 0.6666666667,
252
  "r": 1.0,
253
+ "f": 0.8
254
  },
255
  "csubj": {
256
+ "p": 0.625,
257
+ "r": 0.3846153846,
258
+ "f": 0.4761904762
259
  },
260
  "compound": {
261
+ "p": 0.5862068966,
262
+ "r": 0.6538461538,
263
+ "f": 0.6181818182
264
  },
265
  "discourse": {
266
  "p": 0.0,
268
  "f": 0.0
269
  },
270
  "fixed": {
271
+ "p": 0.7333333333,
272
  "r": 0.7857142857,
273
+ "f": 0.7586206897
274
  },
275
  "expl:pass": {
276
+ "p": 0.6923076923,
277
+ "r": 0.8181818182,
278
+ "f": 0.75
279
  },
280
+ "vocative": {
281
+ "p": 0.25,
282
  "r": 0.3333333333,
283
+ "f": 0.2857142857
284
  },
285
  "orphan": {
286
  "p": 0.0,
287
  "r": 0.0,
288
  "f": 0.0
289
  },
290
+ "flat:foreign": {
291
+ "p": 0.3333333333,
292
  "r": 0.3333333333,
293
+ "f": 0.3333333333
294
+ }
295
+ },
296
+ "lemma_acc": 0.972232207,
297
+ "ents_p": 0.8742779079,
298
+ "ents_r": 0.8732213169,
299
+ "ents_f": 0.873749293,
300
+ "ents_per_type": {
301
+ "LOC": {
302
+ "p": 0.8831085177,
303
+ "r": 0.9129259694,
304
+ "f": 0.8977697315
305
+ },
306
+ "PER": {
307
+ "p": 0.9104382357,
308
+ "r": 0.9125088842,
309
+ "f": 0.9114723839
310
+ },
311
+ "MISC": {
312
+ "p": 0.7848994296,
313
+ "r": 0.7184666117,
314
+ "f": 0.750215208
315
+ },
316
+ "ORG": {
317
+ "p": 0.8381488737,
318
+ "r": 0.7737341772,
319
+ "f": 0.8046544429
320
  }
321
  },
322
+ "speed": 9628.8129127039
323
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
config.cfg CHANGED
@@ -10,7 +10,7 @@ seed = 0
10
 
11
  [nlp]
12
  lang = "it"
13
- pipeline = ["tok2vec","morphologizer","tagger","parser","senter","attribute_ruler","lemmatizer","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
@@ -26,11 +26,22 @@ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
29
- factory = "lemmatizer"
30
- mode = "pos_lookup"
31
- model = null
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
 
 
 
 
 
 
 
 
 
 
 
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
@@ -39,8 +50,9 @@ overwrite = true
39
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
- @architectures = "spacy.Tagger.v1"
43
  nO = null
 
44
 
45
  [components.morphologizer.model.tok2vec]
46
  @architectures = "spacy.Tok2VecListener.v1"
@@ -70,7 +82,7 @@ nO = null
70
  @architectures = "spacy.MultiHashEmbed.v2"
71
  width = 96
72
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
73
- rows = [5000,2500,2500,2500,100]
74
  include_static_vectors = true
75
 
76
  [components.ner.model.tok2vec.encode]
@@ -108,8 +120,9 @@ overwrite = false
108
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
109
 
110
  [components.senter.model]
111
- @architectures = "spacy.Tagger.v1"
112
  nO = null
 
113
 
114
  [components.senter.model.tok2vec]
115
  @architectures = "spacy.Tok2Vec.v2"
@@ -130,12 +143,14 @@ maxout_pieces = 2
130
 
131
  [components.tagger]
132
  factory = "tagger"
 
133
  overwrite = false
134
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
135
 
136
  [components.tagger.model]
137
- @architectures = "spacy.Tagger.v1"
138
  nO = null
 
139
 
140
  [components.tagger.model.tok2vec]
141
  @architectures = "spacy.Tok2VecListener.v1"
@@ -152,7 +167,7 @@ factory = "tok2vec"
152
  @architectures = "spacy.MultiHashEmbed.v2"
153
  width = ${components.tok2vec.model.encode:width}
154
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
155
- rows = [5000,2500,2500,2500,100]
156
  include_static_vectors = true
157
 
158
  [components.tok2vec.model.encode]
@@ -189,7 +204,7 @@ dropout = 0.1
189
  accumulate_gradient = 1
190
  patience = 5000
191
  max_epochs = 0
192
- max_steps = 0
193
  eval_frequency = 1000
194
  frozen_components = []
195
  before_to_disk = null
@@ -224,18 +239,18 @@ eps = 0.00000001
224
  learn_rate = 0.001
225
 
226
  [training.score_weights]
227
- pos_acc = 0.06
228
- morph_acc = 0.05
229
  morph_per_feat = null
230
- tag_acc = 0.06
231
  dep_uas = 0.0
232
- dep_las = 0.16
233
  dep_las_per_type = null
234
  sents_p = null
235
  sents_r = null
236
- sents_f = 0.02
237
- lemma_acc = 0.5
238
- ents_f = 0.16
239
  ents_p = 0.0
240
  ents_r = 0.0
241
  ents_per_type = null
@@ -252,6 +267,13 @@ after_init = null
252
 
253
  [initialize.components]
254
 
 
 
 
 
 
 
 
255
  [initialize.components.morphologizer]
256
 
257
  [initialize.components.morphologizer.labels]
10
 
11
  [nlp]
12
  lang = "it"
13
+ pipeline = ["tok2vec","morphologizer","tagger","parser","lemmatizer","senter","attribute_ruler","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
26
  validate = false
27
 
28
  [components.lemmatizer]
29
+ factory = "trainable_lemmatizer"
30
+ backoff = "orth"
31
+ min_tree_freq = 3
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
+ top_k = 1
35
+
36
+ [components.lemmatizer.model]
37
+ @architectures = "spacy.Tagger.v2"
38
+ nO = null
39
+ normalize = false
40
+
41
+ [components.lemmatizer.model.tok2vec]
42
+ @architectures = "spacy.Tok2VecListener.v1"
43
+ width = ${components.tok2vec.model.encode:width}
44
+ upstream = "tok2vec"
45
 
46
  [components.morphologizer]
47
  factory = "morphologizer"
50
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
51
 
52
  [components.morphologizer.model]
53
+ @architectures = "spacy.Tagger.v2"
54
  nO = null
55
+ normalize = false
56
 
57
  [components.morphologizer.model.tok2vec]
58
  @architectures = "spacy.Tok2VecListener.v1"
82
  @architectures = "spacy.MultiHashEmbed.v2"
83
  width = 96
84
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
85
+ rows = [5000,1000,2500,2500,50]
86
  include_static_vectors = true
87
 
88
  [components.ner.model.tok2vec.encode]
120
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
121
 
122
  [components.senter.model]
123
+ @architectures = "spacy.Tagger.v2"
124
  nO = null
125
+ normalize = false
126
 
127
  [components.senter.model.tok2vec]
128
  @architectures = "spacy.Tok2Vec.v2"
143
 
144
  [components.tagger]
145
  factory = "tagger"
146
+ neg_prefix = "!"
147
  overwrite = false
148
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
149
 
150
  [components.tagger.model]
151
+ @architectures = "spacy.Tagger.v2"
152
  nO = null
153
+ normalize = false
154
 
155
  [components.tagger.model.tok2vec]
156
  @architectures = "spacy.Tok2VecListener.v1"
167
  @architectures = "spacy.MultiHashEmbed.v2"
168
  width = ${components.tok2vec.model.encode:width}
169
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
170
+ rows = [5000,1000,2500,2500,50]
171
  include_static_vectors = true
172
 
173
  [components.tok2vec.model.encode]
204
  accumulate_gradient = 1
205
  patience = 5000
206
  max_epochs = 0
207
+ max_steps = 100000
208
  eval_frequency = 1000
209
  frozen_components = []
210
  before_to_disk = null
239
  learn_rate = 0.001
240
 
241
  [training.score_weights]
242
+ pos_acc = 0.1
243
+ morph_acc = 0.09
244
  morph_per_feat = null
245
+ tag_acc = 0.1
246
  dep_uas = 0.0
247
+ dep_las = 0.29
248
  dep_las_per_type = null
249
  sents_p = null
250
  sents_r = null
251
+ sents_f = 0.04
252
+ lemma_acc = 0.1
253
+ ents_f = 0.29
254
  ents_p = 0.0
255
  ents_r = 0.0
256
  ents_per_type = null
267
 
268
  [initialize.components]
269
 
270
+ [initialize.components.lemmatizer]
271
+
272
+ [initialize.components.lemmatizer.labels]
273
+ @readers = "spacy.read_labels.v1"
274
+ path = "corpus/labels/trainable_lemmatizer.json"
275
+ require = false
276
+
277
  [initialize.components.morphologizer]
278
 
279
  [initialize.components.morphologizer.labels]
it_core_news_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5f5ebe629f3664af129cbf53192f820c0f1a6de0e7274cb671e91d10be2f2d60
3
- size 50723753
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:606adcc8d47e4230bfc19287586d339497164d9f1baeef311385d339bfdb80bc
3
+ size 42382058
lemmatizer/cfg ADDED
@@ -0,0 +1,729 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ 1,
4
+ 2,
5
+ 6,
6
+ 8,
7
+ 10,
8
+ 12,
9
+ 15,
10
+ 17,
11
+ 19,
12
+ 21,
13
+ 23,
14
+ 25,
15
+ 28,
16
+ 31,
17
+ 35,
18
+ 37,
19
+ 39,
20
+ 41,
21
+ 43,
22
+ 45,
23
+ 47,
24
+ 49,
25
+ 50,
26
+ 52,
27
+ 54,
28
+ 56,
29
+ 58,
30
+ 60,
31
+ 63,
32
+ 66,
33
+ 69,
34
+ 71,
35
+ 74,
36
+ 76,
37
+ 78,
38
+ 80,
39
+ 82,
40
+ 84,
41
+ 87,
42
+ 89,
43
+ 92,
44
+ 95,
45
+ 97,
46
+ 99,
47
+ 101,
48
+ 104,
49
+ 106,
50
+ 108,
51
+ 110,
52
+ 112,
53
+ 113,
54
+ 115,
55
+ 117,
56
+ 119,
57
+ 121,
58
+ 123,
59
+ 125,
60
+ 127,
61
+ 128,
62
+ 130,
63
+ 133,
64
+ 135,
65
+ 139,
66
+ 141,
67
+ 142,
68
+ 144,
69
+ 145,
70
+ 147,
71
+ 150,
72
+ 152,
73
+ 154,
74
+ 157,
75
+ 159,
76
+ 160,
77
+ 162,
78
+ 164,
79
+ 165,
80
+ 167,
81
+ 169,
82
+ 171,
83
+ 174,
84
+ 176,
85
+ 178,
86
+ 181,
87
+ 184,
88
+ 187,
89
+ 189,
90
+ 191,
91
+ 195,
92
+ 196,
93
+ 198,
94
+ 200,
95
+ 202,
96
+ 205,
97
+ 207,
98
+ 208,
99
+ 210,
100
+ 212,
101
+ 214,
102
+ 216,
103
+ 218,
104
+ 221,
105
+ 223,
106
+ 226,
107
+ 228,
108
+ 229,
109
+ 231,
110
+ 233,
111
+ 234,
112
+ 235,
113
+ 237,
114
+ 239,
115
+ 241,
116
+ 243,
117
+ 244,
118
+ 247,
119
+ 249,
120
+ 250,
121
+ 251,
122
+ 253,
123
+ 255,
124
+ 257,
125
+ 259,
126
+ 261,
127
+ 263,
128
+ 265,
129
+ 269,
130
+ 271,
131
+ 273,
132
+ 275,
133
+ 277,
134
+ 279,
135
+ 281,
136
+ 283,
137
+ 285,
138
+ 287,
139
+ 289,
140
+ 291,
141
+ 294,
142
+ 297,
143
+ 299,
144
+ 301,
145
+ 302,
146
+ 303,
147
+ 305,
148
+ 307,
149
+ 309,
150
+ 312,
151
+ 314,
152
+ 315,
153
+ 318,
154
+ 320,
155
+ 322,
156
+ 324,
157
+ 326,
158
+ 328,
159
+ 331,
160
+ 334,
161
+ 335,
162
+ 338,
163
+ 340,
164
+ 342,
165
+ 347,
166
+ 349,
167
+ 353,
168
+ 354,
169
+ 356,
170
+ 360,
171
+ 363,
172
+ 365,
173
+ 366,
174
+ 368,
175
+ 370,
176
+ 372,
177
+ 374,
178
+ 376,
179
+ 377,
180
+ 378,
181
+ 379,
182
+ 382,
183
+ 383,
184
+ 385,
185
+ 389,
186
+ 392,
187
+ 393,
188
+ 396,
189
+ 398,
190
+ 402,
191
+ 403,
192
+ 405,
193
+ 407,
194
+ 151,
195
+ 409,
196
+ 410,
197
+ 413,
198
+ 415,
199
+ 417,
200
+ 420,
201
+ 423,
202
+ 427,
203
+ 428,
204
+ 430,
205
+ 432,
206
+ 433,
207
+ 435,
208
+ 437,
209
+ 439,
210
+ 441,
211
+ 445,
212
+ 446,
213
+ 449,
214
+ 451,
215
+ 453,
216
+ 454,
217
+ 455,
218
+ 457,
219
+ 458,
220
+ 461,
221
+ 463,
222
+ 465,
223
+ 170,
224
+ 467,
225
+ 469,
226
+ 471,
227
+ 473,
228
+ 476,
229
+ 477,
230
+ 478,
231
+ 479,
232
+ 481,
233
+ 483,
234
+ 485,
235
+ 488,
236
+ 489,
237
+ 491,
238
+ 494,
239
+ 498,
240
+ 500,
241
+ 502,
242
+ 504,
243
+ 507,
244
+ 509,
245
+ 511,
246
+ 515,
247
+ 518,
248
+ 520,
249
+ 521,
250
+ 522,
251
+ 523,
252
+ 525,
253
+ 527,
254
+ 530,
255
+ 531,
256
+ 533,
257
+ 535,
258
+ 538,
259
+ 539,
260
+ 542,
261
+ 544,
262
+ 546,
263
+ 548,
264
+ 549,
265
+ 550,
266
+ 553,
267
+ 555,
268
+ 558,
269
+ 560,
270
+ 561,
271
+ 562,
272
+ 564,
273
+ 565,
274
+ 567,
275
+ 570,
276
+ 573,
277
+ 575,
278
+ 578,
279
+ 579,
280
+ 582,
281
+ 584,
282
+ 586,
283
+ 588,
284
+ 590,
285
+ 592,
286
+ 594,
287
+ 596,
288
+ 598,
289
+ 601,
290
+ 602,
291
+ 603,
292
+ 605,
293
+ 607,
294
+ 610,
295
+ 611,
296
+ 612,
297
+ 614,
298
+ 616,
299
+ 618,
300
+ 620,
301
+ 623,
302
+ 625,
303
+ 628,
304
+ 630,
305
+ 632,
306
+ 634,
307
+ 635,
308
+ 638,
309
+ 639,
310
+ 641,
311
+ 642,
312
+ 643,
313
+ 647,
314
+ 650,
315
+ 654,
316
+ 656,
317
+ 657,
318
+ 658,
319
+ 660,
320
+ 662,
321
+ 663,
322
+ 665,
323
+ 668,
324
+ 669,
325
+ 673,
326
+ 675,
327
+ 678,
328
+ 680,
329
+ 682,
330
+ 684,
331
+ 686,
332
+ 688,
333
+ 690,
334
+ 693,
335
+ 695,
336
+ 697,
337
+ 699,
338
+ 701,
339
+ 703,
340
+ 705,
341
+ 706,
342
+ 709,
343
+ 711,
344
+ 713,
345
+ 716,
346
+ 718,
347
+ 720,
348
+ 722,
349
+ 724,
350
+ 726,
351
+ 727,
352
+ 730,
353
+ 732,
354
+ 733,
355
+ 734,
356
+ 736,
357
+ 738,
358
+ 741,
359
+ 744,
360
+ 747,
361
+ 749,
362
+ 751,
363
+ 752,
364
+ 754,
365
+ 757,
366
+ 761,
367
+ 762,
368
+ 764,
369
+ 766,
370
+ 770,
371
+ 772,
372
+ 774,
373
+ 776,
374
+ 777,
375
+ 780,
376
+ 782,
377
+ 785,
378
+ 787,
379
+ 789,
380
+ 791,
381
+ 792,
382
+ 794,
383
+ 796,
384
+ 798,
385
+ 800,
386
+ 802,
387
+ 805,
388
+ 807,
389
+ 808,
390
+ 810,
391
+ 812,
392
+ 813,
393
+ 814,
394
+ 816,
395
+ 818,
396
+ 822,
397
+ 824,
398
+ 826,
399
+ 828,
400
+ 829,
401
+ 832,
402
+ 833,
403
+ 835,
404
+ 837,
405
+ 838,
406
+ 839,
407
+ 840,
408
+ 841,
409
+ 842,
410
+ 844,
411
+ 847,
412
+ 850,
413
+ 853,
414
+ 855,
415
+ 856,
416
+ 859,
417
+ 860,
418
+ 862,
419
+ 863,
420
+ 864,
421
+ 865,
422
+ 867,
423
+ 868,
424
+ 870,
425
+ 873,
426
+ 875,
427
+ 877,
428
+ 879,
429
+ 880,
430
+ 883,
431
+ 886,
432
+ 887,
433
+ 890,
434
+ 891,
435
+ 892,
436
+ 894,
437
+ 897,
438
+ 898,
439
+ 901,
440
+ 904,
441
+ 905,
442
+ 906,
443
+ 872,
444
+ 907,
445
+ 908,
446
+ 909,
447
+ 910,
448
+ 911,
449
+ 913,
450
+ 914,
451
+ 915,
452
+ 916,
453
+ 919,
454
+ 920,
455
+ 922,
456
+ 925,
457
+ 926,
458
+ 930,
459
+ 932,
460
+ 934,
461
+ 935,
462
+ 936,
463
+ 938,
464
+ 939,
465
+ 941,
466
+ 944,
467
+ 946,
468
+ 949,
469
+ 950,
470
+ 954,
471
+ 955,
472
+ 958,
473
+ 960,
474
+ 964,
475
+ 967,
476
+ 970,
477
+ 973,
478
+ 974,
479
+ 976,
480
+ 977,
481
+ 979,
482
+ 982,
483
+ 983,
484
+ 984,
485
+ 986,
486
+ 987,
487
+ 989,
488
+ 990,
489
+ 993,
490
+ 995,
491
+ 996,
492
+ 999,
493
+ 1001,
494
+ 1004,
495
+ 1006,
496
+ 1008,
497
+ 1009,
498
+ 1011,
499
+ 1013,
500
+ 1015,
501
+ 1017,
502
+ 1018,
503
+ 1019,
504
+ 1021,
505
+ 1023,
506
+ 1025,
507
+ 1027,
508
+ 1028,
509
+ 1031,
510
+ 1034,
511
+ 1036,
512
+ 1039,
513
+ 1041,
514
+ 1043,
515
+ 1044,
516
+ 1045,
517
+ 1047,
518
+ 1049,
519
+ 1051,
520
+ 1053,
521
+ 1055,
522
+ 1058,
523
+ 1059,
524
+ 1061,
525
+ 1062,
526
+ 1065,
527
+ 1067,
528
+ 1070,
529
+ 1071,
530
+ 1072,
531
+ 1074,
532
+ 1078,
533
+ 1080,
534
+ 1081,
535
+ 1082,
536
+ 1085,
537
+ 1086,
538
+ 1087,
539
+ 1089,
540
+ 1090,
541
+ 1091,
542
+ 1093,
543
+ 1094,
544
+ 1095,
545
+ 1097,
546
+ 1098,
547
+ 1099,
548
+ 1101,
549
+ 1104,
550
+ 1105,
551
+ 1106,
552
+ 1107,
553
+ 1110,
554
+ 1111,
555
+ 1114,
556
+ 1116,
557
+ 1117,
558
+ 1118,
559
+ 1120,
560
+ 1125,
561
+ 1127,
562
+ 1129,
563
+ 1130,
564
+ 1132,
565
+ 1136,
566
+ 1137,
567
+ 1138,
568
+ 1139,
569
+ 1142,
570
+ 1143,
571
+ 1145,
572
+ 1146,
573
+ 1147,
574
+ 1149,
575
+ 1150,
576
+ 1153,
577
+ 1155,
578
+ 1156,
579
+ 1157,
580
+ 1158,
581
+ 1161,
582
+ 1162,
583
+ 1165,
584
+ 1168,
585
+ 1170,
586
+ 1173,
587
+ 1175,
588
+ 1177,
589
+ 1178,
590
+ 1180,
591
+ 1182,
592
+ 1184,
593
+ 1186,
594
+ 1187,
595
+ 1188,
596
+ 1190,
597
+ 1192,
598
+ 1193,
599
+ 1195,
600
+ 1197,
601
+ 1198,
602
+ 1200,
603
+ 1201,
604
+ 1202,
605
+ 1203,
606
+ 1204,
607
+ 1207,
608
+ 1208,
609
+ 1210,
610
+ 1211,
611
+ 1212,
612
+ 1214,
613
+ 1215,
614
+ 1217,
615
+ 1219,
616
+ 1221,
617
+ 1223,
618
+ 1225,
619
+ 1227,
620
+ 1231,
621
+ 1232,
622
+ 1233,
623
+ 1234,
624
+ 1236,
625
+ 1239,
626
+ 1240,
627
+ 1241,
628
+ 1242,
629
+ 1244,
630
+ 1246,
631
+ 1247,
632
+ 1248,
633
+ 1250,
634
+ 1251,
635
+ 1254,
636
+ 1255,
637
+ 1257,
638
+ 1259,
639
+ 1261,
640
+ 1264,
641
+ 1266,
642
+ 1267,
643
+ 1268,
644
+ 1270,
645
+ 1272,
646
+ 1276,
647
+ 1279,
648
+ 1280,
649
+ 1281,
650
+ 1284,
651
+ 1285,
652
+ 1288,
653
+ 1289,
654
+ 1291,
655
+ 1293,
656
+ 1294,
657
+ 1296,
658
+ 937,
659
+ 1297,
660
+ 1299,
661
+ 1302,
662
+ 1303,
663
+ 1304,
664
+ 1305,
665
+ 1308,
666
+ 1311,
667
+ 1313,
668
+ 1314,
669
+ 1317,
670
+ 1318,
671
+ 1319,
672
+ 1321,
673
+ 1324,
674
+ 1325,
675
+ 1327,
676
+ 1328,
677
+ 1330,
678
+ 1332,
679
+ 1334,
680
+ 1335,
681
+ 1337,
682
+ 1338,
683
+ 1340,
684
+ 1343,
685
+ 1345,
686
+ 1347,
687
+ 1349,
688
+ 1351,
689
+ 1352,
690
+ 1355,
691
+ 1356,
692
+ 1358,
693
+ 1360,
694
+ 1362,
695
+ 1364,
696
+ 1365,
697
+ 1367,
698
+ 1370,
699
+ 1371,
700
+ 1372,
701
+ 1374,
702
+ 1376,
703
+ 1378,
704
+ 1379,
705
+ 1380,
706
+ 1381,
707
+ 1383,
708
+ 1384,
709
+ 1385,
710
+ 1387,
711
+ 1388,
712
+ 1392,
713
+ 1393,
714
+ 1394,
715
+ 1395,
716
+ 1396,
717
+ 1397,
718
+ 1398,
719
+ 1399,
720
+ 1401,
721
+ 1402,
722
+ 1403,
723
+ 1404,
724
+ 1406,
725
+ 1409,
726
+ 1411,
727
+ 1412
728
+ ]
729
+ }
lemmatizer/{lookups/lookups.bin → model} RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fba5a704a2baddb2914660c0e0bb3fc5c8d7e9099a10cb708896959b7533d917
3
- size 14835061
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6286d5f76074ec77e2a0e905a78cd85ead2a6a713e8ae13092c8f50eb2bfbdb1
3
+ size 281742
lemmatizer/trees ADDED
Binary file (140 kB). View file
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"it",
3
  "name":"core_news_md",
4
- "version":"3.2.0",
5
- "description":"Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-NC-SA 3.0",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
@@ -462,15 +462,8 @@
462
  "vocative",
463
  "xcomp"
464
  ],
465
- "senter":[
466
- "I",
467
- "S"
468
- ],
469
  "attribute_ruler":[
470
 
471
- ],
472
- "lemmatizer":[
473
-
474
  ],
475
  "ner":[
476
  "LOC",
@@ -484,8 +477,8 @@
484
  "morphologizer",
485
  "tagger",
486
  "parser",
487
- "attribute_ruler",
488
  "lemmatizer",
 
489
  "ner"
490
  ],
491
  "components":[
@@ -493,69 +486,39 @@
493
  "morphologizer",
494
  "tagger",
495
  "parser",
 
496
  "senter",
497
  "attribute_ruler",
498
- "lemmatizer",
499
  "ner"
500
  ],
501
  "disabled":[
502
  "senter"
503
  ],
504
  "performance":{
505
- "tag_acc":0.9707455175,
506
- "sents_p":0.9685314685,
507
- "sents_r":0.9822695035,
508
- "sents_f":0.9753521127,
509
- "ents_p":0.8753707462,
510
- "ents_r":0.8744493392,
511
- "ents_f":0.8749098001,
512
- "ents_per_type":{
513
- "ORG":{
514
- "p":0.0,
515
- "r":0.0,
516
- "f":0.0
517
- },
518
- "LOC":{
519
- "p":0.0,
520
- "r":0.0,
521
- "f":0.0
522
- },
523
- "PER":{
524
- "p":0.0,
525
- "r":0.0,
526
- "f":0.0
527
- },
528
- "MISC":{
529
- "p":0.0,
530
- "r":0.0,
531
- "f":0.0
532
- }
533
- },
534
- "speed":9476.0962556217,
535
  "token_acc":0.9996405141,
536
  "token_p":0.9980235379,
537
  "token_r":0.9978442468,
538
  "token_f":0.9979338843,
539
- "pos_acc":0.9746101649,
540
- "morph_acc":0.9720449438,
541
- "morph_micro_p":0.9863999603,
542
- "morph_micro_r":0.980704698,
543
- "morph_micro_f":0.9835440845,
544
  "morph_per_feat":{
545
  "Gender":{
546
- "p":0.9903510573,
547
- "r":0.9848917926,
548
- "f":0.9876138806
549
  },
550
  "Number":{
551
- "p":0.9924095607,
552
  "r":0.9866730893,
553
- "f":0.9895330113
554
  },
555
  "NumType":{
556
- "p":0.9847328244,
557
- "r":0.9520295203,
558
- "f":0.9681050657
559
  },
560
  "Definite":{
561
  "p":0.9964454976,
@@ -563,133 +526,137 @@
563
  "f":0.9970361589
564
  },
565
  "PronType":{
566
- "p":0.9916839917,
567
- "r":0.9863523573,
568
- "f":0.989010989
569
  },
570
  "Mood":{
571
- "p":0.9588377724,
572
- "r":0.9507803121,
573
- "f":0.9547920434
574
  },
575
  "Person":{
576
- "p":0.972575906,
577
- "r":0.9612778316,
578
- "f":0.9668938656
579
  },
580
  "Tense":{
581
- "p":0.9607173356,
582
- "r":0.9574468085,
583
- "f":0.9590792839
584
  },
585
  "VerbForm":{
586
- "p":0.9735927728,
587
- "r":0.9708939709,
588
- "f":0.972241499
589
  },
590
  "Degree":{
591
- "p":0.8235294118,
592
- "r":0.8235294118,
593
- "f":0.8235294118
594
  },
595
  "Clitic":{
596
- "p":1.0,
597
- "r":0.9786096257,
598
- "f":0.9891891892
599
  },
600
  "Poss":{
601
- "p":0.9852941176,
602
  "r":1.0,
603
- "f":0.9925925926
604
  },
605
  "Polarity":{
606
  "p":1.0,
607
- "r":0.6666666667,
608
- "f":0.8
609
  },
610
  "Foreign":{
611
- "p":1.0,
612
- "r":0.4,
613
- "f":0.5714285714
614
  }
615
  },
616
- "dep_uas":0.9097643791,
617
- "dep_las":0.8749871386,
 
 
 
 
618
  "dep_las_per_type":{
619
  "root":{
620
- "p":0.8898601399,
621
- "r":0.9024822695,
622
- "f":0.8961267606
623
  },
624
  "flat:name":{
625
- "p":0.9230769231,
626
- "r":0.9113924051,
627
- "f":0.9171974522
628
  },
629
  "case":{
630
- "p":0.9811435523,
631
- "r":0.9817407182,
632
- "f":0.9814420444
633
  },
634
  "nmod":{
635
- "p":0.8126888218,
636
- "r":0.8251533742,
637
- "f":0.8188736682
638
  },
639
  "nummod":{
640
- "p":0.8895027624,
641
- "r":0.875,
642
- "f":0.8821917808
643
  },
644
  "det":{
645
- "p":0.976146789,
646
- "r":0.9779411765,
647
- "f":0.9770431589
648
  },
649
  "nsubj":{
650
- "p":0.8474903475,
651
- "r":0.8474903475,
652
- "f":0.8474903475
653
  },
654
  "aux":{
655
- "p":0.9311926606,
656
- "r":0.935483871,
657
- "f":0.9333333333
658
  },
659
  "advmod":{
660
- "p":0.8159090909,
661
- "r":0.8233944954,
662
- "f":0.8196347032
663
  },
664
  "obj":{
665
- "p":0.8684210526,
666
- "r":0.9075,
667
- "f":0.8875305623
668
  },
669
  "cc":{
670
- "p":0.9242902208,
671
- "r":0.8987730061,
672
- "f":0.9113530327
673
  },
674
  "conj":{
675
- "p":0.670984456,
676
- "r":0.7,
677
- "f":0.6851851852
678
  },
679
  "det:predet":{
680
- "p":0.9473684211,
681
  "r":1.0,
682
- "f":0.972972973
683
  },
684
  "amod":{
685
- "p":0.9081325301,
686
- "r":0.904047976,
687
- "f":0.9060856499
688
  },
689
  "mark":{
690
- "p":0.9262295082,
691
- "r":0.9338842975,
692
- "f":0.9300411523
693
  },
694
  "cop":{
695
  "p":0.828358209,
@@ -697,24 +664,24 @@
697
  "f":0.8538461538
698
  },
699
  "xcomp":{
700
- "p":0.7578947368,
701
- "r":0.75,
702
- "f":0.7539267016
703
  },
704
  "obl":{
705
- "p":0.8242612753,
706
- "r":0.7748538012,
707
- "f":0.7987942728
708
  },
709
  "acl:relcl":{
710
- "p":0.7923076923,
711
- "r":0.7518248175,
712
- "f":0.7715355805
713
  },
714
  "acl":{
715
- "p":0.6363636364,
716
- "r":0.6086956522,
717
- "f":0.6222222222
718
  },
719
  "ccomp":{
720
  "p":0.7,
@@ -722,14 +689,14 @@
722
  "f":0.6885245902
723
  },
724
  "expl":{
725
- "p":0.9178082192,
726
- "r":0.858974359,
727
- "f":0.8874172185
728
  },
729
  "nsubj:pass":{
730
- "p":0.8289473684,
731
- "r":0.7777777778,
732
- "f":0.8025477707
733
  },
734
  "aux:pass":{
735
  "p":0.8658536585,
@@ -737,19 +704,24 @@
737
  "f":0.8819875776
738
  },
739
  "parataxis":{
740
- "p":0.5,
741
- "r":0.1290322581,
742
- "f":0.2051282051
 
 
 
 
 
743
  },
744
  "advcl":{
745
- "p":0.5961538462,
746
- "r":0.6549295775,
747
- "f":0.6241610738
748
  },
749
  "det:poss":{
750
- "p":0.9855072464,
751
  "r":1.0,
752
- "f":0.9927007299
753
  },
754
  "flat":{
755
  "p":1.0,
@@ -757,39 +729,34 @@
757
  "f":1.0
758
  },
759
  "appos":{
760
- "p":0.5,
761
- "r":0.4615384615,
762
- "f":0.48
763
  },
764
  "obl:agent":{
765
- "p":0.9487179487,
766
  "r":0.880952381,
767
- "f":0.9135802469
768
- },
769
- "dep":{
770
- "p":0.0,
771
- "r":0.0,
772
- "f":0.0
773
  },
774
  "iobj":{
775
- "p":0.8260869565,
776
- "r":0.95,
777
- "f":0.8837209302
778
  },
779
  "expl:impers":{
780
- "p":0.5333333333,
781
  "r":1.0,
782
- "f":0.6956521739
783
  },
784
  "csubj":{
785
- "p":0.75,
786
- "r":0.4615384615,
787
- "f":0.5714285714
788
  },
789
  "compound":{
790
- "p":0.64,
791
- "r":0.6153846154,
792
- "f":0.6274509804
793
  },
794
  "discourse":{
795
  "p":0.0,
@@ -797,32 +764,58 @@
797
  "f":0.0
798
  },
799
  "fixed":{
800
- "p":0.8461538462,
801
  "r":0.7857142857,
802
- "f":0.8148148148
803
  },
804
  "expl:pass":{
805
- "p":0.5384615385,
806
- "r":0.6363636364,
807
- "f":0.5833333333
808
  },
809
- "flat:foreign":{
810
- "p":0.5,
811
  "r":0.3333333333,
812
- "f":0.4
813
  },
814
  "orphan":{
815
  "p":0.0,
816
  "r":0.0,
817
  "f":0.0
818
  },
819
- "vocative":{
820
- "p":0.5,
821
  "r":0.3333333333,
822
- "f":0.4
823
  }
824
  },
825
- "lemma_acc":0.8659237958
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
826
  },
827
  "sources":[
828
  {
@@ -837,12 +830,6 @@
837
  "license":"CC BY 4.0",
838
  "author":"Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran"
839
  },
840
- {
841
- "name":"Lemmatization Lists",
842
- "url":"https://github.com/michmech/lemmatization-lists/",
843
- "license":"ODbL",
844
- "author":"Michal M\u011bchura"
845
- },
846
  {
847
  "name":"Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)",
848
  "url":"https://spacy.io",
1
  {
2
  "lang":"it",
3
  "name":"core_news_md",
4
+ "version":"3.3.0",
5
+ "description":"Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-NC-SA 3.0",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
462
  "vocative",
463
  "xcomp"
464
  ],
 
 
 
 
465
  "attribute_ruler":[
466
 
 
 
 
467
  ],
468
  "ner":[
469
  "LOC",
477
  "morphologizer",
478
  "tagger",
479
  "parser",
 
480
  "lemmatizer",
481
+ "attribute_ruler",
482
  "ner"
483
  ],
484
  "components":[
486
  "morphologizer",
487
  "tagger",
488
  "parser",
489
+ "lemmatizer",
490
  "senter",
491
  "attribute_ruler",
 
492
  "ner"
493
  ],
494
  "disabled":[
495
  "senter"
496
  ],
497
  "performance":{
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
498
  "token_acc":0.9996405141,
499
  "token_p":0.9980235379,
500
  "token_r":0.9978442468,
501
  "token_f":0.9979338843,
502
+ "pos_acc":0.9743866271,
503
+ "morph_acc":0.971641724,
504
+ "morph_micro_p":0.9867911411,
505
+ "morph_micro_r":0.9806553494,
506
+ "morph_micro_f":0.9837136775,
507
  "morph_per_feat":{
508
  "Gender":{
509
+ "p":0.990151826,
510
+ "r":0.9853001225,
511
+ "f":0.9877200164
512
  },
513
  "Number":{
514
+ "p":0.9933721306,
515
  "r":0.9866730893,
516
+ "f":0.9900112776
517
  },
518
  "NumType":{
519
+ "p":0.9886792453,
520
+ "r":0.9667896679,
521
+ "f":0.9776119403
522
  },
523
  "Definite":{
524
  "p":0.9964454976,
526
  "f":0.9970361589
527
  },
528
  "PronType":{
529
+ "p":0.9904643449,
530
+ "r":0.988006617,
531
+ "f":0.9892339545
532
  },
533
  "Mood":{
534
+ "p":0.9621489621,
535
+ "r":0.9459783914,
536
+ "f":0.9539951574
537
  },
538
  "Person":{
539
+ "p":0.9703849951,
540
+ "r":0.9515972894,
541
+ "f":0.9608993157
542
  },
543
  "Tense":{
544
+ "p":0.9640718563,
545
+ "r":0.9591489362,
546
+ "f":0.9616040956
547
  },
548
  "VerbForm":{
549
+ "p":0.9735744089,
550
+ "r":0.9702009702,
551
+ "f":0.9718847622
552
  },
553
  "Degree":{
554
+ "p":0.8823529412,
555
+ "r":0.8823529412,
556
+ "f":0.8823529412
557
  },
558
  "Clitic":{
559
+ "p":0.9945945946,
560
+ "r":0.9839572193,
561
+ "f":0.9892473118
562
  },
563
  "Poss":{
564
+ "p":1.0,
565
  "r":1.0,
566
+ "f":1.0
567
  },
568
  "Polarity":{
569
  "p":1.0,
570
+ "r":1.0,
571
+ "f":1.0
572
  },
573
  "Foreign":{
574
+ "p":0.5,
575
+ "r":0.2,
576
+ "f":0.2857142857
577
  }
578
  },
579
+ "tag_acc":0.9704322818,
580
+ "sents_p":0.965034965,
581
+ "sents_r":0.9787234043,
582
+ "sents_f":0.9718309859,
583
+ "dep_uas":0.9112894926,
584
+ "dep_las":0.8750192951,
585
  "dep_las_per_type":{
586
  "root":{
587
+ "p":0.8933566434,
588
+ "r":0.9060283688,
589
+ "f":0.8996478873
590
  },
591
  "flat:name":{
592
+ "p":0.9044585987,
593
+ "r":0.8987341772,
594
+ "f":0.9015873016
595
  },
596
  "case":{
597
+ "p":0.978749241,
598
+ "r":0.9811320755,
599
+ "f":0.9799392097
600
  },
601
  "nmod":{
602
+ "p":0.8202247191,
603
+ "r":0.8210633947,
604
+ "f":0.8206438426
605
  },
606
  "nummod":{
607
+ "p":0.912568306,
608
+ "r":0.9076086957,
609
+ "f":0.9100817439
610
  },
611
  "det":{
612
+ "p":0.9760809568,
613
+ "r":0.9751838235,
614
+ "f":0.9756321839
615
  },
616
  "nsubj":{
617
+ "p":0.8664047151,
618
+ "r":0.8513513514,
619
+ "f":0.858812074
620
  },
621
  "aux":{
622
+ "p":0.9493087558,
623
+ "r":0.9493087558,
624
+ "f":0.9493087558
625
  },
626
  "advmod":{
627
+ "p":0.8139013453,
628
+ "r":0.8325688073,
629
+ "f":0.8231292517
630
  },
631
  "obj":{
632
+ "p":0.8613138686,
633
+ "r":0.885,
634
+ "f":0.8729963009
635
  },
636
  "cc":{
637
+ "p":0.9065420561,
638
+ "r":0.8926380368,
639
+ "f":0.8995363215
640
  },
641
  "conj":{
642
+ "p":0.7012987013,
643
+ "r":0.7297297297,
644
+ "f":0.7152317881
645
  },
646
  "det:predet":{
647
+ "p":0.9,
648
  "r":1.0,
649
+ "f":0.9473684211
650
  },
651
  "amod":{
652
+ "p":0.9221374046,
653
+ "r":0.9055472264,
654
+ "f":0.9137670197
655
  },
656
  "mark":{
657
+ "p":0.9265306122,
658
+ "r":0.9380165289,
659
+ "f":0.932238193
660
  },
661
  "cop":{
662
  "p":0.828358209,
664
  "f":0.8538461538
665
  },
666
  "xcomp":{
667
+ "p":0.7717391304,
668
+ "r":0.7395833333,
669
+ "f":0.7553191489
670
  },
671
  "obl":{
672
+ "p":0.8104776579,
673
+ "r":0.769005848,
674
+ "f":0.7891972993
675
  },
676
  "acl:relcl":{
677
+ "p":0.7368421053,
678
+ "r":0.7153284672,
679
+ "f":0.7259259259
680
  },
681
  "acl":{
682
+ "p":0.6476190476,
683
+ "r":0.5913043478,
684
+ "f":0.6181818182
685
  },
686
  "ccomp":{
687
  "p":0.7,
689
  "f":0.6885245902
690
  },
691
  "expl":{
692
+ "p":0.9189189189,
693
+ "r":0.8717948718,
694
+ "f":0.8947368421
695
  },
696
  "nsubj:pass":{
697
+ "p":0.7831325301,
698
+ "r":0.8024691358,
699
+ "f":0.7926829268
700
  },
701
  "aux:pass":{
702
  "p":0.8658536585,
704
  "f":0.8819875776
705
  },
706
  "parataxis":{
707
+ "p":0.5454545455,
708
+ "r":0.1935483871,
709
+ "f":0.2857142857
710
+ },
711
+ "dep":{
712
+ "p":0.0,
713
+ "r":0.0,
714
+ "f":0.0
715
  },
716
  "advcl":{
717
+ "p":0.5732484076,
718
+ "r":0.6338028169,
719
+ "f":0.602006689
720
  },
721
  "det:poss":{
722
+ "p":0.9577464789,
723
  "r":1.0,
724
+ "f":0.9784172662
725
  },
726
  "flat":{
727
  "p":1.0,
729
  "f":1.0
730
  },
731
  "appos":{
732
+ "p":0.5757575758,
733
+ "r":0.4871794872,
734
+ "f":0.5277777778
735
  },
736
  "obl:agent":{
737
+ "p":0.8409090909,
738
  "r":0.880952381,
739
+ "f":0.8604651163
 
 
 
 
 
740
  },
741
  "iobj":{
742
+ "p":0.7619047619,
743
+ "r":0.8,
744
+ "f":0.7804878049
745
  },
746
  "expl:impers":{
747
+ "p":0.6666666667,
748
  "r":1.0,
749
+ "f":0.8
750
  },
751
  "csubj":{
752
+ "p":0.625,
753
+ "r":0.3846153846,
754
+ "f":0.4761904762
755
  },
756
  "compound":{
757
+ "p":0.5862068966,
758
+ "r":0.6538461538,
759
+ "f":0.6181818182
760
  },
761
  "discourse":{
762
  "p":0.0,
764
  "f":0.0
765
  },
766
  "fixed":{
767
+ "p":0.7333333333,
768
  "r":0.7857142857,
769
+ "f":0.7586206897
770
  },
771
  "expl:pass":{
772
+ "p":0.6923076923,
773
+ "r":0.8181818182,
774
+ "f":0.75
775
  },
776
+ "vocative":{
777
+ "p":0.25,
778
  "r":0.3333333333,
779
+ "f":0.2857142857
780
  },
781
  "orphan":{
782
  "p":0.0,
783
  "r":0.0,
784
  "f":0.0
785
  },
786
+ "flat:foreign":{
787
+ "p":0.3333333333,
788
  "r":0.3333333333,
789
+ "f":0.3333333333
790
  }
791
  },
792
+ "lemma_acc":0.972232207,
793
+ "ents_p":0.8742779079,
794
+ "ents_r":0.8732213169,
795
+ "ents_f":0.873749293,
796
+ "ents_per_type":{
797
+ "LOC":{
798
+ "p":0.8831085177,
799
+ "r":0.9129259694,
800
+ "f":0.8977697315
801
+ },
802
+ "PER":{
803
+ "p":0.9104382357,
804
+ "r":0.9125088842,
805
+ "f":0.9114723839
806
+ },
807
+ "MISC":{
808
+ "p":0.7848994296,
809
+ "r":0.7184666117,
810
+ "f":0.750215208
811
+ },
812
+ "ORG":{
813
+ "p":0.8381488737,
814
+ "r":0.7737341772,
815
+ "f":0.8046544429
816
+ }
817
+ },
818
+ "speed":9628.8129127039
819
  },
820
  "sources":[
821
  {
830
  "license":"CC BY 4.0",
831
  "author":"Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran"
832
  },
 
 
 
 
 
 
833
  {
834
  "name":"Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)",
835
  "url":"https://spacy.io",
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:72ddc0c9f5d5df475cac4db1510d9d0d3d47a8984dd9466bff5df81cc513bc0d
3
- size 135802
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:336dd36775985d8fba4a0287dcaff833b71b38e8f54826037cebdc23e32f7eed
3
+ size 135854
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:85a59120e6f27f2635a2ca77f437fdff60277e0af5f74de8982ad9aadba6d70c
3
- size 7091792
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c3176bea04c7fe34c4814da000fe243d6b27e0f6c9ffdbcdf90b76d7c22a9d9e
3
+ size 6496592
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cf85c48c09234b5ad0dba0365327a486f8ee2f85255a627ecb944c0df897aa25
3
  size 307688
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f2901739818acbe43be7d410a6ea7d66bbc101edec39e74e86d87c45a1d58b18
3
  size 307688
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�#{"0":{"":131175},"1":{"":112873},"2":{"case":38562,"det":25835,"nsubj":9091,"advmod":7696,"cc":7544,"punct":6409,"mark":5854,"aux":5614,"amod":4663,"obl":4048,"cop":2625,"aux:pass":2047,"nummod":1979,"det:poss":1677,"expl":1601,"nsubj:pass":1416,"obj":1098,"advcl":962,"nmod":473,"iobj":449,"det:predet":378,"expl:impers":372,"expl:pass":314,"parataxis":90,"vocative":79,"csubj":55,"discourse":48,"acl":40,"dep":0},"3":{"punct":24646,"nmod":21570,"obl":11821,"amod":10763,"conj":9378,"obj":8029,"flat:name":3183,"acl:relcl":2890,"nsubj":2775,"acl":2675,"advcl":2515,"xcomp":2069,"advmod":1947,"ccomp":1334,"nummod":1227,"obl:agent":1027,"appos":827,"nsubj:pass":708,"compound":699,"fixed":662,"cop":575,"flat":532,"parataxis":282,"csubj":243,"flat:foreign":136,"det:poss":63,"dep":0},"4":{"ROOT":13117}}�cfg��neg_key�
1
+ ��moves�#{"0":{"":131325},"1":{"":113172},"2":{"case":38597,"det":25851,"nsubj":9089,"advmod":7698,"cc":7550,"punct":6456,"mark":5855,"aux":5614,"amod":4665,"obl":4048,"cop":2623,"aux:pass":2046,"nummod":2027,"det:poss":1677,"expl":1601,"nsubj:pass":1416,"obj":1096,"advcl":962,"nmod":473,"iobj":449,"det:predet":378,"expl:impers":372,"expl:pass":314,"parataxis":90,"vocative":79,"csubj":55,"discourse":48,"acl":40,"dep":0},"3":{"punct":24838,"nmod":21623,"obl":11817,"amod":10759,"conj":9391,"obj":8029,"flat:name":3204,"acl:relcl":2892,"nsubj":2778,"acl":2676,"advcl":2517,"xcomp":2070,"advmod":1949,"ccomp":1334,"nummod":1233,"obl:agent":1023,"appos":827,"nsubj:pass":709,"compound":709,"fixed":664,"cop":575,"flat":535,"parataxis":283,"csubj":243,"flat:foreign":136,"det:poss":63,"dep":0},"4":{"ROOT":13121}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:55e7fc00de2cf424fe2af914897bb60c862847c9e25ee2615af3277fc1e00ec9
3
- size 219901
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:956ff342584a1ecca4e660307d1759209b58baba5d95e4a78db9d73227f20386
3
+ size 219953
tagger/cfg CHANGED
@@ -48,5 +48,6 @@
48
  "V_PC_PC",
49
  "X"
50
  ],
 
51
  "overwrite":false
52
  }
48
  "V_PC_PC",
49
  "X"
50
  ],
51
+ "neg_prefix":"!",
52
  "overwrite":false
53
  }
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:281ad9580604c58f33fbc09f60a205d4870d9384899a5e2ee0cb189475de0f26
3
- size 18613
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5e305599c90fd4b1191b9a0dd3326c738e0381c2e5a6f314563718b5a7c5dcd
3
+ size 18665
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:53239b092aa520a3711ccdcfafc4b1d190f2a91e06b91234ee6d27711132c47c
3
- size 6960804
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d9bfbf8712ed4b0e352c981b25518a5a6ff6807fc54613b220ecb09a03233640
3
+ size 6365604
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search� �^'[0-9][0-9]|^[0-9]+°|^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2"…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�N�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=\/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]['’])(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9\"])�token_match��url_match�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�..��A�..�....��A�....�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�Art.��A�Art.�Avv.��A�Avv.�C++��A�C++�C.so��A�C.so�Civ.��A�Civ.�Cod.��A�Cod.�Cost.��A�Cost.�E'��A�E'�E’��A�E’�Jr.��A�Jr.�L'art.��A�L'�A�art.�L’art.��A�L’�A�art.�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�Proc.��A�Proc.�St.��A�St.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.C.��A�a.C.�al.��A�al.�all'art.��A�all'�A�art.�all-path��A�all-path�all’art.��A�all’�A�art.�art.��A�art.�artt.��A�artt.�att.��A�att.�avv.��A�avv.�b.��A�b.�by-pass��A�by-pass�c.��A�c.�c.d.��A�c.d.�c/c��A�c/c�centro-sinistra��A�centro-sinistra�check-up��A�check-up�cm.��A�cm.�col.��A�col.�d.��A�d.�d.C.��A�d.C.�dall'art.��A�dall'�A�art.�dall’art.��A�dall’�A�art.�de"��A�de"�dell'art.��A�dell'�A�art.�dell’art.��A�dell’�A�art.�distr.��A�distr.�e-mail��A�e-mail�e.��A�e.�e/o��A�e/o�ecc.��A�ecc.�etc.��A�etc.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l'art.��A�l'�A�art.�l.��A�l.�l’art.��A�l’�A�art.�m.��A�m.�n.��A�n.�nell'art.��A�nell'�A�art.�nell’art.��A�nell’�A�art.�nord-est��A�nord-est�n°��A�n°�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�pag.��A�pag.�po'��A�po'�po’��A�po’�prof.��A�prof.�q.��A�q.�r.��A�r.�s.��A�s.�s.n.c��A�s.n.c�s.p.a.��A�s.p.a.�s.r.l��A�s.r.l�sett.��A�sett.�sett..��A�sett.�A�.�ss.��A�ss.�t.��A�t.�tel.��A�tel.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�week-end��A�week-end�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
1
+ ��prefix_search� �^'[0-9][0-9]|^[0-9]+°|^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2y…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�O�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=\/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]['’])(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9\"])�token_match��url_match�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�..��A�..�....��A�....�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�Art.��A�Art.�Avv.��A�Avv.�C++��A�C++�C.so��A�C.so�Civ.��A�Civ.�Cod.��A�Cod.�Cost.��A�Cost.�E'��A�E'�E’��A�E’�Jr.��A�Jr.�L'art.��A�L'�A�art.�L’art.��A�L’�A�art.�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�Proc.��A�Proc.�St.��A�St.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.C.��A�a.C.�al.��A�al.�all'art.��A�all'�A�art.�all-path��A�all-path�all’art.��A�all’�A�art.�art.��A�art.�artt.��A�artt.�att.��A�att.�avv.��A�avv.�b.��A�b.�by-pass��A�by-pass�c.��A�c.�c.d.��A�c.d.�c/c��A�c/c�centro-sinistra��A�centro-sinistra�check-up��A�check-up�cm.��A�cm.�col.��A�col.�d.��A�d.�d.C.��A�d.C.�dall'art.��A�dall'�A�art.�dall’art.��A�dall’�A�art.�de"��A�de"�dell'art.��A�dell'�A�art.�dell’art.��A�dell’�A�art.�distr.��A�distr.�e-mail��A�e-mail�e.��A�e.�e/o��A�e/o�ecc.��A�ecc.�etc.��A�etc.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l'art.��A�l'�A�art.�l.��A�l.�l’art.��A�l’�A�art.�m.��A�m.�n.��A�n.�nell'art.��A�nell'�A�art.�nell’art.��A�nell’�A�art.�nord-est��A�nord-est�n°��A�n°�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�pag.��A�pag.�po'��A�po'�po’��A�po’�prof.��A�prof.�q.��A�q.�r.��A�r.�s.��A�s.�s.n.c��A�s.n.c�s.p.a.��A�s.p.a.�s.r.l��A�s.r.l�sett.��A�sett.�sett..��A�sett.�A�.�ss.��A�ss.�t.��A�t.�tel.��A�tel.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�week-end��A�week-end�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e9a5ce4d3230dcca5afcb829c031af7095216d98f72cac1d6d33b2ded40cf9cb
3
- size 9668632
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a9c235bd6fcf56d330b2794a2d6078c4a8fe8871b285decc41edaf729f427ad7
3
+ size 9683813