EC2 Default User commited on
Commit
d8577ad
1 Parent(s): 7fcea77

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -378,6 +378,8 @@ Creative Commons Notice
378
  * License: CC BY 4.0
379
 
380
  ```
 
 
381
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
382
 
383
  Section 1 – Definitions.
@@ -467,554 +469,3 @@ Nothing in this Public License constitutes or may be interpreted as a limitation
467
 
468
 
469
 
470
- # Lemmatization Lists
471
-
472
- * Author: Michal Měchura
473
- * URL: https://github.com/michmech/lemmatization-lists/
474
- * License: ODbL
475
-
476
- ```
477
- ## ODC Open Database License (ODbL)
478
-
479
- ### Preamble
480
-
481
- The Open Database License (ODbL) is a license agreement intended to
482
- allow users to freely share, modify, and use this Database while
483
- maintaining this same freedom for others. Many databases are covered by
484
- copyright, and therefore this document licenses these rights. Some
485
- jurisdictions, mainly in the European Union, have specific rights that
486
- cover databases, and so the ODbL addresses these rights, too. Finally,
487
- the ODbL is also an agreement in contract for users of this Database to
488
- act in certain ways in return for accessing this Database.
489
-
490
- Databases can contain a wide variety of types of content (images,
491
- audiovisual material, and sounds all in the same database, for example),
492
- and so the ODbL only governs the rights over the Database, and not the
493
- contents of the Database individually. Licensors should use the ODbL
494
- together with another license for the contents, if the contents have a
495
- single set of rights that uniformly covers all of the contents. If the
496
- contents have multiple sets of different rights, Licensors should
497
- describe what rights govern what contents together in the individual
498
- record or in some other way that clarifies what rights apply.
499
-
500
- Sometimes the contents of a database, or the database itself, can be
501
- covered by other rights not addressed here (such as private contracts,
502
- trade mark over the name, or privacy rights / data protection rights
503
- over information in the contents), and so you are advised that you may
504
- have to consult other documents or clear other rights before doing
505
- activities not covered by this License.
506
-
507
- ------
508
-
509
- The Licensor (as defined below)
510
-
511
- and
512
-
513
- You (as defined below)
514
-
515
- agree as follows:
516
-
517
- ### 1.0 Definitions of Capitalised Words
518
-
519
- "Collective Database" – Means this Database in unmodified form as part
520
- of a collection of independent databases in themselves that together are
521
- assembled into a collective whole. A work that constitutes a Collective
522
- Database will not be considered a Derivative Database.
523
-
524
- "Convey" – As a verb, means Using the Database, a Derivative Database,
525
- or the Database as part of a Collective Database in any way that enables
526
- a Person to make or receive copies of the Database or a Derivative
527
- Database. Conveying does not include interaction with a user through a
528
- computer network, or creating and Using a Produced Work, where no
529
- transfer of a copy of the Database or a Derivative Database occurs.
530
- "Contents" – The contents of this Database, which includes the
531
- information, independent works, or other material collected into the
532
- Database. For example, the contents of the Database could be factual
533
- data or works such as images, audiovisual material, text, or sounds.
534
-
535
- "Database" – A collection of material (the Contents) arranged in a
536
- systematic or methodical way and individually accessible by electronic
537
- or other means offered under the terms of this License.
538
-
539
- "Database Directive" – Means Directive 96/9/EC of the European
540
- Parliament and of the Council of 11 March 1996 on the legal protection
541
- of databases, as amended or succeeded.
542
-
543
- "Database Right" – Means rights resulting from the Chapter III ("sui
544
- generis") rights in the Database Directive (as amended and as transposed
545
- by member states), which includes the Extraction and Re-utilisation of
546
- the whole or a Substantial part of the Contents, as well as any similar
547
- rights available in the relevant jurisdiction under Section 10.4.
548
-
549
- "Derivative Database" – Means a database based upon the Database, and
550
- includes any translation, adaptation, arrangement, modification, or any
551
- other alteration of the Database or of a Substantial part of the
552
- Contents. This includes, but is not limited to, Extracting or
553
- Re-utilising the whole or a Substantial part of the Contents in a new
554
- Database.
555
-
556
- "Extraction" – Means the permanent or temporary transfer of all or a
557
- Substantial part of the Contents to another medium by any means or in
558
- any form.
559
-
560
- "License" – Means this license agreement and is both a license of rights
561
- such as copyright and Database Rights and an agreement in contract.
562
-
563
- "Licensor" – Means the Person that offers the Database under the terms
564
- of this License.
565
-
566
- "Person" – Means a natural or legal person or a body of persons
567
- corporate or incorporate.
568
-
569
- "Produced Work" – a work (such as an image, audiovisual material, text,
570
- or sounds) resulting from using the whole or a Substantial part of the
571
- Contents (via a search or other query) from this Database, a Derivative
572
- Database, or this Database as part of a Collective Database.
573
-
574
- "Publicly" – means to Persons other than You or under Your control by
575
- either more than 50% ownership or by the power to direct their
576
- activities (such as contracting with an independent consultant).
577
-
578
- "Re-utilisation" – means any form of making available to the public all
579
- or a Substantial part of the Contents by the distribution of copies, by
580
- renting, by online or other forms of transmission.
581
-
582
- "Substantial" – Means substantial in terms of quantity or quality or a
583
- combination of both. The repeated and systematic Extraction or
584
- Re-utilisation of insubstantial parts of the Contents may amount to the
585
- Extraction or Re-utilisation of a Substantial part of the Contents.
586
-
587
- "Use" – As a verb, means doing any act that is restricted by copyright
588
- or Database Rights whether in the original medium or any other; and
589
- includes without limitation distributing, copying, publicly performing,
590
- publicly displaying, and preparing derivative works of the Database, as
591
- well as modifying the Database as may be technically necessary to use it
592
- in a different mode or format.
593
-
594
- "You" – Means a Person exercising rights under this License who has not
595
- previously violated the terms of this License with respect to the
596
- Database, or who has received express permission from the Licensor to
597
- exercise rights under this License despite a previous violation.
598
-
599
- Words in the singular include the plural and vice versa.
600
-
601
- ### 2.0 What this License covers
602
-
603
- 2.1. Legal effect of this document. This License is:
604
-
605
- a. A license of applicable copyright and neighbouring rights;
606
-
607
- b. A license of the Database Right; and
608
-
609
- c. An agreement in contract between You and the Licensor.
610
-
611
- 2.2 Legal rights covered. This License covers the legal rights in the
612
- Database, including:
613
-
614
- a. Copyright. Any copyright or neighbouring rights in the Database.
615
- The copyright licensed includes any individual elements of the
616
- Database, but does not cover the copyright over the Contents
617
- independent of this Database. See Section 2.4 for details. Copyright
618
- law varies between jurisdictions, but is likely to cover: the Database
619
- model or schema, which is the structure, arrangement, and organisation
620
- of the Database, and can also include the Database tables and table
621
- indexes; the data entry and output sheets; and the Field names of
622
- Contents stored in the Database;
623
-
624
- b. Database Rights. Database Rights only extend to the Extraction and
625
- Re-utilisation of the whole or a Substantial part of the Contents.
626
- Database Rights can apply even when there is no copyright over the
627
- Database. Database Rights can also apply when the Contents are removed
628
- from the Database and are selected and arranged in a way that would
629
- not infringe any applicable copyright; and
630
-
631
- c. Contract. This is an agreement between You and the Licensor for
632
- access to the Database. In return you agree to certain conditions of
633
- use on this access as outlined in this License.
634
-
635
- 2.3 Rights not covered.
636
-
637
- a. This License does not apply to computer programs used in the making
638
- or operation of the Database;
639
-
640
- b. This License does not cover any patents over the Contents or the
641
- Database; and
642
-
643
- c. This License does not cover any trademarks associated with the
644
- Database.
645
-
646
- 2.4 Relationship to Contents in the Database. The individual items of
647
- the Contents contained in this Database may be covered by other rights,
648
- including copyright, patent, data protection, privacy, or personality
649
- rights, and this License does not cover any rights (other than Database
650
- Rights or in contract) in individual Contents contained in the Database.
651
- For example, if used on a Database of images (the Contents), this
652
- License would not apply to copyright over individual images, which could
653
- have their own separate licenses, or one single license covering all of
654
- the rights over the images.
655
-
656
- ### 3.0 Rights granted
657
-
658
- 3.1 Subject to the terms and conditions of this License, the Licensor
659
- grants to You a worldwide, royalty-free, non-exclusive, terminable (but
660
- only under Section 9) license to Use the Database for the duration of
661
- any applicable copyright and Database Rights. These rights explicitly
662
- include commercial use, and do not exclude any field of endeavour. To
663
- the extent possible in the relevant jurisdiction, these rights may be
664
- exercised in all media and formats whether now known or created in the
665
- future.
666
-
667
- The rights granted cover, for example:
668
-
669
- a. Extraction and Re-utilisation of the whole or a Substantial part of
670
- the Contents;
671
-
672
- b. Creation of Derivative Databases;
673
-
674
- c. Creation of Collective Databases;
675
-
676
- d. Creation of temporary or permanent reproductions by any means and
677
- in any form, in whole or in part, including of any Derivative
678
- Databases or as a part of Collective Databases; and
679
-
680
- e. Distribution, communication, display, lending, making available, or
681
- performance to the public by any means and in any form, in whole or in
682
- part, including of any Derivative Database or as a part of Collective
683
- Databases.
684
-
685
- 3.2 Compulsory license schemes. For the avoidance of doubt:
686
-
687
- a. Non-waivable compulsory license schemes. In those jurisdictions in
688
- which the right to collect royalties through any statutory or
689
- compulsory licensing scheme cannot be waived, the Licensor reserves
690
- the exclusive right to collect such royalties for any exercise by You
691
- of the rights granted under this License;
692
-
693
- b. Waivable compulsory license schemes. In those jurisdictions in
694
- which the right to collect royalties through any statutory or
695
- compulsory licensing scheme can be waived, the Licensor waives the
696
- exclusive right to collect such royalties for any exercise by You of
697
- the rights granted under this License; and,
698
-
699
- c. Voluntary license schemes. The Licensor waives the right to collect
700
- royalties, whether individually or, in the event that the Licensor is
701
- a member of a collecting society that administers voluntary licensing
702
- schemes, via that society, from any exercise by You of the rights
703
- granted under this License.
704
-
705
- 3.3 The right to release the Database under different terms, or to stop
706
- distributing or making available the Database, is reserved. Note that
707
- this Database may be multiple-licensed, and so You may have the choice
708
- of using alternative licenses for this Database. Subject to Section
709
- 10.4, all other rights not expressly granted by Licensor are reserved.
710
-
711
- ### 4.0 Conditions of Use
712
-
713
- 4.1 The rights granted in Section 3 above are expressly made subject to
714
- Your complying with the following conditions of use. These are important
715
- conditions of this License, and if You fail to follow them, You will be
716
- in material breach of its terms.
717
-
718
- 4.2 Notices. If You Publicly Convey this Database, any Derivative
719
- Database, or the Database as part of a Collective Database, then You
720
- must:
721
-
722
- a. Do so only under the terms of this License or another license
723
- permitted under Section 4.4;
724
-
725
- b. Include a copy of this License (or, as applicable, a license
726
- permitted under Section 4.4) or its Uniform Resource Identifier (URI)
727
- with the Database or Derivative Database, including both in the
728
- Database or Derivative Database and in any relevant documentation; and
729
-
730
- c. Keep intact any copyright or Database Right notices and notices
731
- that refer to this License.
732
-
733
- d. If it is not possible to put the required notices in a particular
734
- file due to its structure, then You must include the notices in a
735
- location (such as a relevant directory) where users would be likely to
736
- look for it.
737
-
738
- 4.3 Notice for using output (Contents). Creating and Using a Produced
739
- Work does not require the notice in Section 4.2. However, if you
740
- Publicly Use a Produced Work, You must include a notice associated with
741
- the Produced Work reasonably calculated to make any Person that uses,
742
- views, accesses, interacts with, or is otherwise exposed to the Produced
743
- Work aware that Content was obtained from the Database, Derivative
744
- Database, or the Database as part of a Collective Database, and that it
745
- is available under this License.
746
-
747
- a. Example notice. The following text will satisfy notice under
748
- Section 4.3:
749
-
750
- Contains information from DATABASE NAME, which is made available
751
- here under the Open Database License (ODbL).
752
-
753
- DATABASE NAME should be replaced with the name of the Database and a
754
- hyperlink to the URI of the Database. "Open Database License" should
755
- contain a hyperlink to the URI of the text of this License. If
756
- hyperlinks are not possible, You should include the plain text of the
757
- required URI's with the above notice.
758
-
759
- 4.4 Share alike.
760
-
761
- a. Any Derivative Database that You Publicly Use must be only under
762
- the terms of:
763
-
764
- i. This License;
765
-
766
- ii. A later version of this License similar in spirit to this
767
- License; or
768
-
769
- iii. A compatible license.
770
-
771
- If You license the Derivative Database under one of the licenses
772
- mentioned in (iii), You must comply with the terms of that license.
773
-
774
- b. For the avoidance of doubt, Extraction or Re-utilisation of the
775
- whole or a Substantial part of the Contents into a new database is a
776
- Derivative Database and must comply with Section 4.4.
777
-
778
- c. Derivative Databases and Produced Works. A Derivative Database is
779
- Publicly Used and so must comply with Section 4.4. if a Produced Work
780
- created from the Derivative Database is Publicly Used.
781
-
782
- d. Share Alike and additional Contents. For the avoidance of doubt,
783
- You must not add Contents to Derivative Databases under Section 4.4 a
784
- that are incompatible with the rights granted under this License.
785
-
786
- e. Compatible licenses. Licensors may authorise a proxy to determine
787
- compatible licenses under Section 4.4 a iii. If they do so, the
788
- authorised proxy's public statement of acceptance of a compatible
789
- license grants You permission to use the compatible license.
790
-
791
-
792
- 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
793
- in the following:
794
-
795
- a. For the avoidance of doubt, You are not required to license
796
- Collective Databases under this License if You incorporate this
797
- Database or a Derivative Database in the collection, but this License
798
- still applies to this Database or a Derivative Database as a part of
799
- the Collective Database;
800
-
801
- b. Using this Database, a Derivative Database, or this Database as
802
- part of a Collective Database to create a Produced Work does not
803
- create a Derivative Database for purposes of Section 4.4; and
804
-
805
- c. Use of a Derivative Database internally within an organisation is
806
- not to the public and therefore does not fall under the requirements
807
- of Section 4.4.
808
-
809
- 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
810
- Database or a Produced Work from a Derivative Database, You must also
811
- offer to recipients of the Derivative Database or Produced Work a copy
812
- in a machine readable form of:
813
-
814
- a. The entire Derivative Database; or
815
-
816
- b. A file containing all of the alterations made to the Database or
817
- the method of making the alterations to the Database (such as an
818
- algorithm), including any additional Contents, that make up all the
819
- differences between the Database and the Derivative Database.
820
-
821
- The Derivative Database (under a.) or alteration file (under b.) must be
822
- available at no more than a reasonable production cost for physical
823
- distributions and free of charge if distributed over the internet.
824
-
825
- 4.7 Technological measures and additional terms
826
-
827
- a. This License does not allow You to impose (except subject to
828
- Section 4.7 b.) any terms or any technological measures on the
829
- Database, a Derivative Database, or the whole or a Substantial part of
830
- the Contents that alter or restrict the terms of this License, or any
831
- rights granted under it, or have the effect or intent of restricting
832
- the ability of any person to exercise those rights.
833
-
834
- b. Parallel distribution. You may impose terms or technological
835
- measures on the Database, a Derivative Database, or the whole or a
836
- Substantial part of the Contents (a "Restricted Database") in
837
- contravention of Section 4.74 a. only if You also make a copy of the
838
- Database or a Derivative Database available to the recipient of the
839
- Restricted Database:
840
-
841
- i. That is available without additional fee;
842
-
843
- ii. That is available in a medium that does not alter or restrict
844
- the terms of this License, or any rights granted under it, or have
845
- the effect or intent of restricting the ability of any person to
846
- exercise those rights (an "Unrestricted Database"); and
847
-
848
- iii. The Unrestricted Database is at least as accessible to the
849
- recipient as a practical matter as the Restricted Database.
850
-
851
- c. For the avoidance of doubt, You may place this Database or a
852
- Derivative Database in an authenticated environment, behind a
853
- password, or within a similar access control scheme provided that You
854
- do not alter or restrict the terms of this License or any rights
855
- granted under it or have the effect or intent of restricting the
856
- ability of any person to exercise those rights.
857
-
858
- 4.8 Licensing of others. You may not sublicense the Database. Each time
859
- You communicate the Database, the whole or Substantial part of the
860
- Contents, or any Derivative Database to anyone else in any way, the
861
- Licensor offers to the recipient a license to the Database on the same
862
- terms and conditions as this License. You are not responsible for
863
- enforcing compliance by third parties with this License, but You may
864
- enforce any rights that You have over a Derivative Database. You are
865
- solely responsible for any modifications of a Derivative Database made
866
- by You or another Person at Your direction. You may not impose any
867
- further restrictions on the exercise of the rights granted or affirmed
868
- under this License.
869
-
870
- ### 5.0 Moral rights
871
-
872
- 5.1 Moral rights. This section covers moral rights, including any rights
873
- to be identified as the author of the Database or to object to treatment
874
- that would otherwise prejudice the author's honour and reputation, or
875
- any other derogatory treatment:
876
-
877
- a. For jurisdictions allowing waiver of moral rights, Licensor waives
878
- all moral rights that Licensor may have in the Database to the fullest
879
- extent possible by the law of the relevant jurisdiction under Section
880
- 10.4;
881
-
882
- b. If waiver of moral rights under Section 5.1 a in the relevant
883
- jurisdiction is not possible, Licensor agrees not to assert any moral
884
- rights over the Database and waives all claims in moral rights to the
885
- fullest extent possible by the law of the relevant jurisdiction under
886
- Section 10.4; and
887
-
888
- c. For jurisdictions not allowing waiver or an agreement not to assert
889
- moral rights under Section 5.1 a and b, the author may retain their
890
- moral rights over certain aspects of the Database.
891
-
892
- Please note that some jurisdictions do not allow for the waiver of moral
893
- rights, and so moral rights may still subsist over the Database in some
894
- jurisdictions.
895
-
896
- ### 6.0 Fair dealing, Database exceptions, and other rights not affected
897
-
898
- 6.1 This License does not affect any rights that You or anyone else may
899
- independently have under any applicable law to make any use of this
900
- Database, including without limitation:
901
-
902
- a. Exceptions to the Database Right including: Extraction of Contents
903
- from non-electronic Databases for private purposes, Extraction for
904
- purposes of illustration for teaching or scientific research, and
905
- Extraction or Re-utilisation for public security or an administrative
906
- or judicial procedure.
907
-
908
- b. Fair dealing, fair use, or any other legally recognised limitation
909
- or exception to infringement of copyright or other applicable laws.
910
-
911
- 6.2 This License does not affect any rights of lawful users to Extract
912
- and Re-utilise insubstantial parts of the Contents, evaluated
913
- quantitatively or qualitatively, for any purposes whatsoever, including
914
- creating a Derivative Database (subject to other rights over the
915
- Contents, see Section 2.4). The repeated and systematic Extraction or
916
- Re-utilisation of insubstantial parts of the Contents may however amount
917
- to the Extraction or Re-utilisation of a Substantial part of the
918
- Contents.
919
-
920
- ### 7.0 Warranties and Disclaimer
921
-
922
- 7.1 The Database is licensed by the Licensor "as is" and without any
923
- warranty of any kind, either express, implied, or arising by statute,
924
- custom, course of dealing, or trade usage. Licensor specifically
925
- disclaims any and all implied warranties or conditions of title,
926
- non-infringement, accuracy or completeness, the presence or absence of
927
- errors, fitness for a particular purpose, merchantability, or otherwise.
928
- Some jurisdictions do not allow the exclusion of implied warranties, so
929
- this exclusion may not apply to You.
930
-
931
- ### 8.0 Limitation of liability
932
-
933
- 8.1 Subject to any liability that may not be excluded or limited by law,
934
- the Licensor is not liable for, and expressly excludes, all liability
935
- for loss or damage however and whenever caused to anyone by any use
936
- under this License, whether by You or by anyone else, and whether caused
937
- by any fault on the part of the Licensor or not. This exclusion of
938
- liability includes, but is not limited to, any special, incidental,
939
- consequential, punitive, or exemplary damages such as loss of revenue,
940
- data, anticipated profits, and lost business. This exclusion applies
941
- even if the Licensor has been advised of the possibility of such
942
- damages.
943
-
944
- 8.2 If liability may not be excluded by law, it is limited to actual and
945
- direct financial loss to the extent it is caused by proved negligence on
946
- the part of the Licensor.
947
-
948
- ### 9.0 Termination of Your rights under this License
949
-
950
- 9.1 Any breach by You of the terms and conditions of this License
951
- automatically terminates this License with immediate effect and without
952
- notice to You. For the avoidance of doubt, Persons who have received the
953
- Database, the whole or a Substantial part of the Contents, Derivative
954
- Databases, or the Database as part of a Collective Database from You
955
- under this License will not have their licenses terminated provided
956
- their use is in full compliance with this License or a license granted
957
- under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
958
- survive any termination of this License.
959
-
960
- 9.2 If You are not in breach of the terms of this License, the Licensor
961
- will not terminate Your rights under it.
962
-
963
- 9.3 Unless terminated under Section 9.1, this License is granted to You
964
- for the duration of applicable rights in the Database.
965
-
966
- 9.4 Reinstatement of rights. If you cease any breach of the terms and
967
- conditions of this License, then your full rights under this License
968
- will be reinstated:
969
-
970
- a. Provisionally and subject to permanent termination until the 60th
971
- day after cessation of breach;
972
-
973
- b. Permanently on the 60th day after cessation of breach unless
974
- otherwise reasonably notified by the Licensor; or
975
-
976
- c. Permanently if reasonably notified by the Licensor of the
977
- violation, this is the first time You have received notice of
978
- violation of this License from the Licensor, and You cure the
979
- violation prior to 30 days after your receipt of the notice.
980
-
981
- Persons subject to permanent termination of rights are not eligible to
982
- be a recipient and receive a license under Section 4.8.
983
-
984
- 9.5 Notwithstanding the above, Licensor reserves the right to release
985
- the Database under different license terms or to stop distributing or
986
- making available the Database. Releasing the Database under different
987
- license terms or stopping the distribution of the Database will not
988
- withdraw this License (or any other license that has been, or is
989
- required to be, granted under the terms of this License), and this
990
- License will continue in full force and effect unless terminated as
991
- stated above.
992
-
993
- ### 10.0 General
994
-
995
- 10.1 If any provision of this License is held to be invalid or
996
- unenforceable, that must not affect the validity or enforceability of
997
- the remainder of the terms and conditions of this License and each
998
- remaining provision of this License shall be valid and enforced to the
999
- fullest extent permitted by law.
1000
-
1001
- 10.2 This License is the entire agreement between the parties with
1002
- respect to the rights granted here over the Database. It replaces any
1003
- earlier understandings, agreements or representations with respect to
1004
- the Database.
1005
-
1006
- 10.3 If You are in breach of the terms of this License, You will not be
1007
- entitled to rely on the terms of this License or to complain of any
1008
- breach by the Licensor.
1009
-
1010
- 10.4 Choice of law. This License takes effect in and will be governed by
1011
- the laws of the relevant jurisdiction in which the License terms are
1012
- sought to be enforced. If the standard suite of rights granted under
1013
- applicable copyright law and Database Rights in the relevant
1014
- jurisdiction includes additional rights not granted under this License,
1015
- these additional rights are granted in this License in order to meet the
1016
- terms of this License.```
1017
-
1018
-
1019
-
1020
-
 
378
  * License: CC BY 4.0
379
 
380
  ```
381
+ Creative Commons Attribution 4.0 International Public License
382
+
383
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
384
 
385
  Section 1 – Definitions.
 
469
 
470
 
471
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
README.md CHANGED
@@ -14,61 +14,76 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8556688921
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8519161046
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8537883746
 
 
 
 
 
 
 
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9652631106
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.9718804921
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.9804964539
41
- - name: SENTER F Score
42
- type: f_score
43
- value: 0.9761694616
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8970414201
 
 
 
 
 
 
 
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.8970414201
 
 
 
 
 
 
 
58
  ---
59
  ### Details: https://spacy.io/models/it#it_core_news_sm
60
 
61
- Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, senter, ner, attribute_ruler, lemmatizer.
62
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `it_core_news_sm` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
- | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
- | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
- | **Sources** | [UD Italian ISDT v2.8](https://github.com/UniversalDependencies/UD_Italian-ISDT) (Bosco, Cristina; Lenci, Alessandro; Montemagni, Simonetta; Simi, Maria)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)<br />[Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura) |
72
  | **License** | `CC BY-NC-SA 3.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,14 +91,13 @@ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger,
76
 
77
  <details>
78
 
79
- <summary>View label scheme (443 labels for 5 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`morphologizer`** | `POS=PROPN`, `POS=PUNCT`, `Gender=Masc\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=AUX\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADP`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=ADP\|PronType=Art`, `Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=VERB\|VerbForm=Inf`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `POS=CCONJ`, `NumType=Card\|POS=NUM`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Clitic=Yes\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `Definite=Def\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=ADV`, `POS=NOUN`, `Number=Sing\|POS=NOUN`, `POS=VERB\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Tot`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|POS=NOUN`, `POS=SCONJ`, `Number=Sing\|POS=DET\|PronType=Ind`, `POS=ADV\|PronType=Neg`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Ind`, `POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=PRON\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Cmp\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Abs\|POS=ADV`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=DET\|PronType=Exc`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Int`, `POS=PRON\|PronType=Int`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Sing\|POS=ADP`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Foreign=Yes\|POS=X`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=INTJ\|Polarity=Neg`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `POS=INTJ\|Polarity=Pos`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `POS=DET\|PronType=Int`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Prs`, `Degree=Abs\|Gender=Masc\|Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Ind`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Degree=Abs\|Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Tot`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADJ`, `NumType=Ord\|POS=ADJ`, `POS=DET\|PronType=Rel`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Masc\|Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|PronType=Int`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|POS=DET\|PronType=Art`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `POS=SYM`, `Clitic=Yes\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Degree=Abs\|Gender=Fem\|Number=Plur\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=PRON\|PronType=Ind`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `POS=X`, `Gender=Masc\|POS=ADJ`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=PART`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Int`, `Clitic=Yes\|Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Rel`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `NumType=Range\|POS=NUM`, `Number=Plur\|POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=ADV\|PronType=Prs`, `Clitic=Yes\|Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Imp\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=DET\|PronType=Ind`, `Number=Plur\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Number=Plur\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=ADP`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=ADV\|Person=3\|PronType=Prs`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=ADV\|Person=3\|PronType=Prs`, `POS=DET\|PronType=Tot`, `POS=PRON\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=NUM`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Masc\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Sing\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Int`, `Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Int`, `Clitic=Yes\|Number=Plur\|POS=PRON\|PronType=Prs`, `Foreign=Yes\|Number=Sing\|POS=X`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `POS=PRON\|PronType=Prs`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=SCONJ\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Cmp\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=ADP`, `Gender=Fem\|POS=ADJ`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=ADJ\|Poss=Yes\|PronType=Prs`, `Foreign=Yes\|POS=NOUN`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Number=Sing\|POS=X`, `Foreign=Yes\|Gender=Masc\|POS=X`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Definite=Def\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Definite=Def\|Gender=Fem\|POS=DET`, `Definite=Def\|POS=DET`, `Foreign=Yes\|POS=PROPN`, `NumType=Card\|POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADV`, `Gender=Masc\|Number=Plur\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2`, `Clitic=Yes\|Number=Plur\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=DET`, `Number=Sing\|POS=DET`, `Gender=Masc\|Number=Sing\|POS=PRON`, `POS=DET` |
84
  | **`tagger`** | `A`, `AP`, `B`, `BN`, `B_PC`, `CC`, `CS`, `DD`, `DE`, `DI`, `DQ`, `DR`, `E`, `E_RD`, `FB`, `FC`, `FF`, `FS`, `I`, `N`, `NO`, `PART`, `PC`, `PC_PC`, `PD`, `PE`, `PI`, `PP`, `PQ`, `PR`, `RD`, `RI`, `S`, `SP`, `SW`, `SYM`, `T`, `V`, `VA`, `VA_PC`, `VM`, `VM_PC`, `VM_PC_PC`, `V_B`, `V_PC`, `V_PC_PC`, `X` |
85
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `cop`, `csubj`, `dep`, `det`, `det:poss`, `det:predet`, `discourse`, `expl`, `expl:impers`, `expl:pass`, `fixed`, `flat`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `parataxis`, `punct`, `vocative`, `xcomp` |
86
- | **`senter`** | `I`, `S` |
87
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
88
 
89
  </details>
@@ -96,18 +110,18 @@ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger,
96
  | `TOKEN_P` | 99.80 |
97
  | `TOKEN_R` | 99.78 |
98
  | `TOKEN_F` | 99.79 |
99
- | `POS_ACC` | 96.95 |
100
- | `MORPH_ACC` | 96.87 |
101
- | `MORPH_MICRO_P` | 98.44 |
102
- | `MORPH_MICRO_R` | 97.67 |
103
- | `MORPH_MICRO_F` | 98.05 |
104
- | `TAG_ACC` | 96.53 |
105
- | `SENTS_P` | 97.19 |
106
- | `SENTS_R` | 98.05 |
107
- | `SENTS_F` | 97.62 |
108
- | `DEP_UAS` | 89.70 |
109
- | `DEP_LAS` | 85.74 |
110
- | `LEMMA_ACC` | 86.40 |
111
- | `ENTS_P` | 85.57 |
112
- | `ENTS_R` | 85.19 |
113
- | `ENTS_F` | 85.38 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8546443384
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8515457487
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8530922299
24
+ - task:
25
+ name: TAG
26
+ type: token-classification
27
+ metrics:
28
+ - name: TAG (XPOS) Accuracy
29
+ type: accuracy
30
+ value: 0.9655792217
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
+ - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9697568867
38
  - task:
39
+ name: MORPH
40
  type: token-classification
41
  metrics:
42
+ - name: Morph (UFeats) Accuracy
43
+ type: accuracy
44
+ value: 0.9683595506
 
 
 
 
 
 
45
  - task:
46
+ name: LEMMA
47
  type: token-classification
48
  metrics:
49
+ - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9705248023
52
+ - task:
53
+ name: UNLABELED_DEPENDENCIES
54
+ type: token-classification
55
+ metrics:
56
+ - name: Unlabeled Attachment Score (UAS)
57
+ type: f_score
58
+ value: 0.8962288419
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
+ - name: Labeled Attachment Score (LAS)
64
+ type: f_score
65
+ value: 0.8560991923
66
+ - task:
67
+ name: SENTS
68
+ type: token-classification
69
+ metrics:
70
+ - name: Sentences F-Score
71
+ type: f_score
72
+ value: 0.9744942832
73
  ---
74
  ### Details: https://spacy.io/models/it#it_core_news_sm
75
 
76
+ Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.
77
 
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `it_core_news_sm` |
81
+ | **Version** | `3.3.0` |
82
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
83
+ | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `attribute_ruler`, `ner` |
84
+ | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `senter`, `attribute_ruler`, `ner` |
85
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
86
+ | **Sources** | [UD Italian ISDT v2.8](https://github.com/UniversalDependencies/UD_Italian-ISDT) (Bosco, Cristina; Lenci, Alessandro; Montemagni, Simonetta; Simi, Maria)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran) |
87
  | **License** | `CC BY-NC-SA 3.0` |
88
  | **Author** | [Explosion](https://explosion.ai) |
89
 
 
91
 
92
  <details>
93
 
94
+ <summary>View label scheme (441 labels for 4 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
  | **`morphologizer`** | `POS=PROPN`, `POS=PUNCT`, `Gender=Masc\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=AUX\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADP`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Masc\|Number=Plur\|POS=ADP\|PronType=Art`, `Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=VERB\|VerbForm=Inf`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `POS=CCONJ`, `NumType=Card\|POS=NUM`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=ADP\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Clitic=Yes\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `Definite=Def\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=ADV`, `POS=NOUN`, `Number=Sing\|POS=NOUN`, `POS=VERB\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Tot`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|POS=NOUN`, `POS=SCONJ`, `Number=Sing\|POS=DET\|PronType=Ind`, `POS=ADV\|PronType=Neg`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Ind`, `POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|Poss=Yes\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=PRON\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Cmp\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Abs\|POS=ADV`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=DET\|PronType=Exc`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Dem`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=DET\|PronType=Int`, `POS=PRON\|PronType=Int`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Past\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Ind`, `Number=Sing\|POS=ADP`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Foreign=Yes\|POS=X`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `POS=INTJ\|Polarity=Neg`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Ind`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `POS=INTJ\|Polarity=Pos`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `POS=DET\|PronType=Int`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Prs`, `Degree=Abs\|Gender=Masc\|Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Ind`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Clitic=Yes\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Degree=Abs\|Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Tot`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADJ`, `NumType=Ord\|POS=ADJ`, `POS=DET\|PronType=Rel`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Gender=Masc\|Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|POS=VERB\|PronType=Prs\|VerbForm=Ger`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|PronType=Int`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|POS=DET\|PronType=Art`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `POS=SYM`, `Clitic=Yes\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Degree=Abs\|Gender=Fem\|Number=Plur\|POS=ADJ`, `Number=Sing\|POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Ger`, `Gender=Masc\|Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=PRON\|PronType=Ind`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `POS=X`, `Gender=Masc\|POS=ADJ`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=PART`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Part`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Int`, `Clitic=Yes\|Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Rel`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Ger`, `Clitic=Yes\|Gender=Masc\|Number=Plur\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `NumType=Range\|POS=NUM`, `Number=Plur\|POS=PRON\|PronType=Dem`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|POS=ADV\|PronType=Prs`, `Clitic=Yes\|Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PRON\|PronType=Rel`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Sing\|POS=AUX\|Person=2\|PronType=Prs\|VerbForm=Inf`, `Clitic=Yes\|Number=Sing\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Imp\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Sing\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Past\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `Definite=Ind\|POS=DET\|PronType=Art`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Inf`, `POS=DET\|PronType=Ind`, `Number=Plur\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Plur\|POS=DET\|PronType=Tot`, `Clitic=Yes\|POS=AUX\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Clitic=Yes\|Gender=Fem,Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Clitic=Yes\|Number=Plur\|POS=VERB\|PronType=Prs\|VerbForm=Inf`, `Number=Plur\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Sing\|POS=PRON\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=ADP`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=ADV\|Person=3\|PronType=Prs`, `Clitic=Yes\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,2\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=ADV\|Person=3\|PronType=Prs`, `POS=DET\|PronType=Tot`, `POS=PRON\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=NUM`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=VERB\|Person=3\|PronType=Prs\|VerbForm=Ger`, `Gender=Masc\|POS=DET\|PronType=Dem`, `Clitic=Yes\|Gender=Masc\|Number=Plur,Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Sing\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Imp\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Int`, `Number=Plur\|POS=PRON\|PronType=Int`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Int`, `Clitic=Yes\|Number=Plur\|POS=PRON\|PronType=Prs`, `Foreign=Yes\|Number=Sing\|POS=X`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `POS=PRON\|PronType=Prs`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `POS=SCONJ\|PronType=Rel`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Rel`, `Clitic=Yes\|Number=Plur\|POS=VERB\|Person=2\|PronType=Prs\|VerbForm=Ger`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|VerbForm=Fin`, `Clitic=Yes\|Mood=Ind\|Number=Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Past\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Cmp\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=ADP`, `Gender=Fem\|POS=ADJ`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=2,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|POS=DET\|Poss=Yes\|PronType=Prs`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=ADJ\|Poss=Yes\|PronType=Prs`, `Foreign=Yes\|POS=NOUN`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Clitic=Yes\|Gender=Masc\|Mood=Imp\|Number=Plur\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=DET`, `Clitic=Yes\|Gender=Fem\|Mood=Imp\|Number=Plur,Sing\|POS=VERB\|Person=1,3\|PronType=Prs\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Number=Sing\|POS=X`, `Foreign=Yes\|Gender=Masc\|POS=X`, `Clitic=Yes\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Prs`, `Clitic=Yes\|Definite=Def\|Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Definite=Def\|Gender=Fem\|POS=DET`, `Definite=Def\|POS=DET`, `Foreign=Yes\|POS=PROPN`, `NumType=Card\|POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET`, `Degree=Abs\|Gender=Masc\|Number=Sing\|POS=ADV`, `Gender=Masc\|Number=Plur\|POS=NOUN\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2`, `Clitic=Yes\|Number=Plur\|POS=AUX\|Person=1\|PronType=Prs\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=DET`, `Number=Sing\|POS=DET`, `Gender=Masc\|Number=Sing\|POS=PRON`, `POS=DET` |
99
  | **`tagger`** | `A`, `AP`, `B`, `BN`, `B_PC`, `CC`, `CS`, `DD`, `DE`, `DI`, `DQ`, `DR`, `E`, `E_RD`, `FB`, `FC`, `FF`, `FS`, `I`, `N`, `NO`, `PART`, `PC`, `PC_PC`, `PD`, `PE`, `PI`, `PP`, `PQ`, `PR`, `RD`, `RI`, `S`, `SP`, `SW`, `SYM`, `T`, `V`, `VA`, `VA_PC`, `VM`, `VM_PC`, `VM_PC_PC`, `V_B`, `V_PC`, `V_PC_PC`, `X` |
100
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound`, `conj`, `cop`, `csubj`, `dep`, `det`, `det:poss`, `det:predet`, `discourse`, `expl`, `expl:impers`, `expl:pass`, `fixed`, `flat`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `parataxis`, `punct`, `vocative`, `xcomp` |
 
101
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
102
 
103
  </details>
 
110
  | `TOKEN_P` | 99.80 |
111
  | `TOKEN_R` | 99.78 |
112
  | `TOKEN_F` | 99.79 |
113
+ | `POS_ACC` | 96.98 |
114
+ | `MORPH_ACC` | 96.84 |
115
+ | `MORPH_MICRO_P` | 98.32 |
116
+ | `MORPH_MICRO_R` | 97.66 |
117
+ | `MORPH_MICRO_F` | 97.98 |
118
+ | `TAG_ACC` | 96.56 |
119
+ | `SENTS_P` | 96.68 |
120
+ | `SENTS_R` | 98.23 |
121
+ | `SENTS_F` | 97.45 |
122
+ | `DEP_UAS` | 89.62 |
123
+ | `DEP_LAS` | 85.61 |
124
+ | `LEMMA_ACC` | 97.05 |
125
+ | `ENTS_P` | 85.46 |
126
+ | `ENTS_R` | 85.15 |
127
+ | `ENTS_F` | 85.31 |
accuracy.json CHANGED
@@ -3,66 +3,66 @@
3
  "token_p": 0.9980235379,
4
  "token_r": 0.9978442468,
5
  "token_f": 0.9979338843,
6
- "pos_acc": 0.9694872601,
7
- "morph_acc": 0.9687191011,
8
- "morph_micro_p": 0.9843827713,
9
- "morph_micro_r": 0.9767074615,
10
- "morph_micro_f": 0.9805300966,
11
  "morph_per_feat": {
12
  "Gender": {
13
- "p": 0.987497438,
14
- "r": 0.9836668028,
15
- "f": 0.9855783983
16
  },
17
  "Number": {
18
- "p": 0.9925686591,
19
- "r": 0.9865125241,
20
- "f": 0.9895313255
21
  },
22
  "NumType": {
23
- "p": 0.9773584906,
24
- "r": 0.9557195572,
25
- "f": 0.9664179104
26
  },
27
  "Definite": {
28
- "p": 0.9917501473,
29
  "r": 0.9982206406,
30
- "f": 0.9949748744
31
  },
32
  "PronType": {
33
- "p": 0.9887687188,
34
  "r": 0.9830438379,
35
- "f": 0.9858979676
36
  },
37
  "Mood": {
38
- "p": 0.960591133,
39
- "r": 0.9363745498,
40
- "f": 0.9483282675
41
  },
42
  "Person": {
43
- "p": 0.97002997,
44
- "r": 0.9399806389,
45
- "f": 0.9547689282
46
  },
47
  "Tense": {
48
- "p": 0.9518486672,
49
- "r": 0.9421276596,
50
- "f": 0.9469632164
51
  },
52
  "VerbForm": {
53
- "p": 0.972027972,
54
- "r": 0.9632709633,
55
- "f": 0.9676296554
56
  },
57
  "Degree": {
58
- "p": 0.875,
59
- "r": 0.8235294118,
60
- "f": 0.8484848485
61
  },
62
  "Clitic": {
63
- "p": 1.0,
64
- "r": 0.9465240642,
65
- "f": 0.9725274725
66
  },
67
  "Poss": {
68
  "p": 1.0,
@@ -76,76 +76,76 @@
76
  },
77
  "Foreign": {
78
  "p": 1.0,
79
- "r": 0.8,
80
- "f": 0.8888888889
81
  }
82
  },
83
- "tag_acc": 0.9652631106,
84
- "sents_p": 0.9718804921,
85
- "sents_r": 0.9804964539,
86
- "sents_f": 0.9761694616,
87
- "dep_uas": 0.8970414201,
88
- "dep_las": 0.8574221765,
89
  "dep_las_per_type": {
90
  "root": {
91
- "p": 0.8769771529,
92
- "r": 0.884751773,
93
- "f": 0.880847308
94
  },
95
  "flat:name": {
96
- "p": 0.8757763975,
97
- "r": 0.8924050633,
98
- "f": 0.8840125392
99
  },
100
  "case": {
101
- "p": 0.9739551787,
102
- "r": 0.9786975046,
103
- "f": 0.9763205829
104
  },
105
  "nmod": {
106
- "p": 0.7953394124,
107
- "r": 0.8026584867,
108
- "f": 0.7989821883
109
  },
110
  "nummod": {
111
- "p": 0.8913043478,
112
- "r": 0.8913043478,
113
- "f": 0.8913043478
114
  },
115
  "det": {
116
- "p": 0.9706959707,
117
- "r": 0.9742647059,
118
- "f": 0.9724770642
119
  },
120
  "nsubj": {
121
- "p": 0.8181818182,
122
- "r": 0.8166023166,
123
- "f": 0.8173913043
124
  },
125
  "aux": {
126
- "p": 0.9252336449,
127
- "r": 0.9124423963,
128
- "f": 0.9187935035
129
  },
130
  "advmod": {
131
- "p": 0.8123569794,
132
- "r": 0.8142201835,
133
- "f": 0.8132875143
134
  },
135
  "obj": {
136
- "p": 0.8308823529,
137
- "r": 0.8475,
138
- "f": 0.8391089109
139
  },
140
  "cc": {
141
- "p": 0.9028213166,
142
- "r": 0.8834355828,
143
- "f": 0.8930232558
144
  },
145
  "conj": {
146
- "p": 0.6709183673,
147
- "r": 0.7108108108,
148
- "f": 0.6902887139
149
  },
150
  "det:predet": {
151
  "p": 0.9473684211,
@@ -153,74 +153,74 @@
153
  "f": 0.972972973
154
  },
155
  "amod": {
156
- "p": 0.9044684129,
157
- "r": 0.88005997,
158
- "f": 0.8920972644
159
  },
160
  "mark": {
161
- "p": 0.8821138211,
162
- "r": 0.8966942149,
163
- "f": 0.8893442623
164
  },
165
  "cop": {
166
- "p": 0.8257575758,
167
- "r": 0.8650793651,
168
- "f": 0.8449612403
169
  },
170
  "xcomp": {
171
- "p": 0.6774193548,
172
- "r": 0.65625,
173
- "f": 0.6666666667
174
  },
175
  "obl": {
176
- "p": 0.7908396947,
177
- "r": 0.7573099415,
178
- "f": 0.7737117252
179
  },
180
  "acl:relcl": {
181
- "p": 0.7259259259,
182
- "r": 0.7153284672,
183
- "f": 0.7205882353
184
  },
185
  "acl": {
186
- "p": 0.6914893617,
187
- "r": 0.5652173913,
188
- "f": 0.6220095694
189
  },
190
  "ccomp": {
191
- "p": 0.6909090909,
192
- "r": 0.6129032258,
193
- "f": 0.6495726496
194
  },
195
  "expl": {
196
- "p": 0.8875,
197
- "r": 0.9102564103,
198
- "f": 0.8987341772
199
  },
200
  "nsubj:pass": {
201
- "p": 0.8955223881,
202
- "r": 0.7407407407,
203
- "f": 0.8108108108
204
  },
205
  "aux:pass": {
206
- "p": 0.8414634146,
207
- "r": 0.8734177215,
208
- "f": 0.8571428571
209
  },
210
  "parataxis": {
211
- "p": 0.2631578947,
212
- "r": 0.1612903226,
213
- "f": 0.2
214
- },
215
- "advcl": {
216
- "p": 0.5350318471,
217
- "r": 0.5915492958,
218
- "f": 0.5618729097
219
  },
220
  "obl:agent": {
221
- "p": 0.7380952381,
222
- "r": 0.7380952381,
223
- "f": 0.7380952381
 
 
 
 
 
224
  },
225
  "det:poss": {
226
  "p": 0.9714285714,
@@ -233,9 +233,9 @@
233
  "f": 1.0
234
  },
235
  "appos": {
236
- "p": 0.3947368421,
237
- "r": 0.3846153846,
238
- "f": 0.3896103896
239
  },
240
  "dep": {
241
  "p": 0.0,
@@ -243,24 +243,24 @@
243
  "f": 0.0
244
  },
245
  "iobj": {
246
- "p": 0.7619047619,
247
- "r": 0.8,
248
- "f": 0.7804878049
249
- },
250
- "compound": {
251
- "p": 0.6296296296,
252
- "r": 0.6538461538,
253
- "f": 0.641509434
254
  },
255
  "expl:impers": {
256
- "p": 0.6666666667,
257
- "r": 0.75,
258
- "f": 0.7058823529
259
  },
260
  "csubj": {
261
- "p": 0.5454545455,
262
- "r": 0.4615384615,
263
- "f": 0.5
 
 
 
 
 
264
  },
265
  "discourse": {
266
  "p": 0.0,
@@ -268,14 +268,14 @@
268
  "f": 0.0
269
  },
270
  "fixed": {
271
- "p": 0.8846153846,
272
- "r": 0.8214285714,
273
- "f": 0.8518518519
274
  },
275
  "expl:pass": {
276
- "p": 0.9,
277
  "r": 0.8181818182,
278
- "f": 0.8571428571
279
  },
280
  "orphan": {
281
  "p": 0.0,
@@ -283,41 +283,41 @@
283
  "f": 0.0
284
  },
285
  "flat:foreign": {
286
- "p": 1.0,
287
- "r": 0.3333333333,
288
- "f": 0.5
289
  },
290
  "vocative": {
291
- "p": 0.3333333333,
292
  "r": 0.3333333333,
293
- "f": 0.3333333333
294
  }
295
  },
296
- "lemma_acc": 0.8640366643,
297
- "ents_p": 0.8556688921,
298
- "ents_r": 0.8519161046,
299
- "ents_f": 0.8537883746,
300
  "ents_per_type": {
301
  "LOC": {
302
- "p": 0.8656947539,
303
- "r": 0.8978065022,
304
- "f": 0.8814582652
305
  },
306
  "PER": {
307
- "p": 0.889485747,
308
- "r": 0.8826581379,
309
- "f": 0.88605879
310
  },
311
  "MISC": {
312
- "p": 0.7636830965,
313
- "r": 0.6940093432,
314
- "f": 0.7271811114
315
  },
316
  "ORG": {
317
- "p": 0.8235586481,
318
- "r": 0.7490958409,
319
- "f": 0.7845643939
320
  }
321
  },
322
- "speed": 10840.1050228952
323
  }
 
3
  "token_p": 0.9980235379,
4
  "token_r": 0.9978442468,
5
  "token_f": 0.9979338843,
6
+ "pos_acc": 0.9697568867,
7
+ "morph_acc": 0.9683595506,
8
+ "morph_micro_p": 0.9831577901,
9
+ "morph_micro_r": 0.9765594157,
10
+ "morph_micro_f": 0.9798474946,
11
  "morph_per_feat": {
12
  "Gender": {
13
+ "p": 0.9887017256,
14
+ "r": 0.982645978,
15
+ "f": 0.9856645505
16
  },
17
  "Number": {
18
+ "p": 0.992398512,
19
+ "r": 0.9852280026,
20
+ "f": 0.9888002578
21
  },
22
  "NumType": {
23
+ "p": 0.9811320755,
24
+ "r": 0.9594095941,
25
+ "f": 0.9701492537
26
  },
27
  "Definite": {
28
+ "p": 0.992920354,
29
  "r": 0.9982206406,
30
+ "f": 0.9955634428
31
  },
32
  "PronType": {
33
+ "p": 0.9875363523,
34
  "r": 0.9830438379,
35
+ "f": 0.9852849741
36
  },
37
  "Mood": {
38
+ "p": 0.9586374696,
39
+ "r": 0.9459783914,
40
+ "f": 0.952265861
41
  },
42
  "Person": {
43
+ "p": 0.9654491609,
44
+ "r": 0.9467570184,
45
+ "f": 0.9560117302
46
  },
47
  "Tense": {
48
+ "p": 0.9443969204,
49
+ "r": 0.9395744681,
50
+ "f": 0.9419795222
51
  },
52
  "VerbForm": {
53
+ "p": 0.9651810585,
54
+ "r": 0.9604989605,
55
+ "f": 0.9628343175
56
  },
57
  "Degree": {
58
+ "p": 0.8333333333,
59
+ "r": 0.8823529412,
60
+ "f": 0.8571428571
61
  },
62
  "Clitic": {
63
+ "p": 0.9832402235,
64
+ "r": 0.9411764706,
65
+ "f": 0.9617486339
66
  },
67
  "Poss": {
68
  "p": 1.0,
 
76
  },
77
  "Foreign": {
78
  "p": 1.0,
79
+ "r": 1.0,
80
+ "f": 1.0
81
  }
82
  },
83
+ "tag_acc": 0.9655792217,
84
+ "sents_p": 0.9668411867,
85
+ "sents_r": 0.9822695035,
86
+ "sents_f": 0.9744942832,
87
+ "dep_uas": 0.8962288419,
88
+ "dep_las": 0.8560991923,
89
  "dep_las_per_type": {
90
  "root": {
91
+ "p": 0.8778359511,
92
+ "r": 0.8918439716,
93
+ "f": 0.8847845207
94
  },
95
  "flat:name": {
96
+ "p": 0.9473684211,
97
+ "r": 0.9113924051,
98
+ "f": 0.9290322581
99
  },
100
  "case": {
101
+ "p": 0.9721718088,
102
+ "r": 0.9780888618,
103
+ "f": 0.9751213592
104
  },
105
  "nmod": {
106
+ "p": 0.7912423625,
107
+ "r": 0.7944785276,
108
+ "f": 0.7928571429
109
  },
110
  "nummod": {
111
+ "p": 0.8888888889,
112
+ "r": 0.8695652174,
113
+ "f": 0.8791208791
114
  },
115
  "det": {
116
+ "p": 0.967063129,
117
+ "r": 0.9715073529,
118
+ "f": 0.9692801467
119
  },
120
  "nsubj": {
121
+ "p": 0.809073724,
122
+ "r": 0.8262548263,
123
+ "f": 0.817574021
124
  },
125
  "aux": {
126
+ "p": 0.9266055046,
127
+ "r": 0.930875576,
128
+ "f": 0.9287356322
129
  },
130
  "advmod": {
131
+ "p": 0.802690583,
132
+ "r": 0.8211009174,
133
+ "f": 0.8117913832
134
  },
135
  "obj": {
136
+ "p": 0.8095238095,
137
+ "r": 0.85,
138
+ "f": 0.8292682927
139
  },
140
  "cc": {
141
+ "p": 0.9034267913,
142
+ "r": 0.8895705521,
143
+ "f": 0.8964451314
144
  },
145
  "conj": {
146
+ "p": 0.6485788114,
147
+ "r": 0.6783783784,
148
+ "f": 0.6631439894
149
  },
150
  "det:predet": {
151
  "p": 0.9473684211,
 
153
  "f": 0.972972973
154
  },
155
  "amod": {
156
+ "p": 0.8977099237,
157
+ "r": 0.8815592204,
158
+ "f": 0.8895612708
159
  },
160
  "mark": {
161
+ "p": 0.9090909091,
162
+ "r": 0.9090909091,
163
+ "f": 0.9090909091
164
  },
165
  "cop": {
166
+ "p": 0.8307692308,
167
+ "r": 0.8571428571,
168
+ "f": 0.84375
169
  },
170
  "xcomp": {
171
+ "p": 0.6835443038,
172
+ "r": 0.5625,
173
+ "f": 0.6171428571
174
  },
175
  "obl": {
176
+ "p": 0.7736434109,
177
+ "r": 0.7295321637,
178
+ "f": 0.7509405568
179
  },
180
  "acl:relcl": {
181
+ "p": 0.7674418605,
182
+ "r": 0.7226277372,
183
+ "f": 0.7443609023
184
  },
185
  "acl": {
186
+ "p": 0.6347826087,
187
+ "r": 0.6347826087,
188
+ "f": 0.6347826087
189
  },
190
  "ccomp": {
191
+ "p": 0.6842105263,
192
+ "r": 0.6290322581,
193
+ "f": 0.6554621849
194
  },
195
  "expl": {
196
+ "p": 0.8974358974,
197
+ "r": 0.8974358974,
198
+ "f": 0.8974358974
199
  },
200
  "nsubj:pass": {
201
+ "p": 0.8157894737,
202
+ "r": 0.7654320988,
203
+ "f": 0.7898089172
204
  },
205
  "aux:pass": {
206
+ "p": 0.8701298701,
207
+ "r": 0.8481012658,
208
+ "f": 0.858974359
209
  },
210
  "parataxis": {
211
+ "p": 0.4117647059,
212
+ "r": 0.2258064516,
213
+ "f": 0.2916666667
 
 
 
 
 
214
  },
215
  "obl:agent": {
216
+ "p": 0.7727272727,
217
+ "r": 0.8095238095,
218
+ "f": 0.7906976744
219
+ },
220
+ "advcl": {
221
+ "p": 0.5906040268,
222
+ "r": 0.6197183099,
223
+ "f": 0.6048109966
224
  },
225
  "det:poss": {
226
  "p": 0.9714285714,
 
233
  "f": 1.0
234
  },
235
  "appos": {
236
+ "p": 0.3529411765,
237
+ "r": 0.3076923077,
238
+ "f": 0.3287671233
239
  },
240
  "dep": {
241
  "p": 0.0,
 
243
  "f": 0.0
244
  },
245
  "iobj": {
246
+ "p": 0.8571428571,
247
+ "r": 0.9,
248
+ "f": 0.8780487805
 
 
 
 
 
249
  },
250
  "expl:impers": {
251
+ "p": 0.7777777778,
252
+ "r": 0.875,
253
+ "f": 0.8235294118
254
  },
255
  "csubj": {
256
+ "p": 0.5833333333,
257
+ "r": 0.5384615385,
258
+ "f": 0.56
259
+ },
260
+ "compound": {
261
+ "p": 0.5925925926,
262
+ "r": 0.6153846154,
263
+ "f": 0.6037735849
264
  },
265
  "discourse": {
266
  "p": 0.0,
 
268
  "f": 0.0
269
  },
270
  "fixed": {
271
+ "p": 1.0,
272
+ "r": 0.7857142857,
273
+ "f": 0.88
274
  },
275
  "expl:pass": {
276
+ "p": 0.8181818182,
277
  "r": 0.8181818182,
278
+ "f": 0.8181818182
279
  },
280
  "orphan": {
281
  "p": 0.0,
 
283
  "f": 0.0
284
  },
285
  "flat:foreign": {
286
+ "p": 0.75,
287
+ "r": 1.0,
288
+ "f": 0.8571428571
289
  },
290
  "vocative": {
291
+ "p": 0.5,
292
  "r": 0.3333333333,
293
+ "f": 0.4
294
  }
295
  },
296
+ "lemma_acc": 0.9705248023,
297
+ "ents_p": 0.8546443384,
298
+ "ents_r": 0.8515457487,
299
+ "ents_f": 0.8530922299,
300
  "ents_per_type": {
301
  "LOC": {
302
+ "p": 0.8648893588,
303
+ "r": 0.8971406189,
304
+ "f": 0.8807198339
305
  },
306
  "PER": {
307
+ "p": 0.8897909333,
308
+ "r": 0.8802416489,
309
+ "f": 0.884990532
310
  },
311
  "MISC": {
312
+ "p": 0.7606463304,
313
+ "r": 0.6920857378,
314
+ "f": 0.7247482014
315
  },
316
  "ORG": {
317
+ "p": 0.8209137552,
318
+ "r": 0.7594936709,
319
+ "f": 0.7890102149
320
  }
321
  },
322
+ "speed": 10947.1047973316
323
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -10,7 +10,7 @@ seed = 0
10
 
11
  [nlp]
12
  lang = "it"
13
- pipeline = ["tok2vec","morphologizer","tagger","parser","senter","attribute_ruler","lemmatizer","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
@@ -26,11 +26,22 @@ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
29
- factory = "lemmatizer"
30
- mode = "pos_lookup"
31
- model = null
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
 
 
 
 
 
 
 
 
 
 
 
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
@@ -39,8 +50,9 @@ overwrite = true
39
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
- @architectures = "spacy.Tagger.v1"
43
  nO = null
 
44
 
45
  [components.morphologizer.model.tok2vec]
46
  @architectures = "spacy.Tok2VecListener.v1"
@@ -70,7 +82,7 @@ nO = null
70
  @architectures = "spacy.MultiHashEmbed.v2"
71
  width = 96
72
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
73
- rows = [5000,2500,2500,2500,100]
74
  include_static_vectors = false
75
 
76
  [components.ner.model.tok2vec.encode]
@@ -108,8 +120,9 @@ overwrite = false
108
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
109
 
110
  [components.senter.model]
111
- @architectures = "spacy.Tagger.v1"
112
  nO = null
 
113
 
114
  [components.senter.model.tok2vec]
115
  @architectures = "spacy.Tok2Vec.v2"
@@ -130,12 +143,14 @@ maxout_pieces = 2
130
 
131
  [components.tagger]
132
  factory = "tagger"
 
133
  overwrite = false
134
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
135
 
136
  [components.tagger.model]
137
- @architectures = "spacy.Tagger.v1"
138
  nO = null
 
139
 
140
  [components.tagger.model.tok2vec]
141
  @architectures = "spacy.Tok2VecListener.v1"
@@ -152,7 +167,7 @@ factory = "tok2vec"
152
  @architectures = "spacy.MultiHashEmbed.v2"
153
  width = ${components.tok2vec.model.encode:width}
154
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
155
- rows = [5000,2500,2500,2500,100]
156
  include_static_vectors = false
157
 
158
  [components.tok2vec.model.encode]
@@ -189,7 +204,7 @@ dropout = 0.1
189
  accumulate_gradient = 1
190
  patience = 5000
191
  max_epochs = 0
192
- max_steps = 0
193
  eval_frequency = 1000
194
  frozen_components = []
195
  before_to_disk = null
@@ -224,18 +239,18 @@ eps = 0.00000001
224
  learn_rate = 0.001
225
 
226
  [training.score_weights]
227
- pos_acc = 0.06
228
- morph_acc = 0.05
229
  morph_per_feat = null
230
- tag_acc = 0.06
231
  dep_uas = 0.0
232
- dep_las = 0.16
233
  dep_las_per_type = null
234
  sents_p = null
235
  sents_r = null
236
- sents_f = 0.02
237
- lemma_acc = 0.5
238
- ents_f = 0.16
239
  ents_p = 0.0
240
  ents_r = 0.0
241
  ents_per_type = null
@@ -252,6 +267,13 @@ after_init = null
252
 
253
  [initialize.components]
254
 
 
 
 
 
 
 
 
255
  [initialize.components.morphologizer]
256
 
257
  [initialize.components.morphologizer.labels]
 
10
 
11
  [nlp]
12
  lang = "it"
13
+ pipeline = ["tok2vec","morphologizer","tagger","parser","lemmatizer","senter","attribute_ruler","ner"]
14
  disabled = ["senter"]
15
  before_creation = null
16
  after_creation = null
 
26
  validate = false
27
 
28
  [components.lemmatizer]
29
+ factory = "trainable_lemmatizer"
30
+ backoff = "orth"
31
+ min_tree_freq = 3
32
  overwrite = false
33
  scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
+ top_k = 1
35
+
36
+ [components.lemmatizer.model]
37
+ @architectures = "spacy.Tagger.v2"
38
+ nO = null
39
+ normalize = false
40
+
41
+ [components.lemmatizer.model.tok2vec]
42
+ @architectures = "spacy.Tok2VecListener.v1"
43
+ width = ${components.tok2vec.model.encode:width}
44
+ upstream = "tok2vec"
45
 
46
  [components.morphologizer]
47
  factory = "morphologizer"
 
50
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
51
 
52
  [components.morphologizer.model]
53
+ @architectures = "spacy.Tagger.v2"
54
  nO = null
55
+ normalize = false
56
 
57
  [components.morphologizer.model.tok2vec]
58
  @architectures = "spacy.Tok2VecListener.v1"
 
82
  @architectures = "spacy.MultiHashEmbed.v2"
83
  width = 96
84
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
85
+ rows = [5000,1000,2500,2500,50]
86
  include_static_vectors = false
87
 
88
  [components.ner.model.tok2vec.encode]
 
120
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
121
 
122
  [components.senter.model]
123
+ @architectures = "spacy.Tagger.v2"
124
  nO = null
125
+ normalize = false
126
 
127
  [components.senter.model.tok2vec]
128
  @architectures = "spacy.Tok2Vec.v2"
 
143
 
144
  [components.tagger]
145
  factory = "tagger"
146
+ neg_prefix = "!"
147
  overwrite = false
148
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
149
 
150
  [components.tagger.model]
151
+ @architectures = "spacy.Tagger.v2"
152
  nO = null
153
+ normalize = false
154
 
155
  [components.tagger.model.tok2vec]
156
  @architectures = "spacy.Tok2VecListener.v1"
 
167
  @architectures = "spacy.MultiHashEmbed.v2"
168
  width = ${components.tok2vec.model.encode:width}
169
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
170
+ rows = [5000,1000,2500,2500,50]
171
  include_static_vectors = false
172
 
173
  [components.tok2vec.model.encode]
 
204
  accumulate_gradient = 1
205
  patience = 5000
206
  max_epochs = 0
207
+ max_steps = 100000
208
  eval_frequency = 1000
209
  frozen_components = []
210
  before_to_disk = null
 
239
  learn_rate = 0.001
240
 
241
  [training.score_weights]
242
+ pos_acc = 0.1
243
+ morph_acc = 0.09
244
  morph_per_feat = null
245
+ tag_acc = 0.1
246
  dep_uas = 0.0
247
+ dep_las = 0.29
248
  dep_las_per_type = null
249
  sents_p = null
250
  sents_r = null
251
+ sents_f = 0.04
252
+ lemma_acc = 0.1
253
+ ents_f = 0.29
254
  ents_p = 0.0
255
  ents_r = 0.0
256
  ents_per_type = null
 
267
 
268
  [initialize.components]
269
 
270
+ [initialize.components.lemmatizer]
271
+
272
+ [initialize.components.lemmatizer.labels]
273
+ @readers = "spacy.read_labels.v1"
274
+ path = "corpus/labels/trainable_lemmatizer.json"
275
+ require = false
276
+
277
  [initialize.components.morphologizer]
278
 
279
  [initialize.components.morphologizer.labels]
it_core_news_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:38f5f87d014e9d8aa8d5447425a5176cd5aef71bdf7950da1be0a40217cd983e
3
- size 21364123
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9e11b62072f1110ddca05008cfa922c1368c40093977ab7b76f11bb77abcda7
3
+ size 13021864
lemmatizer/cfg ADDED
@@ -0,0 +1,729 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "labels":[
3
+ 1,
4
+ 2,
5
+ 6,
6
+ 8,
7
+ 10,
8
+ 12,
9
+ 15,
10
+ 17,
11
+ 19,
12
+ 21,
13
+ 23,
14
+ 25,
15
+ 28,
16
+ 31,
17
+ 35,
18
+ 37,
19
+ 39,
20
+ 41,
21
+ 43,
22
+ 45,
23
+ 47,
24
+ 49,
25
+ 50,
26
+ 52,
27
+ 54,
28
+ 56,
29
+ 58,
30
+ 60,
31
+ 63,
32
+ 66,
33
+ 69,
34
+ 71,
35
+ 74,
36
+ 76,
37
+ 78,
38
+ 80,
39
+ 82,
40
+ 84,
41
+ 87,
42
+ 89,
43
+ 92,
44
+ 95,
45
+ 97,
46
+ 99,
47
+ 101,
48
+ 104,
49
+ 106,
50
+ 108,
51
+ 110,
52
+ 112,
53
+ 113,
54
+ 115,
55
+ 117,
56
+ 119,
57
+ 121,
58
+ 123,
59
+ 125,
60
+ 127,
61
+ 128,
62
+ 130,
63
+ 133,
64
+ 135,
65
+ 139,
66
+ 141,
67
+ 142,
68
+ 144,
69
+ 145,
70
+ 147,
71
+ 150,
72
+ 152,
73
+ 154,
74
+ 157,
75
+ 159,
76
+ 160,
77
+ 162,
78
+ 164,
79
+ 165,
80
+ 167,
81
+ 169,
82
+ 171,
83
+ 174,
84
+ 176,
85
+ 178,
86
+ 181,
87
+ 184,
88
+ 187,
89
+ 189,
90
+ 191,
91
+ 195,
92
+ 196,
93
+ 198,
94
+ 200,
95
+ 202,
96
+ 205,
97
+ 207,
98
+ 208,
99
+ 210,
100
+ 212,
101
+ 214,
102
+ 216,
103
+ 218,
104
+ 221,
105
+ 223,
106
+ 226,
107
+ 228,
108
+ 229,
109
+ 231,
110
+ 233,
111
+ 234,
112
+ 235,
113
+ 237,
114
+ 239,
115
+ 241,
116
+ 243,
117
+ 244,
118
+ 247,
119
+ 249,
120
+ 250,
121
+ 251,
122
+ 253,
123
+ 255,
124
+ 257,
125
+ 259,
126
+ 261,
127
+ 263,
128
+ 265,
129
+ 269,
130
+ 271,
131
+ 273,
132
+ 275,
133
+ 277,
134
+ 279,
135
+ 281,
136
+ 283,
137
+ 285,
138
+ 287,
139
+ 289,
140
+ 291,
141
+ 294,
142
+ 297,
143
+ 299,
144
+ 301,
145
+ 302,
146
+ 303,
147
+ 305,
148
+ 307,
149
+ 309,
150
+ 312,
151
+ 314,
152
+ 315,
153
+ 318,
154
+ 320,
155
+ 322,
156
+ 324,
157
+ 326,
158
+ 328,
159
+ 331,
160
+ 334,
161
+ 335,
162
+ 338,
163
+ 340,
164
+ 342,
165
+ 347,
166
+ 349,
167
+ 353,
168
+ 354,
169
+ 356,
170
+ 360,
171
+ 363,
172
+ 365,
173
+ 366,
174
+ 368,
175
+ 370,
176
+ 372,
177
+ 374,
178
+ 376,
179
+ 377,
180
+ 378,
181
+ 379,
182
+ 382,
183
+ 383,
184
+ 385,
185
+ 389,
186
+ 392,
187
+ 393,
188
+ 396,
189
+ 398,
190
+ 402,
191
+ 403,
192
+ 405,
193
+ 407,
194
+ 151,
195
+ 409,
196
+ 410,
197
+ 413,
198
+ 415,
199
+ 417,
200
+ 420,
201
+ 423,
202
+ 427,
203
+ 428,
204
+ 430,
205
+ 432,
206
+ 433,
207
+ 435,
208
+ 437,
209
+ 439,
210
+ 441,
211
+ 445,
212
+ 446,
213
+ 449,
214
+ 451,
215
+ 453,
216
+ 454,
217
+ 455,
218
+ 457,
219
+ 458,
220
+ 461,
221
+ 463,
222
+ 465,
223
+ 170,
224
+ 467,
225
+ 469,
226
+ 471,
227
+ 473,
228
+ 476,
229
+ 477,
230
+ 478,
231
+ 479,
232
+ 481,
233
+ 483,
234
+ 485,
235
+ 488,
236
+ 489,
237
+ 491,
238
+ 494,
239
+ 498,
240
+ 500,
241
+ 502,
242
+ 504,
243
+ 507,
244
+ 509,
245
+ 511,
246
+ 515,
247
+ 518,
248
+ 520,
249
+ 521,
250
+ 522,
251
+ 523,
252
+ 525,
253
+ 527,
254
+ 530,
255
+ 531,
256
+ 533,
257
+ 535,
258
+ 538,
259
+ 539,
260
+ 542,
261
+ 544,
262
+ 546,
263
+ 548,
264
+ 549,
265
+ 550,
266
+ 553,
267
+ 555,
268
+ 558,
269
+ 560,
270
+ 561,
271
+ 562,
272
+ 564,
273
+ 565,
274
+ 567,
275
+ 570,
276
+ 573,
277
+ 575,
278
+ 578,
279
+ 579,
280
+ 582,
281
+ 584,
282
+ 586,
283
+ 588,
284
+ 590,
285
+ 592,
286
+ 594,
287
+ 596,
288
+ 598,
289
+ 601,
290
+ 602,
291
+ 603,
292
+ 605,
293
+ 607,
294
+ 610,
295
+ 611,
296
+ 612,
297
+ 614,
298
+ 616,
299
+ 618,
300
+ 620,
301
+ 623,
302
+ 625,
303
+ 628,
304
+ 630,
305
+ 632,
306
+ 634,
307
+ 635,
308
+ 638,
309
+ 639,
310
+ 641,
311
+ 642,
312
+ 643,
313
+ 647,
314
+ 650,
315
+ 654,
316
+ 656,
317
+ 657,
318
+ 658,
319
+ 660,
320
+ 662,
321
+ 663,
322
+ 665,
323
+ 668,
324
+ 669,
325
+ 673,
326
+ 675,
327
+ 678,
328
+ 680,
329
+ 682,
330
+ 684,
331
+ 686,
332
+ 688,
333
+ 690,
334
+ 693,
335
+ 695,
336
+ 697,
337
+ 699,
338
+ 701,
339
+ 703,
340
+ 705,
341
+ 706,
342
+ 709,
343
+ 711,
344
+ 713,
345
+ 716,
346
+ 718,
347
+ 720,
348
+ 722,
349
+ 724,
350
+ 726,
351
+ 727,
352
+ 730,
353
+ 732,
354
+ 733,
355
+ 734,
356
+ 736,
357
+ 738,
358
+ 741,
359
+ 744,
360
+ 747,
361
+ 749,
362
+ 751,
363
+ 752,
364
+ 754,
365
+ 757,
366
+ 761,
367
+ 762,
368
+ 764,
369
+ 766,
370
+ 770,
371
+ 772,
372
+ 774,
373
+ 776,
374
+ 777,
375
+ 780,
376
+ 782,
377
+ 785,
378
+ 787,
379
+ 789,
380
+ 791,
381
+ 792,
382
+ 794,
383
+ 796,
384
+ 798,
385
+ 800,
386
+ 802,
387
+ 805,
388
+ 807,
389
+ 808,
390
+ 810,
391
+ 812,
392
+ 813,
393
+ 814,
394
+ 816,
395
+ 818,
396
+ 822,
397
+ 824,
398
+ 826,
399
+ 828,
400
+ 829,
401
+ 832,
402
+ 833,
403
+ 835,
404
+ 837,
405
+ 838,
406
+ 839,
407
+ 840,
408
+ 841,
409
+ 842,
410
+ 844,
411
+ 847,
412
+ 850,
413
+ 853,
414
+ 855,
415
+ 856,
416
+ 859,
417
+ 860,
418
+ 862,
419
+ 863,
420
+ 864,
421
+ 865,
422
+ 867,
423
+ 868,
424
+ 870,
425
+ 873,
426
+ 875,
427
+ 877,
428
+ 879,
429
+ 880,
430
+ 883,
431
+ 886,
432
+ 887,
433
+ 890,
434
+ 891,
435
+ 892,
436
+ 894,
437
+ 897,
438
+ 898,
439
+ 901,
440
+ 904,
441
+ 905,
442
+ 906,
443
+ 872,
444
+ 907,
445
+ 908,
446
+ 909,
447
+ 910,
448
+ 911,
449
+ 913,
450
+ 914,
451
+ 915,
452
+ 916,
453
+ 919,
454
+ 920,
455
+ 922,
456
+ 925,
457
+ 926,
458
+ 930,
459
+ 932,
460
+ 934,
461
+ 935,
462
+ 936,
463
+ 938,
464
+ 939,
465
+ 941,
466
+ 944,
467
+ 946,
468
+ 949,
469
+ 950,
470
+ 954,
471
+ 955,
472
+ 958,
473
+ 960,
474
+ 964,
475
+ 967,
476
+ 970,
477
+ 973,
478
+ 974,
479
+ 976,
480
+ 977,
481
+ 979,
482
+ 982,
483
+ 983,
484
+ 984,
485
+ 986,
486
+ 987,
487
+ 989,
488
+ 990,
489
+ 993,
490
+ 995,
491
+ 996,
492
+ 999,
493
+ 1001,
494
+ 1004,
495
+ 1006,
496
+ 1008,
497
+ 1009,
498
+ 1011,
499
+ 1013,
500
+ 1015,
501
+ 1017,
502
+ 1018,
503
+ 1019,
504
+ 1021,
505
+ 1023,
506
+ 1025,
507
+ 1027,
508
+ 1028,
509
+ 1031,
510
+ 1034,
511
+ 1036,
512
+ 1039,
513
+ 1041,
514
+ 1043,
515
+ 1044,
516
+ 1045,
517
+ 1047,
518
+ 1049,
519
+ 1051,
520
+ 1053,
521
+ 1055,
522
+ 1058,
523
+ 1059,
524
+ 1061,
525
+ 1062,
526
+ 1065,
527
+ 1067,
528
+ 1070,
529
+ 1071,
530
+ 1072,
531
+ 1074,
532
+ 1078,
533
+ 1080,
534
+ 1081,
535
+ 1082,
536
+ 1085,
537
+ 1086,
538
+ 1087,
539
+ 1089,
540
+ 1090,
541
+ 1091,
542
+ 1093,
543
+ 1094,
544
+ 1095,
545
+ 1097,
546
+ 1098,
547
+ 1099,
548
+ 1101,
549
+ 1104,
550
+ 1105,
551
+ 1106,
552
+ 1107,
553
+ 1110,
554
+ 1111,
555
+ 1114,
556
+ 1116,
557
+ 1117,
558
+ 1118,
559
+ 1120,
560
+ 1125,
561
+ 1127,
562
+ 1129,
563
+ 1130,
564
+ 1132,
565
+ 1136,
566
+ 1137,
567
+ 1138,
568
+ 1139,
569
+ 1142,
570
+ 1143,
571
+ 1145,
572
+ 1146,
573
+ 1147,
574
+ 1149,
575
+ 1150,
576
+ 1153,
577
+ 1155,
578
+ 1156,
579
+ 1157,
580
+ 1158,
581
+ 1161,
582
+ 1162,
583
+ 1165,
584
+ 1168,
585
+ 1170,
586
+ 1173,
587
+ 1175,
588
+ 1177,
589
+ 1178,
590
+ 1180,
591
+ 1182,
592
+ 1184,
593
+ 1186,
594
+ 1187,
595
+ 1188,
596
+ 1190,
597
+ 1192,
598
+ 1193,
599
+ 1195,
600
+ 1197,
601
+ 1198,
602
+ 1200,
603
+ 1201,
604
+ 1202,
605
+ 1203,
606
+ 1204,
607
+ 1207,
608
+ 1208,
609
+ 1210,
610
+ 1211,
611
+ 1212,
612
+ 1214,
613
+ 1215,
614
+ 1217,
615
+ 1219,
616
+ 1221,
617
+ 1223,
618
+ 1225,
619
+ 1227,
620
+ 1231,
621
+ 1232,
622
+ 1233,
623
+ 1234,
624
+ 1236,
625
+ 1239,
626
+ 1240,
627
+ 1241,
628
+ 1242,
629
+ 1244,
630
+ 1246,
631
+ 1247,
632
+ 1248,
633
+ 1250,
634
+ 1251,
635
+ 1254,
636
+ 1255,
637
+ 1257,
638
+ 1259,
639
+ 1261,
640
+ 1264,
641
+ 1266,
642
+ 1267,
643
+ 1268,
644
+ 1270,
645
+ 1272,
646
+ 1276,
647
+ 1279,
648
+ 1280,
649
+ 1281,
650
+ 1284,
651
+ 1285,
652
+ 1288,
653
+ 1289,
654
+ 1291,
655
+ 1293,
656
+ 1294,
657
+ 1296,
658
+ 937,
659
+ 1297,
660
+ 1299,
661
+ 1302,
662
+ 1303,
663
+ 1304,
664
+ 1305,
665
+ 1308,
666
+ 1311,
667
+ 1313,
668
+ 1314,
669
+ 1317,
670
+ 1318,
671
+ 1319,
672
+ 1321,
673
+ 1324,
674
+ 1325,
675
+ 1327,
676
+ 1328,
677
+ 1330,
678
+ 1332,
679
+ 1334,
680
+ 1335,
681
+ 1337,
682
+ 1338,
683
+ 1340,
684
+ 1343,
685
+ 1345,
686
+ 1347,
687
+ 1349,
688
+ 1351,
689
+ 1352,
690
+ 1355,
691
+ 1356,
692
+ 1358,
693
+ 1360,
694
+ 1362,
695
+ 1364,
696
+ 1365,
697
+ 1367,
698
+ 1370,
699
+ 1371,
700
+ 1372,
701
+ 1374,
702
+ 1376,
703
+ 1378,
704
+ 1379,
705
+ 1380,
706
+ 1381,
707
+ 1383,
708
+ 1384,
709
+ 1385,
710
+ 1387,
711
+ 1388,
712
+ 1392,
713
+ 1393,
714
+ 1394,
715
+ 1395,
716
+ 1396,
717
+ 1397,
718
+ 1398,
719
+ 1399,
720
+ 1401,
721
+ 1402,
722
+ 1403,
723
+ 1404,
724
+ 1406,
725
+ 1409,
726
+ 1411,
727
+ 1412
728
+ ]
729
+ }
lemmatizer/{lookups/lookups.bin → model} RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fba5a704a2baddb2914660c0e0bb3fc5c8d7e9099a10cb708896959b7533d917
3
- size 14835061
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a393555e381c31862f2a2b70cd50ec0243b99f06ac51f4aad84870c89ab23ce
3
+ size 281742
lemmatizer/trees ADDED
Binary file (140 kB). View file
 
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"it",
3
  "name":"core_news_sm",
4
- "version":"3.2.0",
5
- "description":"Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-NC-SA 3.0",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -462,15 +462,8 @@
462
  "vocative",
463
  "xcomp"
464
  ],
465
- "senter":[
466
- "I",
467
- "S"
468
- ],
469
  "attribute_ruler":[
470
 
471
- ],
472
- "lemmatizer":[
473
-
474
  ],
475
  "ner":[
476
  "LOC",
@@ -484,8 +477,8 @@
484
  "morphologizer",
485
  "tagger",
486
  "parser",
487
- "attribute_ruler",
488
  "lemmatizer",
 
489
  "ner"
490
  ],
491
  "components":[
@@ -493,9 +486,9 @@
493
  "morphologizer",
494
  "tagger",
495
  "parser",
 
496
  "senter",
497
  "attribute_ruler",
498
- "lemmatizer",
499
  "ner"
500
  ],
501
  "disabled":[
@@ -506,66 +499,66 @@
506
  "token_p":0.9980235379,
507
  "token_r":0.9978442468,
508
  "token_f":0.9979338843,
509
- "pos_acc":0.9694872601,
510
- "morph_acc":0.9687191011,
511
- "morph_micro_p":0.9843827713,
512
- "morph_micro_r":0.9767074615,
513
- "morph_micro_f":0.9805300966,
514
  "morph_per_feat":{
515
  "Gender":{
516
- "p":0.987497438,
517
- "r":0.9836668028,
518
- "f":0.9855783983
519
  },
520
  "Number":{
521
- "p":0.9925686591,
522
- "r":0.9865125241,
523
- "f":0.9895313255
524
  },
525
  "NumType":{
526
- "p":0.9773584906,
527
- "r":0.9557195572,
528
- "f":0.9664179104
529
  },
530
  "Definite":{
531
- "p":0.9917501473,
532
  "r":0.9982206406,
533
- "f":0.9949748744
534
  },
535
  "PronType":{
536
- "p":0.9887687188,
537
  "r":0.9830438379,
538
- "f":0.9858979676
539
  },
540
  "Mood":{
541
- "p":0.960591133,
542
- "r":0.9363745498,
543
- "f":0.9483282675
544
  },
545
  "Person":{
546
- "p":0.97002997,
547
- "r":0.9399806389,
548
- "f":0.9547689282
549
  },
550
  "Tense":{
551
- "p":0.9518486672,
552
- "r":0.9421276596,
553
- "f":0.9469632164
554
  },
555
  "VerbForm":{
556
- "p":0.972027972,
557
- "r":0.9632709633,
558
- "f":0.9676296554
559
  },
560
  "Degree":{
561
- "p":0.875,
562
- "r":0.8235294118,
563
- "f":0.8484848485
564
  },
565
  "Clitic":{
566
- "p":1.0,
567
- "r":0.9465240642,
568
- "f":0.9725274725
569
  },
570
  "Poss":{
571
  "p":1.0,
@@ -579,76 +572,76 @@
579
  },
580
  "Foreign":{
581
  "p":1.0,
582
- "r":0.8,
583
- "f":0.8888888889
584
  }
585
  },
586
- "tag_acc":0.9652631106,
587
- "sents_p":0.9718804921,
588
- "sents_r":0.9804964539,
589
- "sents_f":0.9761694616,
590
- "dep_uas":0.8970414201,
591
- "dep_las":0.8574221765,
592
  "dep_las_per_type":{
593
  "root":{
594
- "p":0.8769771529,
595
- "r":0.884751773,
596
- "f":0.880847308
597
  },
598
  "flat:name":{
599
- "p":0.8757763975,
600
- "r":0.8924050633,
601
- "f":0.8840125392
602
  },
603
  "case":{
604
- "p":0.9739551787,
605
- "r":0.9786975046,
606
- "f":0.9763205829
607
  },
608
  "nmod":{
609
- "p":0.7953394124,
610
- "r":0.8026584867,
611
- "f":0.7989821883
612
  },
613
  "nummod":{
614
- "p":0.8913043478,
615
- "r":0.8913043478,
616
- "f":0.8913043478
617
  },
618
  "det":{
619
- "p":0.9706959707,
620
- "r":0.9742647059,
621
- "f":0.9724770642
622
  },
623
  "nsubj":{
624
- "p":0.8181818182,
625
- "r":0.8166023166,
626
- "f":0.8173913043
627
  },
628
  "aux":{
629
- "p":0.9252336449,
630
- "r":0.9124423963,
631
- "f":0.9187935035
632
  },
633
  "advmod":{
634
- "p":0.8123569794,
635
- "r":0.8142201835,
636
- "f":0.8132875143
637
  },
638
  "obj":{
639
- "p":0.8308823529,
640
- "r":0.8475,
641
- "f":0.8391089109
642
  },
643
  "cc":{
644
- "p":0.9028213166,
645
- "r":0.8834355828,
646
- "f":0.8930232558
647
  },
648
  "conj":{
649
- "p":0.6709183673,
650
- "r":0.7108108108,
651
- "f":0.6902887139
652
  },
653
  "det:predet":{
654
  "p":0.9473684211,
@@ -656,74 +649,74 @@
656
  "f":0.972972973
657
  },
658
  "amod":{
659
- "p":0.9044684129,
660
- "r":0.88005997,
661
- "f":0.8920972644
662
  },
663
  "mark":{
664
- "p":0.8821138211,
665
- "r":0.8966942149,
666
- "f":0.8893442623
667
  },
668
  "cop":{
669
- "p":0.8257575758,
670
- "r":0.8650793651,
671
- "f":0.8449612403
672
  },
673
  "xcomp":{
674
- "p":0.6774193548,
675
- "r":0.65625,
676
- "f":0.6666666667
677
  },
678
  "obl":{
679
- "p":0.7908396947,
680
- "r":0.7573099415,
681
- "f":0.7737117252
682
  },
683
  "acl:relcl":{
684
- "p":0.7259259259,
685
- "r":0.7153284672,
686
- "f":0.7205882353
687
  },
688
  "acl":{
689
- "p":0.6914893617,
690
- "r":0.5652173913,
691
- "f":0.6220095694
692
  },
693
  "ccomp":{
694
- "p":0.6909090909,
695
- "r":0.6129032258,
696
- "f":0.6495726496
697
  },
698
  "expl":{
699
- "p":0.8875,
700
- "r":0.9102564103,
701
- "f":0.8987341772
702
  },
703
  "nsubj:pass":{
704
- "p":0.8955223881,
705
- "r":0.7407407407,
706
- "f":0.8108108108
707
  },
708
  "aux:pass":{
709
- "p":0.8414634146,
710
- "r":0.8734177215,
711
- "f":0.8571428571
712
  },
713
  "parataxis":{
714
- "p":0.2631578947,
715
- "r":0.1612903226,
716
- "f":0.2
717
- },
718
- "advcl":{
719
- "p":0.5350318471,
720
- "r":0.5915492958,
721
- "f":0.5618729097
722
  },
723
  "obl:agent":{
724
- "p":0.7380952381,
725
- "r":0.7380952381,
726
- "f":0.7380952381
 
 
 
 
 
727
  },
728
  "det:poss":{
729
  "p":0.9714285714,
@@ -736,9 +729,9 @@
736
  "f":1.0
737
  },
738
  "appos":{
739
- "p":0.3947368421,
740
- "r":0.3846153846,
741
- "f":0.3896103896
742
  },
743
  "dep":{
744
  "p":0.0,
@@ -746,24 +739,24 @@
746
  "f":0.0
747
  },
748
  "iobj":{
749
- "p":0.7619047619,
750
- "r":0.8,
751
- "f":0.7804878049
752
- },
753
- "compound":{
754
- "p":0.6296296296,
755
- "r":0.6538461538,
756
- "f":0.641509434
757
  },
758
  "expl:impers":{
759
- "p":0.6666666667,
760
- "r":0.75,
761
- "f":0.7058823529
762
  },
763
  "csubj":{
764
- "p":0.5454545455,
765
- "r":0.4615384615,
766
- "f":0.5
 
 
 
 
 
767
  },
768
  "discourse":{
769
  "p":0.0,
@@ -771,14 +764,14 @@
771
  "f":0.0
772
  },
773
  "fixed":{
774
- "p":0.8846153846,
775
- "r":0.8214285714,
776
- "f":0.8518518519
777
  },
778
  "expl:pass":{
779
- "p":0.9,
780
  "r":0.8181818182,
781
- "f":0.8571428571
782
  },
783
  "orphan":{
784
  "p":0.0,
@@ -786,43 +779,43 @@
786
  "f":0.0
787
  },
788
  "flat:foreign":{
789
- "p":1.0,
790
- "r":0.3333333333,
791
- "f":0.5
792
  },
793
  "vocative":{
794
- "p":0.3333333333,
795
  "r":0.3333333333,
796
- "f":0.3333333333
797
  }
798
  },
799
- "lemma_acc":0.8640366643,
800
- "ents_p":0.8556688921,
801
- "ents_r":0.8519161046,
802
- "ents_f":0.8537883746,
803
  "ents_per_type":{
804
  "LOC":{
805
- "p":0.8656947539,
806
- "r":0.8978065022,
807
- "f":0.8814582652
808
  },
809
  "PER":{
810
- "p":0.889485747,
811
- "r":0.8826581379,
812
- "f":0.88605879
813
  },
814
  "MISC":{
815
- "p":0.7636830965,
816
- "r":0.6940093432,
817
- "f":0.7271811114
818
  },
819
  "ORG":{
820
- "p":0.8235586481,
821
- "r":0.7490958409,
822
- "f":0.7845643939
823
  }
824
  },
825
- "speed":10840.1050228952
826
  },
827
  "sources":[
828
  {
@@ -836,12 +829,6 @@
836
  "url":"https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500",
837
  "license":"CC BY 4.0",
838
  "author":"Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran"
839
- },
840
- {
841
- "name":"Lemmatization Lists",
842
- "url":"https://github.com/michmech/lemmatization-lists/",
843
- "license":"ODbL",
844
- "author":"Michal M\u011bchura"
845
  }
846
  ],
847
  "requirements":[
 
1
  {
2
  "lang":"it",
3
  "name":"core_news_sm",
4
+ "version":"3.3.0",
5
+ "description":"Italian pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-NC-SA 3.0",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
462
  "vocative",
463
  "xcomp"
464
  ],
 
 
 
 
465
  "attribute_ruler":[
466
 
 
 
 
467
  ],
468
  "ner":[
469
  "LOC",
 
477
  "morphologizer",
478
  "tagger",
479
  "parser",
 
480
  "lemmatizer",
481
+ "attribute_ruler",
482
  "ner"
483
  ],
484
  "components":[
 
486
  "morphologizer",
487
  "tagger",
488
  "parser",
489
+ "lemmatizer",
490
  "senter",
491
  "attribute_ruler",
 
492
  "ner"
493
  ],
494
  "disabled":[
 
499
  "token_p":0.9980235379,
500
  "token_r":0.9978442468,
501
  "token_f":0.9979338843,
502
+ "pos_acc":0.9697568867,
503
+ "morph_acc":0.9683595506,
504
+ "morph_micro_p":0.9831577901,
505
+ "morph_micro_r":0.9765594157,
506
+ "morph_micro_f":0.9798474946,
507
  "morph_per_feat":{
508
  "Gender":{
509
+ "p":0.9887017256,
510
+ "r":0.982645978,
511
+ "f":0.9856645505
512
  },
513
  "Number":{
514
+ "p":0.992398512,
515
+ "r":0.9852280026,
516
+ "f":0.9888002578
517
  },
518
  "NumType":{
519
+ "p":0.9811320755,
520
+ "r":0.9594095941,
521
+ "f":0.9701492537
522
  },
523
  "Definite":{
524
+ "p":0.992920354,
525
  "r":0.9982206406,
526
+ "f":0.9955634428
527
  },
528
  "PronType":{
529
+ "p":0.9875363523,
530
  "r":0.9830438379,
531
+ "f":0.9852849741
532
  },
533
  "Mood":{
534
+ "p":0.9586374696,
535
+ "r":0.9459783914,
536
+ "f":0.952265861
537
  },
538
  "Person":{
539
+ "p":0.9654491609,
540
+ "r":0.9467570184,
541
+ "f":0.9560117302
542
  },
543
  "Tense":{
544
+ "p":0.9443969204,
545
+ "r":0.9395744681,
546
+ "f":0.9419795222
547
  },
548
  "VerbForm":{
549
+ "p":0.9651810585,
550
+ "r":0.9604989605,
551
+ "f":0.9628343175
552
  },
553
  "Degree":{
554
+ "p":0.8333333333,
555
+ "r":0.8823529412,
556
+ "f":0.8571428571
557
  },
558
  "Clitic":{
559
+ "p":0.9832402235,
560
+ "r":0.9411764706,
561
+ "f":0.9617486339
562
  },
563
  "Poss":{
564
  "p":1.0,
 
572
  },
573
  "Foreign":{
574
  "p":1.0,
575
+ "r":1.0,
576
+ "f":1.0
577
  }
578
  },
579
+ "tag_acc":0.9655792217,
580
+ "sents_p":0.9668411867,
581
+ "sents_r":0.9822695035,
582
+ "sents_f":0.9744942832,
583
+ "dep_uas":0.8962288419,
584
+ "dep_las":0.8560991923,
585
  "dep_las_per_type":{
586
  "root":{
587
+ "p":0.8778359511,
588
+ "r":0.8918439716,
589
+ "f":0.8847845207
590
  },
591
  "flat:name":{
592
+ "p":0.9473684211,
593
+ "r":0.9113924051,
594
+ "f":0.9290322581
595
  },
596
  "case":{
597
+ "p":0.9721718088,
598
+ "r":0.9780888618,
599
+ "f":0.9751213592
600
  },
601
  "nmod":{
602
+ "p":0.7912423625,
603
+ "r":0.7944785276,
604
+ "f":0.7928571429
605
  },
606
  "nummod":{
607
+ "p":0.8888888889,
608
+ "r":0.8695652174,
609
+ "f":0.8791208791
610
  },
611
  "det":{
612
+ "p":0.967063129,
613
+ "r":0.9715073529,
614
+ "f":0.9692801467
615
  },
616
  "nsubj":{
617
+ "p":0.809073724,
618
+ "r":0.8262548263,
619
+ "f":0.817574021
620
  },
621
  "aux":{
622
+ "p":0.9266055046,
623
+ "r":0.930875576,
624
+ "f":0.9287356322
625
  },
626
  "advmod":{
627
+ "p":0.802690583,
628
+ "r":0.8211009174,
629
+ "f":0.8117913832
630
  },
631
  "obj":{
632
+ "p":0.8095238095,
633
+ "r":0.85,
634
+ "f":0.8292682927
635
  },
636
  "cc":{
637
+ "p":0.9034267913,
638
+ "r":0.8895705521,
639
+ "f":0.8964451314
640
  },
641
  "conj":{
642
+ "p":0.6485788114,
643
+ "r":0.6783783784,
644
+ "f":0.6631439894
645
  },
646
  "det:predet":{
647
  "p":0.9473684211,
 
649
  "f":0.972972973
650
  },
651
  "amod":{
652
+ "p":0.8977099237,
653
+ "r":0.8815592204,
654
+ "f":0.8895612708
655
  },
656
  "mark":{
657
+ "p":0.9090909091,
658
+ "r":0.9090909091,
659
+ "f":0.9090909091
660
  },
661
  "cop":{
662
+ "p":0.8307692308,
663
+ "r":0.8571428571,
664
+ "f":0.84375
665
  },
666
  "xcomp":{
667
+ "p":0.6835443038,
668
+ "r":0.5625,
669
+ "f":0.6171428571
670
  },
671
  "obl":{
672
+ "p":0.7736434109,
673
+ "r":0.7295321637,
674
+ "f":0.7509405568
675
  },
676
  "acl:relcl":{
677
+ "p":0.7674418605,
678
+ "r":0.7226277372,
679
+ "f":0.7443609023
680
  },
681
  "acl":{
682
+ "p":0.6347826087,
683
+ "r":0.6347826087,
684
+ "f":0.6347826087
685
  },
686
  "ccomp":{
687
+ "p":0.6842105263,
688
+ "r":0.6290322581,
689
+ "f":0.6554621849
690
  },
691
  "expl":{
692
+ "p":0.8974358974,
693
+ "r":0.8974358974,
694
+ "f":0.8974358974
695
  },
696
  "nsubj:pass":{
697
+ "p":0.8157894737,
698
+ "r":0.7654320988,
699
+ "f":0.7898089172
700
  },
701
  "aux:pass":{
702
+ "p":0.8701298701,
703
+ "r":0.8481012658,
704
+ "f":0.858974359
705
  },
706
  "parataxis":{
707
+ "p":0.4117647059,
708
+ "r":0.2258064516,
709
+ "f":0.2916666667
 
 
 
 
 
710
  },
711
  "obl:agent":{
712
+ "p":0.7727272727,
713
+ "r":0.8095238095,
714
+ "f":0.7906976744
715
+ },
716
+ "advcl":{
717
+ "p":0.5906040268,
718
+ "r":0.6197183099,
719
+ "f":0.6048109966
720
  },
721
  "det:poss":{
722
  "p":0.9714285714,
 
729
  "f":1.0
730
  },
731
  "appos":{
732
+ "p":0.3529411765,
733
+ "r":0.3076923077,
734
+ "f":0.3287671233
735
  },
736
  "dep":{
737
  "p":0.0,
 
739
  "f":0.0
740
  },
741
  "iobj":{
742
+ "p":0.8571428571,
743
+ "r":0.9,
744
+ "f":0.8780487805
 
 
 
 
 
745
  },
746
  "expl:impers":{
747
+ "p":0.7777777778,
748
+ "r":0.875,
749
+ "f":0.8235294118
750
  },
751
  "csubj":{
752
+ "p":0.5833333333,
753
+ "r":0.5384615385,
754
+ "f":0.56
755
+ },
756
+ "compound":{
757
+ "p":0.5925925926,
758
+ "r":0.6153846154,
759
+ "f":0.6037735849
760
  },
761
  "discourse":{
762
  "p":0.0,
 
764
  "f":0.0
765
  },
766
  "fixed":{
767
+ "p":1.0,
768
+ "r":0.7857142857,
769
+ "f":0.88
770
  },
771
  "expl:pass":{
772
+ "p":0.8181818182,
773
  "r":0.8181818182,
774
+ "f":0.8181818182
775
  },
776
  "orphan":{
777
  "p":0.0,
 
779
  "f":0.0
780
  },
781
  "flat:foreign":{
782
+ "p":0.75,
783
+ "r":1.0,
784
+ "f":0.8571428571
785
  },
786
  "vocative":{
787
+ "p":0.5,
788
  "r":0.3333333333,
789
+ "f":0.4
790
  }
791
  },
792
+ "lemma_acc":0.9705248023,
793
+ "ents_p":0.8546443384,
794
+ "ents_r":0.8515457487,
795
+ "ents_f":0.8530922299,
796
  "ents_per_type":{
797
  "LOC":{
798
+ "p":0.8648893588,
799
+ "r":0.8971406189,
800
+ "f":0.8807198339
801
  },
802
  "PER":{
803
+ "p":0.8897909333,
804
+ "r":0.8802416489,
805
+ "f":0.884990532
806
  },
807
  "MISC":{
808
+ "p":0.7606463304,
809
+ "r":0.6920857378,
810
+ "f":0.7247482014
811
  },
812
  "ORG":{
813
+ "p":0.8209137552,
814
+ "r":0.7594936709,
815
+ "f":0.7890102149
816
  }
817
  },
818
+ "speed":10947.1047973316
819
  },
820
  "sources":[
821
  {
 
829
  "url":"https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500",
830
  "license":"CC BY 4.0",
831
  "author":"Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran"
 
 
 
 
 
 
832
  }
833
  ],
834
  "requirements":[
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cc5630b06ba1b76f2c6ece9a27c9b7026e62451a8eb773ad803f0b7354a736e2
3
- size 135802
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d0d8a1386d1da17cbe37f7305c4ca8edcc38a9cbd855e575d40bcd2d23eab5ad
3
+ size 135854
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:733a0d24e05d7ec67ed9055ac0981ff7a0cab5ef58146b7e35bcb3189d714b8a
3
- size 6865402
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6e3dc4370b7f998374f4ef96ab984a1075fc98c03db4200c23929f13814a56d5
3
+ size 6270202
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:708d4f56d516b77492b4cbd4a760f329edcfd60432334a72ecbe64a5c1129d06
3
  size 307688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:657e8fac826b2a12114595e2f6f730c0a481c66ac1880248416601be883547c6
3
  size 307688
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�#{"0":{"":131175},"1":{"":112873},"2":{"case":38562,"det":25835,"nsubj":9091,"advmod":7696,"cc":7544,"punct":6409,"mark":5854,"aux":5614,"amod":4663,"obl":4048,"cop":2625,"aux:pass":2047,"nummod":1979,"det:poss":1677,"expl":1601,"nsubj:pass":1416,"obj":1098,"advcl":962,"nmod":473,"iobj":449,"det:predet":378,"expl:impers":372,"expl:pass":314,"parataxis":90,"vocative":79,"csubj":55,"discourse":48,"acl":40,"dep":0},"3":{"punct":24646,"nmod":21570,"obl":11821,"amod":10763,"conj":9378,"obj":8029,"flat:name":3183,"acl:relcl":2890,"nsubj":2775,"acl":2675,"advcl":2515,"xcomp":2069,"advmod":1947,"ccomp":1334,"nummod":1227,"obl:agent":1027,"appos":827,"nsubj:pass":708,"compound":699,"fixed":662,"cop":575,"flat":532,"parataxis":282,"csubj":243,"flat:foreign":136,"det:poss":63,"dep":0},"4":{"ROOT":13117}}�cfg��neg_key�
 
1
+ ��moves�#{"0":{"":131325},"1":{"":113172},"2":{"case":38597,"det":25851,"nsubj":9089,"advmod":7698,"cc":7550,"punct":6456,"mark":5855,"aux":5614,"amod":4665,"obl":4048,"cop":2623,"aux:pass":2046,"nummod":2027,"det:poss":1677,"expl":1601,"nsubj:pass":1416,"obj":1096,"advcl":962,"nmod":473,"iobj":449,"det:predet":378,"expl:impers":372,"expl:pass":314,"parataxis":90,"vocative":79,"csubj":55,"discourse":48,"acl":40,"dep":0},"3":{"punct":24838,"nmod":21623,"obl":11817,"amod":10759,"conj":9391,"obj":8029,"flat:name":3204,"acl:relcl":2892,"nsubj":2778,"acl":2676,"advcl":2517,"xcomp":2070,"advmod":1949,"ccomp":1334,"nummod":1233,"obl:agent":1023,"appos":827,"nsubj:pass":709,"compound":709,"fixed":664,"cop":575,"flat":535,"parataxis":283,"csubj":243,"flat:foreign":136,"det:poss":63,"dep":0},"4":{"ROOT":13121}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8167f9dab252eccb2a5144d4fee0332e952f42ad963f44c30962aae763476db3
3
- size 197037
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:68440516546aa3bb3ff8837d8a3eab5ffba5f3551eae0563adaddb9ec2b3b609
3
+ size 197089
tagger/cfg CHANGED
@@ -48,5 +48,6 @@
48
  "V_PC_PC",
49
  "X"
50
  ],
 
51
  "overwrite":false
52
  }
 
48
  "V_PC_PC",
49
  "X"
50
  ],
51
+ "neg_prefix":"!",
52
  "overwrite":false
53
  }
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:236438875d0d32d16fac460ef0e5688a0ef955e452e3692907233cdf901501c3
3
- size 18613
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2dc96a997f2e5b53734ab443fd949e61acdf26f0ea33e6f260f3b24edce3832d
3
+ size 18665
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:36ead0a5b356c3b833c3b37beda51700d05d66e8124f548d72c59ca30e810beb
3
- size 6734429
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:14d85cf1896b6960a311e4e23f4f85040d94d98ffb65e8a49a3694a228c436eb
3
+ size 6139229
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search� �^'[0-9][0-9]|^[0-9]+°|^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2"…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�N�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=\/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]['’])(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9\"])�token_match��url_match�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�..��A�..�....��A�....�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�Art.��A�Art.�Avv.��A�Avv.�C++��A�C++�C.so��A�C.so�Civ.��A�Civ.�Cod.��A�Cod.�Cost.��A�Cost.�E'��A�E'�E’��A�E’�Jr.��A�Jr.�L'art.��A�L'�A�art.�L’art.��A�L’�A�art.�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�Proc.��A�Proc.�St.��A�St.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.C.��A�a.C.�al.��A�al.�all'art.��A�all'�A�art.�all-path��A�all-path�all’art.��A�all’�A�art.�art.��A�art.�artt.��A�artt.�att.��A�att.�avv.��A�avv.�b.��A�b.�by-pass��A�by-pass�c.��A�c.�c.d.��A�c.d.�c/c��A�c/c�centro-sinistra��A�centro-sinistra�check-up��A�check-up�cm.��A�cm.�col.��A�col.�d.��A�d.�d.C.��A�d.C.�dall'art.��A�dall'�A�art.�dall’art.��A�dall’�A�art.�de"��A�de"�dell'art.��A�dell'�A�art.�dell’art.��A�dell’�A�art.�distr.��A�distr.�e-mail��A�e-mail�e.��A�e.�e/o��A�e/o�ecc.��A�ecc.�etc.��A�etc.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l'art.��A�l'�A�art.�l.��A�l.�l’art.��A�l’�A�art.�m.��A�m.�n.��A�n.�nell'art.��A�nell'�A�art.�nell’art.��A�nell’�A�art.�nord-est��A�nord-est�n°��A�n°�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�pag.��A�pag.�po'��A�po'�po’��A�po’�prof.��A�prof.�q.��A�q.�r.��A�r.�s.��A�s.�s.n.c��A�s.n.c�s.p.a.��A�s.p.a.�s.r.l��A�s.r.l�sett.��A�sett.�sett..��A�sett.�A�.�ss.��A�ss.�t.��A�t.�tel.��A�tel.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�week-end��A�week-end�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
 
1
+ ��prefix_search� �^'[0-9][0-9]|^[0-9]+°|^§|^%|^=|^—|^–|^\+(?![0-9])|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�2y…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|'s$|'S$|’s$|’S$|—$|–$|(?<=[0-9])\+$|(?<=°[FfCcKk])\.$|(?<=[0-9])(?:\$|£|€|¥|฿|US\$|C\$|A\$|₽|﷼|₴|₠|₡|₢|₣|₤|₥|₦|₧|₨|₩|₪|₫|€|₭|₮|₯|₰|₱|₲|₳|₴|₵|₶|₷|₸|₹|₺|₻|₼|₽|₾|₿)$|(?<=[0-9])(?:km|km²|km³|m|m²|m³|dm|dm²|dm³|cm|cm²|cm³|mm|mm²|mm³|ha|µm|nm|yd|in|ft|kg|g|mg|µg|t|lb|oz|m/s|km/h|kmh|mph|hPa|Pa|mbar|mb|MB|kb|KB|gb|GB|tb|TB|T|G|M|K|%|км|км²|км³|м|м²|м³|дм|дм²|дм³|см|см²|см³|мм|мм²|мм³|нм|кг|г|мг|м/с|км/ч|кПа|Па|мбар|Кб|КБ|кб|Мб|МБ|мб|Гб|ГБ|гб|Тб|ТБ|тбكم|كم²|كم³|م|م²|م³|سم|سم²|سم³|مم|مم²|مم³|كم|غرام|جرام|جم|كغ|ملغ|كوب|اكواب)$|(?<=[0-9a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F%²\-\+…|……|,|:|;|\!|\?|¿|؟|¡|\(|\)|\[|\]|\{|\}|<|>|_|#|\*|&|。|?|!|,|、|;|:|~|·|।|،|۔|؛|٪(?:\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉)])\.$|(?<=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F][A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])\.$�infix_finditer�O�\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])(?:-|–|—|--|---|——|~)(?=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=\/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]['’])(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9\"])�token_match��url_match�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�..��A�..�....��A�....�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�Art.��A�Art.�Avv.��A�Avv.�C++��A�C++�C.so��A�C.so�Civ.��A�Civ.�Cod.��A�Cod.�Cost.��A�Cost.�E'��A�E'�E’��A�E’�Jr.��A�Jr.�L'art.��A�L'�A�art.�L’art.��A�L’�A�art.�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�Proc.��A�Proc.�St.��A�St.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.C.��A�a.C.�al.��A�al.�all'art.��A�all'�A�art.�all-path��A�all-path�all’art.��A�all’�A�art.�art.��A�art.�artt.��A�artt.�att.��A�att.�avv.��A�avv.�b.��A�b.�by-pass��A�by-pass�c.��A�c.�c.d.��A�c.d.�c/c��A�c/c�centro-sinistra��A�centro-sinistra�check-up��A�check-up�cm.��A�cm.�col.��A�col.�d.��A�d.�d.C.��A�d.C.�dall'art.��A�dall'�A�art.�dall’art.��A�dall’�A�art.�de"��A�de"�dell'art.��A�dell'�A�art.�dell’art.��A�dell’�A�art.�distr.��A�distr.�e-mail��A�e-mail�e.��A�e.�e/o��A�e/o�ecc.��A�ecc.�etc.��A�etc.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l'art.��A�l'�A�art.�l.��A�l.�l’art.��A�l’�A�art.�m.��A�m.�n.��A�n.�nell'art.��A�nell'�A�art.�nell’art.��A�nell’�A�art.�nord-est��A�nord-est�n°��A�n°�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�pag.��A�pag.�po'��A�po'�po’��A�po’�prof.��A�prof.�q.��A�q.�r.��A�r.�s.��A�s.�s.n.c��A�s.n.c�s.p.a.��A�s.p.a.�s.r.l��A�s.r.l�sett.��A�sett.�sett..��A�sett.�A�.�ss.��A�ss.�t.��A�t.�tel.��A�tel.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�week-end��A�week-end�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:144bf245328ca1addb12aa8fe04a52f496e7bb3915fe6ff99850567e5f922612
3
- size 2373277
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8946ef000eb2a2ccace3d1f523ed465321d511745328b2a6373dd45790c157de
3
+ size 2392188