adrianeboyd commited on
Commit
67f7854
1 Parent(s): 819d5c6

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -438,886 +438,6 @@ Creative Commons may be contacted at creativecommons.org.
438
 
439
 
440
 
441
- # Macedonian Corpus
442
-
443
- * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
444
- * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
445
- * License: CC BY-SA 4.0
446
-
447
- ```
448
- Attribution-ShareAlike 4.0 International
449
-
450
- =======================================================================
451
-
452
- Creative Commons Corporation ("Creative Commons") is not a law firm and
453
- does not provide legal services or legal advice. Distribution of
454
- Creative Commons public licenses does not create a lawyer-client or
455
- other relationship. Creative Commons makes its licenses and related
456
- information available on an "as-is" basis. Creative Commons gives no
457
- warranties regarding its licenses, any material licensed under their
458
- terms and conditions, or any related information. Creative Commons
459
- disclaims all liability for damages resulting from their use to the
460
- fullest extent possible.
461
-
462
- Using Creative Commons Public Licenses
463
-
464
- Creative Commons public licenses provide a standard set of terms and
465
- conditions that creators and other rights holders may use to share
466
- original works of authorship and other material subject to copyright
467
- and certain other rights specified in the public license below. The
468
- following considerations are for informational purposes only, are not
469
- exhaustive, and do not form part of our licenses.
470
-
471
- Considerations for licensors: Our public licenses are
472
- intended for use by those authorized to give the public
473
- permission to use material in ways otherwise restricted by
474
- copyright and certain other rights. Our licenses are
475
- irrevocable. Licensors should read and understand the terms
476
- and conditions of the license they choose before applying it.
477
- Licensors should also secure all rights necessary before
478
- applying our licenses so that the public can reuse the
479
- material as expected. Licensors should clearly mark any
480
- material not subject to the license. This includes other CC-
481
- licensed material, or material used under an exception or
482
- limitation to copyright. More considerations for licensors:
483
- wiki.creativecommons.org/Considerations_for_licensors
484
-
485
- Considerations for the public: By using one of our public
486
- licenses, a licensor grants the public permission to use the
487
- licensed material under specified terms and conditions. If
488
- the licensor's permission is not necessary for any reason--for
489
- example, because of any applicable exception or limitation to
490
- copyright--then that use is not regulated by the license. Our
491
- licenses grant only permissions under copyright and certain
492
- other rights that a licensor has authority to grant. Use of
493
- the licensed material may still be restricted for other
494
- reasons, including because others have copyright or other
495
- rights in the material. A licensor may make special requests,
496
- such as asking that all changes be marked or described.
497
- Although not required by our licenses, you are encouraged to
498
- respect those requests where reasonable. More considerations
499
- for the public:
500
- wiki.creativecommons.org/Considerations_for_licensees
501
-
502
- =======================================================================
503
-
504
- Creative Commons Attribution-ShareAlike 4.0 International Public
505
- License
506
-
507
- By exercising the Licensed Rights (defined below), You accept and agree
508
- to be bound by the terms and conditions of this Creative Commons
509
- Attribution-ShareAlike 4.0 International Public License ("Public
510
- License"). To the extent this Public License may be interpreted as a
511
- contract, You are granted the Licensed Rights in consideration of Your
512
- acceptance of these terms and conditions, and the Licensor grants You
513
- such rights in consideration of benefits the Licensor receives from
514
- making the Licensed Material available under these terms and
515
- conditions.
516
-
517
-
518
- Section 1 -- Definitions.
519
-
520
- a. Adapted Material means material subject to Copyright and Similar
521
- Rights that is derived from or based upon the Licensed Material
522
- and in which the Licensed Material is translated, altered,
523
- arranged, transformed, or otherwise modified in a manner requiring
524
- permission under the Copyright and Similar Rights held by the
525
- Licensor. For purposes of this Public License, where the Licensed
526
- Material is a musical work, performance, or sound recording,
527
- Adapted Material is always produced where the Licensed Material is
528
- synched in timed relation with a moving image.
529
-
530
- b. Adapter's License means the license You apply to Your Copyright
531
- and Similar Rights in Your contributions to Adapted Material in
532
- accordance with the terms and conditions of this Public License.
533
-
534
- c. BY-SA Compatible License means a license listed at
535
- creativecommons.org/compatiblelicenses, approved by Creative
536
- Commons as essentially the equivalent of this Public License.
537
-
538
- d. Copyright and Similar Rights means copyright and/or similar rights
539
- closely related to copyright including, without limitation,
540
- performance, broadcast, sound recording, and Sui Generis Database
541
- Rights, without regard to how the rights are labeled or
542
- categorized. For purposes of this Public License, the rights
543
- specified in Section 2(b)(1)-(2) are not Copyright and Similar
544
- Rights.
545
-
546
- e. Effective Technological Measures means those measures that, in the
547
- absence of proper authority, may not be circumvented under laws
548
- fulfilling obligations under Article 11 of the WIPO Copyright
549
- Treaty adopted on December 20, 1996, and/or similar international
550
- agreements.
551
-
552
- f. Exceptions and Limitations means fair use, fair dealing, and/or
553
- any other exception or limitation to Copyright and Similar Rights
554
- that applies to Your use of the Licensed Material.
555
-
556
- g. License Elements means the license attributes listed in the name
557
- of a Creative Commons Public License. The License Elements of this
558
- Public License are Attribution and ShareAlike.
559
-
560
- h. Licensed Material means the artistic or literary work, database,
561
- or other material to which the Licensor applied this Public
562
- License.
563
-
564
- i. Licensed Rights means the rights granted to You subject to the
565
- terms and conditions of this Public License, which are limited to
566
- all Copyright and Similar Rights that apply to Your use of the
567
- Licensed Material and that the Licensor has authority to license.
568
-
569
- j. Licensor means the individual(s) or entity(ies) granting rights
570
- under this Public License.
571
-
572
- k. Share means to provide material to the public by any means or
573
- process that requires permission under the Licensed Rights, such
574
- as reproduction, public display, public performance, distribution,
575
- dissemination, communication, or importation, and to make material
576
- available to the public including in ways that members of the
577
- public may access the material from a place and at a time
578
- individually chosen by them.
579
-
580
- l. Sui Generis Database Rights means rights other than copyright
581
- resulting from Directive 96/9/EC of the European Parliament and of
582
- the Council of 11 March 1996 on the legal protection of databases,
583
- as amended and/or succeeded, as well as other essentially
584
- equivalent rights anywhere in the world.
585
-
586
- m. You means the individual or entity exercising the Licensed Rights
587
- under this Public License. Your has a corresponding meaning.
588
-
589
-
590
- Section 2 -- Scope.
591
-
592
- a. License grant.
593
-
594
- 1. Subject to the terms and conditions of this Public License,
595
- the Licensor hereby grants You a worldwide, royalty-free,
596
- non-sublicensable, non-exclusive, irrevocable license to
597
- exercise the Licensed Rights in the Licensed Material to:
598
-
599
- a. reproduce and Share the Licensed Material, in whole or
600
- in part; and
601
-
602
- b. produce, reproduce, and Share Adapted Material.
603
-
604
- 2. Exceptions and Limitations. For the avoidance of doubt, where
605
- Exceptions and Limitations apply to Your use, this Public
606
- License does not apply, and You do not need to comply with
607
- its terms and conditions.
608
-
609
- 3. Term. The term of this Public License is specified in Section
610
- 6(a).
611
-
612
- 4. Media and formats; technical modifications allowed. The
613
- Licensor authorizes You to exercise the Licensed Rights in
614
- all media and formats whether now known or hereafter created,
615
- and to make technical modifications necessary to do so. The
616
- Licensor waives and/or agrees not to assert any right or
617
- authority to forbid You from making technical modifications
618
- necessary to exercise the Licensed Rights, including
619
- technical modifications necessary to circumvent Effective
620
- Technological Measures. For purposes of this Public License,
621
- simply making modifications authorized by this Section 2(a)
622
- (4) never produces Adapted Material.
623
-
624
- 5. Downstream recipients.
625
-
626
- a. Offer from the Licensor -- Licensed Material. Every
627
- recipient of the Licensed Material automatically
628
- receives an offer from the Licensor to exercise the
629
- Licensed Rights under the terms and conditions of this
630
- Public License.
631
-
632
- b. Additional offer from the Licensor -- Adapted Material.
633
- Every recipient of Adapted Material from You
634
- automatically receives an offer from the Licensor to
635
- exercise the Licensed Rights in the Adapted Material
636
- under the conditions of the Adapter's License You apply.
637
-
638
- c. No downstream restrictions. You may not offer or impose
639
- any additional or different terms or conditions on, or
640
- apply any Effective Technological Measures to, the
641
- Licensed Material if doing so restricts exercise of the
642
- Licensed Rights by any recipient of the Licensed
643
- Material.
644
-
645
- 6. No endorsement. Nothing in this Public License constitutes or
646
- may be construed as permission to assert or imply that You
647
- are, or that Your use of the Licensed Material is, connected
648
- with, or sponsored, endorsed, or granted official status by,
649
- the Licensor or others designated to receive attribution as
650
- provided in Section 3(a)(1)(A)(i).
651
-
652
- b. Other rights.
653
-
654
- 1. Moral rights, such as the right of integrity, are not
655
- licensed under this Public License, nor are publicity,
656
- privacy, and/or other similar personality rights; however, to
657
- the extent possible, the Licensor waives and/or agrees not to
658
- assert any such rights held by the Licensor to the limited
659
- extent necessary to allow You to exercise the Licensed
660
- Rights, but not otherwise.
661
-
662
- 2. Patent and trademark rights are not licensed under this
663
- Public License.
664
-
665
- 3. To the extent possible, the Licensor waives any right to
666
- collect royalties from You for the exercise of the Licensed
667
- Rights, whether directly or through a collecting society
668
- under any voluntary or waivable statutory or compulsory
669
- licensing scheme. In all other cases the Licensor expressly
670
- reserves any right to collect such royalties.
671
-
672
-
673
- Section 3 -- License Conditions.
674
-
675
- Your exercise of the Licensed Rights is expressly made subject to the
676
- following conditions.
677
-
678
- a. Attribution.
679
-
680
- 1. If You Share the Licensed Material (including in modified
681
- form), You must:
682
-
683
- a. retain the following if it is supplied by the Licensor
684
- with the Licensed Material:
685
-
686
- i. identification of the creator(s) of the Licensed
687
- Material and any others designated to receive
688
- attribution, in any reasonable manner requested by
689
- the Licensor (including by pseudonym if
690
- designated);
691
-
692
- ii. a copyright notice;
693
-
694
- iii. a notice that refers to this Public License;
695
-
696
- iv. a notice that refers to the disclaimer of
697
- warranties;
698
-
699
- v. a URI or hyperlink to the Licensed Material to the
700
- extent reasonably practicable;
701
-
702
- b. indicate if You modified the Licensed Material and
703
- retain an indication of any previous modifications; and
704
-
705
- c. indicate the Licensed Material is licensed under this
706
- Public License, and include the text of, or the URI or
707
- hyperlink to, this Public License.
708
-
709
- 2. You may satisfy the conditions in Section 3(a)(1) in any
710
- reasonable manner based on the medium, means, and context in
711
- which You Share the Licensed Material. For example, it may be
712
- reasonable to satisfy the conditions by providing a URI or
713
- hyperlink to a resource that includes the required
714
- information.
715
-
716
- 3. If requested by the Licensor, You must remove any of the
717
- information required by Section 3(a)(1)(A) to the extent
718
- reasonably practicable.
719
-
720
- b. ShareAlike.
721
-
722
- In addition to the conditions in Section 3(a), if You Share
723
- Adapted Material You produce, the following conditions also apply.
724
-
725
- 1. The Adapter's License You apply must be a Creative Commons
726
- license with the same License Elements, this version or
727
- later, or a BY-SA Compatible License.
728
-
729
- 2. You must include the text of, or the URI or hyperlink to, the
730
- Adapter's License You apply. You may satisfy this condition
731
- in any reasonable manner based on the medium, means, and
732
- context in which You Share Adapted Material.
733
-
734
- 3. You may not offer or impose any additional or different terms
735
- or conditions on, or apply any Effective Technological
736
- Measures to, Adapted Material that restrict exercise of the
737
- rights granted under the Adapter's License You apply.
738
-
739
-
740
- Section 4 -- Sui Generis Database Rights.
741
-
742
- Where the Licensed Rights include Sui Generis Database Rights that
743
- apply to Your use of the Licensed Material:
744
-
745
- a. for the avoidance of doubt, Section 2(a)(1) grants You the right
746
- to extract, reuse, reproduce, and Share all or a substantial
747
- portion of the contents of the database;
748
-
749
- b. if You include all or a substantial portion of the database
750
- contents in a database in which You have Sui Generis Database
751
- Rights, then the database in which You have Sui Generis Database
752
- Rights (but not its individual contents) is Adapted Material,
753
-
754
- including for purposes of Section 3(b); and
755
- c. You must comply with the conditions in Section 3(a) if You Share
756
- all or a substantial portion of the contents of the database.
757
-
758
- For the avoidance of doubt, this Section 4 supplements and does not
759
- replace Your obligations under this Public License where the Licensed
760
- Rights include other Copyright and Similar Rights.
761
-
762
-
763
- Section 5 -- Disclaimer of Warranties and Limitation of Liability.
764
-
765
- a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
766
- EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
767
- AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
768
- ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
769
- IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
770
- WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
771
- PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
772
- ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
773
- KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
774
- ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
775
-
776
- b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
777
- TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
778
- NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
779
- INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
780
- COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
781
- USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
782
- ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
783
- DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
784
- IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
785
-
786
- c. The disclaimer of warranties and limitation of liability provided
787
- above shall be interpreted in a manner that, to the extent
788
- possible, most closely approximates an absolute disclaimer and
789
- waiver of all liability.
790
-
791
-
792
- Section 6 -- Term and Termination.
793
-
794
- a. This Public License applies for the term of the Copyright and
795
- Similar Rights licensed here. However, if You fail to comply with
796
- this Public License, then Your rights under this Public License
797
- terminate automatically.
798
-
799
- b. Where Your right to use the Licensed Material has terminated under
800
- Section 6(a), it reinstates:
801
-
802
- 1. automatically as of the date the violation is cured, provided
803
- it is cured within 30 days of Your discovery of the
804
- violation; or
805
-
806
- 2. upon express reinstatement by the Licensor.
807
-
808
- For the avoidance of doubt, this Section 6(b) does not affect any
809
- right the Licensor may have to seek remedies for Your violations
810
- of this Public License.
811
-
812
- c. For the avoidance of doubt, the Licensor may also offer the
813
- Licensed Material under separate terms or conditions or stop
814
- distributing the Licensed Material at any time; however, doing so
815
- will not terminate this Public License.
816
-
817
- d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
818
- License.
819
-
820
-
821
- Section 7 -- Other Terms and Conditions.
822
-
823
- a. The Licensor shall not be bound by any additional or different
824
- terms or conditions communicated by You unless expressly agreed.
825
-
826
- b. Any arrangements, understandings, or agreements regarding the
827
- Licensed Material not stated herein are separate from and
828
- independent of the terms and conditions of this Public License.
829
-
830
-
831
- Section 8 -- Interpretation.
832
-
833
- a. For the avoidance of doubt, this Public License does not, and
834
- shall not be interpreted to, reduce, limit, restrict, or impose
835
- conditions on any use of the Licensed Material that could lawfully
836
- be made without permission under this Public License.
837
-
838
- b. To the extent possible, if any provision of this Public License is
839
- deemed unenforceable, it shall be automatically reformed to the
840
- minimum extent necessary to make it enforceable. If the provision
841
- cannot be reformed, it shall be severed from this Public License
842
- without affecting the enforceability of the remaining terms and
843
- conditions.
844
-
845
- c. No term or condition of this Public License will be waived and no
846
- failure to comply consented to unless expressly agreed to by the
847
- Licensor.
848
-
849
- d. Nothing in this Public License constitutes or may be interpreted
850
- as a limitation upon, or waiver of, any privileges and immunities
851
- that apply to the Licensor or You, including from the legal
852
- processes of any jurisdiction or authority.
853
-
854
-
855
- =======================================================================
856
-
857
- Creative Commons is not a party to its public
858
- licenses. Notwithstanding, Creative Commons may elect to apply one of
859
- its public licenses to material it publishes and in those instances
860
- will be considered the “Licensor.” The text of the Creative Commons
861
- public licenses is dedicated to the public domain under the CC0 Public
862
- Domain Dedication. Except for the limited purpose of indicating that
863
- material is shared under a Creative Commons public license or as
864
- otherwise permitted by the Creative Commons policies published at
865
- creativecommons.org/policies, Creative Commons does not authorize the
866
- use of the trademark "Creative Commons" or any other trademark or logo
867
- of Creative Commons without its prior written consent including,
868
- without limitation, in connection with any unauthorized modifications
869
- to any of its public licenses or any other arrangements,
870
- understandings, or agreements concerning use of licensed material. For
871
- the avoidance of doubt, this paragraph does not form part of the
872
- public licenses.
873
-
874
- Creative Commons may be contacted at creativecommons.org.
875
-
876
- ```
877
-
878
-
879
-
880
-
881
- # Macedonian Corpus
882
-
883
- * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
884
- * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
885
- * License: CC BY-SA 4.0
886
-
887
- ```
888
- Attribution-ShareAlike 4.0 International
889
-
890
- =======================================================================
891
-
892
- Creative Commons Corporation ("Creative Commons") is not a law firm and
893
- does not provide legal services or legal advice. Distribution of
894
- Creative Commons public licenses does not create a lawyer-client or
895
- other relationship. Creative Commons makes its licenses and related
896
- information available on an "as-is" basis. Creative Commons gives no
897
- warranties regarding its licenses, any material licensed under their
898
- terms and conditions, or any related information. Creative Commons
899
- disclaims all liability for damages resulting from their use to the
900
- fullest extent possible.
901
-
902
- Using Creative Commons Public Licenses
903
-
904
- Creative Commons public licenses provide a standard set of terms and
905
- conditions that creators and other rights holders may use to share
906
- original works of authorship and other material subject to copyright
907
- and certain other rights specified in the public license below. The
908
- following considerations are for informational purposes only, are not
909
- exhaustive, and do not form part of our licenses.
910
-
911
- Considerations for licensors: Our public licenses are
912
- intended for use by those authorized to give the public
913
- permission to use material in ways otherwise restricted by
914
- copyright and certain other rights. Our licenses are
915
- irrevocable. Licensors should read and understand the terms
916
- and conditions of the license they choose before applying it.
917
- Licensors should also secure all rights necessary before
918
- applying our licenses so that the public can reuse the
919
- material as expected. Licensors should clearly mark any
920
- material not subject to the license. This includes other CC-
921
- licensed material, or material used under an exception or
922
- limitation to copyright. More considerations for licensors:
923
- wiki.creativecommons.org/Considerations_for_licensors
924
-
925
- Considerations for the public: By using one of our public
926
- licenses, a licensor grants the public permission to use the
927
- licensed material under specified terms and conditions. If
928
- the licensor's permission is not necessary for any reason--for
929
- example, because of any applicable exception or limitation to
930
- copyright--then that use is not regulated by the license. Our
931
- licenses grant only permissions under copyright and certain
932
- other rights that a licensor has authority to grant. Use of
933
- the licensed material may still be restricted for other
934
- reasons, including because others have copyright or other
935
- rights in the material. A licensor may make special requests,
936
- such as asking that all changes be marked or described.
937
- Although not required by our licenses, you are encouraged to
938
- respect those requests where reasonable. More considerations
939
- for the public:
940
- wiki.creativecommons.org/Considerations_for_licensees
941
-
942
- =======================================================================
943
-
944
- Creative Commons Attribution-ShareAlike 4.0 International Public
945
- License
946
-
947
- By exercising the Licensed Rights (defined below), You accept and agree
948
- to be bound by the terms and conditions of this Creative Commons
949
- Attribution-ShareAlike 4.0 International Public License ("Public
950
- License"). To the extent this Public License may be interpreted as a
951
- contract, You are granted the Licensed Rights in consideration of Your
952
- acceptance of these terms and conditions, and the Licensor grants You
953
- such rights in consideration of benefits the Licensor receives from
954
- making the Licensed Material available under these terms and
955
- conditions.
956
-
957
-
958
- Section 1 -- Definitions.
959
-
960
- a. Adapted Material means material subject to Copyright and Similar
961
- Rights that is derived from or based upon the Licensed Material
962
- and in which the Licensed Material is translated, altered,
963
- arranged, transformed, or otherwise modified in a manner requiring
964
- permission under the Copyright and Similar Rights held by the
965
- Licensor. For purposes of this Public License, where the Licensed
966
- Material is a musical work, performance, or sound recording,
967
- Adapted Material is always produced where the Licensed Material is
968
- synched in timed relation with a moving image.
969
-
970
- b. Adapter's License means the license You apply to Your Copyright
971
- and Similar Rights in Your contributions to Adapted Material in
972
- accordance with the terms and conditions of this Public License.
973
-
974
- c. BY-SA Compatible License means a license listed at
975
- creativecommons.org/compatiblelicenses, approved by Creative
976
- Commons as essentially the equivalent of this Public License.
977
-
978
- d. Copyright and Similar Rights means copyright and/or similar rights
979
- closely related to copyright including, without limitation,
980
- performance, broadcast, sound recording, and Sui Generis Database
981
- Rights, without regard to how the rights are labeled or
982
- categorized. For purposes of this Public License, the rights
983
- specified in Section 2(b)(1)-(2) are not Copyright and Similar
984
- Rights.
985
-
986
- e. Effective Technological Measures means those measures that, in the
987
- absence of proper authority, may not be circumvented under laws
988
- fulfilling obligations under Article 11 of the WIPO Copyright
989
- Treaty adopted on December 20, 1996, and/or similar international
990
- agreements.
991
-
992
- f. Exceptions and Limitations means fair use, fair dealing, and/or
993
- any other exception or limitation to Copyright and Similar Rights
994
- that applies to Your use of the Licensed Material.
995
-
996
- g. License Elements means the license attributes listed in the name
997
- of a Creative Commons Public License. The License Elements of this
998
- Public License are Attribution and ShareAlike.
999
-
1000
- h. Licensed Material means the artistic or literary work, database,
1001
- or other material to which the Licensor applied this Public
1002
- License.
1003
-
1004
- i. Licensed Rights means the rights granted to You subject to the
1005
- terms and conditions of this Public License, which are limited to
1006
- all Copyright and Similar Rights that apply to Your use of the
1007
- Licensed Material and that the Licensor has authority to license.
1008
-
1009
- j. Licensor means the individual(s) or entity(ies) granting rights
1010
- under this Public License.
1011
-
1012
- k. Share means to provide material to the public by any means or
1013
- process that requires permission under the Licensed Rights, such
1014
- as reproduction, public display, public performance, distribution,
1015
- dissemination, communication, or importation, and to make material
1016
- available to the public including in ways that members of the
1017
- public may access the material from a place and at a time
1018
- individually chosen by them.
1019
-
1020
- l. Sui Generis Database Rights means rights other than copyright
1021
- resulting from Directive 96/9/EC of the European Parliament and of
1022
- the Council of 11 March 1996 on the legal protection of databases,
1023
- as amended and/or succeeded, as well as other essentially
1024
- equivalent rights anywhere in the world.
1025
-
1026
- m. You means the individual or entity exercising the Licensed Rights
1027
- under this Public License. Your has a corresponding meaning.
1028
-
1029
-
1030
- Section 2 -- Scope.
1031
-
1032
- a. License grant.
1033
-
1034
- 1. Subject to the terms and conditions of this Public License,
1035
- the Licensor hereby grants You a worldwide, royalty-free,
1036
- non-sublicensable, non-exclusive, irrevocable license to
1037
- exercise the Licensed Rights in the Licensed Material to:
1038
-
1039
- a. reproduce and Share the Licensed Material, in whole or
1040
- in part; and
1041
-
1042
- b. produce, reproduce, and Share Adapted Material.
1043
-
1044
- 2. Exceptions and Limitations. For the avoidance of doubt, where
1045
- Exceptions and Limitations apply to Your use, this Public
1046
- License does not apply, and You do not need to comply with
1047
- its terms and conditions.
1048
-
1049
- 3. Term. The term of this Public License is specified in Section
1050
- 6(a).
1051
-
1052
- 4. Media and formats; technical modifications allowed. The
1053
- Licensor authorizes You to exercise the Licensed Rights in
1054
- all media and formats whether now known or hereafter created,
1055
- and to make technical modifications necessary to do so. The
1056
- Licensor waives and/or agrees not to assert any right or
1057
- authority to forbid You from making technical modifications
1058
- necessary to exercise the Licensed Rights, including
1059
- technical modifications necessary to circumvent Effective
1060
- Technological Measures. For purposes of this Public License,
1061
- simply making modifications authorized by this Section 2(a)
1062
- (4) never produces Adapted Material.
1063
-
1064
- 5. Downstream recipients.
1065
-
1066
- a. Offer from the Licensor -- Licensed Material. Every
1067
- recipient of the Licensed Material automatically
1068
- receives an offer from the Licensor to exercise the
1069
- Licensed Rights under the terms and conditions of this
1070
- Public License.
1071
-
1072
- b. Additional offer from the Licensor -- Adapted Material.
1073
- Every recipient of Adapted Material from You
1074
- automatically receives an offer from the Licensor to
1075
- exercise the Licensed Rights in the Adapted Material
1076
- under the conditions of the Adapter's License You apply.
1077
-
1078
- c. No downstream restrictions. You may not offer or impose
1079
- any additional or different terms or conditions on, or
1080
- apply any Effective Technological Measures to, the
1081
- Licensed Material if doing so restricts exercise of the
1082
- Licensed Rights by any recipient of the Licensed
1083
- Material.
1084
-
1085
- 6. No endorsement. Nothing in this Public License constitutes or
1086
- may be construed as permission to assert or imply that You
1087
- are, or that Your use of the Licensed Material is, connected
1088
- with, or sponsored, endorsed, or granted official status by,
1089
- the Licensor or others designated to receive attribution as
1090
- provided in Section 3(a)(1)(A)(i).
1091
-
1092
- b. Other rights.
1093
-
1094
- 1. Moral rights, such as the right of integrity, are not
1095
- licensed under this Public License, nor are publicity,
1096
- privacy, and/or other similar personality rights; however, to
1097
- the extent possible, the Licensor waives and/or agrees not to
1098
- assert any such rights held by the Licensor to the limited
1099
- extent necessary to allow You to exercise the Licensed
1100
- Rights, but not otherwise.
1101
-
1102
- 2. Patent and trademark rights are not licensed under this
1103
- Public License.
1104
-
1105
- 3. To the extent possible, the Licensor waives any right to
1106
- collect royalties from You for the exercise of the Licensed
1107
- Rights, whether directly or through a collecting society
1108
- under any voluntary or waivable statutory or compulsory
1109
- licensing scheme. In all other cases the Licensor expressly
1110
- reserves any right to collect such royalties.
1111
-
1112
-
1113
- Section 3 -- License Conditions.
1114
-
1115
- Your exercise of the Licensed Rights is expressly made subject to the
1116
- following conditions.
1117
-
1118
- a. Attribution.
1119
-
1120
- 1. If You Share the Licensed Material (including in modified
1121
- form), You must:
1122
-
1123
- a. retain the following if it is supplied by the Licensor
1124
- with the Licensed Material:
1125
-
1126
- i. identification of the creator(s) of the Licensed
1127
- Material and any others designated to receive
1128
- attribution, in any reasonable manner requested by
1129
- the Licensor (including by pseudonym if
1130
- designated);
1131
-
1132
- ii. a copyright notice;
1133
-
1134
- iii. a notice that refers to this Public License;
1135
-
1136
- iv. a notice that refers to the disclaimer of
1137
- warranties;
1138
-
1139
- v. a URI or hyperlink to the Licensed Material to the
1140
- extent reasonably practicable;
1141
-
1142
- b. indicate if You modified the Licensed Material and
1143
- retain an indication of any previous modifications; and
1144
-
1145
- c. indicate the Licensed Material is licensed under this
1146
- Public License, and include the text of, or the URI or
1147
- hyperlink to, this Public License.
1148
-
1149
- 2. You may satisfy the conditions in Section 3(a)(1) in any
1150
- reasonable manner based on the medium, means, and context in
1151
- which You Share the Licensed Material. For example, it may be
1152
- reasonable to satisfy the conditions by providing a URI or
1153
- hyperlink to a resource that includes the required
1154
- information.
1155
-
1156
- 3. If requested by the Licensor, You must remove any of the
1157
- information required by Section 3(a)(1)(A) to the extent
1158
- reasonably practicable.
1159
-
1160
- b. ShareAlike.
1161
-
1162
- In addition to the conditions in Section 3(a), if You Share
1163
- Adapted Material You produce, the following conditions also apply.
1164
-
1165
- 1. The Adapter's License You apply must be a Creative Commons
1166
- license with the same License Elements, this version or
1167
- later, or a BY-SA Compatible License.
1168
-
1169
- 2. You must include the text of, or the URI or hyperlink to, the
1170
- Adapter's License You apply. You may satisfy this condition
1171
- in any reasonable manner based on the medium, means, and
1172
- context in which You Share Adapted Material.
1173
-
1174
- 3. You may not offer or impose any additional or different terms
1175
- or conditions on, or apply any Effective Technological
1176
- Measures to, Adapted Material that restrict exercise of the
1177
- rights granted under the Adapter's License You apply.
1178
-
1179
-
1180
- Section 4 -- Sui Generis Database Rights.
1181
-
1182
- Where the Licensed Rights include Sui Generis Database Rights that
1183
- apply to Your use of the Licensed Material:
1184
-
1185
- a. for the avoidance of doubt, Section 2(a)(1) grants You the right
1186
- to extract, reuse, reproduce, and Share all or a substantial
1187
- portion of the contents of the database;
1188
-
1189
- b. if You include all or a substantial portion of the database
1190
- contents in a database in which You have Sui Generis Database
1191
- Rights, then the database in which You have Sui Generis Database
1192
- Rights (but not its individual contents) is Adapted Material,
1193
-
1194
- including for purposes of Section 3(b); and
1195
- c. You must comply with the conditions in Section 3(a) if You Share
1196
- all or a substantial portion of the contents of the database.
1197
-
1198
- For the avoidance of doubt, this Section 4 supplements and does not
1199
- replace Your obligations under this Public License where the Licensed
1200
- Rights include other Copyright and Similar Rights.
1201
-
1202
-
1203
- Section 5 -- Disclaimer of Warranties and Limitation of Liability.
1204
-
1205
- a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
1206
- EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
1207
- AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
1208
- ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
1209
- IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
1210
- WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
1211
- PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
1212
- ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
1213
- KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
1214
- ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
1215
-
1216
- b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
1217
- TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
1218
- NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
1219
- INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
1220
- COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
1221
- USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
1222
- ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
1223
- DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
1224
- IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
1225
-
1226
- c. The disclaimer of warranties and limitation of liability provided
1227
- above shall be interpreted in a manner that, to the extent
1228
- possible, most closely approximates an absolute disclaimer and
1229
- waiver of all liability.
1230
-
1231
-
1232
- Section 6 -- Term and Termination.
1233
-
1234
- a. This Public License applies for the term of the Copyright and
1235
- Similar Rights licensed here. However, if You fail to comply with
1236
- this Public License, then Your rights under this Public License
1237
- terminate automatically.
1238
-
1239
- b. Where Your right to use the Licensed Material has terminated under
1240
- Section 6(a), it reinstates:
1241
-
1242
- 1. automatically as of the date the violation is cured, provided
1243
- it is cured within 30 days of Your discovery of the
1244
- violation; or
1245
-
1246
- 2. upon express reinstatement by the Licensor.
1247
-
1248
- For the avoidance of doubt, this Section 6(b) does not affect any
1249
- right the Licensor may have to seek remedies for Your violations
1250
- of this Public License.
1251
-
1252
- c. For the avoidance of doubt, the Licensor may also offer the
1253
- Licensed Material under separate terms or conditions or stop
1254
- distributing the Licensed Material at any time; however, doing so
1255
- will not terminate this Public License.
1256
-
1257
- d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
1258
- License.
1259
-
1260
-
1261
- Section 7 -- Other Terms and Conditions.
1262
-
1263
- a. The Licensor shall not be bound by any additional or different
1264
- terms or conditions communicated by You unless expressly agreed.
1265
-
1266
- b. Any arrangements, understandings, or agreements regarding the
1267
- Licensed Material not stated herein are separate from and
1268
- independent of the terms and conditions of this Public License.
1269
-
1270
-
1271
- Section 8 -- Interpretation.
1272
-
1273
- a. For the avoidance of doubt, this Public License does not, and
1274
- shall not be interpreted to, reduce, limit, restrict, or impose
1275
- conditions on any use of the Licensed Material that could lawfully
1276
- be made without permission under this Public License.
1277
-
1278
- b. To the extent possible, if any provision of this Public License is
1279
- deemed unenforceable, it shall be automatically reformed to the
1280
- minimum extent necessary to make it enforceable. If the provision
1281
- cannot be reformed, it shall be severed from this Public License
1282
- without affecting the enforceability of the remaining terms and
1283
- conditions.
1284
-
1285
- c. No term or condition of this Public License will be waived and no
1286
- failure to comply consented to unless expressly agreed to by the
1287
- Licensor.
1288
-
1289
- d. Nothing in this Public License constitutes or may be interpreted
1290
- as a limitation upon, or waiver of, any privileges and immunities
1291
- that apply to the Licensor or You, including from the legal
1292
- processes of any jurisdiction or authority.
1293
-
1294
-
1295
- =======================================================================
1296
-
1297
- Creative Commons is not a party to its public
1298
- licenses. Notwithstanding, Creative Commons may elect to apply one of
1299
- its public licenses to material it publishes and in those instances
1300
- will be considered the “Licensor.” The text of the Creative Commons
1301
- public licenses is dedicated to the public domain under the CC0 Public
1302
- Domain Dedication. Except for the limited purpose of indicating that
1303
- material is shared under a Creative Commons public license or as
1304
- otherwise permitted by the Creative Commons policies published at
1305
- creativecommons.org/policies, Creative Commons does not authorize the
1306
- use of the trademark "Creative Commons" or any other trademark or logo
1307
- of Creative Commons without its prior written consent including,
1308
- without limitation, in connection with any unauthorized modifications
1309
- to any of its public licenses or any other arrangements,
1310
- understandings, or agreements concerning use of licensed material. For
1311
- the avoidance of doubt, this paragraph does not form part of the
1312
- public licenses.
1313
-
1314
- Creative Commons may be contacted at creativecommons.org.
1315
-
1316
- ```
1317
-
1318
-
1319
-
1320
-
1321
  # spaCy lookups data
1322
 
1323
  * Author: Explosion
438
 
439
 
440
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
441
  # spaCy lookups data
442
 
443
  * Author: Explosion
README.md CHANGED
@@ -14,41 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7344978166
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7157446809
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.725
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS (UPOS) Accuracy
29
  type: accuracy
30
- value: 0.9190720259
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
- value: 0.6417322835
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
- value: 0.4625984252
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
- value: 0.6623376623
52
  ---
53
  ### Details: https://spacy.io/models/mk#mk_core_news_sm
54
 
@@ -57,12 +57,12 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `mk_core_news_sm` |
60
- | **Version** | `3.3.0` |
61
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
65
- | **Sources** | [Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion) |
66
  | **License** | `CC BY-SA 4.0` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
@@ -70,11 +70,11 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
70
 
71
  <details>
72
 
73
- <summary>View label scheme (53 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
- | **`morphologizer`** | `POS=PROPN`, `POS=AUX`, `POS=ADJ`, `POS=NOUN`, `POS=ADP`, `POS=PUNCT`, `POS=CONJ`, `POS=NUM`, `POS=VERB`, `POS=PRON`, `POS=ADV`, `POS=SCONJ`, `POS=PART`, `POS=SYM`, `POS=X`, `_`, `POS=INTJ` |
78
  | **`parser`** | `ROOT`, `advmod`, `att`, `aux`, `cc`, `dep`, `det`, `dobj`, `iobj`, `neg`, `nsubj`, `pobj`, `poss`, `pozm`, `pozv`, `prep`, `punct`, `relcl` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
@@ -88,12 +88,12 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
88
  | `TOKEN_P` | 100.00 |
89
  | `TOKEN_R` | 100.00 |
90
  | `TOKEN_F` | 100.00 |
91
- | `SENTS_P` | 66.23 |
92
- | `SENTS_R` | 66.23 |
93
- | `SENTS_F` | 66.23 |
94
- | `DEP_UAS` | 64.17 |
95
- | `DEP_LAS` | 46.26 |
96
- | `ENTS_P` | 73.45 |
97
- | `ENTS_R` | 71.57 |
98
- | `ENTS_F` | 72.50 |
99
- | `POS_ACC` | 91.91 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7252368648
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.7165957447
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.720890411
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS (UPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9185325061
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
+ value: 0.6280667321
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
+ value: 0.4553483808
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
+ value: 0.6802721088
52
  ---
53
  ### Details: https://spacy.io/models/mk#mk_core_news_sm
54
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `mk_core_news_sm` |
60
+ | **Version** | `3.4.0` |
61
+ | **spaCy** | `>=3.4.0,<3.5.0` |
62
  | **Default Pipeline** | `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
65
+ | **Sources** | [Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion) |
66
  | **License** | `CC BY-SA 4.0` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (54 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
+ | **`morphologizer`** | `POS=PROPN`, `POS=AUX`, `POS=ADJ`, `POS=NOUN`, `POS=ADP`, `POS=PUNCT`, `POS=CONJ`, `POS=NUM`, `POS=VERB`, `POS=PRON`, `POS=ADV`, `POS=SCONJ`, `POS=PART`, `POS=SYM`, `_`, `POS=SPACE`, `POS=X`, `POS=INTJ` |
78
  | **`parser`** | `ROOT`, `advmod`, `att`, `aux`, `cc`, `dep`, `det`, `dobj`, `iobj`, `neg`, `nsubj`, `pobj`, `poss`, `pozm`, `pozv`, `prep`, `punct`, `relcl` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
88
  | `TOKEN_P` | 100.00 |
89
  | `TOKEN_R` | 100.00 |
90
  | `TOKEN_F` | 100.00 |
91
+ | `SENTS_P` | 71.43 |
92
+ | `SENTS_R` | 64.94 |
93
+ | `SENTS_F` | 68.03 |
94
+ | `DEP_UAS` | 62.81 |
95
+ | `DEP_LAS` | 45.53 |
96
+ | `ENTS_P` | 72.52 |
97
+ | `ENTS_R` | 71.66 |
98
+ | `ENTS_F` | 72.09 |
99
+ | `POS_ACC` | 91.85 |
accuracy.json CHANGED
@@ -3,36 +3,36 @@
3
  "token_p": 1.0,
4
  "token_r": 1.0,
5
  "token_f": 1.0,
6
- "sents_p": 0.6623376623,
7
- "sents_r": 0.6623376623,
8
- "sents_f": 0.6623376623,
9
- "dep_uas": 0.6417322835,
10
- "dep_las": 0.4625984252,
11
  "dep_las_per_type": {
12
  "nsubj": {
13
- "p": 0.5625,
14
- "r": 0.4736842105,
15
- "f": 0.5142857143
16
  },
17
  "root": {
18
- "p": 0.6623376623,
19
- "r": 0.7285714286,
20
- "f": 0.693877551
21
  },
22
  "cc": {
23
- "p": 0.7368421053,
24
- "r": 0.5,
25
- "f": 0.5957446809
26
  },
27
  "relcl": {
28
- "p": 0.347826087,
29
- "r": 0.3076923077,
30
- "f": 0.3265306122
31
  },
32
  "pozm": {
33
- "p": 0.6,
34
  "r": 0.2727272727,
35
- "f": 0.375
36
  },
37
  "poss": {
38
  "p": 0.0,
@@ -40,14 +40,14 @@
40
  "f": 0.0
41
  },
42
  "aux": {
43
- "p": 0.525,
44
  "r": 0.6363636364,
45
- "f": 0.5753424658
46
  },
47
  "prep": {
48
- "p": 0.6615384615,
49
  "r": 0.7166666667,
50
- "f": 0.688
51
  },
52
  "iobj": {
53
  "p": 0.0,
@@ -55,9 +55,9 @@
55
  "f": 0.0
56
  },
57
  "pozv": {
58
- "p": 0.2857142857,
59
  "r": 0.2666666667,
60
- "f": 0.275862069
61
  },
62
  "quantmod": {
63
  "p": 0.0,
@@ -65,9 +65,9 @@
65
  "f": 0.0
66
  },
67
  "att": {
68
- "p": 0.6486486486,
69
- "r": 0.4615384615,
70
- "f": 0.5393258427
71
  },
72
  "det": {
73
  "p": 0.0,
@@ -80,24 +80,24 @@
80
  "f": 0.0
81
  },
82
  "dep": {
83
- "p": 0.0,
84
- "r": 0.0,
85
- "f": 0.0
86
  },
87
  "dobj": {
88
- "p": 0.4035087719,
89
  "r": 0.3833333333,
90
- "f": 0.3931623932
91
  },
92
  "ppdo": {
93
- "p": 0.6,
94
- "r": 0.4,
95
- "f": 0.48
96
  },
97
  "neg": {
98
- "p": 0.625,
99
- "r": 0.4545454545,
100
- "f": 0.5263157895
101
  },
102
  "pobj": {
103
  "p": 0.4054054054,
@@ -160,70 +160,75 @@
160
  "f": 0.0
161
  }
162
  },
163
- "speed": 2310.8975813075,
164
- "ents_p": 0.7344978166,
165
- "ents_r": 0.7157446809,
166
- "ents_f": 0.725,
167
  "ents_per_type": {
168
- "GPE": {
169
- "p": 0.8571428571,
170
- "r": 0.8773584906,
171
- "f": 0.8671328671
172
  },
173
  "LOC": {
174
- "p": 0.7884615385,
175
- "r": 0.4712643678,
176
- "f": 0.5899280576
177
  },
178
- "QUANTITY": {
179
- "p": 0.65,
180
- "r": 0.6341463415,
181
- "f": 0.6419753086
182
  },
183
- "PERCENT": {
184
- "p": 0.7619047619,
185
- "r": 1.0,
186
- "f": 0.8648648649
187
  },
188
  "CARDINAL": {
189
- "p": 0.6132075472,
190
- "r": 0.6914893617,
191
- "f": 0.65
192
  },
193
  "ORG": {
194
- "p": 0.5322580645,
195
- "r": 0.6734693878,
196
- "f": 0.5945945946
197
- },
198
- "NORP": {
199
- "p": 0.5094339623,
200
- "r": 0.4153846154,
201
- "f": 0.4576271186
202
  },
203
  "PERSON": {
204
- "p": 0.6979865772,
205
- "r": 0.6933333333,
206
- "f": 0.6956521739
207
  },
208
  "DATE": {
209
- "p": 0.7638888889,
210
- "r": 0.7746478873,
211
- "f": 0.7692307692
212
  },
213
- "WORK_OF_ART": {
214
- "p": 0.65625,
215
- "r": 0.512195122,
216
- "f": 0.5753424658
217
  },
218
  "ORDINAL": {
219
- "p": 0.5384615385,
220
- "r": 0.6363636364,
221
- "f": 0.5833333333
222
  },
223
- "MONEY": {
224
- "p": 0.0,
225
- "r": 0.0,
226
- "f": 0.0
 
 
 
 
 
 
 
 
 
 
227
  },
228
  "LANGUAGE": {
229
  "p": 0.0,
@@ -231,30 +236,25 @@
231
  "f": 0.0
232
  },
233
  "TIME": {
234
- "p": 1.0,
235
  "r": 0.8333333333,
236
- "f": 0.9090909091
237
- },
238
- "FAC": {
239
- "p": 0.1818181818,
240
- "r": 0.1,
241
- "f": 0.1290322581
242
- },
243
- "LAW": {
244
- "p": 0.25,
245
- "r": 0.3333333333,
246
- "f": 0.2857142857
247
  },
248
  "EVENT": {
249
- "p": 0.625,
250
- "r": 0.5882352941,
251
- "f": 0.6060606061
252
  },
253
  "PRODUCT": {
254
- "p": 1.0,
255
  "r": 0.2,
256
- "f": 0.3333333333
 
 
 
 
 
257
  }
258
  },
259
- "pos_acc": 0.9190720259
260
  }
3
  "token_p": 1.0,
4
  "token_r": 1.0,
5
  "token_f": 1.0,
6
+ "sents_p": 0.7142857143,
7
+ "sents_r": 0.6493506494,
8
+ "sents_f": 0.6802721088,
9
+ "dep_uas": 0.6280667321,
10
+ "dep_las": 0.4553483808,
11
  "dep_las_per_type": {
12
  "nsubj": {
13
+ "p": 0.4210526316,
14
+ "r": 0.4210526316,
15
+ "f": 0.4210526316
16
  },
17
  "root": {
18
+ "p": 0.6571428571,
19
+ "r": 0.6571428571,
20
+ "f": 0.6571428571
21
  },
22
  "cc": {
23
+ "p": 0.8888888889,
24
+ "r": 0.5714285714,
25
+ "f": 0.6956521739
26
  },
27
  "relcl": {
28
+ "p": 0.35,
29
+ "r": 0.2692307692,
30
+ "f": 0.3043478261
31
  },
32
  "pozm": {
33
+ "p": 0.75,
34
  "r": 0.2727272727,
35
+ "f": 0.4
36
  },
37
  "poss": {
38
  "p": 0.0,
40
  "f": 0.0
41
  },
42
  "aux": {
43
+ "p": 0.5675675676,
44
  "r": 0.6363636364,
45
+ "f": 0.6
46
  },
47
  "prep": {
48
+ "p": 0.5733333333,
49
  "r": 0.7166666667,
50
+ "f": 0.637037037
51
  },
52
  "iobj": {
53
  "p": 0.0,
55
  "f": 0.0
56
  },
57
  "pozv": {
58
+ "p": 0.3333333333,
59
  "r": 0.2666666667,
60
+ "f": 0.2962962963
61
  },
62
  "quantmod": {
63
  "p": 0.0,
65
  "f": 0.0
66
  },
67
  "att": {
68
+ "p": 0.6170212766,
69
+ "r": 0.5576923077,
70
+ "f": 0.5858585859
71
  },
72
  "det": {
73
  "p": 0.0,
80
  "f": 0.0
81
  },
82
  "dep": {
83
+ "p": 0.0142857143,
84
+ "r": 0.3333333333,
85
+ "f": 0.0273972603
86
  },
87
  "dobj": {
88
+ "p": 0.3770491803,
89
  "r": 0.3833333333,
90
+ "f": 0.3801652893
91
  },
92
  "ppdo": {
93
+ "p": 0.5,
94
+ "r": 0.1333333333,
95
+ "f": 0.2105263158
96
  },
97
  "neg": {
98
+ "p": 0.75,
99
+ "r": 0.5454545455,
100
+ "f": 0.6315789474
101
  },
102
  "pobj": {
103
  "p": 0.4054054054,
160
  "f": 0.0
161
  }
162
  },
163
+ "speed": 2195.0892767476,
164
+ "ents_p": 0.7252368648,
165
+ "ents_r": 0.7165957447,
166
+ "ents_f": 0.720890411,
167
  "ents_per_type": {
168
+ "NORP": {
169
+ "p": 0.4259259259,
170
+ "r": 0.3538461538,
171
+ "f": 0.3865546218
172
  },
173
  "LOC": {
174
+ "p": 0.6231884058,
175
+ "r": 0.4942528736,
176
+ "f": 0.5512820513
177
  },
178
+ "GPE": {
179
+ "p": 0.875862069,
180
+ "r": 0.8985849057,
181
+ "f": 0.8870779977
182
  },
183
+ "QUANTITY": {
184
+ "p": 0.6170212766,
185
+ "r": 0.7073170732,
186
+ "f": 0.6590909091
187
  },
188
  "CARDINAL": {
189
+ "p": 0.6875,
190
+ "r": 0.7021276596,
191
+ "f": 0.6947368421
192
  },
193
  "ORG": {
194
+ "p": 0.53125,
195
+ "r": 0.693877551,
196
+ "f": 0.6017699115
 
 
 
 
 
197
  },
198
  "PERSON": {
199
+ "p": 0.6734693878,
200
+ "r": 0.66,
201
+ "f": 0.6666666667
202
  },
203
  "DATE": {
204
+ "p": 0.7046979866,
205
+ "r": 0.7394366197,
206
+ "f": 0.7216494845
207
  },
208
+ "MONEY": {
209
+ "p": 0.3333333333,
210
+ "r": 0.5,
211
+ "f": 0.4
212
  },
213
  "ORDINAL": {
214
+ "p": 0.5454545455,
215
+ "r": 0.5454545455,
216
+ "f": 0.5454545455
217
  },
218
+ "PERCENT": {
219
+ "p": 1.0,
220
+ "r": 1.0,
221
+ "f": 1.0
222
+ },
223
+ "WORK_OF_ART": {
224
+ "p": 0.6451612903,
225
+ "r": 0.487804878,
226
+ "f": 0.5555555556
227
+ },
228
+ "FAC": {
229
+ "p": 0.125,
230
+ "r": 0.05,
231
+ "f": 0.0714285714
232
  },
233
  "LANGUAGE": {
234
  "p": 0.0,
236
  "f": 0.0
237
  },
238
  "TIME": {
239
+ "p": 0.7142857143,
240
  "r": 0.8333333333,
241
+ "f": 0.7692307692
 
 
 
 
 
 
 
 
 
 
242
  },
243
  "EVENT": {
244
+ "p": 0.6666666667,
245
+ "r": 0.7058823529,
246
+ "f": 0.6857142857
247
  },
248
  "PRODUCT": {
249
+ "p": 0.5,
250
  "r": 0.2,
251
+ "f": 0.2857142857
252
+ },
253
+ "LAW": {
254
+ "p": 0.0,
255
+ "r": 0.0,
256
+ "f": 0.0
257
  }
258
  },
259
+ "pos_acc": 0.9185325061
260
  }
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"mk",
3
  "name":"core_news_sm",
4
- "version":"3.3.0",
5
  "description":"Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -31,8 +31,9 @@
31
  "POS=SCONJ",
32
  "POS=PART",
33
  "POS=SYM",
34
- "POS=X",
35
  "_",
 
 
36
  "POS=INTJ"
37
  ],
38
  "parser":[
@@ -105,36 +106,36 @@
105
  "token_p":1.0,
106
  "token_r":1.0,
107
  "token_f":1.0,
108
- "sents_p":0.6623376623,
109
- "sents_r":0.6623376623,
110
- "sents_f":0.6623376623,
111
- "dep_uas":0.6417322835,
112
- "dep_las":0.4625984252,
113
  "dep_las_per_type":{
114
  "nsubj":{
115
- "p":0.5625,
116
- "r":0.4736842105,
117
- "f":0.5142857143
118
  },
119
  "root":{
120
- "p":0.6623376623,
121
- "r":0.7285714286,
122
- "f":0.693877551
123
  },
124
  "cc":{
125
- "p":0.7368421053,
126
- "r":0.5,
127
- "f":0.5957446809
128
  },
129
  "relcl":{
130
- "p":0.347826087,
131
- "r":0.3076923077,
132
- "f":0.3265306122
133
  },
134
  "pozm":{
135
- "p":0.6,
136
  "r":0.2727272727,
137
- "f":0.375
138
  },
139
  "poss":{
140
  "p":0.0,
@@ -142,14 +143,14 @@
142
  "f":0.0
143
  },
144
  "aux":{
145
- "p":0.525,
146
  "r":0.6363636364,
147
- "f":0.5753424658
148
  },
149
  "prep":{
150
- "p":0.6615384615,
151
  "r":0.7166666667,
152
- "f":0.688
153
  },
154
  "iobj":{
155
  "p":0.0,
@@ -157,9 +158,9 @@
157
  "f":0.0
158
  },
159
  "pozv":{
160
- "p":0.2857142857,
161
  "r":0.2666666667,
162
- "f":0.275862069
163
  },
164
  "quantmod":{
165
  "p":0.0,
@@ -167,9 +168,9 @@
167
  "f":0.0
168
  },
169
  "att":{
170
- "p":0.6486486486,
171
- "r":0.4615384615,
172
- "f":0.5393258427
173
  },
174
  "det":{
175
  "p":0.0,
@@ -182,24 +183,24 @@
182
  "f":0.0
183
  },
184
  "dep":{
185
- "p":0.0,
186
- "r":0.0,
187
- "f":0.0
188
  },
189
  "dobj":{
190
- "p":0.4035087719,
191
  "r":0.3833333333,
192
- "f":0.3931623932
193
  },
194
  "ppdo":{
195
- "p":0.6,
196
- "r":0.4,
197
- "f":0.48
198
  },
199
  "neg":{
200
- "p":0.625,
201
- "r":0.4545454545,
202
- "f":0.5263157895
203
  },
204
  "pobj":{
205
  "p":0.4054054054,
@@ -262,70 +263,75 @@
262
  "f":0.0
263
  }
264
  },
265
- "speed":2310.8975813075,
266
- "ents_p":0.7344978166,
267
- "ents_r":0.7157446809,
268
- "ents_f":0.725,
269
  "ents_per_type":{
270
- "GPE":{
271
- "p":0.8571428571,
272
- "r":0.8773584906,
273
- "f":0.8671328671
274
  },
275
  "LOC":{
276
- "p":0.7884615385,
277
- "r":0.4712643678,
278
- "f":0.5899280576
279
  },
280
- "QUANTITY":{
281
- "p":0.65,
282
- "r":0.6341463415,
283
- "f":0.6419753086
284
  },
285
- "PERCENT":{
286
- "p":0.7619047619,
287
- "r":1.0,
288
- "f":0.8648648649
289
  },
290
  "CARDINAL":{
291
- "p":0.6132075472,
292
- "r":0.6914893617,
293
- "f":0.65
294
  },
295
  "ORG":{
296
- "p":0.5322580645,
297
- "r":0.6734693878,
298
- "f":0.5945945946
299
- },
300
- "NORP":{
301
- "p":0.5094339623,
302
- "r":0.4153846154,
303
- "f":0.4576271186
304
  },
305
  "PERSON":{
306
- "p":0.6979865772,
307
- "r":0.6933333333,
308
- "f":0.6956521739
309
  },
310
  "DATE":{
311
- "p":0.7638888889,
312
- "r":0.7746478873,
313
- "f":0.7692307692
314
  },
315
- "WORK_OF_ART":{
316
- "p":0.65625,
317
- "r":0.512195122,
318
- "f":0.5753424658
319
  },
320
  "ORDINAL":{
321
- "p":0.5384615385,
322
- "r":0.6363636364,
323
- "f":0.5833333333
324
  },
325
- "MONEY":{
326
- "p":0.0,
327
- "r":0.0,
328
- "f":0.0
 
 
 
 
 
 
 
 
 
 
329
  },
330
  "LANGUAGE":{
331
  "p":0.0,
@@ -333,46 +339,29 @@
333
  "f":0.0
334
  },
335
  "TIME":{
336
- "p":1.0,
337
  "r":0.8333333333,
338
- "f":0.9090909091
339
- },
340
- "FAC":{
341
- "p":0.1818181818,
342
- "r":0.1,
343
- "f":0.1290322581
344
- },
345
- "LAW":{
346
- "p":0.25,
347
- "r":0.3333333333,
348
- "f":0.2857142857
349
  },
350
  "EVENT":{
351
- "p":0.625,
352
- "r":0.5882352941,
353
- "f":0.6060606061
354
  },
355
  "PRODUCT":{
356
- "p":1.0,
357
  "r":0.2,
358
- "f":0.3333333333
 
 
 
 
 
359
  }
360
  },
361
- "pos_acc":0.9190720259
362
  },
363
  "sources":[
364
- {
365
- "name":"Macedonian Corpus",
366
- "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
367
- "license":"CC BY-SA 4.0",
368
- "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
369
- },
370
- {
371
- "name":"Macedonian Corpus",
372
- "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
373
- "license":"CC BY-SA 4.0",
374
- "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
375
- },
376
  {
377
  "name":"Macedonian Corpus",
378
  "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
1
  {
2
  "lang":"mk",
3
  "name":"core_news_sm",
4
+ "version":"3.4.0",
5
  "description":"Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
31
  "POS=SCONJ",
32
  "POS=PART",
33
  "POS=SYM",
 
34
  "_",
35
+ "POS=SPACE",
36
+ "POS=X",
37
  "POS=INTJ"
38
  ],
39
  "parser":[
106
  "token_p":1.0,
107
  "token_r":1.0,
108
  "token_f":1.0,
109
+ "sents_p":0.7142857143,
110
+ "sents_r":0.6493506494,
111
+ "sents_f":0.6802721088,
112
+ "dep_uas":0.6280667321,
113
+ "dep_las":0.4553483808,
114
  "dep_las_per_type":{
115
  "nsubj":{
116
+ "p":0.4210526316,
117
+ "r":0.4210526316,
118
+ "f":0.4210526316
119
  },
120
  "root":{
121
+ "p":0.6571428571,
122
+ "r":0.6571428571,
123
+ "f":0.6571428571
124
  },
125
  "cc":{
126
+ "p":0.8888888889,
127
+ "r":0.5714285714,
128
+ "f":0.6956521739
129
  },
130
  "relcl":{
131
+ "p":0.35,
132
+ "r":0.2692307692,
133
+ "f":0.3043478261
134
  },
135
  "pozm":{
136
+ "p":0.75,
137
  "r":0.2727272727,
138
+ "f":0.4
139
  },
140
  "poss":{
141
  "p":0.0,
143
  "f":0.0
144
  },
145
  "aux":{
146
+ "p":0.5675675676,
147
  "r":0.6363636364,
148
+ "f":0.6
149
  },
150
  "prep":{
151
+ "p":0.5733333333,
152
  "r":0.7166666667,
153
+ "f":0.637037037
154
  },
155
  "iobj":{
156
  "p":0.0,
158
  "f":0.0
159
  },
160
  "pozv":{
161
+ "p":0.3333333333,
162
  "r":0.2666666667,
163
+ "f":0.2962962963
164
  },
165
  "quantmod":{
166
  "p":0.0,
168
  "f":0.0
169
  },
170
  "att":{
171
+ "p":0.6170212766,
172
+ "r":0.5576923077,
173
+ "f":0.5858585859
174
  },
175
  "det":{
176
  "p":0.0,
183
  "f":0.0
184
  },
185
  "dep":{
186
+ "p":0.0142857143,
187
+ "r":0.3333333333,
188
+ "f":0.0273972603
189
  },
190
  "dobj":{
191
+ "p":0.3770491803,
192
  "r":0.3833333333,
193
+ "f":0.3801652893
194
  },
195
  "ppdo":{
196
+ "p":0.5,
197
+ "r":0.1333333333,
198
+ "f":0.2105263158
199
  },
200
  "neg":{
201
+ "p":0.75,
202
+ "r":0.5454545455,
203
+ "f":0.6315789474
204
  },
205
  "pobj":{
206
  "p":0.4054054054,
263
  "f":0.0
264
  }
265
  },
266
+ "speed":2195.0892767476,
267
+ "ents_p":0.7252368648,
268
+ "ents_r":0.7165957447,
269
+ "ents_f":0.720890411,
270
  "ents_per_type":{
271
+ "NORP":{
272
+ "p":0.4259259259,
273
+ "r":0.3538461538,
274
+ "f":0.3865546218
275
  },
276
  "LOC":{
277
+ "p":0.6231884058,
278
+ "r":0.4942528736,
279
+ "f":0.5512820513
280
  },
281
+ "GPE":{
282
+ "p":0.875862069,
283
+ "r":0.8985849057,
284
+ "f":0.8870779977
285
  },
286
+ "QUANTITY":{
287
+ "p":0.6170212766,
288
+ "r":0.7073170732,
289
+ "f":0.6590909091
290
  },
291
  "CARDINAL":{
292
+ "p":0.6875,
293
+ "r":0.7021276596,
294
+ "f":0.6947368421
295
  },
296
  "ORG":{
297
+ "p":0.53125,
298
+ "r":0.693877551,
299
+ "f":0.6017699115
 
 
 
 
 
300
  },
301
  "PERSON":{
302
+ "p":0.6734693878,
303
+ "r":0.66,
304
+ "f":0.6666666667
305
  },
306
  "DATE":{
307
+ "p":0.7046979866,
308
+ "r":0.7394366197,
309
+ "f":0.7216494845
310
  },
311
+ "MONEY":{
312
+ "p":0.3333333333,
313
+ "r":0.5,
314
+ "f":0.4
315
  },
316
  "ORDINAL":{
317
+ "p":0.5454545455,
318
+ "r":0.5454545455,
319
+ "f":0.5454545455
320
  },
321
+ "PERCENT":{
322
+ "p":1.0,
323
+ "r":1.0,
324
+ "f":1.0
325
+ },
326
+ "WORK_OF_ART":{
327
+ "p":0.6451612903,
328
+ "r":0.487804878,
329
+ "f":0.5555555556
330
+ },
331
+ "FAC":{
332
+ "p":0.125,
333
+ "r":0.05,
334
+ "f":0.0714285714
335
  },
336
  "LANGUAGE":{
337
  "p":0.0,
339
  "f":0.0
340
  },
341
  "TIME":{
342
+ "p":0.7142857143,
343
  "r":0.8333333333,
344
+ "f":0.7692307692
 
 
 
 
 
 
 
 
 
 
345
  },
346
  "EVENT":{
347
+ "p":0.6666666667,
348
+ "r":0.7058823529,
349
+ "f":0.6857142857
350
  },
351
  "PRODUCT":{
352
+ "p":0.5,
353
  "r":0.2,
354
+ "f":0.2857142857
355
+ },
356
+ "LAW":{
357
+ "p":0.0,
358
+ "r":0.0,
359
+ "f":0.0
360
  }
361
  },
362
+ "pos_acc":0.9185325061
363
  },
364
  "sources":[
 
 
 
 
 
 
 
 
 
 
 
 
365
  {
366
  "name":"Macedonian Corpus",
367
  "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
mk_core_news_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5cfc0083e6d1a00dd6fff2ee677f6c9ebc0b161858b8339978921c2885cf4420
3
- size 18027652
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c40a97439e598afbb6beb04fbd249d2285e043adf83584d485fc8f1a494c04a
3
+ size 18014625
morphologizer/cfg CHANGED
@@ -15,8 +15,9 @@
15
  "POS=SCONJ":"",
16
  "POS=PART":"",
17
  "POS=SYM":"",
18
- "POS=X":"",
19
  "_":"",
 
 
20
  "POS=INTJ":""
21
  },
22
  "labels_pos":{
@@ -34,8 +35,9 @@
34
  "POS=SCONJ":98,
35
  "POS=PART":94,
36
  "POS=SYM":99,
37
- "POS=X":101,
38
  "_":0,
 
 
39
  "POS=INTJ":91
40
  },
41
  "overwrite":true
15
  "POS=SCONJ":"",
16
  "POS=PART":"",
17
  "POS=SYM":"",
 
18
  "_":"",
19
+ "POS=SPACE":"",
20
+ "POS=X":"",
21
  "POS=INTJ":""
22
  },
23
  "labels_pos":{
35
  "POS=SCONJ":98,
36
  "POS=PART":94,
37
  "POS=SYM":99,
 
38
  "_":0,
39
+ "POS=SPACE":103,
40
+ "POS=X":101,
41
  "POS=INTJ":91
42
  },
43
  "overwrite":true
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7e2678c9970f41bfa572aa5f636bf62f890f582357c255cb5d1342c6f218cae4
3
- size 6146599
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48e63a5e03882890eb73594cdc2eb59a1a116df93d339e27c36f054590e3ad91
3
+ size 6146987
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:98ad79defba750984d37534214d83a66fec734d43c7fe7e463d6e870de60a651
3
  size 6284763
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93532561883a73afdc614ab5c26390352af76542e1d0fcd9aa808ffb40f4dfd2
3
  size 6284763
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{},"1":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"2":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"3":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"4":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43,"":1},"5":{"":1}}�cfg��neg_key�
1
+ ��moves��{"0":{},"1":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"2":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"3":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"4":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43,"":1},"5":{"":1}}�cfg��neg_key�
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ffc12d254fd79c0911d0b365e4c322492826b1755b2d6072d8a9ee7ca389af19
3
  size 6438942
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2a4ff25bf57ed12013175333a660ada5b49b8c90de69fd6f359370ec44077582
3
  size 6438942
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�2{"0":{"":1190},"1":{"":2140},"2":{"nsubj":278,"aux":187,"att":180,"neg":67,"prep":56,"poss":42,"pozv":37,"advmod":36,"ppdo||dobj":35,"dep":0},"3":{"punct":550,"dobj":316,"prep":291,"relcl":148,"pobj":141,"aux":87,"cc":70,"iobj":63,"att":56,"nsubj":51,"pozv":44,"pozm":40,"det":31,"dep":0},"4":{"ROOT":500}}�cfg��neg_key�
1
+ ��moves�3{"0":{"":1190},"1":{"":2177},"2":{"nsubj":278,"aux":187,"att":180,"neg":67,"prep":56,"poss":42,"pozv":37,"advmod":36,"ppdo||dobj":35,"dep":0},"3":{"punct":550,"dobj":316,"prep":291,"relcl":148,"pobj":141,"aux":87,"cc":70,"iobj":63,"dep":60,"att":56,"nsubj":51,"pozv":44,"pozm":40,"det":31},"4":{"ROOT":500}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:04d8fd67eec456f2d5e5d77fad8d6b99fb7b2bd8b553b8b96ebe650e466d15e9
3
  size 197089
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c70d5f55e6a2ed187321d5aac0013f209b21b062746b6f2581570ae58017923
3
  size 197089
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a02c1ec501f3603d94a56c06b742a0450889b1b886af7a92d2b14e29310ef9e2
3
- size 1409419
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5e26a31138f74688e4411f555704714bddf1ef633a535826c6db154cbdf94d2
3
+ size 1410212