adrianeboyd commited on
Commit
f51487a
1 Parent(s): bcdcb27

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -19,3 +19,4 @@
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
 
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
22
+ vocab/key2row filter=lfs diff=lfs merge=lfs -text
LICENSES_SOURCES CHANGED
@@ -438,886 +438,6 @@ Creative Commons may be contacted at creativecommons.org.
438
 
439
 
440
 
441
- # Macedonian Corpus
442
-
443
- * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
444
- * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
445
- * License: CC BY-SA 4.0
446
-
447
- ```
448
- Attribution-ShareAlike 4.0 International
449
-
450
- =======================================================================
451
-
452
- Creative Commons Corporation ("Creative Commons") is not a law firm and
453
- does not provide legal services or legal advice. Distribution of
454
- Creative Commons public licenses does not create a lawyer-client or
455
- other relationship. Creative Commons makes its licenses and related
456
- information available on an "as-is" basis. Creative Commons gives no
457
- warranties regarding its licenses, any material licensed under their
458
- terms and conditions, or any related information. Creative Commons
459
- disclaims all liability for damages resulting from their use to the
460
- fullest extent possible.
461
-
462
- Using Creative Commons Public Licenses
463
-
464
- Creative Commons public licenses provide a standard set of terms and
465
- conditions that creators and other rights holders may use to share
466
- original works of authorship and other material subject to copyright
467
- and certain other rights specified in the public license below. The
468
- following considerations are for informational purposes only, are not
469
- exhaustive, and do not form part of our licenses.
470
-
471
- Considerations for licensors: Our public licenses are
472
- intended for use by those authorized to give the public
473
- permission to use material in ways otherwise restricted by
474
- copyright and certain other rights. Our licenses are
475
- irrevocable. Licensors should read and understand the terms
476
- and conditions of the license they choose before applying it.
477
- Licensors should also secure all rights necessary before
478
- applying our licenses so that the public can reuse the
479
- material as expected. Licensors should clearly mark any
480
- material not subject to the license. This includes other CC-
481
- licensed material, or material used under an exception or
482
- limitation to copyright. More considerations for licensors:
483
- wiki.creativecommons.org/Considerations_for_licensors
484
-
485
- Considerations for the public: By using one of our public
486
- licenses, a licensor grants the public permission to use the
487
- licensed material under specified terms and conditions. If
488
- the licensor's permission is not necessary for any reason--for
489
- example, because of any applicable exception or limitation to
490
- copyright--then that use is not regulated by the license. Our
491
- licenses grant only permissions under copyright and certain
492
- other rights that a licensor has authority to grant. Use of
493
- the licensed material may still be restricted for other
494
- reasons, including because others have copyright or other
495
- rights in the material. A licensor may make special requests,
496
- such as asking that all changes be marked or described.
497
- Although not required by our licenses, you are encouraged to
498
- respect those requests where reasonable. More considerations
499
- for the public:
500
- wiki.creativecommons.org/Considerations_for_licensees
501
-
502
- =======================================================================
503
-
504
- Creative Commons Attribution-ShareAlike 4.0 International Public
505
- License
506
-
507
- By exercising the Licensed Rights (defined below), You accept and agree
508
- to be bound by the terms and conditions of this Creative Commons
509
- Attribution-ShareAlike 4.0 International Public License ("Public
510
- License"). To the extent this Public License may be interpreted as a
511
- contract, You are granted the Licensed Rights in consideration of Your
512
- acceptance of these terms and conditions, and the Licensor grants You
513
- such rights in consideration of benefits the Licensor receives from
514
- making the Licensed Material available under these terms and
515
- conditions.
516
-
517
-
518
- Section 1 -- Definitions.
519
-
520
- a. Adapted Material means material subject to Copyright and Similar
521
- Rights that is derived from or based upon the Licensed Material
522
- and in which the Licensed Material is translated, altered,
523
- arranged, transformed, or otherwise modified in a manner requiring
524
- permission under the Copyright and Similar Rights held by the
525
- Licensor. For purposes of this Public License, where the Licensed
526
- Material is a musical work, performance, or sound recording,
527
- Adapted Material is always produced where the Licensed Material is
528
- synched in timed relation with a moving image.
529
-
530
- b. Adapter's License means the license You apply to Your Copyright
531
- and Similar Rights in Your contributions to Adapted Material in
532
- accordance with the terms and conditions of this Public License.
533
-
534
- c. BY-SA Compatible License means a license listed at
535
- creativecommons.org/compatiblelicenses, approved by Creative
536
- Commons as essentially the equivalent of this Public License.
537
-
538
- d. Copyright and Similar Rights means copyright and/or similar rights
539
- closely related to copyright including, without limitation,
540
- performance, broadcast, sound recording, and Sui Generis Database
541
- Rights, without regard to how the rights are labeled or
542
- categorized. For purposes of this Public License, the rights
543
- specified in Section 2(b)(1)-(2) are not Copyright and Similar
544
- Rights.
545
-
546
- e. Effective Technological Measures means those measures that, in the
547
- absence of proper authority, may not be circumvented under laws
548
- fulfilling obligations under Article 11 of the WIPO Copyright
549
- Treaty adopted on December 20, 1996, and/or similar international
550
- agreements.
551
-
552
- f. Exceptions and Limitations means fair use, fair dealing, and/or
553
- any other exception or limitation to Copyright and Similar Rights
554
- that applies to Your use of the Licensed Material.
555
-
556
- g. License Elements means the license attributes listed in the name
557
- of a Creative Commons Public License. The License Elements of this
558
- Public License are Attribution and ShareAlike.
559
-
560
- h. Licensed Material means the artistic or literary work, database,
561
- or other material to which the Licensor applied this Public
562
- License.
563
-
564
- i. Licensed Rights means the rights granted to You subject to the
565
- terms and conditions of this Public License, which are limited to
566
- all Copyright and Similar Rights that apply to Your use of the
567
- Licensed Material and that the Licensor has authority to license.
568
-
569
- j. Licensor means the individual(s) or entity(ies) granting rights
570
- under this Public License.
571
-
572
- k. Share means to provide material to the public by any means or
573
- process that requires permission under the Licensed Rights, such
574
- as reproduction, public display, public performance, distribution,
575
- dissemination, communication, or importation, and to make material
576
- available to the public including in ways that members of the
577
- public may access the material from a place and at a time
578
- individually chosen by them.
579
-
580
- l. Sui Generis Database Rights means rights other than copyright
581
- resulting from Directive 96/9/EC of the European Parliament and of
582
- the Council of 11 March 1996 on the legal protection of databases,
583
- as amended and/or succeeded, as well as other essentially
584
- equivalent rights anywhere in the world.
585
-
586
- m. You means the individual or entity exercising the Licensed Rights
587
- under this Public License. Your has a corresponding meaning.
588
-
589
-
590
- Section 2 -- Scope.
591
-
592
- a. License grant.
593
-
594
- 1. Subject to the terms and conditions of this Public License,
595
- the Licensor hereby grants You a worldwide, royalty-free,
596
- non-sublicensable, non-exclusive, irrevocable license to
597
- exercise the Licensed Rights in the Licensed Material to:
598
-
599
- a. reproduce and Share the Licensed Material, in whole or
600
- in part; and
601
-
602
- b. produce, reproduce, and Share Adapted Material.
603
-
604
- 2. Exceptions and Limitations. For the avoidance of doubt, where
605
- Exceptions and Limitations apply to Your use, this Public
606
- License does not apply, and You do not need to comply with
607
- its terms and conditions.
608
-
609
- 3. Term. The term of this Public License is specified in Section
610
- 6(a).
611
-
612
- 4. Media and formats; technical modifications allowed. The
613
- Licensor authorizes You to exercise the Licensed Rights in
614
- all media and formats whether now known or hereafter created,
615
- and to make technical modifications necessary to do so. The
616
- Licensor waives and/or agrees not to assert any right or
617
- authority to forbid You from making technical modifications
618
- necessary to exercise the Licensed Rights, including
619
- technical modifications necessary to circumvent Effective
620
- Technological Measures. For purposes of this Public License,
621
- simply making modifications authorized by this Section 2(a)
622
- (4) never produces Adapted Material.
623
-
624
- 5. Downstream recipients.
625
-
626
- a. Offer from the Licensor -- Licensed Material. Every
627
- recipient of the Licensed Material automatically
628
- receives an offer from the Licensor to exercise the
629
- Licensed Rights under the terms and conditions of this
630
- Public License.
631
-
632
- b. Additional offer from the Licensor -- Adapted Material.
633
- Every recipient of Adapted Material from You
634
- automatically receives an offer from the Licensor to
635
- exercise the Licensed Rights in the Adapted Material
636
- under the conditions of the Adapter's License You apply.
637
-
638
- c. No downstream restrictions. You may not offer or impose
639
- any additional or different terms or conditions on, or
640
- apply any Effective Technological Measures to, the
641
- Licensed Material if doing so restricts exercise of the
642
- Licensed Rights by any recipient of the Licensed
643
- Material.
644
-
645
- 6. No endorsement. Nothing in this Public License constitutes or
646
- may be construed as permission to assert or imply that You
647
- are, or that Your use of the Licensed Material is, connected
648
- with, or sponsored, endorsed, or granted official status by,
649
- the Licensor or others designated to receive attribution as
650
- provided in Section 3(a)(1)(A)(i).
651
-
652
- b. Other rights.
653
-
654
- 1. Moral rights, such as the right of integrity, are not
655
- licensed under this Public License, nor are publicity,
656
- privacy, and/or other similar personality rights; however, to
657
- the extent possible, the Licensor waives and/or agrees not to
658
- assert any such rights held by the Licensor to the limited
659
- extent necessary to allow You to exercise the Licensed
660
- Rights, but not otherwise.
661
-
662
- 2. Patent and trademark rights are not licensed under this
663
- Public License.
664
-
665
- 3. To the extent possible, the Licensor waives any right to
666
- collect royalties from You for the exercise of the Licensed
667
- Rights, whether directly or through a collecting society
668
- under any voluntary or waivable statutory or compulsory
669
- licensing scheme. In all other cases the Licensor expressly
670
- reserves any right to collect such royalties.
671
-
672
-
673
- Section 3 -- License Conditions.
674
-
675
- Your exercise of the Licensed Rights is expressly made subject to the
676
- following conditions.
677
-
678
- a. Attribution.
679
-
680
- 1. If You Share the Licensed Material (including in modified
681
- form), You must:
682
-
683
- a. retain the following if it is supplied by the Licensor
684
- with the Licensed Material:
685
-
686
- i. identification of the creator(s) of the Licensed
687
- Material and any others designated to receive
688
- attribution, in any reasonable manner requested by
689
- the Licensor (including by pseudonym if
690
- designated);
691
-
692
- ii. a copyright notice;
693
-
694
- iii. a notice that refers to this Public License;
695
-
696
- iv. a notice that refers to the disclaimer of
697
- warranties;
698
-
699
- v. a URI or hyperlink to the Licensed Material to the
700
- extent reasonably practicable;
701
-
702
- b. indicate if You modified the Licensed Material and
703
- retain an indication of any previous modifications; and
704
-
705
- c. indicate the Licensed Material is licensed under this
706
- Public License, and include the text of, or the URI or
707
- hyperlink to, this Public License.
708
-
709
- 2. You may satisfy the conditions in Section 3(a)(1) in any
710
- reasonable manner based on the medium, means, and context in
711
- which You Share the Licensed Material. For example, it may be
712
- reasonable to satisfy the conditions by providing a URI or
713
- hyperlink to a resource that includes the required
714
- information.
715
-
716
- 3. If requested by the Licensor, You must remove any of the
717
- information required by Section 3(a)(1)(A) to the extent
718
- reasonably practicable.
719
-
720
- b. ShareAlike.
721
-
722
- In addition to the conditions in Section 3(a), if You Share
723
- Adapted Material You produce, the following conditions also apply.
724
-
725
- 1. The Adapter's License You apply must be a Creative Commons
726
- license with the same License Elements, this version or
727
- later, or a BY-SA Compatible License.
728
-
729
- 2. You must include the text of, or the URI or hyperlink to, the
730
- Adapter's License You apply. You may satisfy this condition
731
- in any reasonable manner based on the medium, means, and
732
- context in which You Share Adapted Material.
733
-
734
- 3. You may not offer or impose any additional or different terms
735
- or conditions on, or apply any Effective Technological
736
- Measures to, Adapted Material that restrict exercise of the
737
- rights granted under the Adapter's License You apply.
738
-
739
-
740
- Section 4 -- Sui Generis Database Rights.
741
-
742
- Where the Licensed Rights include Sui Generis Database Rights that
743
- apply to Your use of the Licensed Material:
744
-
745
- a. for the avoidance of doubt, Section 2(a)(1) grants You the right
746
- to extract, reuse, reproduce, and Share all or a substantial
747
- portion of the contents of the database;
748
-
749
- b. if You include all or a substantial portion of the database
750
- contents in a database in which You have Sui Generis Database
751
- Rights, then the database in which You have Sui Generis Database
752
- Rights (but not its individual contents) is Adapted Material,
753
-
754
- including for purposes of Section 3(b); and
755
- c. You must comply with the conditions in Section 3(a) if You Share
756
- all or a substantial portion of the contents of the database.
757
-
758
- For the avoidance of doubt, this Section 4 supplements and does not
759
- replace Your obligations under this Public License where the Licensed
760
- Rights include other Copyright and Similar Rights.
761
-
762
-
763
- Section 5 -- Disclaimer of Warranties and Limitation of Liability.
764
-
765
- a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
766
- EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
767
- AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
768
- ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
769
- IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
770
- WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
771
- PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
772
- ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
773
- KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
774
- ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
775
-
776
- b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
777
- TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
778
- NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
779
- INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
780
- COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
781
- USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
782
- ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
783
- DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
784
- IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
785
-
786
- c. The disclaimer of warranties and limitation of liability provided
787
- above shall be interpreted in a manner that, to the extent
788
- possible, most closely approximates an absolute disclaimer and
789
- waiver of all liability.
790
-
791
-
792
- Section 6 -- Term and Termination.
793
-
794
- a. This Public License applies for the term of the Copyright and
795
- Similar Rights licensed here. However, if You fail to comply with
796
- this Public License, then Your rights under this Public License
797
- terminate automatically.
798
-
799
- b. Where Your right to use the Licensed Material has terminated under
800
- Section 6(a), it reinstates:
801
-
802
- 1. automatically as of the date the violation is cured, provided
803
- it is cured within 30 days of Your discovery of the
804
- violation; or
805
-
806
- 2. upon express reinstatement by the Licensor.
807
-
808
- For the avoidance of doubt, this Section 6(b) does not affect any
809
- right the Licensor may have to seek remedies for Your violations
810
- of this Public License.
811
-
812
- c. For the avoidance of doubt, the Licensor may also offer the
813
- Licensed Material under separate terms or conditions or stop
814
- distributing the Licensed Material at any time; however, doing so
815
- will not terminate this Public License.
816
-
817
- d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
818
- License.
819
-
820
-
821
- Section 7 -- Other Terms and Conditions.
822
-
823
- a. The Licensor shall not be bound by any additional or different
824
- terms or conditions communicated by You unless expressly agreed.
825
-
826
- b. Any arrangements, understandings, or agreements regarding the
827
- Licensed Material not stated herein are separate from and
828
- independent of the terms and conditions of this Public License.
829
-
830
-
831
- Section 8 -- Interpretation.
832
-
833
- a. For the avoidance of doubt, this Public License does not, and
834
- shall not be interpreted to, reduce, limit, restrict, or impose
835
- conditions on any use of the Licensed Material that could lawfully
836
- be made without permission under this Public License.
837
-
838
- b. To the extent possible, if any provision of this Public License is
839
- deemed unenforceable, it shall be automatically reformed to the
840
- minimum extent necessary to make it enforceable. If the provision
841
- cannot be reformed, it shall be severed from this Public License
842
- without affecting the enforceability of the remaining terms and
843
- conditions.
844
-
845
- c. No term or condition of this Public License will be waived and no
846
- failure to comply consented to unless expressly agreed to by the
847
- Licensor.
848
-
849
- d. Nothing in this Public License constitutes or may be interpreted
850
- as a limitation upon, or waiver of, any privileges and immunities
851
- that apply to the Licensor or You, including from the legal
852
- processes of any jurisdiction or authority.
853
-
854
-
855
- =======================================================================
856
-
857
- Creative Commons is not a party to its public
858
- licenses. Notwithstanding, Creative Commons may elect to apply one of
859
- its public licenses to material it publishes and in those instances
860
- will be considered the “Licensor.” The text of the Creative Commons
861
- public licenses is dedicated to the public domain under the CC0 Public
862
- Domain Dedication. Except for the limited purpose of indicating that
863
- material is shared under a Creative Commons public license or as
864
- otherwise permitted by the Creative Commons policies published at
865
- creativecommons.org/policies, Creative Commons does not authorize the
866
- use of the trademark "Creative Commons" or any other trademark or logo
867
- of Creative Commons without its prior written consent including,
868
- without limitation, in connection with any unauthorized modifications
869
- to any of its public licenses or any other arrangements,
870
- understandings, or agreements concerning use of licensed material. For
871
- the avoidance of doubt, this paragraph does not form part of the
872
- public licenses.
873
-
874
- Creative Commons may be contacted at creativecommons.org.
875
-
876
- ```
877
-
878
-
879
-
880
-
881
- # Macedonian Corpus
882
-
883
- * Author: Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska
884
- * URL: https://blog.netcetera.com/macedonian-spacy-f3c85484777f
885
- * License: CC BY-SA 4.0
886
-
887
- ```
888
- Attribution-ShareAlike 4.0 International
889
-
890
- =======================================================================
891
-
892
- Creative Commons Corporation ("Creative Commons") is not a law firm and
893
- does not provide legal services or legal advice. Distribution of
894
- Creative Commons public licenses does not create a lawyer-client or
895
- other relationship. Creative Commons makes its licenses and related
896
- information available on an "as-is" basis. Creative Commons gives no
897
- warranties regarding its licenses, any material licensed under their
898
- terms and conditions, or any related information. Creative Commons
899
- disclaims all liability for damages resulting from their use to the
900
- fullest extent possible.
901
-
902
- Using Creative Commons Public Licenses
903
-
904
- Creative Commons public licenses provide a standard set of terms and
905
- conditions that creators and other rights holders may use to share
906
- original works of authorship and other material subject to copyright
907
- and certain other rights specified in the public license below. The
908
- following considerations are for informational purposes only, are not
909
- exhaustive, and do not form part of our licenses.
910
-
911
- Considerations for licensors: Our public licenses are
912
- intended for use by those authorized to give the public
913
- permission to use material in ways otherwise restricted by
914
- copyright and certain other rights. Our licenses are
915
- irrevocable. Licensors should read and understand the terms
916
- and conditions of the license they choose before applying it.
917
- Licensors should also secure all rights necessary before
918
- applying our licenses so that the public can reuse the
919
- material as expected. Licensors should clearly mark any
920
- material not subject to the license. This includes other CC-
921
- licensed material, or material used under an exception or
922
- limitation to copyright. More considerations for licensors:
923
- wiki.creativecommons.org/Considerations_for_licensors
924
-
925
- Considerations for the public: By using one of our public
926
- licenses, a licensor grants the public permission to use the
927
- licensed material under specified terms and conditions. If
928
- the licensor's permission is not necessary for any reason--for
929
- example, because of any applicable exception or limitation to
930
- copyright--then that use is not regulated by the license. Our
931
- licenses grant only permissions under copyright and certain
932
- other rights that a licensor has authority to grant. Use of
933
- the licensed material may still be restricted for other
934
- reasons, including because others have copyright or other
935
- rights in the material. A licensor may make special requests,
936
- such as asking that all changes be marked or described.
937
- Although not required by our licenses, you are encouraged to
938
- respect those requests where reasonable. More considerations
939
- for the public:
940
- wiki.creativecommons.org/Considerations_for_licensees
941
-
942
- =======================================================================
943
-
944
- Creative Commons Attribution-ShareAlike 4.0 International Public
945
- License
946
-
947
- By exercising the Licensed Rights (defined below), You accept and agree
948
- to be bound by the terms and conditions of this Creative Commons
949
- Attribution-ShareAlike 4.0 International Public License ("Public
950
- License"). To the extent this Public License may be interpreted as a
951
- contract, You are granted the Licensed Rights in consideration of Your
952
- acceptance of these terms and conditions, and the Licensor grants You
953
- such rights in consideration of benefits the Licensor receives from
954
- making the Licensed Material available under these terms and
955
- conditions.
956
-
957
-
958
- Section 1 -- Definitions.
959
-
960
- a. Adapted Material means material subject to Copyright and Similar
961
- Rights that is derived from or based upon the Licensed Material
962
- and in which the Licensed Material is translated, altered,
963
- arranged, transformed, or otherwise modified in a manner requiring
964
- permission under the Copyright and Similar Rights held by the
965
- Licensor. For purposes of this Public License, where the Licensed
966
- Material is a musical work, performance, or sound recording,
967
- Adapted Material is always produced where the Licensed Material is
968
- synched in timed relation with a moving image.
969
-
970
- b. Adapter's License means the license You apply to Your Copyright
971
- and Similar Rights in Your contributions to Adapted Material in
972
- accordance with the terms and conditions of this Public License.
973
-
974
- c. BY-SA Compatible License means a license listed at
975
- creativecommons.org/compatiblelicenses, approved by Creative
976
- Commons as essentially the equivalent of this Public License.
977
-
978
- d. Copyright and Similar Rights means copyright and/or similar rights
979
- closely related to copyright including, without limitation,
980
- performance, broadcast, sound recording, and Sui Generis Database
981
- Rights, without regard to how the rights are labeled or
982
- categorized. For purposes of this Public License, the rights
983
- specified in Section 2(b)(1)-(2) are not Copyright and Similar
984
- Rights.
985
-
986
- e. Effective Technological Measures means those measures that, in the
987
- absence of proper authority, may not be circumvented under laws
988
- fulfilling obligations under Article 11 of the WIPO Copyright
989
- Treaty adopted on December 20, 1996, and/or similar international
990
- agreements.
991
-
992
- f. Exceptions and Limitations means fair use, fair dealing, and/or
993
- any other exception or limitation to Copyright and Similar Rights
994
- that applies to Your use of the Licensed Material.
995
-
996
- g. License Elements means the license attributes listed in the name
997
- of a Creative Commons Public License. The License Elements of this
998
- Public License are Attribution and ShareAlike.
999
-
1000
- h. Licensed Material means the artistic or literary work, database,
1001
- or other material to which the Licensor applied this Public
1002
- License.
1003
-
1004
- i. Licensed Rights means the rights granted to You subject to the
1005
- terms and conditions of this Public License, which are limited to
1006
- all Copyright and Similar Rights that apply to Your use of the
1007
- Licensed Material and that the Licensor has authority to license.
1008
-
1009
- j. Licensor means the individual(s) or entity(ies) granting rights
1010
- under this Public License.
1011
-
1012
- k. Share means to provide material to the public by any means or
1013
- process that requires permission under the Licensed Rights, such
1014
- as reproduction, public display, public performance, distribution,
1015
- dissemination, communication, or importation, and to make material
1016
- available to the public including in ways that members of the
1017
- public may access the material from a place and at a time
1018
- individually chosen by them.
1019
-
1020
- l. Sui Generis Database Rights means rights other than copyright
1021
- resulting from Directive 96/9/EC of the European Parliament and of
1022
- the Council of 11 March 1996 on the legal protection of databases,
1023
- as amended and/or succeeded, as well as other essentially
1024
- equivalent rights anywhere in the world.
1025
-
1026
- m. You means the individual or entity exercising the Licensed Rights
1027
- under this Public License. Your has a corresponding meaning.
1028
-
1029
-
1030
- Section 2 -- Scope.
1031
-
1032
- a. License grant.
1033
-
1034
- 1. Subject to the terms and conditions of this Public License,
1035
- the Licensor hereby grants You a worldwide, royalty-free,
1036
- non-sublicensable, non-exclusive, irrevocable license to
1037
- exercise the Licensed Rights in the Licensed Material to:
1038
-
1039
- a. reproduce and Share the Licensed Material, in whole or
1040
- in part; and
1041
-
1042
- b. produce, reproduce, and Share Adapted Material.
1043
-
1044
- 2. Exceptions and Limitations. For the avoidance of doubt, where
1045
- Exceptions and Limitations apply to Your use, this Public
1046
- License does not apply, and You do not need to comply with
1047
- its terms and conditions.
1048
-
1049
- 3. Term. The term of this Public License is specified in Section
1050
- 6(a).
1051
-
1052
- 4. Media and formats; technical modifications allowed. The
1053
- Licensor authorizes You to exercise the Licensed Rights in
1054
- all media and formats whether now known or hereafter created,
1055
- and to make technical modifications necessary to do so. The
1056
- Licensor waives and/or agrees not to assert any right or
1057
- authority to forbid You from making technical modifications
1058
- necessary to exercise the Licensed Rights, including
1059
- technical modifications necessary to circumvent Effective
1060
- Technological Measures. For purposes of this Public License,
1061
- simply making modifications authorized by this Section 2(a)
1062
- (4) never produces Adapted Material.
1063
-
1064
- 5. Downstream recipients.
1065
-
1066
- a. Offer from the Licensor -- Licensed Material. Every
1067
- recipient of the Licensed Material automatically
1068
- receives an offer from the Licensor to exercise the
1069
- Licensed Rights under the terms and conditions of this
1070
- Public License.
1071
-
1072
- b. Additional offer from the Licensor -- Adapted Material.
1073
- Every recipient of Adapted Material from You
1074
- automatically receives an offer from the Licensor to
1075
- exercise the Licensed Rights in the Adapted Material
1076
- under the conditions of the Adapter's License You apply.
1077
-
1078
- c. No downstream restrictions. You may not offer or impose
1079
- any additional or different terms or conditions on, or
1080
- apply any Effective Technological Measures to, the
1081
- Licensed Material if doing so restricts exercise of the
1082
- Licensed Rights by any recipient of the Licensed
1083
- Material.
1084
-
1085
- 6. No endorsement. Nothing in this Public License constitutes or
1086
- may be construed as permission to assert or imply that You
1087
- are, or that Your use of the Licensed Material is, connected
1088
- with, or sponsored, endorsed, or granted official status by,
1089
- the Licensor or others designated to receive attribution as
1090
- provided in Section 3(a)(1)(A)(i).
1091
-
1092
- b. Other rights.
1093
-
1094
- 1. Moral rights, such as the right of integrity, are not
1095
- licensed under this Public License, nor are publicity,
1096
- privacy, and/or other similar personality rights; however, to
1097
- the extent possible, the Licensor waives and/or agrees not to
1098
- assert any such rights held by the Licensor to the limited
1099
- extent necessary to allow You to exercise the Licensed
1100
- Rights, but not otherwise.
1101
-
1102
- 2. Patent and trademark rights are not licensed under this
1103
- Public License.
1104
-
1105
- 3. To the extent possible, the Licensor waives any right to
1106
- collect royalties from You for the exercise of the Licensed
1107
- Rights, whether directly or through a collecting society
1108
- under any voluntary or waivable statutory or compulsory
1109
- licensing scheme. In all other cases the Licensor expressly
1110
- reserves any right to collect such royalties.
1111
-
1112
-
1113
- Section 3 -- License Conditions.
1114
-
1115
- Your exercise of the Licensed Rights is expressly made subject to the
1116
- following conditions.
1117
-
1118
- a. Attribution.
1119
-
1120
- 1. If You Share the Licensed Material (including in modified
1121
- form), You must:
1122
-
1123
- a. retain the following if it is supplied by the Licensor
1124
- with the Licensed Material:
1125
-
1126
- i. identification of the creator(s) of the Licensed
1127
- Material and any others designated to receive
1128
- attribution, in any reasonable manner requested by
1129
- the Licensor (including by pseudonym if
1130
- designated);
1131
-
1132
- ii. a copyright notice;
1133
-
1134
- iii. a notice that refers to this Public License;
1135
-
1136
- iv. a notice that refers to the disclaimer of
1137
- warranties;
1138
-
1139
- v. a URI or hyperlink to the Licensed Material to the
1140
- extent reasonably practicable;
1141
-
1142
- b. indicate if You modified the Licensed Material and
1143
- retain an indication of any previous modifications; and
1144
-
1145
- c. indicate the Licensed Material is licensed under this
1146
- Public License, and include the text of, or the URI or
1147
- hyperlink to, this Public License.
1148
-
1149
- 2. You may satisfy the conditions in Section 3(a)(1) in any
1150
- reasonable manner based on the medium, means, and context in
1151
- which You Share the Licensed Material. For example, it may be
1152
- reasonable to satisfy the conditions by providing a URI or
1153
- hyperlink to a resource that includes the required
1154
- information.
1155
-
1156
- 3. If requested by the Licensor, You must remove any of the
1157
- information required by Section 3(a)(1)(A) to the extent
1158
- reasonably practicable.
1159
-
1160
- b. ShareAlike.
1161
-
1162
- In addition to the conditions in Section 3(a), if You Share
1163
- Adapted Material You produce, the following conditions also apply.
1164
-
1165
- 1. The Adapter's License You apply must be a Creative Commons
1166
- license with the same License Elements, this version or
1167
- later, or a BY-SA Compatible License.
1168
-
1169
- 2. You must include the text of, or the URI or hyperlink to, the
1170
- Adapter's License You apply. You may satisfy this condition
1171
- in any reasonable manner based on the medium, means, and
1172
- context in which You Share Adapted Material.
1173
-
1174
- 3. You may not offer or impose any additional or different terms
1175
- or conditions on, or apply any Effective Technological
1176
- Measures to, Adapted Material that restrict exercise of the
1177
- rights granted under the Adapter's License You apply.
1178
-
1179
-
1180
- Section 4 -- Sui Generis Database Rights.
1181
-
1182
- Where the Licensed Rights include Sui Generis Database Rights that
1183
- apply to Your use of the Licensed Material:
1184
-
1185
- a. for the avoidance of doubt, Section 2(a)(1) grants You the right
1186
- to extract, reuse, reproduce, and Share all or a substantial
1187
- portion of the contents of the database;
1188
-
1189
- b. if You include all or a substantial portion of the database
1190
- contents in a database in which You have Sui Generis Database
1191
- Rights, then the database in which You have Sui Generis Database
1192
- Rights (but not its individual contents) is Adapted Material,
1193
-
1194
- including for purposes of Section 3(b); and
1195
- c. You must comply with the conditions in Section 3(a) if You Share
1196
- all or a substantial portion of the contents of the database.
1197
-
1198
- For the avoidance of doubt, this Section 4 supplements and does not
1199
- replace Your obligations under this Public License where the Licensed
1200
- Rights include other Copyright and Similar Rights.
1201
-
1202
-
1203
- Section 5 -- Disclaimer of Warranties and Limitation of Liability.
1204
-
1205
- a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
1206
- EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
1207
- AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
1208
- ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
1209
- IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
1210
- WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
1211
- PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
1212
- ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
1213
- KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
1214
- ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
1215
-
1216
- b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
1217
- TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
1218
- NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
1219
- INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
1220
- COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
1221
- USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
1222
- ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
1223
- DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
1224
- IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
1225
-
1226
- c. The disclaimer of warranties and limitation of liability provided
1227
- above shall be interpreted in a manner that, to the extent
1228
- possible, most closely approximates an absolute disclaimer and
1229
- waiver of all liability.
1230
-
1231
-
1232
- Section 6 -- Term and Termination.
1233
-
1234
- a. This Public License applies for the term of the Copyright and
1235
- Similar Rights licensed here. However, if You fail to comply with
1236
- this Public License, then Your rights under this Public License
1237
- terminate automatically.
1238
-
1239
- b. Where Your right to use the Licensed Material has terminated under
1240
- Section 6(a), it reinstates:
1241
-
1242
- 1. automatically as of the date the violation is cured, provided
1243
- it is cured within 30 days of Your discovery of the
1244
- violation; or
1245
-
1246
- 2. upon express reinstatement by the Licensor.
1247
-
1248
- For the avoidance of doubt, this Section 6(b) does not affect any
1249
- right the Licensor may have to seek remedies for Your violations
1250
- of this Public License.
1251
-
1252
- c. For the avoidance of doubt, the Licensor may also offer the
1253
- Licensed Material under separate terms or conditions or stop
1254
- distributing the Licensed Material at any time; however, doing so
1255
- will not terminate this Public License.
1256
-
1257
- d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
1258
- License.
1259
-
1260
-
1261
- Section 7 -- Other Terms and Conditions.
1262
-
1263
- a. The Licensor shall not be bound by any additional or different
1264
- terms or conditions communicated by You unless expressly agreed.
1265
-
1266
- b. Any arrangements, understandings, or agreements regarding the
1267
- Licensed Material not stated herein are separate from and
1268
- independent of the terms and conditions of this Public License.
1269
-
1270
-
1271
- Section 8 -- Interpretation.
1272
-
1273
- a. For the avoidance of doubt, this Public License does not, and
1274
- shall not be interpreted to, reduce, limit, restrict, or impose
1275
- conditions on any use of the Licensed Material that could lawfully
1276
- be made without permission under this Public License.
1277
-
1278
- b. To the extent possible, if any provision of this Public License is
1279
- deemed unenforceable, it shall be automatically reformed to the
1280
- minimum extent necessary to make it enforceable. If the provision
1281
- cannot be reformed, it shall be severed from this Public License
1282
- without affecting the enforceability of the remaining terms and
1283
- conditions.
1284
-
1285
- c. No term or condition of this Public License will be waived and no
1286
- failure to comply consented to unless expressly agreed to by the
1287
- Licensor.
1288
-
1289
- d. Nothing in this Public License constitutes or may be interpreted
1290
- as a limitation upon, or waiver of, any privileges and immunities
1291
- that apply to the Licensor or You, including from the legal
1292
- processes of any jurisdiction or authority.
1293
-
1294
-
1295
- =======================================================================
1296
-
1297
- Creative Commons is not a party to its public
1298
- licenses. Notwithstanding, Creative Commons may elect to apply one of
1299
- its public licenses to material it publishes and in those instances
1300
- will be considered the “Licensor.” The text of the Creative Commons
1301
- public licenses is dedicated to the public domain under the CC0 Public
1302
- Domain Dedication. Except for the limited purpose of indicating that
1303
- material is shared under a Creative Commons public license or as
1304
- otherwise permitted by the Creative Commons policies published at
1305
- creativecommons.org/policies, Creative Commons does not authorize the
1306
- use of the trademark "Creative Commons" or any other trademark or logo
1307
- of Creative Commons without its prior written consent including,
1308
- without limitation, in connection with any unauthorized modifications
1309
- to any of its public licenses or any other arrangements,
1310
- understandings, or agreements concerning use of licensed material. For
1311
- the avoidance of doubt, this paragraph does not form part of the
1312
- public licenses.
1313
-
1314
- Creative Commons may be contacted at creativecommons.org.
1315
-
1316
- ```
1317
-
1318
-
1319
-
1320
-
1321
  # spaCy lookups data
1322
 
1323
  * Author: Explosion
438
 
439
 
440
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
441
  # spaCy lookups data
442
 
443
  * Author: Explosion
README.md CHANGED
@@ -14,41 +14,41 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7543402778
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7395744681
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7468844005
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS (UPOS) Accuracy
29
  type: accuracy
30
- value: 0.9309414621
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
- value: 0.6916256158
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
- value: 0.5300492611
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
- value: 0.7123287671
52
  ---
53
  ### Details: https://spacy.io/models/mk#mk_core_news_md
54
 
@@ -57,12 +57,12 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `mk_core_news_md` |
60
- | **Version** | `3.3.0` |
61
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
62
  | **Default Pipeline** | `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 274587 keys, 20000 unique vectors (300 dimensions) |
65
- | **Sources** | [Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
66
  | **License** | `CC BY-SA 4.0` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
@@ -70,11 +70,11 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
70
 
71
  <details>
72
 
73
- <summary>View label scheme (53 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
- | **`morphologizer`** | `POS=PROPN`, `POS=AUX`, `POS=ADJ`, `POS=NOUN`, `POS=ADP`, `POS=PUNCT`, `POS=CONJ`, `POS=NUM`, `POS=VERB`, `POS=PRON`, `POS=ADV`, `POS=SCONJ`, `POS=PART`, `POS=SYM`, `POS=X`, `_`, `POS=INTJ` |
78
  | **`parser`** | `ROOT`, `advmod`, `att`, `aux`, `cc`, `dep`, `det`, `dobj`, `iobj`, `neg`, `nsubj`, `pobj`, `poss`, `pozm`, `pozv`, `prep`, `punct`, `relcl` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
@@ -88,12 +88,12 @@ Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parse
88
  | `TOKEN_P` | 100.00 |
89
  | `TOKEN_R` | 100.00 |
90
  | `TOKEN_F` | 100.00 |
91
- | `SENTS_P` | 75.36 |
92
- | `SENTS_R` | 67.53 |
93
- | `SENTS_F` | 71.23 |
94
- | `DEP_UAS` | 69.16 |
95
- | `DEP_LAS` | 53.00 |
96
- | `ENTS_P` | 75.43 |
97
- | `ENTS_R` | 73.96 |
98
- | `ENTS_F` | 74.69 |
99
- | `POS_ACC` | 93.09 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7373737374
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.7455319149
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7414303851
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS (UPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9314809819
31
  - task:
32
  name: UNLABELED_DEPENDENCIES
33
  type: token-classification
34
  metrics:
35
  - name: Unlabeled Attachment Score (UAS)
36
  type: f_score
37
+ value: 0.6836434868
38
  - task:
39
  name: LABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Labeled Attachment Score (LAS)
43
  type: f_score
44
+ value: 0.5190989226
45
  - task:
46
  name: SENTS
47
  type: token-classification
48
  metrics:
49
  - name: Sentences F-Score
50
  type: f_score
51
+ value: 0.6578947368
52
  ---
53
  ### Details: https://spacy.io/models/mk#mk_core_news_md
54
 
57
  | Feature | Description |
58
  | --- | --- |
59
  | **Name** | `mk_core_news_md` |
60
+ | **Version** | `3.4.0` |
61
+ | **spaCy** | `>=3.4.0,<3.5.0` |
62
  | **Default Pipeline** | `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
63
  | **Components** | `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
64
  | **Vectors** | 274587 keys, 20000 unique vectors (300 dimensions) |
65
+ | **Sources** | [Macedonian Corpus](https://blog.netcetera.com/macedonian-spacy-f3c85484777f) (Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
66
  | **License** | `CC BY-SA 4.0` |
67
  | **Author** | [Explosion](https://explosion.ai) |
68
 
70
 
71
  <details>
72
 
73
+ <summary>View label scheme (54 labels for 3 components)</summary>
74
 
75
  | Component | Labels |
76
  | --- | --- |
77
+ | **`morphologizer`** | `POS=PROPN`, `POS=AUX`, `POS=ADJ`, `POS=NOUN`, `POS=ADP`, `POS=PUNCT`, `POS=CONJ`, `POS=NUM`, `POS=VERB`, `POS=PRON`, `POS=ADV`, `POS=SCONJ`, `POS=PART`, `POS=SYM`, `_`, `POS=SPACE`, `POS=X`, `POS=INTJ` |
78
  | **`parser`** | `ROOT`, `advmod`, `att`, `aux`, `cc`, `dep`, `det`, `dobj`, `iobj`, `neg`, `nsubj`, `pobj`, `poss`, `pozm`, `pozv`, `prep`, `punct`, `relcl` |
79
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
80
 
88
  | `TOKEN_P` | 100.00 |
89
  | `TOKEN_R` | 100.00 |
90
  | `TOKEN_F` | 100.00 |
91
+ | `SENTS_P` | 66.67 |
92
+ | `SENTS_R` | 64.94 |
93
+ | `SENTS_F` | 65.79 |
94
+ | `DEP_UAS` | 68.36 |
95
+ | `DEP_LAS` | 51.91 |
96
+ | `ENTS_P` | 73.74 |
97
+ | `ENTS_R` | 74.55 |
98
+ | `ENTS_F` | 74.14 |
99
+ | `POS_ACC` | 93.15 |
accuracy.json CHANGED
@@ -3,36 +3,36 @@
3
  "token_p": 1.0,
4
  "token_r": 1.0,
5
  "token_f": 1.0,
6
- "sents_p": 0.7536231884,
7
- "sents_r": 0.6753246753,
8
- "sents_f": 0.7123287671,
9
- "dep_uas": 0.6916256158,
10
- "dep_las": 0.5300492611,
11
  "dep_las_per_type": {
12
  "nsubj": {
13
- "p": 0.6046511628,
14
- "r": 0.6842105263,
15
- "f": 0.6419753086
16
  },
17
  "root": {
18
- "p": 0.7971014493,
19
- "r": 0.7857142857,
20
- "f": 0.7913669065
21
  },
22
  "cc": {
23
- "p": 0.875,
24
- "r": 0.5,
25
- "f": 0.6363636364
26
  },
27
  "relcl": {
28
- "p": 0.4444444444,
29
- "r": 0.4615384615,
30
- "f": 0.4528301887
31
  },
32
  "pozm": {
33
- "p": 1.0,
34
- "r": 0.3636363636,
35
- "f": 0.5333333333
36
  },
37
  "poss": {
38
  "p": 0.0,
@@ -40,14 +40,14 @@
40
  "f": 0.0
41
  },
42
  "aux": {
43
- "p": 0.6,
44
- "r": 0.6363636364,
45
- "f": 0.6176470588
46
  },
47
  "prep": {
48
- "p": 0.7142857143,
49
- "r": 0.75,
50
- "f": 0.7317073171
51
  },
52
  "iobj": {
53
  "p": 0.0,
@@ -55,9 +55,9 @@
55
  "f": 0.0
56
  },
57
  "pozv": {
58
- "p": 0.1428571429,
59
- "r": 0.0666666667,
60
- "f": 0.0909090909
61
  },
62
  "quantmod": {
63
  "p": 0.0,
@@ -65,9 +65,9 @@
65
  "f": 0.0
66
  },
67
  "att": {
68
- "p": 0.7872340426,
69
  "r": 0.7115384615,
70
- "f": 0.7474747475
71
  },
72
  "det": {
73
  "p": 0.0,
@@ -80,19 +80,19 @@
80
  "f": 0.0
81
  },
82
  "dep": {
83
- "p": 0.0153846154,
84
  "r": 0.3333333333,
85
- "f": 0.0294117647
86
  },
87
  "dobj": {
88
- "p": 0.4576271186,
89
- "r": 0.45,
90
- "f": 0.4537815126
91
  },
92
  "ppdo": {
93
- "p": 0.5714285714,
94
- "r": 0.2666666667,
95
- "f": 0.3636363636
96
  },
97
  "neg": {
98
  "p": 0.5555555556,
@@ -100,9 +100,9 @@
100
  "f": 0.5
101
  },
102
  "pobj": {
103
- "p": 0.4210526316,
104
  "r": 0.5,
105
- "f": 0.4571428571
106
  },
107
  "mwe": {
108
  "p": 0.0,
@@ -114,16 +114,16 @@
114
  "r": 0.0,
115
  "f": 0.0
116
  },
117
- "advmod": {
118
- "p": 0.3333333333,
119
- "r": 0.5,
120
- "f": 0.4
121
- },
122
  "appos": {
123
  "p": 0.0,
124
  "r": 0.0,
125
  "f": 0.0
126
  },
 
 
 
 
 
127
  "advcl": {
128
  "p": 0.0,
129
  "r": 0.0,
@@ -160,101 +160,101 @@
160
  "f": 0.0
161
  }
162
  },
163
- "speed": 2181.9209832256,
164
- "ents_p": 0.7543402778,
165
- "ents_r": 0.7395744681,
166
- "ents_f": 0.7468844005,
167
  "ents_per_type": {
168
- "GPE": {
169
- "p": 0.8643678161,
170
- "r": 0.8867924528,
171
- "f": 0.8754365541
172
- },
173
  "LOC": {
174
- "p": 0.7246376812,
175
  "r": 0.5747126437,
176
- "f": 0.641025641
177
  },
178
- "QUANTITY": {
179
- "p": 0.7435897436,
180
- "r": 0.7073170732,
181
- "f": 0.725
182
  },
183
- "DATE": {
184
- "p": 0.7142857143,
185
- "r": 0.7042253521,
186
- "f": 0.7092198582
187
  },
188
  "CARDINAL": {
189
- "p": 0.6930693069,
190
- "r": 0.7446808511,
191
- "f": 0.7179487179
192
  },
193
- "NORP": {
194
- "p": 0.5192307692,
195
- "r": 0.4153846154,
196
- "f": 0.4615384615
197
  },
198
- "PERSON": {
199
- "p": 0.7756410256,
200
- "r": 0.8066666667,
201
- "f": 0.7908496732
202
  },
203
- "ORG": {
204
- "p": 0.5666666667,
205
- "r": 0.693877551,
206
- "f": 0.623853211
207
  },
208
  "MONEY": {
209
- "p": 1.0,
210
  "r": 0.5,
211
- "f": 0.6666666667
212
  },
213
  "ORDINAL": {
214
- "p": 0.5384615385,
215
- "r": 0.6363636364,
216
- "f": 0.5833333333
217
  },
218
  "PERCENT": {
219
- "p": 0.9411764706,
220
  "r": 1.0,
221
- "f": 0.9696969697
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
222
  },
223
  "WORK_OF_ART": {
224
- "p": 0.7,
225
- "r": 0.512195122,
226
- "f": 0.5915492958
227
  },
228
  "LANGUAGE": {
229
  "p": 0.0,
230
  "r": 0.0,
231
  "f": 0.0
232
  },
233
- "FAC": {
234
- "p": 0.0833333333,
235
- "r": 0.05,
236
- "f": 0.0625
237
- },
238
  "TIME": {
239
- "p": 1.0,
240
  "r": 0.8333333333,
241
- "f": 0.9090909091
242
- },
243
- "EVENT": {
244
- "p": 0.6111111111,
245
- "r": 0.6470588235,
246
- "f": 0.6285714286
247
  },
248
  "LAW": {
249
  "p": 0.0,
250
  "r": 0.0,
251
  "f": 0.0
252
- },
253
- "PRODUCT": {
254
- "p": 0.0,
255
- "r": 0.0,
256
- "f": 0.0
257
  }
258
  },
259
- "pos_acc": 0.9309414621
260
  }
3
  "token_p": 1.0,
4
  "token_r": 1.0,
5
  "token_f": 1.0,
6
+ "sents_p": 0.6666666667,
7
+ "sents_r": 0.6493506494,
8
+ "sents_f": 0.6578947368,
9
+ "dep_uas": 0.6836434868,
10
+ "dep_las": 0.5190989226,
11
  "dep_las_per_type": {
12
  "nsubj": {
13
+ "p": 0.6857142857,
14
+ "r": 0.6315789474,
15
+ "f": 0.6575342466
16
  },
17
  "root": {
18
+ "p": 0.76,
19
+ "r": 0.8142857143,
20
+ "f": 0.7862068966
21
  },
22
  "cc": {
23
+ "p": 0.8888888889,
24
+ "r": 0.5714285714,
25
+ "f": 0.6956521739
26
  },
27
  "relcl": {
28
+ "p": 0.4210526316,
29
+ "r": 0.3076923077,
30
+ "f": 0.3555555556
31
  },
32
  "pozm": {
33
+ "p": 0.8333333333,
34
+ "r": 0.4545454545,
35
+ "f": 0.5882352941
36
  },
37
  "poss": {
38
  "p": 0.0,
40
  "f": 0.0
41
  },
42
  "aux": {
43
+ "p": 0.5882352941,
44
+ "r": 0.6060606061,
45
+ "f": 0.5970149254
46
  },
47
  "prep": {
48
+ "p": 0.6825396825,
49
+ "r": 0.7166666667,
50
+ "f": 0.6991869919
51
  },
52
  "iobj": {
53
  "p": 0.0,
55
  "f": 0.0
56
  },
57
  "pozv": {
58
+ "p": 0.1818181818,
59
+ "r": 0.1333333333,
60
+ "f": 0.1538461538
61
  },
62
  "quantmod": {
63
  "p": 0.0,
65
  "f": 0.0
66
  },
67
  "att": {
68
+ "p": 0.7254901961,
69
  "r": 0.7115384615,
70
+ "f": 0.7184466019
71
  },
72
  "det": {
73
  "p": 0.0,
80
  "f": 0.0
81
  },
82
  "dep": {
83
+ "p": 0.0138888889,
84
  "r": 0.3333333333,
85
+ "f": 0.0266666667
86
  },
87
  "dobj": {
88
+ "p": 0.4126984127,
89
+ "r": 0.4333333333,
90
+ "f": 0.4227642276
91
  },
92
  "ppdo": {
93
+ "p": 0.7142857143,
94
+ "r": 0.3333333333,
95
+ "f": 0.4545454545
96
  },
97
  "neg": {
98
  "p": 0.5555555556,
100
  "f": 0.5
101
  },
102
  "pobj": {
103
+ "p": 0.5,
104
  "r": 0.5,
105
+ "f": 0.5
106
  },
107
  "mwe": {
108
  "p": 0.0,
114
  "r": 0.0,
115
  "f": 0.0
116
  },
 
 
 
 
 
117
  "appos": {
118
  "p": 0.0,
119
  "r": 0.0,
120
  "f": 0.0
121
  },
122
+ "advmod": {
123
+ "p": 0.0,
124
+ "r": 0.0,
125
+ "f": 0.0
126
+ },
127
  "advcl": {
128
  "p": 0.0,
129
  "r": 0.0,
160
  "f": 0.0
161
  }
162
  },
163
+ "speed": 2245.2550240296,
164
+ "ents_p": 0.7373737374,
165
+ "ents_r": 0.7455319149,
166
+ "ents_f": 0.7414303851,
167
  "ents_per_type": {
 
 
 
 
 
168
  "LOC": {
169
+ "p": 0.6666666667,
170
  "r": 0.5747126437,
171
+ "f": 0.6172839506
172
  },
173
+ "GPE": {
174
+ "p": 0.8646788991,
175
+ "r": 0.8891509434,
176
+ "f": 0.876744186
177
  },
178
+ "QUANTITY": {
179
+ "p": 0.6511627907,
180
+ "r": 0.6829268293,
181
+ "f": 0.6666666667
182
  },
183
  "CARDINAL": {
184
+ "p": 0.7052631579,
185
+ "r": 0.7127659574,
186
+ "f": 0.708994709
187
  },
188
+ "DATE": {
189
+ "p": 0.7465753425,
190
+ "r": 0.7676056338,
191
+ "f": 0.7569444444
192
  },
193
+ "PRODUCT": {
194
+ "p": 0.0,
195
+ "r": 0.0,
196
+ "f": 0.0
197
  },
198
+ "NORP": {
199
+ "p": 0.4202898551,
200
+ "r": 0.4461538462,
201
+ "f": 0.4328358209
202
  },
203
  "MONEY": {
204
+ "p": 0.5,
205
  "r": 0.5,
206
+ "f": 0.5
207
  },
208
  "ORDINAL": {
209
+ "p": 0.5,
210
+ "r": 0.7272727273,
211
+ "f": 0.5925925926
212
  },
213
  "PERCENT": {
214
+ "p": 1.0,
215
  "r": 1.0,
216
+ "f": 1.0
217
+ },
218
+ "PERSON": {
219
+ "p": 0.7843137255,
220
+ "r": 0.8,
221
+ "f": 0.7920792079
222
+ },
223
+ "FAC": {
224
+ "p": 0.3,
225
+ "r": 0.15,
226
+ "f": 0.2
227
+ },
228
+ "ORG": {
229
+ "p": 0.5454545455,
230
+ "r": 0.7346938776,
231
+ "f": 0.6260869565
232
+ },
233
+ "EVENT": {
234
+ "p": 0.4705882353,
235
+ "r": 0.4705882353,
236
+ "f": 0.4705882353
237
  },
238
  "WORK_OF_ART": {
239
+ "p": 0.5757575758,
240
+ "r": 0.4634146341,
241
+ "f": 0.5135135135
242
  },
243
  "LANGUAGE": {
244
  "p": 0.0,
245
  "r": 0.0,
246
  "f": 0.0
247
  },
 
 
 
 
 
248
  "TIME": {
249
+ "p": 0.8333333333,
250
  "r": 0.8333333333,
251
+ "f": 0.8333333333
 
 
 
 
 
252
  },
253
  "LAW": {
254
  "p": 0.0,
255
  "r": 0.0,
256
  "f": 0.0
 
 
 
 
 
257
  }
258
  },
259
+ "pos_acc": 0.9314809819
260
  }
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"mk",
3
  "name":"core_news_md",
4
- "version":"3.3.0",
5
  "description":"Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
@@ -31,8 +31,9 @@
31
  "POS=SCONJ",
32
  "POS=PART",
33
  "POS=SYM",
34
- "POS=X",
35
  "_",
 
 
36
  "POS=INTJ"
37
  ],
38
  "parser":[
@@ -105,36 +106,36 @@
105
  "token_p":1.0,
106
  "token_r":1.0,
107
  "token_f":1.0,
108
- "sents_p":0.7536231884,
109
- "sents_r":0.6753246753,
110
- "sents_f":0.7123287671,
111
- "dep_uas":0.6916256158,
112
- "dep_las":0.5300492611,
113
  "dep_las_per_type":{
114
  "nsubj":{
115
- "p":0.6046511628,
116
- "r":0.6842105263,
117
- "f":0.6419753086
118
  },
119
  "root":{
120
- "p":0.7971014493,
121
- "r":0.7857142857,
122
- "f":0.7913669065
123
  },
124
  "cc":{
125
- "p":0.875,
126
- "r":0.5,
127
- "f":0.6363636364
128
  },
129
  "relcl":{
130
- "p":0.4444444444,
131
- "r":0.4615384615,
132
- "f":0.4528301887
133
  },
134
  "pozm":{
135
- "p":1.0,
136
- "r":0.3636363636,
137
- "f":0.5333333333
138
  },
139
  "poss":{
140
  "p":0.0,
@@ -142,14 +143,14 @@
142
  "f":0.0
143
  },
144
  "aux":{
145
- "p":0.6,
146
- "r":0.6363636364,
147
- "f":0.6176470588
148
  },
149
  "prep":{
150
- "p":0.7142857143,
151
- "r":0.75,
152
- "f":0.7317073171
153
  },
154
  "iobj":{
155
  "p":0.0,
@@ -157,9 +158,9 @@
157
  "f":0.0
158
  },
159
  "pozv":{
160
- "p":0.1428571429,
161
- "r":0.0666666667,
162
- "f":0.0909090909
163
  },
164
  "quantmod":{
165
  "p":0.0,
@@ -167,9 +168,9 @@
167
  "f":0.0
168
  },
169
  "att":{
170
- "p":0.7872340426,
171
  "r":0.7115384615,
172
- "f":0.7474747475
173
  },
174
  "det":{
175
  "p":0.0,
@@ -182,19 +183,19 @@
182
  "f":0.0
183
  },
184
  "dep":{
185
- "p":0.0153846154,
186
  "r":0.3333333333,
187
- "f":0.0294117647
188
  },
189
  "dobj":{
190
- "p":0.4576271186,
191
- "r":0.45,
192
- "f":0.4537815126
193
  },
194
  "ppdo":{
195
- "p":0.5714285714,
196
- "r":0.2666666667,
197
- "f":0.3636363636
198
  },
199
  "neg":{
200
  "p":0.5555555556,
@@ -202,9 +203,9 @@
202
  "f":0.5
203
  },
204
  "pobj":{
205
- "p":0.4210526316,
206
  "r":0.5,
207
- "f":0.4571428571
208
  },
209
  "mwe":{
210
  "p":0.0,
@@ -216,16 +217,16 @@
216
  "r":0.0,
217
  "f":0.0
218
  },
219
- "advmod":{
220
- "p":0.3333333333,
221
- "r":0.5,
222
- "f":0.4
223
- },
224
  "appos":{
225
  "p":0.0,
226
  "r":0.0,
227
  "f":0.0
228
  },
 
 
 
 
 
229
  "advcl":{
230
  "p":0.0,
231
  "r":0.0,
@@ -262,117 +263,105 @@
262
  "f":0.0
263
  }
264
  },
265
- "speed":2181.9209832256,
266
- "ents_p":0.7543402778,
267
- "ents_r":0.7395744681,
268
- "ents_f":0.7468844005,
269
  "ents_per_type":{
270
- "GPE":{
271
- "p":0.8643678161,
272
- "r":0.8867924528,
273
- "f":0.8754365541
274
- },
275
  "LOC":{
276
- "p":0.7246376812,
277
  "r":0.5747126437,
278
- "f":0.641025641
279
  },
280
- "QUANTITY":{
281
- "p":0.7435897436,
282
- "r":0.7073170732,
283
- "f":0.725
284
  },
285
- "DATE":{
286
- "p":0.7142857143,
287
- "r":0.7042253521,
288
- "f":0.7092198582
289
  },
290
  "CARDINAL":{
291
- "p":0.6930693069,
292
- "r":0.7446808511,
293
- "f":0.7179487179
294
  },
295
- "NORP":{
296
- "p":0.5192307692,
297
- "r":0.4153846154,
298
- "f":0.4615384615
299
  },
300
- "PERSON":{
301
- "p":0.7756410256,
302
- "r":0.8066666667,
303
- "f":0.7908496732
304
  },
305
- "ORG":{
306
- "p":0.5666666667,
307
- "r":0.693877551,
308
- "f":0.623853211
309
  },
310
  "MONEY":{
311
- "p":1.0,
312
  "r":0.5,
313
- "f":0.6666666667
314
  },
315
  "ORDINAL":{
316
- "p":0.5384615385,
317
- "r":0.6363636364,
318
- "f":0.5833333333
319
  },
320
  "PERCENT":{
321
- "p":0.9411764706,
322
  "r":1.0,
323
- "f":0.9696969697
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
324
  },
325
  "WORK_OF_ART":{
326
- "p":0.7,
327
- "r":0.512195122,
328
- "f":0.5915492958
329
  },
330
  "LANGUAGE":{
331
  "p":0.0,
332
  "r":0.0,
333
  "f":0.0
334
  },
335
- "FAC":{
336
- "p":0.0833333333,
337
- "r":0.05,
338
- "f":0.0625
339
- },
340
  "TIME":{
341
- "p":1.0,
342
  "r":0.8333333333,
343
- "f":0.9090909091
344
- },
345
- "EVENT":{
346
- "p":0.6111111111,
347
- "r":0.6470588235,
348
- "f":0.6285714286
349
  },
350
  "LAW":{
351
  "p":0.0,
352
  "r":0.0,
353
  "f":0.0
354
- },
355
- "PRODUCT":{
356
- "p":0.0,
357
- "r":0.0,
358
- "f":0.0
359
  }
360
  },
361
- "pos_acc":0.9309414621
362
  },
363
  "sources":[
364
- {
365
- "name":"Macedonian Corpus",
366
- "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
367
- "license":"CC BY-SA 4.0",
368
- "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
369
- },
370
- {
371
- "name":"Macedonian Corpus",
372
- "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
373
- "license":"CC BY-SA 4.0",
374
- "author":"Damjan Zlatinov, Melanija Gerasimovska, Borijan Georgievski, Marija Todosovska"
375
- },
376
  {
377
  "name":"Macedonian Corpus",
378
  "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
1
  {
2
  "lang":"mk",
3
  "name":"core_news_md",
4
+ "version":"3.4.0",
5
  "description":"Macedonian pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":300,
14
  "vectors":20000,
31
  "POS=SCONJ",
32
  "POS=PART",
33
  "POS=SYM",
 
34
  "_",
35
+ "POS=SPACE",
36
+ "POS=X",
37
  "POS=INTJ"
38
  ],
39
  "parser":[
106
  "token_p":1.0,
107
  "token_r":1.0,
108
  "token_f":1.0,
109
+ "sents_p":0.6666666667,
110
+ "sents_r":0.6493506494,
111
+ "sents_f":0.6578947368,
112
+ "dep_uas":0.6836434868,
113
+ "dep_las":0.5190989226,
114
  "dep_las_per_type":{
115
  "nsubj":{
116
+ "p":0.6857142857,
117
+ "r":0.6315789474,
118
+ "f":0.6575342466
119
  },
120
  "root":{
121
+ "p":0.76,
122
+ "r":0.8142857143,
123
+ "f":0.7862068966
124
  },
125
  "cc":{
126
+ "p":0.8888888889,
127
+ "r":0.5714285714,
128
+ "f":0.6956521739
129
  },
130
  "relcl":{
131
+ "p":0.4210526316,
132
+ "r":0.3076923077,
133
+ "f":0.3555555556
134
  },
135
  "pozm":{
136
+ "p":0.8333333333,
137
+ "r":0.4545454545,
138
+ "f":0.5882352941
139
  },
140
  "poss":{
141
  "p":0.0,
143
  "f":0.0
144
  },
145
  "aux":{
146
+ "p":0.5882352941,
147
+ "r":0.6060606061,
148
+ "f":0.5970149254
149
  },
150
  "prep":{
151
+ "p":0.6825396825,
152
+ "r":0.7166666667,
153
+ "f":0.6991869919
154
  },
155
  "iobj":{
156
  "p":0.0,
158
  "f":0.0
159
  },
160
  "pozv":{
161
+ "p":0.1818181818,
162
+ "r":0.1333333333,
163
+ "f":0.1538461538
164
  },
165
  "quantmod":{
166
  "p":0.0,
168
  "f":0.0
169
  },
170
  "att":{
171
+ "p":0.7254901961,
172
  "r":0.7115384615,
173
+ "f":0.7184466019
174
  },
175
  "det":{
176
  "p":0.0,
183
  "f":0.0
184
  },
185
  "dep":{
186
+ "p":0.0138888889,
187
  "r":0.3333333333,
188
+ "f":0.0266666667
189
  },
190
  "dobj":{
191
+ "p":0.4126984127,
192
+ "r":0.4333333333,
193
+ "f":0.4227642276
194
  },
195
  "ppdo":{
196
+ "p":0.7142857143,
197
+ "r":0.3333333333,
198
+ "f":0.4545454545
199
  },
200
  "neg":{
201
  "p":0.5555555556,
203
  "f":0.5
204
  },
205
  "pobj":{
206
+ "p":0.5,
207
  "r":0.5,
208
+ "f":0.5
209
  },
210
  "mwe":{
211
  "p":0.0,
217
  "r":0.0,
218
  "f":0.0
219
  },
 
 
 
 
 
220
  "appos":{
221
  "p":0.0,
222
  "r":0.0,
223
  "f":0.0
224
  },
225
+ "advmod":{
226
+ "p":0.0,
227
+ "r":0.0,
228
+ "f":0.0
229
+ },
230
  "advcl":{
231
  "p":0.0,
232
  "r":0.0,
263
  "f":0.0
264
  }
265
  },
266
+ "speed":2245.2550240296,
267
+ "ents_p":0.7373737374,
268
+ "ents_r":0.7455319149,
269
+ "ents_f":0.7414303851,
270
  "ents_per_type":{
 
 
 
 
 
271
  "LOC":{
272
+ "p":0.6666666667,
273
  "r":0.5747126437,
274
+ "f":0.6172839506
275
  },
276
+ "GPE":{
277
+ "p":0.8646788991,
278
+ "r":0.8891509434,
279
+ "f":0.876744186
280
  },
281
+ "QUANTITY":{
282
+ "p":0.6511627907,
283
+ "r":0.6829268293,
284
+ "f":0.6666666667
285
  },
286
  "CARDINAL":{
287
+ "p":0.7052631579,
288
+ "r":0.7127659574,
289
+ "f":0.708994709
290
  },
291
+ "DATE":{
292
+ "p":0.7465753425,
293
+ "r":0.7676056338,
294
+ "f":0.7569444444
295
  },
296
+ "PRODUCT":{
297
+ "p":0.0,
298
+ "r":0.0,
299
+ "f":0.0
300
  },
301
+ "NORP":{
302
+ "p":0.4202898551,
303
+ "r":0.4461538462,
304
+ "f":0.4328358209
305
  },
306
  "MONEY":{
307
+ "p":0.5,
308
  "r":0.5,
309
+ "f":0.5
310
  },
311
  "ORDINAL":{
312
+ "p":0.5,
313
+ "r":0.7272727273,
314
+ "f":0.5925925926
315
  },
316
  "PERCENT":{
317
+ "p":1.0,
318
  "r":1.0,
319
+ "f":1.0
320
+ },
321
+ "PERSON":{
322
+ "p":0.7843137255,
323
+ "r":0.8,
324
+ "f":0.7920792079
325
+ },
326
+ "FAC":{
327
+ "p":0.3,
328
+ "r":0.15,
329
+ "f":0.2
330
+ },
331
+ "ORG":{
332
+ "p":0.5454545455,
333
+ "r":0.7346938776,
334
+ "f":0.6260869565
335
+ },
336
+ "EVENT":{
337
+ "p":0.4705882353,
338
+ "r":0.4705882353,
339
+ "f":0.4705882353
340
  },
341
  "WORK_OF_ART":{
342
+ "p":0.5757575758,
343
+ "r":0.4634146341,
344
+ "f":0.5135135135
345
  },
346
  "LANGUAGE":{
347
  "p":0.0,
348
  "r":0.0,
349
  "f":0.0
350
  },
 
 
 
 
 
351
  "TIME":{
352
+ "p":0.8333333333,
353
  "r":0.8333333333,
354
+ "f":0.8333333333
 
 
 
 
 
355
  },
356
  "LAW":{
357
  "p":0.0,
358
  "r":0.0,
359
  "f":0.0
 
 
 
 
 
360
  }
361
  },
362
+ "pos_acc":0.9314809819
363
  },
364
  "sources":[
 
 
 
 
 
 
 
 
 
 
 
 
365
  {
366
  "name":"Macedonian Corpus",
367
  "url":"https://blog.netcetera.com/macedonian-spacy-f3c85484777f",
mk_core_news_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e38ac11e399e80457d1dd8eda60796be73053b3ef77e70ab2fedb7a72c1c97a4
3
- size 44860875
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b1954cbeb70545a3e9019f99ca8e3d4fb3ed00f036ad96a70e764aabf4b0456
3
+ size 44832743
morphologizer/cfg CHANGED
@@ -15,8 +15,9 @@
15
  "POS=SCONJ":"",
16
  "POS=PART":"",
17
  "POS=SYM":"",
18
- "POS=X":"",
19
  "_":"",
 
 
20
  "POS=INTJ":""
21
  },
22
  "labels_pos":{
@@ -34,8 +35,9 @@
34
  "POS=SCONJ":98,
35
  "POS=PART":94,
36
  "POS=SYM":99,
37
- "POS=X":101,
38
  "_":0,
 
 
39
  "POS=INTJ":91
40
  },
41
  "overwrite":true
15
  "POS=SCONJ":"",
16
  "POS=PART":"",
17
  "POS=SYM":"",
 
18
  "_":"",
19
+ "POS=SPACE":"",
20
+ "POS=X":"",
21
  "POS=INTJ":""
22
  },
23
  "labels_pos":{
35
  "POS=SCONJ":98,
36
  "POS=PART":94,
37
  "POS=SYM":99,
 
38
  "_":0,
39
+ "POS=SPACE":103,
40
+ "POS=X":101,
41
  "POS=INTJ":91
42
  },
43
  "overwrite":true
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:028a9a832770195b9dc6f3c01547ea8ab44b4f129e4a9a1682f09974dc1fe3be
3
- size 6372989
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:23dff8f7495983eb4574d0f3b2f3e551c4cd51ffd6a2a36afef55ba9d6c7c3b6
3
+ size 6373377
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2b907d5b7f923c0a08219e878372210605aa1a9553e82e5278c7c835f64c7c14
3
  size 6511153
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:50ff75a59d62f6e1004b06bc73060dcd86a71650ee04297a8830a0d7646c58d4
3
  size 6511153
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{},"1":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"2":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"3":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43},"4":{"GPE":3855,"PERSON":2039,"DATE":1865,"ORG":1187,"NORP":1025,"WORK_OF_ART":983,"CARDINAL":641,"LOC":600,"EVENT":476,"FAC":418,"QUANTITY":284,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":109,"PRODUCT":57,"MONEY":43,"":1},"5":{"":1}}�cfg��neg_key�
1
+ ��moves��{"0":{},"1":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"2":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"3":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43},"4":{"GPE":3857,"PERSON":2043,"DATE":1873,"ORG":1192,"NORP":1028,"WORK_OF_ART":983,"CARDINAL":641,"LOC":603,"EVENT":481,"FAC":418,"QUANTITY":286,"LAW":141,"PERCENT":136,"TIME":125,"ORDINAL":118,"LANGUAGE":110,"PRODUCT":57,"MONEY":43,"":1},"5":{"":1}}�cfg��neg_key�
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2b70e3edd243084ea1610f52d3960af782f4ed606e5ad442f2bbc6921467383c
3
  size 6665332
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a71589eb0106c54d571809b4fad15bb2c4a0e59262082671d9f5ff89476272a9
3
  size 6665332
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�2{"0":{"":1190},"1":{"":2140},"2":{"nsubj":278,"aux":187,"att":180,"neg":67,"prep":56,"poss":42,"pozv":37,"advmod":36,"ppdo||dobj":35,"dep":0},"3":{"punct":550,"dobj":316,"prep":291,"relcl":148,"pobj":141,"aux":87,"cc":70,"iobj":63,"att":56,"nsubj":51,"pozv":44,"pozm":40,"det":31,"dep":0},"4":{"ROOT":500}}�cfg��neg_key�
1
+ ��moves�3{"0":{"":1190},"1":{"":2177},"2":{"nsubj":278,"aux":187,"att":180,"neg":67,"prep":56,"poss":42,"pozv":37,"advmod":36,"ppdo||dobj":35,"dep":0},"3":{"punct":550,"dobj":316,"prep":291,"relcl":148,"pobj":141,"aux":87,"cc":70,"iobj":63,"dep":60,"att":56,"nsubj":51,"pozv":44,"pozm":40,"det":31},"4":{"ROOT":500}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b28e6228aa68ed4566dc2e2d8977435cdfb8d78a917d4ce851ec08d407d63bac
3
  size 219953
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bc49a968935df68512ef8573888c05fc97e3aff994db7834b42540dd23d40fb2
3
  size 219953
vocab/key2row CHANGED
Binary files a/vocab/key2row and b/vocab/key2row differ
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2e9527bcfe770697345b06ba498cb76a27fc32e3d5fddb465cc09dcb6223a6e2
3
- size 16436509
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3747ce536a12454d77ae50c3adba83c1f1212b2c316c5152cfe3f991d210f266
3
+ size 16437251