adrianeboyd commited on
Commit
a56a017
1 Parent(s): 2f5c7b2

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -19,3 +19,4 @@
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
 
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
22
+ vocab/key2row filter=lfs diff=lfs merge=lfs -text
LICENSES_SOURCES CHANGED
@@ -878,446 +878,6 @@ Creative Commons may be contacted at creativecommons.org.
878
 
879
 
880
 
881
- # UD Dutch LassySmall v2.8
882
-
883
- * Author: Bouma, Gosse; van Noord, Gertjan
884
- * URL: https://github.com/UniversalDependencies/UD_Dutch-LassySmall
885
- * License: CC BY-SA 4.0
886
-
887
- ```
888
- Attribution-ShareAlike 4.0 International
889
-
890
- =======================================================================
891
-
892
- Creative Commons Corporation ("Creative Commons") is not a law firm and
893
- does not provide legal services or legal advice. Distribution of
894
- Creative Commons public licenses does not create a lawyer-client or
895
- other relationship. Creative Commons makes its licenses and related
896
- information available on an "as-is" basis. Creative Commons gives no
897
- warranties regarding its licenses, any material licensed under their
898
- terms and conditions, or any related information. Creative Commons
899
- disclaims all liability for damages resulting from their use to the
900
- fullest extent possible.
901
-
902
- Using Creative Commons Public Licenses
903
-
904
- Creative Commons public licenses provide a standard set of terms and
905
- conditions that creators and other rights holders may use to share
906
- original works of authorship and other material subject to copyright
907
- and certain other rights specified in the public license below. The
908
- following considerations are for informational purposes only, are not
909
- exhaustive, and do not form part of our licenses.
910
-
911
- Considerations for licensors: Our public licenses are
912
- intended for use by those authorized to give the public
913
- permission to use material in ways otherwise restricted by
914
- copyright and certain other rights. Our licenses are
915
- irrevocable. Licensors should read and understand the terms
916
- and conditions of the license they choose before applying it.
917
- Licensors should also secure all rights necessary before
918
- applying our licenses so that the public can reuse the
919
- material as expected. Licensors should clearly mark any
920
- material not subject to the license. This includes other CC-
921
- licensed material, or material used under an exception or
922
- limitation to copyright. More considerations for licensors:
923
- wiki.creativecommons.org/Considerations_for_licensors
924
-
925
- Considerations for the public: By using one of our public
926
- licenses, a licensor grants the public permission to use the
927
- licensed material under specified terms and conditions. If
928
- the licensor's permission is not necessary for any reason--for
929
- example, because of any applicable exception or limitation to
930
- copyright--then that use is not regulated by the license. Our
931
- licenses grant only permissions under copyright and certain
932
- other rights that a licensor has authority to grant. Use of
933
- the licensed material may still be restricted for other
934
- reasons, including because others have copyright or other
935
- rights in the material. A licensor may make special requests,
936
- such as asking that all changes be marked or described.
937
- Although not required by our licenses, you are encouraged to
938
- respect those requests where reasonable. More considerations
939
- for the public:
940
- wiki.creativecommons.org/Considerations_for_licensees
941
-
942
- =======================================================================
943
-
944
- Creative Commons Attribution-ShareAlike 4.0 International Public
945
- License
946
-
947
- By exercising the Licensed Rights (defined below), You accept and agree
948
- to be bound by the terms and conditions of this Creative Commons
949
- Attribution-ShareAlike 4.0 International Public License ("Public
950
- License"). To the extent this Public License may be interpreted as a
951
- contract, You are granted the Licensed Rights in consideration of Your
952
- acceptance of these terms and conditions, and the Licensor grants You
953
- such rights in consideration of benefits the Licensor receives from
954
- making the Licensed Material available under these terms and
955
- conditions.
956
-
957
-
958
- Section 1 -- Definitions.
959
-
960
- a. Adapted Material means material subject to Copyright and Similar
961
- Rights that is derived from or based upon the Licensed Material
962
- and in which the Licensed Material is translated, altered,
963
- arranged, transformed, or otherwise modified in a manner requiring
964
- permission under the Copyright and Similar Rights held by the
965
- Licensor. For purposes of this Public License, where the Licensed
966
- Material is a musical work, performance, or sound recording,
967
- Adapted Material is always produced where the Licensed Material is
968
- synched in timed relation with a moving image.
969
-
970
- b. Adapter's License means the license You apply to Your Copyright
971
- and Similar Rights in Your contributions to Adapted Material in
972
- accordance with the terms and conditions of this Public License.
973
-
974
- c. BY-SA Compatible License means a license listed at
975
- creativecommons.org/compatiblelicenses, approved by Creative
976
- Commons as essentially the equivalent of this Public License.
977
-
978
- d. Copyright and Similar Rights means copyright and/or similar rights
979
- closely related to copyright including, without limitation,
980
- performance, broadcast, sound recording, and Sui Generis Database
981
- Rights, without regard to how the rights are labeled or
982
- categorized. For purposes of this Public License, the rights
983
- specified in Section 2(b)(1)-(2) are not Copyright and Similar
984
- Rights.
985
-
986
- e. Effective Technological Measures means those measures that, in the
987
- absence of proper authority, may not be circumvented under laws
988
- fulfilling obligations under Article 11 of the WIPO Copyright
989
- Treaty adopted on December 20, 1996, and/or similar international
990
- agreements.
991
-
992
- f. Exceptions and Limitations means fair use, fair dealing, and/or
993
- any other exception or limitation to Copyright and Similar Rights
994
- that applies to Your use of the Licensed Material.
995
-
996
- g. License Elements means the license attributes listed in the name
997
- of a Creative Commons Public License. The License Elements of this
998
- Public License are Attribution and ShareAlike.
999
-
1000
- h. Licensed Material means the artistic or literary work, database,
1001
- or other material to which the Licensor applied this Public
1002
- License.
1003
-
1004
- i. Licensed Rights means the rights granted to You subject to the
1005
- terms and conditions of this Public License, which are limited to
1006
- all Copyright and Similar Rights that apply to Your use of the
1007
- Licensed Material and that the Licensor has authority to license.
1008
-
1009
- j. Licensor means the individual(s) or entity(ies) granting rights
1010
- under this Public License.
1011
-
1012
- k. Share means to provide material to the public by any means or
1013
- process that requires permission under the Licensed Rights, such
1014
- as reproduction, public display, public performance, distribution,
1015
- dissemination, communication, or importation, and to make material
1016
- available to the public including in ways that members of the
1017
- public may access the material from a place and at a time
1018
- individually chosen by them.
1019
-
1020
- l. Sui Generis Database Rights means rights other than copyright
1021
- resulting from Directive 96/9/EC of the European Parliament and of
1022
- the Council of 11 March 1996 on the legal protection of databases,
1023
- as amended and/or succeeded, as well as other essentially
1024
- equivalent rights anywhere in the world.
1025
-
1026
- m. You means the individual or entity exercising the Licensed Rights
1027
- under this Public License. Your has a corresponding meaning.
1028
-
1029
-
1030
- Section 2 -- Scope.
1031
-
1032
- a. License grant.
1033
-
1034
- 1. Subject to the terms and conditions of this Public License,
1035
- the Licensor hereby grants You a worldwide, royalty-free,
1036
- non-sublicensable, non-exclusive, irrevocable license to
1037
- exercise the Licensed Rights in the Licensed Material to:
1038
-
1039
- a. reproduce and Share the Licensed Material, in whole or
1040
- in part; and
1041
-
1042
- b. produce, reproduce, and Share Adapted Material.
1043
-
1044
- 2. Exceptions and Limitations. For the avoidance of doubt, where
1045
- Exceptions and Limitations apply to Your use, this Public
1046
- License does not apply, and You do not need to comply with
1047
- its terms and conditions.
1048
-
1049
- 3. Term. The term of this Public License is specified in Section
1050
- 6(a).
1051
-
1052
- 4. Media and formats; technical modifications allowed. The
1053
- Licensor authorizes You to exercise the Licensed Rights in
1054
- all media and formats whether now known or hereafter created,
1055
- and to make technical modifications necessary to do so. The
1056
- Licensor waives and/or agrees not to assert any right or
1057
- authority to forbid You from making technical modifications
1058
- necessary to exercise the Licensed Rights, including
1059
- technical modifications necessary to circumvent Effective
1060
- Technological Measures. For purposes of this Public License,
1061
- simply making modifications authorized by this Section 2(a)
1062
- (4) never produces Adapted Material.
1063
-
1064
- 5. Downstream recipients.
1065
-
1066
- a. Offer from the Licensor -- Licensed Material. Every
1067
- recipient of the Licensed Material automatically
1068
- receives an offer from the Licensor to exercise the
1069
- Licensed Rights under the terms and conditions of this
1070
- Public License.
1071
-
1072
- b. Additional offer from the Licensor -- Adapted Material.
1073
- Every recipient of Adapted Material from You
1074
- automatically receives an offer from the Licensor to
1075
- exercise the Licensed Rights in the Adapted Material
1076
- under the conditions of the Adapter's License You apply.
1077
-
1078
- c. No downstream restrictions. You may not offer or impose
1079
- any additional or different terms or conditions on, or
1080
- apply any Effective Technological Measures to, the
1081
- Licensed Material if doing so restricts exercise of the
1082
- Licensed Rights by any recipient of the Licensed
1083
- Material.
1084
-
1085
- 6. No endorsement. Nothing in this Public License constitutes or
1086
- may be construed as permission to assert or imply that You
1087
- are, or that Your use of the Licensed Material is, connected
1088
- with, or sponsored, endorsed, or granted official status by,
1089
- the Licensor or others designated to receive attribution as
1090
- provided in Section 3(a)(1)(A)(i).
1091
-
1092
- b. Other rights.
1093
-
1094
- 1. Moral rights, such as the right of integrity, are not
1095
- licensed under this Public License, nor are publicity,
1096
- privacy, and/or other similar personality rights; however, to
1097
- the extent possible, the Licensor waives and/or agrees not to
1098
- assert any such rights held by the Licensor to the limited
1099
- extent necessary to allow You to exercise the Licensed
1100
- Rights, but not otherwise.
1101
-
1102
- 2. Patent and trademark rights are not licensed under this
1103
- Public License.
1104
-
1105
- 3. To the extent possible, the Licensor waives any right to
1106
- collect royalties from You for the exercise of the Licensed
1107
- Rights, whether directly or through a collecting society
1108
- under any voluntary or waivable statutory or compulsory
1109
- licensing scheme. In all other cases the Licensor expressly
1110
- reserves any right to collect such royalties.
1111
-
1112
-
1113
- Section 3 -- License Conditions.
1114
-
1115
- Your exercise of the Licensed Rights is expressly made subject to the
1116
- following conditions.
1117
-
1118
- a. Attribution.
1119
-
1120
- 1. If You Share the Licensed Material (including in modified
1121
- form), You must:
1122
-
1123
- a. retain the following if it is supplied by the Licensor
1124
- with the Licensed Material:
1125
-
1126
- i. identification of the creator(s) of the Licensed
1127
- Material and any others designated to receive
1128
- attribution, in any reasonable manner requested by
1129
- the Licensor (including by pseudonym if
1130
- designated);
1131
-
1132
- ii. a copyright notice;
1133
-
1134
- iii. a notice that refers to this Public License;
1135
-
1136
- iv. a notice that refers to the disclaimer of
1137
- warranties;
1138
-
1139
- v. a URI or hyperlink to the Licensed Material to the
1140
- extent reasonably practicable;
1141
-
1142
- b. indicate if You modified the Licensed Material and
1143
- retain an indication of any previous modifications; and
1144
-
1145
- c. indicate the Licensed Material is licensed under this
1146
- Public License, and include the text of, or the URI or
1147
- hyperlink to, this Public License.
1148
-
1149
- 2. You may satisfy the conditions in Section 3(a)(1) in any
1150
- reasonable manner based on the medium, means, and context in
1151
- which You Share the Licensed Material. For example, it may be
1152
- reasonable to satisfy the conditions by providing a URI or
1153
- hyperlink to a resource that includes the required
1154
- information.
1155
-
1156
- 3. If requested by the Licensor, You must remove any of the
1157
- information required by Section 3(a)(1)(A) to the extent
1158
- reasonably practicable.
1159
-
1160
- b. ShareAlike.
1161
-
1162
- In addition to the conditions in Section 3(a), if You Share
1163
- Adapted Material You produce, the following conditions also apply.
1164
-
1165
- 1. The Adapter's License You apply must be a Creative Commons
1166
- license with the same License Elements, this version or
1167
- later, or a BY-SA Compatible License.
1168
-
1169
- 2. You must include the text of, or the URI or hyperlink to, the
1170
- Adapter's License You apply. You may satisfy this condition
1171
- in any reasonable manner based on the medium, means, and
1172
- context in which You Share Adapted Material.
1173
-
1174
- 3. You may not offer or impose any additional or different terms
1175
- or conditions on, or apply any Effective Technological
1176
- Measures to, Adapted Material that restrict exercise of the
1177
- rights granted under the Adapter's License You apply.
1178
-
1179
-
1180
- Section 4 -- Sui Generis Database Rights.
1181
-
1182
- Where the Licensed Rights include Sui Generis Database Rights that
1183
- apply to Your use of the Licensed Material:
1184
-
1185
- a. for the avoidance of doubt, Section 2(a)(1) grants You the right
1186
- to extract, reuse, reproduce, and Share all or a substantial
1187
- portion of the contents of the database;
1188
-
1189
- b. if You include all or a substantial portion of the database
1190
- contents in a database in which You have Sui Generis Database
1191
- Rights, then the database in which You have Sui Generis Database
1192
- Rights (but not its individual contents) is Adapted Material,
1193
-
1194
- including for purposes of Section 3(b); and
1195
- c. You must comply with the conditions in Section 3(a) if You Share
1196
- all or a substantial portion of the contents of the database.
1197
-
1198
- For the avoidance of doubt, this Section 4 supplements and does not
1199
- replace Your obligations under this Public License where the Licensed
1200
- Rights include other Copyright and Similar Rights.
1201
-
1202
-
1203
- Section 5 -- Disclaimer of Warranties and Limitation of Liability.
1204
-
1205
- a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
1206
- EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
1207
- AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
1208
- ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
1209
- IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
1210
- WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
1211
- PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
1212
- ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
1213
- KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
1214
- ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
1215
-
1216
- b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
1217
- TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
1218
- NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
1219
- INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
1220
- COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
1221
- USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
1222
- ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
1223
- DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
1224
- IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
1225
-
1226
- c. The disclaimer of warranties and limitation of liability provided
1227
- above shall be interpreted in a manner that, to the extent
1228
- possible, most closely approximates an absolute disclaimer and
1229
- waiver of all liability.
1230
-
1231
-
1232
- Section 6 -- Term and Termination.
1233
-
1234
- a. This Public License applies for the term of the Copyright and
1235
- Similar Rights licensed here. However, if You fail to comply with
1236
- this Public License, then Your rights under this Public License
1237
- terminate automatically.
1238
-
1239
- b. Where Your right to use the Licensed Material has terminated under
1240
- Section 6(a), it reinstates:
1241
-
1242
- 1. automatically as of the date the violation is cured, provided
1243
- it is cured within 30 days of Your discovery of the
1244
- violation; or
1245
-
1246
- 2. upon express reinstatement by the Licensor.
1247
-
1248
- For the avoidance of doubt, this Section 6(b) does not affect any
1249
- right the Licensor may have to seek remedies for Your violations
1250
- of this Public License.
1251
-
1252
- c. For the avoidance of doubt, the Licensor may also offer the
1253
- Licensed Material under separate terms or conditions or stop
1254
- distributing the Licensed Material at any time; however, doing so
1255
- will not terminate this Public License.
1256
-
1257
- d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
1258
- License.
1259
-
1260
-
1261
- Section 7 -- Other Terms and Conditions.
1262
-
1263
- a. The Licensor shall not be bound by any additional or different
1264
- terms or conditions communicated by You unless expressly agreed.
1265
-
1266
- b. Any arrangements, understandings, or agreements regarding the
1267
- Licensed Material not stated herein are separate from and
1268
- independent of the terms and conditions of this Public License.
1269
-
1270
-
1271
- Section 8 -- Interpretation.
1272
-
1273
- a. For the avoidance of doubt, this Public License does not, and
1274
- shall not be interpreted to, reduce, limit, restrict, or impose
1275
- conditions on any use of the Licensed Material that could lawfully
1276
- be made without permission under this Public License.
1277
-
1278
- b. To the extent possible, if any provision of this Public License is
1279
- deemed unenforceable, it shall be automatically reformed to the
1280
- minimum extent necessary to make it enforceable. If the provision
1281
- cannot be reformed, it shall be severed from this Public License
1282
- without affecting the enforceability of the remaining terms and
1283
- conditions.
1284
-
1285
- c. No term or condition of this Public License will be waived and no
1286
- failure to comply consented to unless expressly agreed to by the
1287
- Licensor.
1288
-
1289
- d. Nothing in this Public License constitutes or may be interpreted
1290
- as a limitation upon, or waiver of, any privileges and immunities
1291
- that apply to the Licensor or You, including from the legal
1292
- processes of any jurisdiction or authority.
1293
-
1294
-
1295
- =======================================================================
1296
-
1297
- Creative Commons is not a party to its public
1298
- licenses. Notwithstanding, Creative Commons may elect to apply one of
1299
- its public licenses to material it publishes and in those instances
1300
- will be considered the “Licensor.” The text of the Creative Commons
1301
- public licenses is dedicated to the public domain under the CC0 Public
1302
- Domain Dedication. Except for the limited purpose of indicating that
1303
- material is shared under a Creative Commons public license or as
1304
- otherwise permitted by the Creative Commons policies published at
1305
- creativecommons.org/policies, Creative Commons does not authorize the
1306
- use of the trademark "Creative Commons" or any other trademark or logo
1307
- of Creative Commons without its prior written consent including,
1308
- without limitation, in connection with any unauthorized modifications
1309
- to any of its public licenses or any other arrangements,
1310
- understandings, or agreements concerning use of licensed material. For
1311
- the avoidance of doubt, this paragraph does not form part of the
1312
- public licenses.
1313
-
1314
- Creative Commons may be contacted at creativecommons.org.
1315
-
1316
- ```
1317
-
1318
-
1319
-
1320
-
1321
  # UD Dutch Alpino v2.8
1322
 
1323
  * Author: Zeman, Daniel; Žabokrtský, Zdeněk; Bouma, Gosse; van Noord, Gertjan
878
 
879
 
880
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
881
  # UD Dutch Alpino v2.8
882
 
883
  * Author: Zeman, Daniel; Žabokrtský, Zdeněk; Bouma, Gosse; van Noord, Gertjan
README.md CHANGED
@@ -14,62 +14,62 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7652916074
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7441217151
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7545582048
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9534133043
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
- value: 0.9661941112
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
- value: 0.9635947213
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
- value: 0.9417537126
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
- value: 0.8698053923
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
- value: 0.8235860531
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
- value: 0.8749559704
73
  ---
74
  ### Details: https://spacy.io/models/nl#nl_core_news_lg
75
 
@@ -78,12 +78,12 @@ Dutch pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, pa
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `nl_core_news_lg` |
81
- | **Version** | `3.3.0` |
82
- | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
83
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `attribute_ruler`, `ner` |
84
  | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `senter`, `attribute_ruler`, `ner` |
85
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
86
- | **Sources** | [UD Dutch LassySmall v2.8](https://github.com/UniversalDependencies/UD_Dutch-LassySmall) (Bouma, Gosse; van Noord, Gertjan)<br />[Dutch NER Annotations for UD LassySmall](https://nlp.town) (NLP Town)<br />[UD Dutch LassySmall v2.8](https://github.com/UniversalDependencies/UD_Dutch-LassySmall) (Bouma, Gosse; van Noord, Gertjan)<br />[UD Dutch Alpino v2.8](https://github.com/UniversalDependencies/UD_Dutch-Alpino) (Zeman, Daniel; Žabokrtský, Zdeněk; Bouma, Gosse; van Noord, Gertjan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
87
  | **License** | `CC BY-SA 4.0` |
88
  | **Author** | [Explosion](https://explosion.ai) |
89
 
@@ -91,12 +91,12 @@ Dutch pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, pa
91
 
92
  <details>
93
 
94
- <summary>View label scheme (321 labels for 4 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
- | **`morphologizer`** | `POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=AUX\|Tense=Pres\|VerbForm=Fin`, `POS=ADV`, `POS=VERB\|VerbForm=Part`, `POS=PUNCT`, `Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Fin`, `POS=ADP`, `POS=NUM`, `Number=Plur\|POS=NOUN`, `POS=VERB\|VerbForm=Inf`, `POS=SCONJ`, `Definite=Def\|POS=DET`, `Gender=Com\|Number=Sing\|POS=NOUN`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Pos\|POS=ADJ`, `Gender=Neut\|Number=Sing\|POS=PROPN`, `Gender=Com\|Number=Sing\|POS=PROPN`, `POS=AUX\|VerbForm=Inf`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `POS=DET`, `Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=PRON\|Person=3\|PronType=Prs`, `POS=CCONJ`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Ind`, `Degree=Cmp\|POS=ADJ`, `Case=Nom\|POS=PRON\|Person=1\|PronType=Prs`, `Definite=Ind\|POS=DET`, `Case=Nom\|POS=PRON\|Person=3\|PronType=Prs`, `POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=AUX\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Case=Acc\|POS=PRON\|Person=1\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Com,Neut\|Number=Sing\|POS=NOUN`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs\|Reflex=Yes`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs`, `POS=PROPN`, `POS=PRON\|PronType=Ind`, `POS=PRON\|Person=3\|PronType=Int`, `Case=Acc\|POS=PRON\|PronType=Rcp`, `Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=NOUN`, `POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs`, `POS=SYM`, `Abbr=Yes\|POS=X`, `Gender=Com,Neut\|Number=Sing\|POS=PROPN`, `Degree=Sup\|POS=ADJ`, `POS=ADJ`, `Number=Sing\|POS=PROPN`, `POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Part`, `POS=PRON\|Person=3\|PronType=Rel`, `Number=Plur\|POS=PROPN`, `POS=PRON\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Dat\|POS=PRON\|PronType=Dem`, `Case=Nom\|POS=PRON\|Person=2\|PronType=Prs`, `POS=INTJ`, `Case=Acc\|POS=PRON\|Person=2\|PronType=Prs`, `Case=Gen\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=PRON\|PronType=Int`, `POS=PRON\|Person=2\|PronType=Prs`, `POS=PRON\|Person=3`, `Case=Gen\|POS=PRON\|Person=2\|PronType=Prs`, `POS=X` |
99
- | **`tagger`** | `ADJ\|nom\|basis\|met-e\|mv-n`, `ADJ\|nom\|basis\|met-e\|zonder-n\|bijz`, `ADJ\|nom\|basis\|met-e\|zonder-n\|stan`, `ADJ\|nom\|basis\|zonder\|mv-n`, `ADJ\|nom\|basis\|zonder\|zonder-n`, `ADJ\|nom\|comp\|met-e\|mv-n`, `ADJ\|nom\|comp\|met-e\|zonder-n\|stan`, `ADJ\|nom\|sup\|met-e\|mv-n`, `ADJ\|nom\|sup\|met-e\|zonder-n\|bijz`, `ADJ\|nom\|sup\|met-e\|zonder-n\|stan`, `ADJ\|nom\|sup\|zonder\|zonder-n`, `ADJ\|postnom\|basis\|met-s`, `ADJ\|postnom\|basis\|zonder`, `ADJ\|postnom\|comp\|met-s`, `ADJ\|prenom\|basis\|met-e\|bijz`, `ADJ\|prenom\|basis\|met-e\|stan`, `ADJ\|prenom\|basis\|zonder`, `ADJ\|prenom\|comp\|met-e\|stan`, `ADJ\|prenom\|comp\|zonder`, `ADJ\|prenom\|sup\|met-e\|stan`, `ADJ\|prenom\|sup\|zonder`, `ADJ\|vrij\|basis\|zonder`, `ADJ\|vrij\|comp\|zonder`, `ADJ\|vrij\|dim\|zonder`, `ADJ\|vrij\|sup\|zonder`, `BW`, `LET`, `LID\|bep\|dat\|evmo`, `LID\|bep\|gen\|evmo`, `LID\|bep\|gen\|rest3`, `LID\|bep\|stan\|evon`, `LID\|bep\|stan\|rest`, `LID\|onbep\|stan\|agr`, `N\|eigen\|ev\|basis\|gen`, `N\|eigen\|ev\|basis\|genus\|stan`, `N\|eigen\|ev\|basis\|onz\|stan`, `N\|eigen\|ev\|basis\|zijd\|stan`, `N\|eigen\|ev\|dim\|onz\|stan`, `N\|eigen\|mv\|basis`, `N\|soort\|ev\|basis\|dat`, `N\|soort\|ev\|basis\|gen`, `N\|soort\|ev\|basis\|genus\|stan`, `N\|soort\|ev\|basis\|onz\|stan`, `N\|soort\|ev\|basis\|zijd\|stan`, `N\|soort\|ev\|dim\|onz\|stan`, `N\|soort\|mv\|basis`, `N\|soort\|mv\|dim`, `SPEC\|afgebr`, `SPEC\|afk`, `SPEC\|deeleigen`, `SPEC\|enof`, `SPEC\|meta`, `SPEC\|symb`, `SPEC\|vreemd`, `TSW`, `TW\|hoofd\|nom\|mv-n\|basis`, `TW\|hoofd\|nom\|mv-n\|dim`, `TW\|hoofd\|nom\|zonder-n\|basis`, `TW\|hoofd\|nom\|zonder-n\|dim`, `TW\|hoofd\|prenom\|stan`, `TW\|hoofd\|vrij`, `TW\|rang\|nom\|mv-n`, `TW\|rang\|nom\|zonder-n`, `TW\|rang\|prenom\|stan`, `VG\|neven`, `VG\|onder`, `VNW\|aanw\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|aanw\|adv-pron\|stan\|red\|3\|getal`, `VNW\|aanw\|det\|dat\|nom\|met-e\|zonder-n`, `VNW\|aanw\|det\|dat\|prenom\|met-e\|evmo`, `VNW\|aanw\|det\|gen\|prenom\|met-e\|rest3`, `VNW\|aanw\|det\|stan\|nom\|met-e\|mv-n`, `VNW\|aanw\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|aanw\|det\|stan\|prenom\|met-e\|rest`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|agr`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|evon`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|rest`, `VNW\|aanw\|det\|stan\|vrij\|zonder`, `VNW\|aanw\|pron\|gen\|vol\|3m\|ev`, `VNW\|aanw\|pron\|stan\|vol\|3o\|ev`, `VNW\|aanw\|pron\|stan\|vol\|3\|getal`, `VNW\|betr\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|betr\|det\|stan\|nom\|zonder\|zonder-n`, `VNW\|betr\|pron\|stan\|vol\|3\|ev`, `VNW\|betr\|pron\|stan\|vol\|persoon\|getal`, `VNW\|bez\|det\|gen\|vol\|3\|ev\|prenom\|met-e\|rest3`, `VNW\|bez\|det\|stan\|nadr\|2v\|mv\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|1\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|2v\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|3\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|1\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|1\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|1\|mv\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|1\|mv\|prenom\|zonder\|evon`, `VNW\|bez\|det\|stan\|vol\|2v\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|2\|getal\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|3m\|ev\|nom\|met-e\|zonder-n`, `VNW\|bez\|det\|stan\|vol\|3m\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3p\|mv\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3v\|ev\|nom\|met-e\|zonder-n`, `VNW\|bez\|det\|stan\|vol\|3v\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|3\|mv\|prenom\|zonder\|agr`, `VNW\|excl\|pron\|stan\|vol\|3\|getal`, `VNW\|onbep\|adv-pron\|gen\|red\|3\|getal`, `VNW\|onbep\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|onbep\|det\|stan\|nom\|met-e\|mv-n`, `VNW\|onbep\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|onbep\|det\|stan\|nom\|zonder\|zonder-n`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|agr`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|evz`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|mv`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|rest`, `VNW\|onbep\|det\|stan\|prenom\|zonder\|agr`, `VNW\|onbep\|det\|stan\|prenom\|zonder\|evon`, `VNW\|onbep\|det\|stan\|vrij\|zonder`, `VNW\|onbep\|grad\|gen\|nom\|met-e\|mv-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|mv-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|mv-n\|sup`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|zonder-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|zonder-n\|sup`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|comp`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|sup`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|mv\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|zonder\|agr\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|zonder\|agr\|comp`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|basis`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|comp`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|sup`, `VNW\|onbep\|pron\|gen\|vol\|3p\|ev`, `VNW\|onbep\|pron\|stan\|vol\|3o\|ev`, `VNW\|onbep\|pron\|stan\|vol\|3p\|ev`, `VNW\|pers\|pron\|gen\|vol\|2\|getal`, `VNW\|pers\|pron\|nomin\|nadr\|3m\|ev\|masc`, `VNW\|pers\|pron\|nomin\|nadr\|3v\|ev\|fem`, `VNW\|pers\|pron\|nomin\|red\|1\|mv`, `VNW\|pers\|pron\|nomin\|red\|2v\|ev`, `VNW\|pers\|pron\|nomin\|red\|2\|getal`, `VNW\|pers\|pron\|nomin\|red\|3p\|ev\|masc`, `VNW\|pers\|pron\|nomin\|red\|3\|ev\|masc`, `VNW\|pers\|pron\|nomin\|vol\|1\|ev`, `VNW\|pers\|pron\|nomin\|vol\|1\|mv`, `VNW\|pers\|pron\|nomin\|vol\|2b\|getal`, `VNW\|pers\|pron\|nomin\|vol\|2v\|ev`, `VNW\|pers\|pron\|nomin\|vol\|2\|getal`, `VNW\|pers\|pron\|nomin\|vol\|3p\|mv`, `VNW\|pers\|pron\|nomin\|vol\|3v\|ev\|fem`, `VNW\|pers\|pron\|nomin\|vol\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|nadr\|3m\|ev\|masc`, `VNW\|pers\|pron\|obl\|red\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|vol\|2v\|ev`, `VNW\|pers\|pron\|obl\|vol\|3p\|mv`, `VNW\|pers\|pron\|obl\|vol\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|vol\|3\|getal\|fem`, `VNW\|pers\|pron\|stan\|nadr\|2v\|mv`, `VNW\|pers\|pron\|stan\|red\|3\|ev\|fem`, `VNW\|pers\|pron\|stan\|red\|3\|ev\|onz`, `VNW\|pers\|pron\|stan\|red\|3\|mv`, `VNW\|pr\|pron\|obl\|nadr\|1\|ev`, `VNW\|pr\|pron\|obl\|nadr\|2v\|getal`, `VNW\|pr\|pron\|obl\|nadr\|2\|getal`, `VNW\|pr\|pron\|obl\|red\|1\|ev`, `VNW\|pr\|pron\|obl\|red\|2v\|getal`, `VNW\|pr\|pron\|obl\|vol\|1\|ev`, `VNW\|pr\|pron\|obl\|vol\|1\|mv`, `VNW\|pr\|pron\|obl\|vol\|2\|getal`, `VNW\|recip\|pron\|gen\|vol\|persoon\|mv`, `VNW\|recip\|pron\|obl\|vol\|persoon\|mv`, `VNW\|refl\|pron\|obl\|nadr\|3\|getal`, `VNW\|refl\|pron\|obl\|red\|3\|getal`, `VNW\|vb\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|vb\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|vb\|det\|stan\|prenom\|met-e\|rest`, `VNW\|vb\|det\|stan\|prenom\|zonder\|evon`, `VNW\|vb\|pron\|gen\|vol\|3m\|ev`, `VNW\|vb\|pron\|gen\|vol\|3p\|mv`, `VNW\|vb\|pron\|gen\|vol\|3v\|ev`, `VNW\|vb\|pron\|stan\|vol\|3o\|ev`, `VNW\|vb\|pron\|stan\|vol\|3p\|getal`, `VZ\|fin`, `VZ\|init`, `VZ\|versm`, `WW\|inf\|nom\|zonder\|zonder-n`, `WW\|inf\|prenom\|met-e`, `WW\|inf\|vrij\|zonder`, `WW\|od\|nom\|met-e\|mv-n`, `WW\|od\|nom\|met-e\|zonder-n`, `WW\|od\|prenom\|met-e`, `WW\|od\|prenom\|zonder`, `WW\|od\|vrij\|zonder`, `WW\|pv\|conj\|ev`, `WW\|pv\|tgw\|ev`, `WW\|pv\|tgw\|met-t`, `WW\|pv\|tgw\|mv`, `WW\|pv\|verl\|ev`, `WW\|pv\|verl\|mv`, `WW\|vd\|nom\|met-e\|mv-n`, `WW\|vd\|nom\|met-e\|zonder-n`, `WW\|vd\|prenom\|met-e`, `WW\|vd\|prenom\|zonder`, `WW\|vd\|vrij\|zonder` |
100
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound:prt`, `conj`, `cop`, `csubj`, `dep`, `det`, `expl`, `expl:pv`, `fixed`, `flat`, `iobj`, `mark`, `nmod`, `nmod:poss`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `orphan`, `parataxis`, `punct`, `xcomp` |
101
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
102
 
@@ -106,22 +106,22 @@ Dutch pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, pa
106
 
107
  | Type | Score |
108
  | --- | --- |
109
- | `TAG_ACC` | 95.34 |
110
- | `SENTS_P` | 85.95 |
111
- | `SENTS_R` | 89.10 |
112
- | `SENTS_F` | 87.50 |
113
- | `DEP_UAS` | 86.98 |
114
- | `DEP_LAS` | 82.36 |
115
- | `ENTS_P` | 76.53 |
116
- | `ENTS_R` | 74.41 |
117
- | `ENTS_F` | 75.46 |
118
  | `TOKEN_ACC` | 99.97 |
119
  | `TOKEN_P` | 99.74 |
120
  | `TOKEN_R` | 99.76 |
121
  | `TOKEN_F` | 99.75 |
122
- | `POS_ACC` | 96.62 |
123
- | `MORPH_ACC` | 96.36 |
124
- | `MORPH_MICRO_P` | 97.22 |
125
- | `MORPH_MICRO_R` | 95.52 |
126
- | `MORPH_MICRO_F` | 96.36 |
127
- | `LEMMA_ACC` | 94.18 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7845934379
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.7607192254
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7724719101
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9538495093
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9666739368
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
+ value: 0.9639000982
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9568877162
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
+ value: 0.8682669641
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
+ value: 0.8246685762
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
+ value: 0.8765867419
73
  ---
74
  ### Details: https://spacy.io/models/nl#nl_core_news_lg
75
 
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `nl_core_news_lg` |
81
+ | **Version** | `3.4.0` |
82
+ | **spaCy** | `>=3.4.0,<3.5.0` |
83
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `attribute_ruler`, `ner` |
84
  | **Components** | `tok2vec`, `morphologizer`, `tagger`, `parser`, `lemmatizer`, `senter`, `attribute_ruler`, `ner` |
85
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
86
+ | **Sources** | [UD Dutch LassySmall v2.8](https://github.com/UniversalDependencies/UD_Dutch-LassySmall) (Bouma, Gosse; van Noord, Gertjan)<br />[Dutch NER Annotations for UD LassySmall](https://nlp.town) (NLP Town)<br />[UD Dutch Alpino v2.8](https://github.com/UniversalDependencies/UD_Dutch-Alpino) (Zeman, Daniel; Žabokrtský, Zdeněk; Bouma, Gosse; van Noord, Gertjan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
87
  | **License** | `CC BY-SA 4.0` |
88
  | **Author** | [Explosion](https://explosion.ai) |
89
 
91
 
92
  <details>
93
 
94
+ <summary>View label scheme (323 labels for 4 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
+ | **`morphologizer`** | `POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=AUX\|Tense=Pres\|VerbForm=Fin`, `POS=ADV`, `POS=VERB\|VerbForm=Part`, `POS=PUNCT`, `Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Fin`, `POS=ADP`, `POS=NUM`, `Number=Plur\|POS=NOUN`, `POS=VERB\|VerbForm=Inf`, `POS=SCONJ`, `Definite=Def\|POS=DET`, `Gender=Com\|Number=Sing\|POS=NOUN`, `Number=Sing\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Degree=Pos\|POS=ADJ`, `Gender=Neut\|Number=Sing\|POS=PROPN`, `Gender=Com\|Number=Sing\|POS=PROPN`, `POS=AUX\|VerbForm=Inf`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `POS=DET`, `Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=PRON\|Person=3\|PronType=Prs`, `POS=CCONJ`, `Number=Plur\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|Person=3\|PronType=Ind`, `Degree=Cmp\|POS=ADJ`, `Case=Nom\|POS=PRON\|Person=1\|PronType=Prs`, `Definite=Ind\|POS=DET`, `Case=Nom\|POS=PRON\|Person=3\|PronType=Prs`, `POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `Number=Plur\|POS=AUX\|Tense=Pres\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Case=Acc\|POS=PRON\|Person=1\|PronType=Prs`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Fin`, `Gender=Com,Neut\|Number=Sing\|POS=NOUN`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs\|Reflex=Yes`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs`, `POS=PROPN`, `POS=PRON\|PronType=Ind`, `POS=PRON\|Person=3\|PronType=Int`, `Case=Acc\|POS=PRON\|PronType=Rcp`, `Number=Plur\|POS=AUX\|Tense=Past\|VerbForm=Fin`, `Number=Sing\|POS=NOUN`, `POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs`, `POS=SYM`, `Abbr=Yes\|POS=X`, `Gender=Com,Neut\|Number=Sing\|POS=PROPN`, `Degree=Sup\|POS=ADJ`, `POS=ADJ`, `Number=Sing\|POS=PROPN`, `POS=PRON\|PronType=Dem`, `POS=AUX\|VerbForm=Part`, `POS=SPACE`, `POS=PRON\|Person=3\|PronType=Rel`, `Number=Plur\|POS=PROPN`, `POS=PRON\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Dat\|POS=PRON\|PronType=Dem`, `Case=Nom\|POS=PRON\|Person=2\|PronType=Prs`, `POS=INTJ`, `Case=Acc\|POS=PRON\|Person=2\|PronType=Prs`, `Case=Gen\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=PRON\|PronType=Int`, `POS=PRON\|Person=2\|PronType=Prs`, `POS=PRON\|Person=3`, `Case=Gen\|POS=PRON\|Person=2\|PronType=Prs`, `POS=X` |
99
+ | **`tagger`** | `ADJ\|nom\|basis\|met-e\|mv-n`, `ADJ\|nom\|basis\|met-e\|zonder-n\|bijz`, `ADJ\|nom\|basis\|met-e\|zonder-n\|stan`, `ADJ\|nom\|basis\|zonder\|mv-n`, `ADJ\|nom\|basis\|zonder\|zonder-n`, `ADJ\|nom\|comp\|met-e\|mv-n`, `ADJ\|nom\|comp\|met-e\|zonder-n\|stan`, `ADJ\|nom\|sup\|met-e\|mv-n`, `ADJ\|nom\|sup\|met-e\|zonder-n\|bijz`, `ADJ\|nom\|sup\|met-e\|zonder-n\|stan`, `ADJ\|nom\|sup\|zonder\|zonder-n`, `ADJ\|postnom\|basis\|met-s`, `ADJ\|postnom\|basis\|zonder`, `ADJ\|postnom\|comp\|met-s`, `ADJ\|prenom\|basis\|met-e\|bijz`, `ADJ\|prenom\|basis\|met-e\|stan`, `ADJ\|prenom\|basis\|zonder`, `ADJ\|prenom\|comp\|met-e\|stan`, `ADJ\|prenom\|comp\|zonder`, `ADJ\|prenom\|sup\|met-e\|stan`, `ADJ\|prenom\|sup\|zonder`, `ADJ\|vrij\|basis\|zonder`, `ADJ\|vrij\|comp\|zonder`, `ADJ\|vrij\|dim\|zonder`, `ADJ\|vrij\|sup\|zonder`, `BW`, `LET`, `LID\|bep\|dat\|evmo`, `LID\|bep\|gen\|evmo`, `LID\|bep\|gen\|rest3`, `LID\|bep\|stan\|evon`, `LID\|bep\|stan\|rest`, `LID\|onbep\|stan\|agr`, `N\|eigen\|ev\|basis\|gen`, `N\|eigen\|ev\|basis\|genus\|stan`, `N\|eigen\|ev\|basis\|onz\|stan`, `N\|eigen\|ev\|basis\|zijd\|stan`, `N\|eigen\|ev\|dim\|onz\|stan`, `N\|eigen\|mv\|basis`, `N\|soort\|ev\|basis\|dat`, `N\|soort\|ev\|basis\|gen`, `N\|soort\|ev\|basis\|genus\|stan`, `N\|soort\|ev\|basis\|onz\|stan`, `N\|soort\|ev\|basis\|zijd\|stan`, `N\|soort\|ev\|dim\|onz\|stan`, `N\|soort\|mv\|basis`, `N\|soort\|mv\|dim`, `SPEC\|afgebr`, `SPEC\|afk`, `SPEC\|deeleigen`, `SPEC\|enof`, `SPEC\|meta`, `SPEC\|symb`, `SPEC\|vreemd`, `TSW`, `TW\|hoofd\|nom\|mv-n\|basis`, `TW\|hoofd\|nom\|mv-n\|dim`, `TW\|hoofd\|nom\|zonder-n\|basis`, `TW\|hoofd\|nom\|zonder-n\|dim`, `TW\|hoofd\|prenom\|stan`, `TW\|hoofd\|vrij`, `TW\|rang\|nom\|mv-n`, `TW\|rang\|nom\|zonder-n`, `TW\|rang\|prenom\|stan`, `VG\|neven`, `VG\|onder`, `VNW\|aanw\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|aanw\|adv-pron\|stan\|red\|3\|getal`, `VNW\|aanw\|det\|dat\|nom\|met-e\|zonder-n`, `VNW\|aanw\|det\|dat\|prenom\|met-e\|evmo`, `VNW\|aanw\|det\|gen\|prenom\|met-e\|rest3`, `VNW\|aanw\|det\|stan\|nom\|met-e\|mv-n`, `VNW\|aanw\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|aanw\|det\|stan\|prenom\|met-e\|rest`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|agr`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|evon`, `VNW\|aanw\|det\|stan\|prenom\|zonder\|rest`, `VNW\|aanw\|det\|stan\|vrij\|zonder`, `VNW\|aanw\|pron\|gen\|vol\|3m\|ev`, `VNW\|aanw\|pron\|stan\|vol\|3o\|ev`, `VNW\|aanw\|pron\|stan\|vol\|3\|getal`, `VNW\|betr\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|betr\|det\|stan\|nom\|zonder\|zonder-n`, `VNW\|betr\|pron\|stan\|vol\|3\|ev`, `VNW\|betr\|pron\|stan\|vol\|persoon\|getal`, `VNW\|bez\|det\|gen\|vol\|3\|ev\|prenom\|met-e\|rest3`, `VNW\|bez\|det\|stan\|nadr\|2v\|mv\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|1\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|2v\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|red\|3\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|1\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|1\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|1\|mv\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|1\|mv\|prenom\|zonder\|evon`, `VNW\|bez\|det\|stan\|vol\|2v\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|2\|getal\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|3m\|ev\|nom\|met-e\|zonder-n`, `VNW\|bez\|det\|stan\|vol\|3m\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3p\|mv\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3v\|ev\|nom\|met-e\|zonder-n`, `VNW\|bez\|det\|stan\|vol\|3v\|ev\|prenom\|met-e\|rest`, `VNW\|bez\|det\|stan\|vol\|3\|ev\|prenom\|zonder\|agr`, `VNW\|bez\|det\|stan\|vol\|3\|mv\|prenom\|zonder\|agr`, `VNW\|excl\|pron\|stan\|vol\|3\|getal`, `VNW\|onbep\|adv-pron\|gen\|red\|3\|getal`, `VNW\|onbep\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|onbep\|det\|stan\|nom\|met-e\|mv-n`, `VNW\|onbep\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|onbep\|det\|stan\|nom\|zonder\|zonder-n`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|agr`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|evz`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|mv`, `VNW\|onbep\|det\|stan\|prenom\|met-e\|rest`, `VNW\|onbep\|det\|stan\|prenom\|zonder\|agr`, `VNW\|onbep\|det\|stan\|prenom\|zonder\|evon`, `VNW\|onbep\|det\|stan\|vrij\|zonder`, `VNW\|onbep\|grad\|gen\|nom\|met-e\|mv-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|mv-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|mv-n\|sup`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|zonder-n\|basis`, `VNW\|onbep\|grad\|stan\|nom\|met-e\|zonder-n\|sup`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|comp`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|agr\|sup`, `VNW\|onbep\|grad\|stan\|prenom\|met-e\|mv\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|zonder\|agr\|basis`, `VNW\|onbep\|grad\|stan\|prenom\|zonder\|agr\|comp`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|basis`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|comp`, `VNW\|onbep\|grad\|stan\|vrij\|zonder\|sup`, `VNW\|onbep\|pron\|gen\|vol\|3p\|ev`, `VNW\|onbep\|pron\|stan\|vol\|3o\|ev`, `VNW\|onbep\|pron\|stan\|vol\|3p\|ev`, `VNW\|pers\|pron\|gen\|vol\|2\|getal`, `VNW\|pers\|pron\|nomin\|nadr\|3m\|ev\|masc`, `VNW\|pers\|pron\|nomin\|nadr\|3v\|ev\|fem`, `VNW\|pers\|pron\|nomin\|red\|1\|mv`, `VNW\|pers\|pron\|nomin\|red\|2v\|ev`, `VNW\|pers\|pron\|nomin\|red\|2\|getal`, `VNW\|pers\|pron\|nomin\|red\|3p\|ev\|masc`, `VNW\|pers\|pron\|nomin\|red\|3\|ev\|masc`, `VNW\|pers\|pron\|nomin\|vol\|1\|ev`, `VNW\|pers\|pron\|nomin\|vol\|1\|mv`, `VNW\|pers\|pron\|nomin\|vol\|2b\|getal`, `VNW\|pers\|pron\|nomin\|vol\|2v\|ev`, `VNW\|pers\|pron\|nomin\|vol\|2\|getal`, `VNW\|pers\|pron\|nomin\|vol\|3p\|mv`, `VNW\|pers\|pron\|nomin\|vol\|3v\|ev\|fem`, `VNW\|pers\|pron\|nomin\|vol\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|nadr\|3m\|ev\|masc`, `VNW\|pers\|pron\|obl\|red\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|vol\|2v\|ev`, `VNW\|pers\|pron\|obl\|vol\|3p\|mv`, `VNW\|pers\|pron\|obl\|vol\|3\|ev\|masc`, `VNW\|pers\|pron\|obl\|vol\|3\|getal\|fem`, `VNW\|pers\|pron\|stan\|nadr\|2v\|mv`, `VNW\|pers\|pron\|stan\|red\|3\|ev\|fem`, `VNW\|pers\|pron\|stan\|red\|3\|ev\|onz`, `VNW\|pers\|pron\|stan\|red\|3\|mv`, `VNW\|pr\|pron\|obl\|nadr\|1\|ev`, `VNW\|pr\|pron\|obl\|nadr\|2v\|getal`, `VNW\|pr\|pron\|obl\|nadr\|2\|getal`, `VNW\|pr\|pron\|obl\|red\|1\|ev`, `VNW\|pr\|pron\|obl\|red\|2v\|getal`, `VNW\|pr\|pron\|obl\|vol\|1\|ev`, `VNW\|pr\|pron\|obl\|vol\|1\|mv`, `VNW\|pr\|pron\|obl\|vol\|2\|getal`, `VNW\|recip\|pron\|gen\|vol\|persoon\|mv`, `VNW\|recip\|pron\|obl\|vol\|persoon\|mv`, `VNW\|refl\|pron\|obl\|nadr\|3\|getal`, `VNW\|refl\|pron\|obl\|red\|3\|getal`, `VNW\|vb\|adv-pron\|obl\|vol\|3o\|getal`, `VNW\|vb\|det\|stan\|nom\|met-e\|zonder-n`, `VNW\|vb\|det\|stan\|prenom\|met-e\|rest`, `VNW\|vb\|det\|stan\|prenom\|zonder\|evon`, `VNW\|vb\|pron\|gen\|vol\|3m\|ev`, `VNW\|vb\|pron\|gen\|vol\|3p\|mv`, `VNW\|vb\|pron\|gen\|vol\|3v\|ev`, `VNW\|vb\|pron\|stan\|vol\|3o\|ev`, `VNW\|vb\|pron\|stan\|vol\|3p\|getal`, `VZ\|fin`, `VZ\|init`, `VZ\|versm`, `WW\|inf\|nom\|zonder\|zonder-n`, `WW\|inf\|prenom\|met-e`, `WW\|inf\|vrij\|zonder`, `WW\|od\|nom\|met-e\|mv-n`, `WW\|od\|nom\|met-e\|zonder-n`, `WW\|od\|prenom\|met-e`, `WW\|od\|prenom\|zonder`, `WW\|od\|vrij\|zonder`, `WW\|pv\|conj\|ev`, `WW\|pv\|tgw\|ev`, `WW\|pv\|tgw\|met-t`, `WW\|pv\|tgw\|mv`, `WW\|pv\|verl\|ev`, `WW\|pv\|verl\|mv`, `WW\|vd\|nom\|met-e\|mv-n`, `WW\|vd\|nom\|met-e\|zonder-n`, `WW\|vd\|prenom\|met-e`, `WW\|vd\|prenom\|zonder`, `WW\|vd\|vrij\|zonder`, `_SP` |
100
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `ccomp`, `compound:prt`, `conj`, `cop`, `csubj`, `dep`, `det`, `expl`, `expl:pv`, `fixed`, `flat`, `iobj`, `mark`, `nmod`, `nmod:poss`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `orphan`, `parataxis`, `punct`, `xcomp` |
101
  | **`ner`** | `CARDINAL`, `DATE`, `EVENT`, `FAC`, `GPE`, `LANGUAGE`, `LAW`, `LOC`, `MONEY`, `NORP`, `ORDINAL`, `ORG`, `PERCENT`, `PERSON`, `PRODUCT`, `QUANTITY`, `TIME`, `WORK_OF_ART` |
102
 
106
 
107
  | Type | Score |
108
  | --- | --- |
109
+ | `TAG_ACC` | 95.38 |
110
+ | `SENTS_P` | 86.20 |
111
+ | `SENTS_R` | 89.17 |
112
+ | `SENTS_F` | 87.66 |
113
+ | `DEP_UAS` | 86.83 |
114
+ | `DEP_LAS` | 82.47 |
115
+ | `ENTS_P` | 78.46 |
116
+ | `ENTS_R` | 76.07 |
117
+ | `ENTS_F` | 77.25 |
118
  | `TOKEN_ACC` | 99.97 |
119
  | `TOKEN_P` | 99.74 |
120
  | `TOKEN_R` | 99.76 |
121
  | `TOKEN_F` | 99.75 |
122
+ | `POS_ACC` | 96.67 |
123
+ | `MORPH_ACC` | 96.39 |
124
+ | `MORPH_MICRO_P` | 97.41 |
125
+ | `MORPH_MICRO_R` | 95.71 |
126
+ | `MORPH_MICRO_F` | 96.56 |
127
+ | `LEMMA_ACC` | 95.69 |
accuracy.json CHANGED
@@ -1,175 +1,175 @@
1
  {
2
- "tag_acc": 0.9534133043,
3
- "sents_p": 0.8595155709,
4
- "sents_r": 0.8909612626,
5
- "sents_f": 0.8749559704,
6
- "dep_uas": 0.8698053923,
7
- "dep_las": 0.8235860531,
8
  "dep_las_per_type": {
9
  "nmod:poss": {
10
- "p": 0.9456521739,
11
- "r": 0.9525547445,
12
- "f": 0.9490909091
13
  },
14
  "nsubj": {
15
- "p": 0.8466494845,
16
- "r": 0.8639053254,
17
- "f": 0.8551903677
18
  },
19
  "aux": {
20
- "p": 0.9102990033,
21
- "r": 0.9013157895,
22
- "f": 0.905785124
23
  },
24
  "advmod": {
25
- "p": 0.7967269595,
26
- "r": 0.8258928571,
27
- "f": 0.8110477861
28
  },
29
  "root": {
30
- "p": 0.8636678201,
31
- "r": 0.8952654232,
32
- "f": 0.8791828108
33
  },
34
  "det": {
35
- "p": 0.946263644,
36
- "r": 0.9753353527,
37
- "f": 0.9605795866
38
  },
39
  "amod": {
40
- "p": 0.8720420684,
41
- "r": 0.8915770609,
42
- "f": 0.8817013735
43
  },
44
  "obl": {
45
- "p": 0.7456021651,
46
- "r": 0.7461069736,
47
- "f": 0.7458544839
48
  },
49
  "mark": {
50
- "p": 0.8880866426,
51
- "r": 0.8945454545,
52
- "f": 0.8913043478
53
  },
54
  "ccomp": {
55
- "p": 0.6727272727,
56
- "r": 0.691588785,
57
- "f": 0.6820276498
58
  },
59
  "case": {
60
- "p": 0.9379652605,
61
- "r": 0.9606099111,
62
- "f": 0.9491525424
63
  },
64
  "appos": {
65
- "p": 0.7060702875,
66
- "r": 0.6696969697,
67
- "f": 0.6874027994
68
  },
69
  "obj": {
70
- "p": 0.7875968992,
71
- "r": 0.7755725191,
72
- "f": 0.7815384615
73
  },
74
  "compound:prt": {
75
- "p": 0.7755102041,
76
- "r": 0.7136150235,
77
- "f": 0.7432762836
78
  },
79
  "xcomp": {
80
- "p": 0.6765799257,
81
- "r": 0.6618181818,
82
- "f": 0.6691176471
83
  },
84
  "flat": {
85
- "p": 0.8115124153,
86
- "r": 0.7624602333,
87
- "f": 0.7862219792
88
  },
89
  "expl:pv": {
90
- "p": 0.7674418605,
91
- "r": 0.75,
92
- "f": 0.7586206897
93
  },
94
  "acl": {
95
- "p": 0.4615384615,
96
- "r": 0.3673469388,
97
- "f": 0.4090909091
98
  },
99
  "advcl": {
100
- "p": 0.5024875622,
101
- "r": 0.454954955,
102
- "f": 0.4775413712
103
  },
104
  "nummod": {
105
- "p": 0.8141025641,
106
- "r": 0.8466666667,
107
- "f": 0.8300653595
108
  },
109
  "nmod": {
110
- "p": 0.7278742763,
111
- "r": 0.7652173913,
112
- "f": 0.746078847
113
  },
114
  "cc": {
115
- "p": 0.8544776119,
116
- "r": 0.8674242424,
117
- "f": 0.8609022556
118
  },
119
  "conj": {
120
- "p": 0.6463245492,
121
- "r": 0.6331521739,
122
- "f": 0.6396705559
123
  },
124
  "nsubj:pass": {
125
- "p": 0.8083832335,
126
- "r": 0.8490566038,
127
- "f": 0.8282208589
128
  },
129
  "aux:pass": {
130
- "p": 0.8871794872,
131
- "r": 0.9611111111,
132
- "f": 0.9226666667
133
- },
134
- "iobj": {
135
- "p": 0.5652173913,
136
- "r": 0.3939393939,
137
- "f": 0.4642857143
138
  },
139
  "cop": {
140
- "p": 0.7789473684,
141
  "r": 0.8131868132,
142
- "f": 0.7956989247
143
  },
144
  "parataxis": {
145
- "p": 0.3663366337,
146
- "r": 0.268115942,
147
- "f": 0.309623431
148
  },
149
  "acl:relcl": {
150
- "p": 0.6956521739,
151
- "r": 0.7044025157,
152
- "f": 0.7
 
 
 
 
 
 
 
 
 
 
153
  },
154
  "fixed": {
155
- "p": 0.721448468,
156
- "r": 0.4683544304,
157
- "f": 0.5679824561
158
  },
159
  "obl:agent": {
160
- "p": 0.9615384615,
161
- "r": 0.8620689655,
162
- "f": 0.9090909091
163
- },
164
- "expl": {
165
- "p": 0.4,
166
- "r": 0.4761904762,
167
- "f": 0.4347826087
168
  },
169
  "csubj": {
170
- "p": 0.6111111111,
171
- "r": 0.55,
172
- "f": 0.5789473684
173
  },
174
  "dep": {
175
  "p": 0.0,
@@ -182,16 +182,16 @@
182
  "f": 0.0
183
  }
184
  },
185
- "ents_p": 0.7652916074,
186
- "ents_r": 0.7441217151,
187
- "ents_f": 0.7545582048,
188
  "ents_per_type": {
189
- "ORG": {
190
  "p": 0.0,
191
  "r": 0.0,
192
  "f": 0.0
193
  },
194
- "PERSON": {
195
  "p": 0.0,
196
  "r": 0.0,
197
  "f": 0.0
@@ -206,12 +206,17 @@
206
  "r": 0.0,
207
  "f": 0.0
208
  },
209
- "NORP": {
210
  "p": 0.0,
211
  "r": 0.0,
212
  "f": 0.0
213
  },
214
- "DATE": {
 
 
 
 
 
215
  "p": 0.0,
216
  "r": 0.0,
217
  "f": 0.0
@@ -221,12 +226,12 @@
221
  "r": 0.0,
222
  "f": 0.0
223
  },
224
- "PRODUCT": {
225
  "p": 0.0,
226
  "r": 0.0,
227
  "f": 0.0
228
  },
229
- "GPE": {
230
  "p": 0.0,
231
  "r": 0.0,
232
  "f": 0.0
@@ -246,12 +251,12 @@
246
  "r": 0.0,
247
  "f": 0.0
248
  },
249
- "LOC": {
250
  "p": 0.0,
251
  "r": 0.0,
252
  "f": 0.0
253
  },
254
- "WORK_OF_ART": {
255
  "p": 0.0,
256
  "r": 0.0,
257
  "f": 0.0
@@ -266,77 +271,72 @@
266
  "r": 0.0,
267
  "f": 0.0
268
  },
269
- "PERCENT": {
270
- "p": 0.0,
271
- "r": 0.0,
272
- "f": 0.0
273
- },
274
  "LAW": {
275
  "p": 0.0,
276
  "r": 0.0,
277
  "f": 0.0
278
  }
279
  },
280
- "speed": 10256.4374458498,
281
  "token_acc": 0.9997165842,
282
  "token_p": 0.9974281853,
283
  "token_r": 0.9975586363,
284
  "token_f": 0.9974934066,
285
- "pos_acc": 0.9661941112,
286
- "morph_acc": 0.9635947213,
287
- "morph_micro_p": 0.9722389581,
288
- "morph_micro_r": 0.9551518463,
289
- "morph_micro_f": 0.9636196601,
290
  "morph_per_feat": {
291
  "Person": {
292
- "p": 0.9892891918,
293
- "r": 0.9713193117,
294
- "f": 0.9802219006
295
  },
296
  "Poss": {
297
- "p": 0.9811320755,
298
- "r": 0.9961685824,
299
- "f": 0.9885931559
300
  },
301
  "PronType": {
302
- "p": 0.9880647911,
303
- "r": 0.9634247714,
304
- "f": 0.9755892256
305
  },
306
  "Gender": {
307
- "p": 0.9345253747,
308
- "r": 0.90409565,
309
- "f": 0.9190587018
310
  },
311
  "Number": {
312
- "p": 0.9816793893,
313
- "r": 0.9624307738,
314
- "f": 0.9719597914
315
  },
316
  "Tense": {
317
- "p": 0.9805339266,
318
- "r": 0.9692138538,
319
- "f": 0.9748410285
320
  },
321
  "VerbForm": {
322
- "p": 0.9623188406,
323
- "r": 0.9557394746,
324
- "f": 0.9590178733
325
  },
326
  "Degree": {
327
- "p": 0.9655677656,
328
- "r": 0.9468390805,
329
- "f": 0.9561117156
330
  },
331
  "Definite": {
332
- "p": 0.9942680776,
333
- "r": 0.9925176056,
334
- "f": 0.9933920705
335
  },
336
  "Case": {
337
- "p": 0.998,
338
- "r": 0.9940239044,
339
- "f": 0.996007984
340
  },
341
  "Reflex": {
342
  "p": 1.0,
@@ -345,9 +345,9 @@
345
  },
346
  "Abbr": {
347
  "p": 1.0,
348
- "r": 0.5,
349
- "f": 0.6666666667
350
  }
351
  },
352
- "lemma_acc": 0.9417537126
353
  }
1
  {
2
+ "tag_acc": 0.9538495093,
3
+ "sents_p": 0.8619972261,
4
+ "sents_r": 0.8916786227,
5
+ "sents_f": 0.8765867419,
6
+ "dep_uas": 0.8682669641,
7
+ "dep_las": 0.8246685762,
8
  "dep_las_per_type": {
9
  "nmod:poss": {
10
+ "p": 0.9522058824,
11
+ "r": 0.9452554745,
12
+ "f": 0.9487179487
13
  },
14
  "nsubj": {
15
+ "p": 0.8483072917,
16
+ "r": 0.8566732413,
17
+ "f": 0.8524697416
18
  },
19
  "aux": {
20
+ "p": 0.9150326797,
21
+ "r": 0.9210526316,
22
+ "f": 0.9180327869
23
  },
24
  "advmod": {
25
+ "p": 0.7968337731,
26
+ "r": 0.8089285714,
27
+ "f": 0.8028356225
28
  },
29
  "root": {
30
+ "p": 0.8689320388,
31
+ "r": 0.8988522238,
32
+ "f": 0.8836389281
33
  },
34
  "det": {
35
+ "p": 0.9430724152,
36
+ "r": 0.9749026395,
37
+ "f": 0.9587234043
38
  },
39
  "amod": {
40
+ "p": 0.8828193833,
41
+ "r": 0.8978494624,
42
+ "f": 0.8902709907
43
  },
44
  "obl": {
45
+ "p": 0.7610619469,
46
+ "r": 0.7569397427,
47
+ "f": 0.7589952478
48
  },
49
  "mark": {
50
+ "p": 0.8918918919,
51
+ "r": 0.9,
52
+ "f": 0.8959276018
53
  },
54
  "ccomp": {
55
+ "p": 0.6979166667,
56
+ "r": 0.6261682243,
57
+ "f": 0.6600985222
58
  },
59
  "case": {
60
+ "p": 0.9368811881,
61
+ "r": 0.9618805591,
62
+ "f": 0.9492163009
63
  },
64
  "appos": {
65
+ "p": 0.707165109,
66
+ "r": 0.6878787879,
67
+ "f": 0.6973886329
68
  },
69
  "obj": {
70
+ "p": 0.7573529412,
71
+ "r": 0.786259542,
72
+ "f": 0.7715355805
73
  },
74
  "compound:prt": {
75
+ "p": 0.7692307692,
76
+ "r": 0.7042253521,
77
+ "f": 0.7352941176
78
  },
79
  "xcomp": {
80
+ "p": 0.6642599278,
81
+ "r": 0.6690909091,
82
+ "f": 0.6666666667
83
  },
84
  "flat": {
85
+ "p": 0.8215527231,
86
+ "r": 0.7518557794,
87
+ "f": 0.7851605759
88
  },
89
  "expl:pv": {
90
+ "p": 0.7619047619,
91
+ "r": 0.7272727273,
92
+ "f": 0.7441860465
93
  },
94
  "acl": {
95
+ "p": 0.4404761905,
96
+ "r": 0.3775510204,
97
+ "f": 0.4065934066
98
  },
99
  "advcl": {
100
+ "p": 0.5577889447,
101
+ "r": 0.5,
102
+ "f": 0.5273159145
103
  },
104
  "nummod": {
105
+ "p": 0.8291139241,
106
+ "r": 0.8733333333,
107
+ "f": 0.8506493506
108
  },
109
  "nmod": {
110
+ "p": 0.7201986755,
111
+ "r": 0.7565217391,
112
+ "f": 0.737913486
113
  },
114
  "cc": {
115
+ "p": 0.8536585366,
116
+ "r": 0.8617424242,
117
+ "f": 0.8576814326
118
  },
119
  "conj": {
120
+ "p": 0.6527777778,
121
+ "r": 0.6385869565,
122
+ "f": 0.6456043956
123
  },
124
  "nsubj:pass": {
125
+ "p": 0.8128654971,
126
+ "r": 0.8742138365,
127
+ "f": 0.8424242424
128
  },
129
  "aux:pass": {
130
+ "p": 0.9095744681,
131
+ "r": 0.95,
132
+ "f": 0.9293478261
 
 
 
 
 
133
  },
134
  "cop": {
135
+ "p": 0.7900355872,
136
  "r": 0.8131868132,
137
+ "f": 0.8014440433
138
  },
139
  "parataxis": {
140
+ "p": 0.3349282297,
141
+ "r": 0.2536231884,
142
+ "f": 0.2886597938
143
  },
144
  "acl:relcl": {
145
+ "p": 0.701863354,
146
+ "r": 0.7106918239,
147
+ "f": 0.70625
148
+ },
149
+ "expl": {
150
+ "p": 0.4166666667,
151
+ "r": 0.4761904762,
152
+ "f": 0.4444444444
153
+ },
154
+ "iobj": {
155
+ "p": 0.6470588235,
156
+ "r": 0.3333333333,
157
+ "f": 0.44
158
  },
159
  "fixed": {
160
+ "p": 0.7255434783,
161
+ "r": 0.4828209765,
162
+ "f": 0.5798045603
163
  },
164
  "obl:agent": {
165
+ "p": 0.8387096774,
166
+ "r": 0.8965517241,
167
+ "f": 0.8666666667
 
 
 
 
 
168
  },
169
  "csubj": {
170
+ "p": 0.45,
171
+ "r": 0.45,
172
+ "f": 0.45
173
  },
174
  "dep": {
175
  "p": 0.0,
182
  "f": 0.0
183
  }
184
  },
185
+ "ents_p": 0.7845934379,
186
+ "ents_r": 0.7607192254,
187
+ "ents_f": 0.7724719101,
188
  "ents_per_type": {
189
+ "PERSON": {
190
  "p": 0.0,
191
  "r": 0.0,
192
  "f": 0.0
193
  },
194
+ "GPE": {
195
  "p": 0.0,
196
  "r": 0.0,
197
  "f": 0.0
206
  "r": 0.0,
207
  "f": 0.0
208
  },
209
+ "ORG": {
210
  "p": 0.0,
211
  "r": 0.0,
212
  "f": 0.0
213
  },
214
+ "WORK_OF_ART": {
215
+ "p": 0.0,
216
+ "r": 0.0,
217
+ "f": 0.0
218
+ },
219
+ "PRODUCT": {
220
  "p": 0.0,
221
  "r": 0.0,
222
  "f": 0.0
226
  "r": 0.0,
227
  "f": 0.0
228
  },
229
+ "NORP": {
230
  "p": 0.0,
231
  "r": 0.0,
232
  "f": 0.0
233
  },
234
+ "DATE": {
235
  "p": 0.0,
236
  "r": 0.0,
237
  "f": 0.0
251
  "r": 0.0,
252
  "f": 0.0
253
  },
254
+ "PERCENT": {
255
  "p": 0.0,
256
  "r": 0.0,
257
  "f": 0.0
258
  },
259
+ "LOC": {
260
  "p": 0.0,
261
  "r": 0.0,
262
  "f": 0.0
271
  "r": 0.0,
272
  "f": 0.0
273
  },
 
 
 
 
 
274
  "LAW": {
275
  "p": 0.0,
276
  "r": 0.0,
277
  "f": 0.0
278
  }
279
  },
280
+ "speed": 10089.2857654723,
281
  "token_acc": 0.9997165842,
282
  "token_p": 0.9974281853,
283
  "token_r": 0.9975586363,
284
  "token_f": 0.9974934066,
285
+ "pos_acc": 0.9666739368,
286
+ "morph_acc": 0.9639000982,
287
+ "morph_micro_p": 0.9741415265,
288
+ "morph_micro_r": 0.9571096845,
289
+ "morph_micro_f": 0.965550503,
290
  "morph_per_feat": {
291
  "Person": {
292
+ "p": 0.994157741,
293
+ "r": 0.9760994264,
294
+ "f": 0.9850458273
295
  },
296
  "Poss": {
297
+ "p": 0.9886363636,
298
+ "r": 1.0,
299
+ "f": 0.9942857143
300
  },
301
  "PronType": {
302
+ "p": 0.9914965986,
303
+ "r": 0.9692435578,
304
+ "f": 0.9802437999
305
  },
306
  "Gender": {
307
+ "p": 0.9321944809,
308
+ "r": 0.9023149326,
309
+ "f": 0.9170113754
310
  },
311
  "Number": {
312
+ "p": 0.9844227245,
313
+ "r": 0.9648256249,
314
+ "f": 0.9745256633
315
  },
316
  "Tense": {
317
+ "p": 0.9838530067,
318
+ "r": 0.9714128642,
319
+ "f": 0.977593361
320
  },
321
  "VerbForm": {
322
+ "p": 0.9687726943,
323
+ "r": 0.9600575747,
324
+ "f": 0.9643954455
325
  },
326
  "Degree": {
327
+ "p": 0.9598246896,
328
+ "r": 0.9439655172,
329
+ "f": 0.9518290474
330
  },
331
  "Definite": {
332
+ "p": 0.9964757709,
333
+ "r": 0.9955985915,
334
+ "f": 0.9960369881
335
  },
336
  "Case": {
337
+ "p": 0.998003992,
338
+ "r": 0.9960159363,
339
+ "f": 0.9970089731
340
  },
341
  "Reflex": {
342
  "p": 1.0,
345
  },
346
  "Abbr": {
347
  "p": 1.0,
348
+ "r": 0.5555555556,
349
+ "f": 0.7142857143
350
  }
351
  },
352
+ "lemma_acc": 0.9568877162
353
  }
lemmatizer/cfg CHANGED
@@ -6,988 +6,1028 @@
6
  10,
7
  12,
8
  14,
9
- 18,
10
- 22,
11
- 26,
12
- 27,
 
13
  30,
14
  32,
15
  34,
16
- 36,
17
- 39,
18
  43,
19
  45,
20
- 47,
21
- 51,
22
- 53,
 
23
  55,
24
- 57,
25
  58,
 
26
  61,
27
- 63,
28
  64,
 
29
  66,
30
- 67,
31
- 69,
32
  71,
 
33
  75,
34
  77,
35
  79,
36
  81,
37
  83,
38
  85,
39
- 87,
40
  89,
41
  91,
42
- 95,
43
- 97,
44
  98,
45
- 101,
46
- 103,
47
- 104,
48
- 106,
49
  109,
50
  110,
51
- 112,
52
- 113,
53
- 117,
54
- 120,
55
- 122,
56
- 125,
57
- 127,
58
- 130,
59
- 135,
60
- 136,
61
  138,
62
- 139,
63
  140,
64
- 142,
65
  143,
66
- 145,
67
  147,
 
 
68
  151,
69
- 154,
70
- 155,
71
- 156,
72
- 158,
73
- 160,
74
- 164,
75
  165,
76
- 167,
77
  168,
78
- 170,
79
- 172,
80
  173,
81
- 174,
82
  176,
83
- 179,
84
- 181,
85
- 184,
86
- 188,
 
87
  190,
88
- 193,
89
- 195,
90
  197,
91
  199,
92
- 119,
93
- 202,
94
  203,
95
- 205,
96
- 208,
97
  211,
98
  213,
99
- 216,
100
  220,
101
- 221,
102
- 223,
103
  225,
104
- 227,
105
- 231,
106
  234,
107
  236,
 
108
  239,
 
109
  243,
110
- 246,
111
- 248,
112
- 250,
 
113
  252,
114
  254,
115
- 255,
116
  258,
117
- 73,
118
  260,
119
  262,
120
  264,
121
  266,
122
- 269,
123
- 271,
124
- 273,
125
  276,
126
  278,
127
- 280,
128
  281,
129
  283,
130
  285,
 
131
  287,
132
- 289,
133
- 291,
134
- 293,
135
  294,
 
136
  296,
137
- 298,
138
- 300,
139
  301,
140
- 303,
 
 
141
  307,
142
  309,
143
- 312,
144
- 313,
145
- 315,
146
- 317,
147
- 318,
148
- 320,
149
- 321,
150
  325,
151
- 327,
152
- 329,
153
  330,
154
- 332,
155
  335,
156
  337,
157
- 338,
158
  339,
159
  341,
160
  343,
161
- 348,
 
 
162
  350,
163
- 352,
164
  355,
165
- 357,
166
- 360,
167
  362,
168
- 363,
169
- 364,
170
- 366,
171
  368,
172
- 369,
173
  373,
174
- 374,
175
  376,
176
  378,
177
- 380,
178
  382,
179
- 384,
180
- 386,
181
  388,
182
  389,
183
- 392,
 
 
184
  394,
 
185
  398,
186
  400,
187
- 401,
188
  404,
189
- 407,
190
- 409,
 
 
191
  412,
192
  414,
193
- 415,
194
- 417,
195
  418,
196
- 422,
 
 
197
  425,
198
- 428,
199
  429,
200
  430,
201
- 431,
202
- 432,
203
  434,
204
- 435,
205
  436,
206
  438,
207
- 441,
208
- 444,
 
 
209
  446,
210
  448,
211
  449,
212
- 450,
213
  452,
214
  454,
215
- 456,
216
  459,
217
- 461,
218
  463,
219
- 465,
220
  466,
221
- 468,
222
- 471,
223
  473,
224
  475,
225
- 476,
226
  480,
227
  482,
228
- 484,
229
  486,
230
- 489,
231
  491,
 
 
232
  494,
233
- 495,
234
- 497,
235
- 498,
236
  500,
237
- 501,
238
- 502,
239
- 504,
240
  505,
241
- 506,
242
- 509,
243
- 512,
 
244
  516,
245
- 517,
246
- 519,
247
  522,
 
248
  526,
249
  527,
 
250
  529,
251
  530,
252
- 531,
253
  533,
254
  535,
255
- 538,
256
  540,
257
- 542,
258
- 546,
259
- 547,
260
  548,
261
- 552,
262
- 553,
263
  554,
264
  555,
265
  557,
 
 
266
  559,
 
267
  563,
268
- 566,
269
  568,
 
270
  570,
271
  572,
272
- 575,
273
- 576,
274
  578,
275
- 581,
276
- 583,
277
- 586,
278
  587,
 
279
  589,
280
- 592,
281
  593,
282
- 594,
283
  595,
 
284
  596,
 
285
  599,
286
- 601,
 
287
  603,
 
288
  606,
289
- 607,
290
  609,
291
  610,
292
  613,
 
293
  615,
294
- 616,
295
  619,
296
- 620,
297
- 622,
298
  623,
299
- 625,
300
  626,
301
- 627,
302
- 629,
303
  631,
304
  634,
305
- 635,
 
306
  638,
307
- 640,
308
- 643,
309
  644,
310
- 645,
311
  647,
312
- 649,
313
- 653,
 
 
 
314
  657,
315
- 659,
316
  660,
317
- 661,
318
  662,
319
  663,
320
- 665,
321
  666,
322
- 667,
323
- 671,
324
- 673,
325
  674,
326
- 675,
327
  677,
328
  678,
329
  679,
330
  681,
331
  683,
332
- 684,
333
  686,
334
- 688,
335
  689,
 
336
  691,
337
- 692,
338
- 693,
339
- 696,
340
  697,
341
- 698,
342
- 700,
343
- 702,
344
  704,
 
345
  707,
346
  709,
347
- 710,
348
  712,
349
  713,
350
- 715,
351
  717,
352
- 718,
353
  719,
354
- 720,
355
- 722,
356
- 724,
357
- 727,
358
  729,
359
  730,
360
- 731,
361
  735,
362
- 736,
363
  737,
364
- 738,
365
- 739,
366
  740,
367
  741,
368
- 742,
369
  744,
370
- 746,
371
  747,
372
  748,
 
 
373
  751,
374
- 754,
 
375
  756,
 
376
  758,
377
  759,
378
  760,
 
379
  762,
380
- 764,
381
  765,
382
- 767,
383
  768,
 
384
  771,
 
385
  772,
386
- 775,
387
  777,
388
  778,
389
- 779,
390
  782,
391
  783,
392
  785,
393
  787,
 
394
  789,
 
395
  792,
396
- 794,
397
  795,
398
- 797,
 
399
  799,
400
  802,
401
  803,
 
402
  806,
403
- 809,
 
404
  811,
405
- 813,
406
  815,
407
- 35,
408
- 818,
409
  819,
410
  821,
411
- 824,
412
  825,
413
- 828,
 
 
414
  830,
415
  832,
 
 
416
  835,
417
  836,
418
- 838,
419
- 841,
 
 
420
  844,
421
  845,
 
422
  847,
423
- 848,
424
  849,
425
  850,
426
  851,
427
  853,
428
- 855,
429
  856,
430
- 857,
431
  858,
432
- 860,
433
  861,
434
- 864,
 
435
  867,
436
- 869,
437
- 872,
438
  874,
439
  875,
440
  877,
 
441
  880,
442
- 881,
443
  883,
444
- 884,
445
- 886,
446
- 888,
447
  889,
448
  890,
449
- 893,
450
  894,
451
  897,
452
  898,
 
453
  900,
454
- 902,
455
  903,
456
  906,
457
- 907,
458
  909,
459
- 910,
460
- 912,
461
  913,
462
- 914,
463
- 915,
464
- 916,
465
- 918,
466
  919,
467
  920,
468
- 922,
469
  925,
470
- 929,
471
- 933,
 
472
  934,
473
  935,
 
474
  937,
475
  938,
476
- 940,
477
  941,
478
  943,
479
- 944,
480
- 945,
481
  946,
 
482
  949,
483
- 950,
484
  952,
485
- 955,
 
486
  956,
487
  957,
488
- 958,
489
  959,
 
490
  961,
491
- 962,
492
  963,
 
493
  965,
494
- 968,
495
- 969,
496
- 971,
497
  973,
498
  976,
 
499
  979,
500
  980,
501
- 981,
502
  983,
 
503
  986,
504
- 987,
505
  990,
506
  992,
507
- 995,
508
- 997,
509
  999,
510
  1001,
 
511
  1004,
512
  1005,
513
- 1007,
514
- 1009,
515
- 1011,
516
- 1012,
517
- 1015,
 
518
  1018,
 
519
  1019,
520
- 1020,
521
- 1021,
522
  1024,
523
- 1027,
524
- 1029,
525
  1030,
526
- 1031,
527
  1033,
528
  1035,
529
- 1039,
530
- 1041,
 
531
  1042,
532
  1043,
533
- 1047,
534
- 1048,
 
535
  1053,
 
536
  1055,
537
- 1056,
538
  1057,
539
  1058,
540
- 1059,
541
  1060,
 
542
  1062,
 
543
  1064,
 
 
544
  1067,
545
  1069,
 
546
  1072,
 
 
547
  1075,
548
  1076,
549
- 1078,
550
  1079,
551
  1081,
552
- 1084,
553
  1085,
554
- 1087,
555
- 1089,
556
- 1091,
557
  1092,
558
  1093,
559
  1094,
560
- 1095,
561
  1096,
562
- 1097,
563
- 1102,
564
  1103,
565
- 1106,
566
- 1107,
567
  1108,
568
- 1110,
569
  1111,
570
  1112,
571
  1113,
 
572
  1116,
 
573
  1118,
574
- 1120,
 
575
  1123,
576
  1124,
577
- 1126,
578
  1128,
579
- 1130,
580
- 1131,
581
  1132,
582
  1133,
583
- 1136,
584
  1137,
585
- 1138,
586
  1140,
587
  1142,
588
- 1144,
589
  1146,
590
- 1147,
591
- 1148,
592
- 1152,
593
- 1153,
594
- 1154,
595
  1155,
 
596
  1158,
597
  1159,
598
  1160,
599
- 1162,
 
600
  1164,
601
  1165,
602
- 88,
603
  1166,
604
- 1169,
605
  1171,
606
- 1172,
 
607
  1175,
 
608
  1177,
609
  1179,
610
- 1181,
611
- 1183,
612
  1185,
613
  1186,
614
  1188,
615
- 1191,
616
  1192,
617
- 1193,
618
- 1194,
619
  1195,
620
  1196,
621
- 1200,
 
 
 
 
622
  1204,
623
  1205,
624
  1206,
625
- 1207,
626
- 1210,
627
- 1212,
628
- 311,
629
- 1213,
630
  1215,
631
- 1217,
632
  1218,
 
633
  1221,
634
  1222,
635
- 1224,
636
  1225,
637
  1226,
638
  1227,
639
  1228,
640
- 1229,
641
- 1230,
642
  1231,
643
- 1232,
644
  1233,
645
  1235,
646
- 1236,
 
647
  1239,
648
  1241,
649
  1242,
 
650
  1244,
651
  1245,
652
- 1247,
653
  1249,
654
- 1251,
655
  1252,
656
  1253,
 
 
 
657
  1257,
658
- 1258,
 
 
659
  1262,
660
- 1263,
661
  1265,
662
  1266,
663
  1267,
664
- 1270,
 
 
665
  1272,
 
666
  1274,
667
  1275,
668
- 1276,
669
  1277,
670
- 1280,
 
 
671
  1282,
672
- 1283,
673
  1286,
674
- 1287,
 
675
  1290,
 
676
  1292,
677
  1293,
678
- 1295,
 
679
  1299,
680
  1300,
681
  1302,
682
- 1303,
683
  1306,
 
684
  1308,
685
  1309,
 
686
  1311,
687
- 1314,
 
 
 
688
  1318,
689
- 1319,
690
- 1320,
691
  1324,
692
  1325,
693
- 1327,
694
  1328,
695
  1329,
696
  1330,
697
- 1331,
698
- 1332,
699
  1333,
700
  1334,
701
- 1335,
702
  1337,
703
  1338,
704
  1341,
 
705
  1343,
706
  1344,
707
  1345,
708
- 1346,
709
  1347,
710
  1348,
711
- 1350,
712
  1351,
713
- 1355,
 
714
  1356,
 
715
  1358,
716
- 1359,
717
- 1361,
718
  1362,
719
- 1365,
 
 
720
  1368,
721
  1369,
 
722
  1371,
 
723
  1373,
 
724
  1375,
725
- 1377,
726
- 1379,
727
- 1381,
728
  1382,
729
- 1383,
730
  1384,
731
- 1385,
732
  1387,
733
  1390,
734
- 90,
735
  1391,
 
736
  1393,
737
  1394,
738
  1395,
 
739
  1397,
740
  1398,
741
- 1399,
742
- 1400,
743
- 1401,
744
  1402,
 
 
745
  1405,
746
- 1407,
747
- 1409,
748
- 1411,
749
  1412,
750
- 1413,
751
  1415,
752
  1416,
753
  1417,
754
- 1418,
755
  1419,
 
 
756
  1423,
757
- 1425,
758
  1426,
759
  1427,
760
  1428,
761
  1429,
762
  1431,
763
- 1432,
764
  1433,
 
765
  1436,
766
  1437,
767
  1438,
768
  1439,
769
- 1441,
 
770
  1443,
771
  1444,
772
  1445,
773
- 1446,
774
  1447,
775
- 1449,
776
- 1450,
777
  1451,
778
  1452,
779
- 1453,
780
  1454,
781
- 1456,
782
- 1460,
783
- 1461,
 
 
784
  1464,
785
  1465,
786
  1466,
787
  1467,
788
  1468,
789
  1469,
790
- 1470,
791
  1471,
792
  1472,
793
  1473,
 
794
  1476,
795
  1477,
796
  1479,
797
- 1481,
798
  1483,
799
- 1486,
 
800
  1488,
801
- 1489,
802
  1490,
803
  1492,
804
- 1493,
805
  1494,
806
- 1496,
807
  1497,
808
  1498,
809
  1499,
810
- 1502,
811
- 1503,
812
  1504,
813
- 1506,
814
  1508,
815
  1509,
 
 
816
  1512,
817
- 1514,
818
- 1517,
819
  1518,
820
  1519,
821
  1521,
 
822
  1523,
823
- 1524,
824
  1525,
 
 
825
  1528,
826
- 1529,
827
- 1531,
828
  1532,
 
829
  1534,
830
- 1535,
 
831
  1538,
 
832
  1540,
833
  1541,
834
  1542,
835
- 1543,
836
  1544,
837
- 1545,
838
  1547,
839
- 1548,
840
- 1549,
841
  1551,
842
- 1552,
843
  1553,
844
- 1554,
845
  1555,
846
  1556,
847
  1558,
848
- 1559,
849
  1561,
850
  1562,
851
- 1564,
852
  1565,
853
- 1566,
854
- 1570,
855
  1571,
856
  1574,
857
  1575,
858
  1576,
859
- 1577,
860
- 1579,
861
  1580,
862
  1581,
863
- 1583,
864
  1584,
865
- 1588,
 
 
 
 
866
  1590,
867
- 1592,
868
- 1594,
869
  1595,
870
  1596,
871
- 1598,
872
  1599,
873
  1600,
874
  1602,
875
- 1603,
876
- 1604,
877
  1606,
878
  1607,
879
  1608,
880
- 1612,
 
 
881
  1614,
882
  1615,
 
883
  1618,
884
- 1619,
885
  1621,
 
 
886
  1624,
887
  1625,
888
- 1628,
 
889
  1629,
 
890
  1632,
891
  1633,
892
- 1635,
893
  1636,
894
- 1637,
895
  1638,
 
896
  1640,
897
- 1641,
898
  1644,
899
  1645,
900
  1646,
901
  1647,
902
- 1649,
 
903
  1651,
 
 
904
  1653,
 
 
905
  1656,
906
  1658,
907
- 1659,
908
  1660,
909
  1662,
910
  1663,
911
  1664,
912
- 1665,
913
  1666,
914
  1668,
915
- 1669,
916
  1671,
917
  1672,
918
- 1673,
 
919
  1676,
920
- 1677,
921
  1678,
 
922
  1680,
923
  1681,
 
924
  1684,
925
- 1685,
926
  1687,
927
- 1688,
928
  1690,
929
  1692,
930
  1694,
931
  1697,
932
  1698,
933
- 1699,
934
- 1700,
935
  1701,
936
- 1702,
937
  1703,
938
  1704,
939
  1706,
940
  1707,
941
- 1709,
942
  1710,
943
  1711,
944
- 1712,
 
945
  1716,
946
- 1718,
947
  1719,
948
  1720,
949
- 1721,
950
  1722,
 
 
951
  1725,
952
  1726,
953
- 1727,
954
- 1729,
955
  1730,
 
956
  1732,
957
- 1734,
958
- 1736,
959
  1738,
960
  1739,
961
  1740,
962
  1742,
963
  1743,
964
  1744,
 
965
  1745,
966
  1746,
 
 
967
  1749,
 
968
  1752,
969
- 1753,
970
  1754,
971
  1755,
972
- 1756,
973
- 1758,
974
  1761,
975
  1762,
 
976
  1763,
977
  1765,
978
  1766,
 
979
  1768,
980
  1769,
981
- 1771,
982
  1772,
983
  1773,
984
- 1774,
985
- 1778,
986
  1779,
987
- 1780,
988
- 1782,
989
  1783,
990
- 1784,
991
  1785,
992
  1786,
993
  1787,
@@ -996,203 +1036,21 @@
996
  1792,
997
  1793,
998
  1795,
999
- 1796,
1000
  1799,
1001
- 1801,
1002
- 1802,
1003
  1804,
1004
- 1806,
 
1005
  1808,
1006
- 1809,
1007
  1810,
1008
- 1811,
1009
- 1814,
 
1010
  1816,
1011
- 1817,
 
1012
  1820,
1013
- 1823,
1014
- 1824,
1015
- 1825,
1016
- 1827,
1017
- 1830,
1018
- 1831,
1019
- 1833,
1020
- 1834,
1021
- 1836,
1022
- 1837,
1023
- 1838,
1024
- 1840,
1025
- 1841,
1026
- 1842,
1027
- 1844,
1028
- 1846,
1029
- 1849,
1030
- 1850,
1031
- 1852,
1032
- 1853,
1033
- 1855,
1034
- 1857,
1035
- 1858,
1036
- 1859,
1037
- 1860,
1038
- 1861,
1039
- 1862,
1040
- 1863,
1041
- 1864,
1042
- 1867,
1043
- 1868,
1044
- 1870,
1045
- 1871,
1046
- 1872,
1047
- 1873,
1048
- 1874,
1049
- 1875,
1050
- 1876,
1051
- 1878,
1052
- 1880,
1053
- 1882,
1054
- 1883,
1055
- 1884,
1056
- 1886,
1057
- 1887,
1058
- 1889,
1059
- 1893,
1060
- 1894,
1061
- 1896,
1062
- 1899,
1063
- 1900,
1064
- 1902,
1065
- 1903,
1066
- 1904,
1067
- 1905,
1068
- 1906,
1069
- 1907,
1070
- 1909,
1071
- 1910,
1072
- 1911,
1073
- 1912,
1074
- 1913,
1075
- 1916,
1076
- 295,
1077
- 1917,
1078
- 1918,
1079
- 1919,
1080
- 1921,
1081
- 1923,
1082
- 1925,
1083
- 1926,
1084
- 1929,
1085
- 1931,
1086
- 1932,
1087
- 1933,
1088
- 1935,
1089
- 1936,
1090
- 1937,
1091
- 1939,
1092
- 1940,
1093
- 1941,
1094
- 1942,
1095
- 1943,
1096
- 1944,
1097
- 1946,
1098
- 1948,
1099
- 1950,
1100
- 1951,
1101
- 1952,
1102
- 1954,
1103
- 1956,
1104
- 1957,
1105
- 1960,
1106
- 1961,
1107
- 1964,
1108
- 1966,
1109
- 1967,
1110
- 1968,
1111
- 1970,
1112
- 1971,
1113
- 1972,
1114
- 1975,
1115
- 1978,
1116
- 1981,
1117
- 1982,
1118
- 1983,
1119
- 1984,
1120
- 1987,
1121
- 1988,
1122
- 44,
1123
- 1990,
1124
- 1991,
1125
- 1994,
1126
- 1995,
1127
- 1996,
1128
- 2000,
1129
- 2001,
1130
- 2002,
1131
- 2005,
1132
- 2007,
1133
- 2008,
1134
- 2009,
1135
- 2011,
1136
- 2012,
1137
- 2013,
1138
- 2014,
1139
- 2015,
1140
- 2016,
1141
- 2017,
1142
- 2019,
1143
- 2020,
1144
- 2021,
1145
- 2023,
1146
- 2025,
1147
- 2028,
1148
- 2029,
1149
- 2030,
1150
- 2031,
1151
- 2032,
1152
- 2034,
1153
- 2035,
1154
- 2036,
1155
- 2037,
1156
- 2038,
1157
- 2040,
1158
- 2042,
1159
- 2045,
1160
- 2046,
1161
- 2047,
1162
- 2048,
1163
- 2050,
1164
- 2054,
1165
- 2056,
1166
- 2057,
1167
- 2058,
1168
- 2060,
1169
- 2061,
1170
- 2062,
1171
- 2063,
1172
- 2064,
1173
- 2066,
1174
- 2067,
1175
- 2068,
1176
- 2070,
1177
- 2072,
1178
- 2074,
1179
- 2075,
1180
- 2077,
1181
- 2081,
1182
- 852,
1183
- 2082,
1184
- 2083,
1185
- 2084,
1186
- 2086,
1187
- 2087,
1188
- 2089,
1189
- 2091,
1190
- 2092,
1191
- 2094,
1192
- 2096,
1193
- 2097,
1194
- 2098,
1195
- 2099,
1196
- 2101
1197
  ]
1198
  }
6
  10,
7
  12,
8
  14,
9
+ 16,
10
+ 20,
11
+ 24,
12
+ 25,
13
+ 28,
14
  30,
15
  32,
16
  34,
17
+ 37,
18
+ 41,
19
  43,
20
  45,
21
+ 48,
22
+ 50,
23
+ 52,
24
+ 54,
25
  55,
 
26
  58,
27
+ 59,
28
  61,
29
+ 62,
30
  64,
31
+ 65,
32
  66,
33
+ 68,
 
34
  71,
35
+ 73,
36
  75,
37
  77,
38
  79,
39
  81,
40
  83,
41
  85,
 
42
  89,
43
  91,
44
+ 93,
45
+ 96,
46
  98,
47
+ 100,
48
+ 102,
49
+ 105,
50
+ 107,
51
  109,
52
  110,
53
+ 114,
54
+ 116,
55
+ 118,
56
+ 121,
57
+ 123,
58
+ 126,
59
+ 128,
60
+ 133,
61
+ 134,
62
+ 137,
63
  138,
 
64
  140,
65
+ 141,
66
  143,
 
67
  147,
68
+ 148,
69
+ 149,
70
  151,
71
+ 153,
72
+ 157,
73
+ 159,
74
+ 161,
75
+ 163,
 
76
  165,
77
+ 166,
78
  168,
79
+ 171,
 
80
  173,
 
81
  176,
82
+ 180,
83
+ 182,
84
+ 185,
85
+ 186,
86
+ 189,
87
  190,
88
+ 192,
89
+ 194,
90
  197,
91
  199,
92
+ 200,
 
93
  203,
94
+ 207,
95
+ 209,
96
  211,
97
  213,
98
+ 217,
99
  220,
100
+ 222,
 
101
  225,
102
+ 229,
103
+ 232,
104
  234,
105
  236,
106
+ 237,
107
  239,
108
+ 240,
109
  243,
110
+ 70,
111
+ 245,
112
+ 247,
113
+ 249,
114
  252,
115
  254,
116
+ 257,
117
  258,
 
118
  260,
119
  262,
120
  264,
121
  266,
122
+ 268,
123
+ 270,
124
+ 272,
125
  276,
126
  278,
 
127
  281,
128
  283,
129
  285,
130
+ 284,
131
  287,
132
+ 288,
133
+ 292,
 
134
  294,
135
+ 295,
136
  296,
137
+ 299,
 
138
  301,
139
+ 302,
140
+ 304,
141
+ 305,
142
  307,
143
  309,
144
+ 314,
145
+ 316,
146
+ 319,
147
+ 322,
148
+ 324,
 
 
149
  325,
150
+ 326,
151
+ 328,
152
  330,
153
+ 334,
154
  335,
155
  337,
 
156
  339,
157
  341,
158
  343,
159
+ 345,
160
+ 347,
161
+ 349,
162
  350,
163
+ 353,
164
  355,
165
+ 359,
166
+ 361,
167
  362,
168
+ 365,
 
 
169
  368,
170
+ 370,
171
  373,
172
+ 375,
173
  376,
174
  378,
 
175
  382,
176
+ 385,
 
177
  388,
178
  389,
179
+ 390,
180
+ 391,
181
+ 393,
182
  394,
183
+ 395,
184
  398,
185
  400,
186
+ 402,
187
  404,
188
+ 405,
189
+ 406,
190
+ 408,
191
+ 410,
192
  412,
193
  414,
194
+ 416,
 
195
  418,
196
+ 420,
197
+ 421,
198
+ 423,
199
  425,
200
+ 427,
201
  429,
202
  430,
 
 
203
  434,
 
204
  436,
205
  438,
206
+ 440,
207
+ 9,
208
+ 442,
209
+ 445,
210
  446,
211
  448,
212
  449,
213
+ 451,
214
  452,
215
  454,
216
+ 457,
217
  459,
 
218
  463,
219
+ 464,
220
  466,
221
+ 469,
 
222
  473,
223
  475,
224
+ 478,
225
  480,
226
  482,
227
+ 483,
228
  486,
229
+ 487,
230
  491,
231
+ 492,
232
+ 493,
233
  494,
234
+ 496,
 
 
235
  500,
236
+ 503,
 
 
237
  505,
238
+ 507,
239
+ 510,
240
+ 511,
241
+ 513,
242
  516,
243
+ 518,
244
+ 521,
245
  522,
246
+ 524,
247
  526,
248
  527,
249
+ 528,
250
  529,
251
  530,
 
252
  533,
253
  535,
254
+ 537,
255
  540,
256
+ 541,
257
+ 543,
258
+ 545,
259
  548,
260
+ 550,
261
+ 551,
262
  554,
263
  555,
264
  557,
265
+ 558,
266
+ 333,
267
  559,
268
+ 561,
269
  563,
270
+ 565,
271
  568,
272
+ 569,
273
  570,
274
  572,
275
+ 574,
 
276
  578,
277
+ 582,
278
+ 584,
279
+ 585,
280
  587,
281
+ 588,
282
  589,
 
283
  593,
 
284
  595,
285
+ 564,
286
  596,
287
+ 598,
288
  599,
289
+ 600,
290
+ 602,
291
  603,
292
+ 604,
293
  606,
294
+ 608,
295
  609,
296
  610,
297
  613,
298
+ 614,
299
  615,
300
+ 617,
301
  619,
302
+ 621,
 
303
  623,
 
304
  626,
305
+ 628,
306
+ 630,
307
  631,
308
  634,
309
+ 636,
310
+ 637,
311
  638,
312
+ 639,
313
+ 641,
314
  644,
315
+ 646,
316
  647,
317
+ 648,
318
+ 652,
319
+ 654,
320
+ 655,
321
+ 656,
322
  657,
323
+ 658,
324
  660,
 
325
  662,
326
  663,
 
327
  666,
328
+ 669,
329
+ 672,
 
330
  674,
331
+ 676,
332
  677,
333
  678,
334
  679,
335
  681,
336
  683,
 
337
  686,
 
338
  689,
339
+ 690,
340
  691,
341
+ 694,
342
+ 695,
 
343
  697,
344
+ 699,
345
+ 701,
 
346
  704,
347
+ 706,
348
  707,
349
  709,
 
350
  712,
351
  713,
352
+ 714,
353
  717,
 
354
  719,
355
+ 721,
356
+ 723,
357
+ 33,
358
+ 726,
359
  729,
360
  730,
361
+ 733,
362
  735,
 
363
  737,
 
 
364
  740,
365
  741,
366
+ 743,
367
  744,
 
368
  747,
369
  748,
370
+ 749,
371
+ 750,
372
  751,
373
+ 752,
374
+ 753,
375
  756,
376
+ 757,
377
  758,
378
  759,
379
  760,
380
+ 761,
381
  762,
 
382
  765,
 
383
  768,
384
+ 770,
385
  771,
386
+ 728,
387
  772,
388
+ 774,
389
  777,
390
  778,
391
+ 780,
392
  782,
393
  783,
394
  785,
395
  787,
396
+ 788,
397
  789,
398
+ 791,
399
  792,
 
400
  795,
401
+ 796,
402
+ 798,
403
  799,
404
  802,
405
  803,
406
+ 804,
407
  806,
408
+ 807,
409
+ 808,
410
  811,
 
411
  815,
 
 
412
  819,
413
  821,
414
+ 823,
415
  825,
416
+ 826,
417
+ 827,
418
+ 829,
419
  830,
420
  832,
421
+ 833,
422
+ 834,
423
  835,
424
  836,
425
+ 839,
426
+ 840,
427
+ 842,
428
+ 843,
429
  844,
430
  845,
431
+ 846,
432
  847,
 
433
  849,
434
  850,
435
  851,
436
  853,
 
437
  856,
438
+ 742,
439
  858,
 
440
  861,
441
+ 863,
442
+ 866,
443
  867,
444
+ 870,
445
+ 873,
446
  874,
447
  875,
448
  877,
449
+ 879,
450
  880,
451
+ 882,
452
  883,
453
+ 885,
454
+ 887,
 
455
  889,
456
  890,
457
+ 891,
458
  894,
459
  897,
460
  898,
461
+ 899,
462
  900,
 
463
  903,
464
  906,
465
+ 908,
466
  909,
467
+ 911,
 
468
  913,
469
+ 917,
 
 
 
470
  919,
471
  920,
472
+ 921,
473
  925,
474
+ 926,
475
+ 927,
476
+ 932,
477
  934,
478
  935,
479
+ 936,
480
  937,
481
  938,
482
+ 939,
483
  941,
484
  943,
 
 
485
  946,
486
+ 948,
487
  949,
 
488
  952,
489
+ 953,
490
+ 954,
491
  956,
492
  957,
 
493
  959,
494
+ 716,
495
  961,
 
496
  963,
497
+ 964,
498
  965,
499
+ 966,
500
+ 967,
501
+ 972,
502
  973,
503
  976,
504
+ 978,
505
  979,
506
  980,
 
507
  983,
508
+ 985,
509
  986,
510
+ 988,
511
  990,
512
  992,
513
+ 993,
514
+ 996,
515
  999,
516
  1001,
517
+ 1002,
518
  1004,
519
  1005,
520
+ 1006,
521
+ 1010,
522
+ 1013,
523
+ 1014,
524
+ 1016,
525
+ 1017,
526
  1018,
527
+ 82,
528
  1019,
529
+ 1022,
 
530
  1024,
531
+ 1025,
532
+ 1028,
533
  1030,
534
+ 1032,
535
  1033,
536
  1035,
537
+ 1037,
538
+ 1038,
539
+ 1040,
540
  1042,
541
  1043,
542
+ 1044,
543
+ 1045,
544
+ 1049,
545
  1053,
546
+ 1054,
547
  1055,
 
548
  1057,
549
  1058,
 
550
  1060,
551
+ 1061,
552
  1062,
553
+ 313,
554
  1064,
555
+ 280,
556
+ 1065,
557
  1067,
558
  1069,
559
+ 1070,
560
  1072,
561
+ 1073,
562
+ 1074,
563
  1075,
564
  1076,
565
+ 1077,
566
  1079,
567
  1081,
568
+ 1082,
569
  1085,
570
+ 1086,
571
+ 1088,
572
+ 1090,
573
  1092,
574
  1093,
575
  1094,
 
576
  1096,
577
+ 1098,
578
+ 1099,
579
  1103,
580
+ 1104,
 
581
  1108,
582
+ 1109,
583
  1111,
584
  1112,
585
  1113,
586
+ 1115,
587
  1116,
588
+ 1117,
589
  1118,
590
+ 1119,
591
+ 1122,
592
  1123,
593
  1124,
594
+ 1125,
595
  1128,
596
+ 1129,
 
597
  1132,
598
  1133,
599
+ 1134,
600
  1137,
601
+ 1139,
602
  1140,
603
  1142,
604
+ 1145,
605
  1146,
606
+ 1150,
607
+ 1151,
 
 
 
608
  1155,
609
+ 1156,
610
  1158,
611
  1159,
612
  1160,
613
+ 1161,
614
+ 1163,
615
  1164,
616
  1165,
 
617
  1166,
618
+ 1168,
619
  1171,
620
+ 1173,
621
+ 1174,
622
  1175,
623
+ 1176,
624
  1177,
625
  1179,
626
+ 1180,
627
+ 1184,
628
  1185,
629
  1186,
630
  1188,
631
+ 1189,
632
  1192,
 
 
633
  1195,
634
  1196,
635
+ 1198,
636
+ 1199,
637
+ 1201,
638
+ 1202,
639
+ 1203,
640
  1204,
641
  1205,
642
  1206,
643
+ 1208,
644
+ 1209,
645
+ 1211,
646
+ 1214,
 
647
  1215,
 
648
  1218,
649
+ 1219,
650
  1221,
651
  1222,
652
+ 1223,
653
  1225,
654
  1226,
655
  1227,
656
  1228,
 
 
657
  1231,
 
658
  1233,
659
  1235,
660
+ 1237,
661
+ 1238,
662
  1239,
663
  1241,
664
  1242,
665
+ 1243,
666
  1244,
667
  1245,
 
668
  1249,
 
669
  1252,
670
  1253,
671
+ 1254,
672
+ 1255,
673
+ 1256,
674
  1257,
675
+ 1260,
676
+ 1041,
677
+ 1261,
678
  1262,
679
+ 1264,
680
  1265,
681
  1266,
682
  1267,
683
+ 1268,
684
+ 1269,
685
+ 1271,
686
  1272,
687
+ 1273,
688
  1274,
689
  1275,
 
690
  1277,
691
+ 1278,
692
+ 1279,
693
+ 1281,
694
  1282,
695
+ 1285,
696
  1286,
697
+ 1288,
698
+ 1289,
699
  1290,
700
+ 1291,
701
  1292,
702
  1293,
703
+ 1296,
704
+ 1297,
705
  1299,
706
  1300,
707
  1302,
708
+ 1305,
709
  1306,
710
+ 1307,
711
  1308,
712
  1309,
713
+ 1310,
714
  1311,
715
+ 1313,
716
+ 1315,
717
+ 1316,
718
+ 1317,
719
  1318,
720
+ 1321,
 
721
  1324,
722
  1325,
723
+ 1326,
724
  1328,
725
  1329,
726
  1330,
 
 
727
  1333,
728
  1334,
729
+ 1336,
730
  1337,
731
  1338,
732
  1341,
733
+ 1342,
734
  1343,
735
  1344,
736
  1345,
 
737
  1347,
738
  1348,
739
+ 1349,
740
  1351,
741
+ 1352,
742
+ 1354,
743
  1356,
744
+ 1357,
745
  1358,
 
 
746
  1362,
747
+ 1363,
748
+ 1366,
749
+ 1367,
750
  1368,
751
  1369,
752
+ 1370,
753
  1371,
754
+ 1372,
755
  1373,
756
+ 1374,
757
  1375,
758
+ 1378,
 
 
759
  1382,
 
760
  1384,
761
+ 1386,
762
  1387,
763
  1390,
 
764
  1391,
765
+ 1392,
766
  1393,
767
  1394,
768
  1395,
769
+ 1396,
770
  1397,
771
  1398,
 
 
 
772
  1402,
773
+ 1403,
774
+ 1404,
775
  1405,
776
+ 1408,
777
+ 1410,
 
778
  1412,
 
779
  1415,
780
  1416,
781
  1417,
 
782
  1419,
783
+ 1421,
784
+ 1422,
785
  1423,
786
+ 1424,
787
  1426,
788
  1427,
789
  1428,
790
  1429,
791
  1431,
 
792
  1433,
793
+ 1434,
794
  1436,
795
  1437,
796
  1438,
797
  1439,
798
+ 1440,
799
+ 1442,
800
  1443,
801
  1444,
802
  1445,
 
803
  1447,
804
+ 1448,
 
805
  1451,
806
  1452,
 
807
  1454,
808
+ 1455,
809
+ 1457,
810
+ 1459,
811
+ 1462,
812
+ 1463,
813
  1464,
814
  1465,
815
  1466,
816
  1467,
817
  1468,
818
  1469,
 
819
  1471,
820
  1472,
821
  1473,
822
+ 1474,
823
  1476,
824
  1477,
825
  1479,
826
+ 1480,
827
  1483,
828
+ 1484,
829
+ 1485,
830
  1488,
 
831
  1490,
832
  1492,
 
833
  1494,
 
834
  1497,
835
  1498,
836
  1499,
837
+ 1500,
838
+ 1501,
839
  1504,
840
+ 1507,
841
  1508,
842
  1509,
843
+ 1510,
844
+ 1511,
845
  1512,
846
+ 1513,
847
+ 1515,
848
  1518,
849
  1519,
850
  1521,
851
+ 1522,
852
  1523,
 
853
  1525,
854
+ 1526,
855
+ 1527,
856
  1528,
 
 
857
  1532,
858
+ 1533,
859
  1534,
860
+ 1536,
861
+ 1537,
862
  1538,
863
+ 1539,
864
  1540,
865
  1541,
866
  1542,
 
867
  1544,
868
+ 1546,
869
  1547,
870
+ 1550,
 
871
  1551,
 
872
  1553,
 
873
  1555,
874
  1556,
875
  1558,
876
+ 1560,
877
  1561,
878
  1562,
 
879
  1565,
880
+ 1567,
881
+ 1568,
882
  1571,
883
  1574,
884
  1575,
885
  1576,
886
+ 1578,
 
887
  1580,
888
  1581,
889
+ 1582,
890
  1584,
891
+ 1585,
892
+ 1586,
893
+ 1587,
894
+ 1276,
895
+ 1589,
896
  1590,
897
+ 1591,
898
+ 1593,
899
  1595,
900
  1596,
901
+ 1597,
902
  1599,
903
  1600,
904
  1602,
905
+ 1605,
 
906
  1606,
907
  1607,
908
  1608,
909
+ 1609,
910
+ 1610,
911
+ 1613,
912
  1614,
913
  1615,
914
+ 1617,
915
  1618,
916
+ 1620,
917
  1621,
918
+ 1622,
919
+ 1623,
920
  1624,
921
  1625,
922
+ 1594,
923
+ 1627,
924
  1629,
925
+ 1631,
926
  1632,
927
  1633,
928
+ 1634,
929
  1636,
 
930
  1638,
931
+ 1639,
932
  1640,
933
+ 1643,
934
  1644,
935
  1645,
936
  1646,
937
  1647,
938
+ 1648,
939
+ 1650,
940
  1651,
941
+ 1652,
942
+ 265,
943
  1653,
944
+ 1654,
945
+ 1655,
946
  1656,
947
  1658,
 
948
  1660,
949
  1662,
950
  1663,
951
  1664,
 
952
  1666,
953
  1668,
954
+ 1670,
955
  1671,
956
  1672,
957
+ 1674,
958
+ 1675,
959
  1676,
 
960
  1678,
961
+ 1679,
962
  1680,
963
  1681,
964
+ 1682,
965
  1684,
966
+ 1686,
967
  1687,
968
+ 1689,
969
  1690,
970
  1692,
971
  1694,
972
  1697,
973
  1698,
 
 
974
  1701,
 
975
  1703,
976
  1704,
977
  1706,
978
  1707,
 
979
  1710,
980
  1711,
981
+ 1714,
982
+ 1715,
983
  1716,
984
+ 1717,
985
  1719,
986
  1720,
987
+ 42,
988
  1722,
989
+ 1723,
990
+ 1724,
991
  1725,
992
  1726,
 
 
993
  1730,
994
+ 1731,
995
  1732,
996
+ 1735,
 
997
  1738,
998
  1739,
999
  1740,
1000
  1742,
1001
  1743,
1002
  1744,
1003
+ 1089,
1004
  1745,
1005
  1746,
1006
+ 1747,
1007
+ 1748,
1008
  1749,
1009
+ 1751,
1010
  1752,
 
1011
  1754,
1012
  1755,
1013
+ 1757,
1014
+ 1760,
1015
  1761,
1016
  1762,
1017
+ 1470,
1018
  1763,
1019
  1765,
1020
  1766,
1021
+ 1767,
1022
  1768,
1023
  1769,
1024
+ 1770,
1025
  1772,
1026
  1773,
1027
+ 1775,
1028
+ 1777,
1029
  1779,
 
 
1030
  1783,
 
1031
  1785,
1032
  1786,
1033
  1787,
1036
  1792,
1037
  1793,
1038
  1795,
1039
+ 1797,
1040
  1799,
1041
+ 1800,
 
1042
  1804,
1043
+ 1805,
1044
+ 1807,
1045
  1808,
 
1046
  1810,
1047
+ 1812,
1048
+ 1813,
1049
+ 1815,
1050
  1816,
1051
+ 1818,
1052
+ 1819,
1053
  1820,
1054
+ 1822
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1055
  ]
1056
  }
lemmatizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a930ffab83ea245ace02a91e5b332dff291f73bdf2833e42950d3be6b47e65db
3
- size 463714
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1187f468c84c03816a17d6c97a314c5cbaaa30b602717f50a7b7184431b285b
3
+ size 408618
lemmatizer/trees CHANGED
Binary files a/lemmatizer/trees and b/lemmatizer/trees differ
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"nl",
3
  "name":"core_news_lg",
4
- "version":"3.3.0",
5
  "description":"Dutch pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.3.0.dev0,<3.4.0",
11
- "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
@@ -72,6 +72,7 @@
72
  "Number=Sing|POS=PROPN",
73
  "POS=PRON|PronType=Dem",
74
  "POS=AUX|VerbForm=Part",
 
75
  "POS=PRON|Person=3|PronType=Rel",
76
  "Number=Plur|POS=PROPN",
77
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs",
@@ -288,7 +289,8 @@
288
  "WW|vd|nom|met-e|zonder-n",
289
  "WW|vd|prenom|met-e",
290
  "WW|vd|prenom|zonder",
291
- "WW|vd|vrij|zonder"
 
292
  ],
293
  "parser":[
294
  "ROOT",
@@ -375,177 +377,177 @@
375
  "senter"
376
  ],
377
  "performance":{
378
- "tag_acc":0.9534133043,
379
- "sents_p":0.8595155709,
380
- "sents_r":0.8909612626,
381
- "sents_f":0.8749559704,
382
- "dep_uas":0.8698053923,
383
- "dep_las":0.8235860531,
384
  "dep_las_per_type":{
385
  "nmod:poss":{
386
- "p":0.9456521739,
387
- "r":0.9525547445,
388
- "f":0.9490909091
389
  },
390
  "nsubj":{
391
- "p":0.8466494845,
392
- "r":0.8639053254,
393
- "f":0.8551903677
394
  },
395
  "aux":{
396
- "p":0.9102990033,
397
- "r":0.9013157895,
398
- "f":0.905785124
399
  },
400
  "advmod":{
401
- "p":0.7967269595,
402
- "r":0.8258928571,
403
- "f":0.8110477861
404
  },
405
  "root":{
406
- "p":0.8636678201,
407
- "r":0.8952654232,
408
- "f":0.8791828108
409
  },
410
  "det":{
411
- "p":0.946263644,
412
- "r":0.9753353527,
413
- "f":0.9605795866
414
  },
415
  "amod":{
416
- "p":0.8720420684,
417
- "r":0.8915770609,
418
- "f":0.8817013735
419
  },
420
  "obl":{
421
- "p":0.7456021651,
422
- "r":0.7461069736,
423
- "f":0.7458544839
424
  },
425
  "mark":{
426
- "p":0.8880866426,
427
- "r":0.8945454545,
428
- "f":0.8913043478
429
  },
430
  "ccomp":{
431
- "p":0.6727272727,
432
- "r":0.691588785,
433
- "f":0.6820276498
434
  },
435
  "case":{
436
- "p":0.9379652605,
437
- "r":0.9606099111,
438
- "f":0.9491525424
439
  },
440
  "appos":{
441
- "p":0.7060702875,
442
- "r":0.6696969697,
443
- "f":0.6874027994
444
  },
445
  "obj":{
446
- "p":0.7875968992,
447
- "r":0.7755725191,
448
- "f":0.7815384615
449
  },
450
  "compound:prt":{
451
- "p":0.7755102041,
452
- "r":0.7136150235,
453
- "f":0.7432762836
454
  },
455
  "xcomp":{
456
- "p":0.6765799257,
457
- "r":0.6618181818,
458
- "f":0.6691176471
459
  },
460
  "flat":{
461
- "p":0.8115124153,
462
- "r":0.7624602333,
463
- "f":0.7862219792
464
  },
465
  "expl:pv":{
466
- "p":0.7674418605,
467
- "r":0.75,
468
- "f":0.7586206897
469
  },
470
  "acl":{
471
- "p":0.4615384615,
472
- "r":0.3673469388,
473
- "f":0.4090909091
474
  },
475
  "advcl":{
476
- "p":0.5024875622,
477
- "r":0.454954955,
478
- "f":0.4775413712
479
  },
480
  "nummod":{
481
- "p":0.8141025641,
482
- "r":0.8466666667,
483
- "f":0.8300653595
484
  },
485
  "nmod":{
486
- "p":0.7278742763,
487
- "r":0.7652173913,
488
- "f":0.746078847
489
  },
490
  "cc":{
491
- "p":0.8544776119,
492
- "r":0.8674242424,
493
- "f":0.8609022556
494
  },
495
  "conj":{
496
- "p":0.6463245492,
497
- "r":0.6331521739,
498
- "f":0.6396705559
499
  },
500
  "nsubj:pass":{
501
- "p":0.8083832335,
502
- "r":0.8490566038,
503
- "f":0.8282208589
504
  },
505
  "aux:pass":{
506
- "p":0.8871794872,
507
- "r":0.9611111111,
508
- "f":0.9226666667
509
- },
510
- "iobj":{
511
- "p":0.5652173913,
512
- "r":0.3939393939,
513
- "f":0.4642857143
514
  },
515
  "cop":{
516
- "p":0.7789473684,
517
  "r":0.8131868132,
518
- "f":0.7956989247
519
  },
520
  "parataxis":{
521
- "p":0.3663366337,
522
- "r":0.268115942,
523
- "f":0.309623431
524
  },
525
  "acl:relcl":{
526
- "p":0.6956521739,
527
- "r":0.7044025157,
528
- "f":0.7
 
 
 
 
 
 
 
 
 
 
529
  },
530
  "fixed":{
531
- "p":0.721448468,
532
- "r":0.4683544304,
533
- "f":0.5679824561
534
  },
535
  "obl:agent":{
536
- "p":0.9615384615,
537
- "r":0.8620689655,
538
- "f":0.9090909091
539
- },
540
- "expl":{
541
- "p":0.4,
542
- "r":0.4761904762,
543
- "f":0.4347826087
544
  },
545
  "csubj":{
546
- "p":0.6111111111,
547
- "r":0.55,
548
- "f":0.5789473684
549
  },
550
  "dep":{
551
  "p":0.0,
@@ -558,16 +560,16 @@
558
  "f":0.0
559
  }
560
  },
561
- "ents_p":0.7652916074,
562
- "ents_r":0.7441217151,
563
- "ents_f":0.7545582048,
564
  "ents_per_type":{
565
- "ORG":{
566
  "p":0.0,
567
  "r":0.0,
568
  "f":0.0
569
  },
570
- "PERSON":{
571
  "p":0.0,
572
  "r":0.0,
573
  "f":0.0
@@ -582,12 +584,17 @@
582
  "r":0.0,
583
  "f":0.0
584
  },
585
- "NORP":{
586
  "p":0.0,
587
  "r":0.0,
588
  "f":0.0
589
  },
590
- "DATE":{
 
 
 
 
 
591
  "p":0.0,
592
  "r":0.0,
593
  "f":0.0
@@ -597,12 +604,12 @@
597
  "r":0.0,
598
  "f":0.0
599
  },
600
- "PRODUCT":{
601
  "p":0.0,
602
  "r":0.0,
603
  "f":0.0
604
  },
605
- "GPE":{
606
  "p":0.0,
607
  "r":0.0,
608
  "f":0.0
@@ -622,12 +629,12 @@
622
  "r":0.0,
623
  "f":0.0
624
  },
625
- "LOC":{
626
  "p":0.0,
627
  "r":0.0,
628
  "f":0.0
629
  },
630
- "WORK_OF_ART":{
631
  "p":0.0,
632
  "r":0.0,
633
  "f":0.0
@@ -642,77 +649,72 @@
642
  "r":0.0,
643
  "f":0.0
644
  },
645
- "PERCENT":{
646
- "p":0.0,
647
- "r":0.0,
648
- "f":0.0
649
- },
650
  "LAW":{
651
  "p":0.0,
652
  "r":0.0,
653
  "f":0.0
654
  }
655
  },
656
- "speed":10256.4374458498,
657
  "token_acc":0.9997165842,
658
  "token_p":0.9974281853,
659
  "token_r":0.9975586363,
660
  "token_f":0.9974934066,
661
- "pos_acc":0.9661941112,
662
- "morph_acc":0.9635947213,
663
- "morph_micro_p":0.9722389581,
664
- "morph_micro_r":0.9551518463,
665
- "morph_micro_f":0.9636196601,
666
  "morph_per_feat":{
667
  "Person":{
668
- "p":0.9892891918,
669
- "r":0.9713193117,
670
- "f":0.9802219006
671
  },
672
  "Poss":{
673
- "p":0.9811320755,
674
- "r":0.9961685824,
675
- "f":0.9885931559
676
  },
677
  "PronType":{
678
- "p":0.9880647911,
679
- "r":0.9634247714,
680
- "f":0.9755892256
681
  },
682
  "Gender":{
683
- "p":0.9345253747,
684
- "r":0.90409565,
685
- "f":0.9190587018
686
  },
687
  "Number":{
688
- "p":0.9816793893,
689
- "r":0.9624307738,
690
- "f":0.9719597914
691
  },
692
  "Tense":{
693
- "p":0.9805339266,
694
- "r":0.9692138538,
695
- "f":0.9748410285
696
  },
697
  "VerbForm":{
698
- "p":0.9623188406,
699
- "r":0.9557394746,
700
- "f":0.9590178733
701
  },
702
  "Degree":{
703
- "p":0.9655677656,
704
- "r":0.9468390805,
705
- "f":0.9561117156
706
  },
707
  "Definite":{
708
- "p":0.9942680776,
709
- "r":0.9925176056,
710
- "f":0.9933920705
711
  },
712
  "Case":{
713
- "p":0.998,
714
- "r":0.9940239044,
715
- "f":0.996007984
716
  },
717
  "Reflex":{
718
  "p":1.0,
@@ -721,11 +723,11 @@
721
  },
722
  "Abbr":{
723
  "p":1.0,
724
- "r":0.5,
725
- "f":0.6666666667
726
  }
727
  },
728
- "lemma_acc":0.9417537126
729
  },
730
  "sources":[
731
  {
@@ -740,12 +742,6 @@
740
  "license":"CC BY-SA 4.0",
741
  "author":"NLP Town"
742
  },
743
- {
744
- "name":"UD Dutch LassySmall v2.8",
745
- "url":"https://github.com/UniversalDependencies/UD_Dutch-LassySmall",
746
- "license":"CC BY-SA 4.0",
747
- "author":"Bouma, Gosse; van Noord, Gertjan"
748
- },
749
  {
750
  "name":"UD Dutch Alpino v2.8",
751
  "url":"https://github.com/UniversalDependencies/UD_Dutch-Alpino",
1
  {
2
  "lang":"nl",
3
  "name":"core_news_lg",
4
+ "version":"3.4.0",
5
  "description":"Dutch pipeline optimized for CPU. Components: tok2vec, morphologizer, tagger, parser, lemmatizer (trainable_lemmatizer), senter, ner.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.4.0,<3.5.0",
11
+ "spacy_git_version":"dd038b536",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
72
  "Number=Sing|POS=PROPN",
73
  "POS=PRON|PronType=Dem",
74
  "POS=AUX|VerbForm=Part",
75
+ "POS=SPACE",
76
  "POS=PRON|Person=3|PronType=Rel",
77
  "Number=Plur|POS=PROPN",
78
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs",
289
  "WW|vd|nom|met-e|zonder-n",
290
  "WW|vd|prenom|met-e",
291
  "WW|vd|prenom|zonder",
292
+ "WW|vd|vrij|zonder",
293
+ "_SP"
294
  ],
295
  "parser":[
296
  "ROOT",
377
  "senter"
378
  ],
379
  "performance":{
380
+ "tag_acc":0.9538495093,
381
+ "sents_p":0.8619972261,
382
+ "sents_r":0.8916786227,
383
+ "sents_f":0.8765867419,
384
+ "dep_uas":0.8682669641,
385
+ "dep_las":0.8246685762,
386
  "dep_las_per_type":{
387
  "nmod:poss":{
388
+ "p":0.9522058824,
389
+ "r":0.9452554745,
390
+ "f":0.9487179487
391
  },
392
  "nsubj":{
393
+ "p":0.8483072917,
394
+ "r":0.8566732413,
395
+ "f":0.8524697416
396
  },
397
  "aux":{
398
+ "p":0.9150326797,
399
+ "r":0.9210526316,
400
+ "f":0.9180327869
401
  },
402
  "advmod":{
403
+ "p":0.7968337731,
404
+ "r":0.8089285714,
405
+ "f":0.8028356225
406
  },
407
  "root":{
408
+ "p":0.8689320388,
409
+ "r":0.8988522238,
410
+ "f":0.8836389281
411
  },
412
  "det":{
413
+ "p":0.9430724152,
414
+ "r":0.9749026395,
415
+ "f":0.9587234043
416
  },
417
  "amod":{
418
+ "p":0.8828193833,
419
+ "r":0.8978494624,
420
+ "f":0.8902709907
421
  },
422
  "obl":{
423
+ "p":0.7610619469,
424
+ "r":0.7569397427,
425
+ "f":0.7589952478
426
  },
427
  "mark":{
428
+ "p":0.8918918919,
429
+ "r":0.9,
430
+ "f":0.8959276018
431
  },
432
  "ccomp":{
433
+ "p":0.6979166667,
434
+ "r":0.6261682243,
435
+ "f":0.6600985222
436
  },
437
  "case":{
438
+ "p":0.9368811881,
439
+ "r":0.9618805591,
440
+ "f":0.9492163009
441
  },
442
  "appos":{
443
+ "p":0.707165109,
444
+ "r":0.6878787879,
445
+ "f":0.6973886329
446
  },
447
  "obj":{
448
+ "p":0.7573529412,
449
+ "r":0.786259542,
450
+ "f":0.7715355805
451
  },
452
  "compound:prt":{
453
+ "p":0.7692307692,
454
+ "r":0.7042253521,
455
+ "f":0.7352941176
456
  },
457
  "xcomp":{
458
+ "p":0.6642599278,
459
+ "r":0.6690909091,
460
+ "f":0.6666666667
461
  },
462
  "flat":{
463
+ "p":0.8215527231,
464
+ "r":0.7518557794,
465
+ "f":0.7851605759
466
  },
467
  "expl:pv":{
468
+ "p":0.7619047619,
469
+ "r":0.7272727273,
470
+ "f":0.7441860465
471
  },
472
  "acl":{
473
+ "p":0.4404761905,
474
+ "r":0.3775510204,
475
+ "f":0.4065934066
476
  },
477
  "advcl":{
478
+ "p":0.5577889447,
479
+ "r":0.5,
480
+ "f":0.5273159145
481
  },
482
  "nummod":{
483
+ "p":0.8291139241,
484
+ "r":0.8733333333,
485
+ "f":0.8506493506
486
  },
487
  "nmod":{
488
+ "p":0.7201986755,
489
+ "r":0.7565217391,
490
+ "f":0.737913486
491
  },
492
  "cc":{
493
+ "p":0.8536585366,
494
+ "r":0.8617424242,
495
+ "f":0.8576814326
496
  },
497
  "conj":{
498
+ "p":0.6527777778,
499
+ "r":0.6385869565,
500
+ "f":0.6456043956
501
  },
502
  "nsubj:pass":{
503
+ "p":0.8128654971,
504
+ "r":0.8742138365,
505
+ "f":0.8424242424
506
  },
507
  "aux:pass":{
508
+ "p":0.9095744681,
509
+ "r":0.95,
510
+ "f":0.9293478261
 
 
 
 
 
511
  },
512
  "cop":{
513
+ "p":0.7900355872,
514
  "r":0.8131868132,
515
+ "f":0.8014440433
516
  },
517
  "parataxis":{
518
+ "p":0.3349282297,
519
+ "r":0.2536231884,
520
+ "f":0.2886597938
521
  },
522
  "acl:relcl":{
523
+ "p":0.701863354,
524
+ "r":0.7106918239,
525
+ "f":0.70625
526
+ },
527
+ "expl":{
528
+ "p":0.4166666667,
529
+ "r":0.4761904762,
530
+ "f":0.4444444444
531
+ },
532
+ "iobj":{
533
+ "p":0.6470588235,
534
+ "r":0.3333333333,
535
+ "f":0.44
536
  },
537
  "fixed":{
538
+ "p":0.7255434783,
539
+ "r":0.4828209765,
540
+ "f":0.5798045603
541
  },
542
  "obl:agent":{
543
+ "p":0.8387096774,
544
+ "r":0.8965517241,
545
+ "f":0.8666666667
 
 
 
 
 
546
  },
547
  "csubj":{
548
+ "p":0.45,
549
+ "r":0.45,
550
+ "f":0.45
551
  },
552
  "dep":{
553
  "p":0.0,
560
  "f":0.0
561
  }
562
  },
563
+ "ents_p":0.7845934379,
564
+ "ents_r":0.7607192254,
565
+ "ents_f":0.7724719101,
566
  "ents_per_type":{
567
+ "PERSON":{
568
  "p":0.0,
569
  "r":0.0,
570
  "f":0.0
571
  },
572
+ "GPE":{
573
  "p":0.0,
574
  "r":0.0,
575
  "f":0.0
584
  "r":0.0,
585
  "f":0.0
586
  },
587
+ "ORG":{
588
  "p":0.0,
589
  "r":0.0,
590
  "f":0.0
591
  },
592
+ "WORK_OF_ART":{
593
+ "p":0.0,
594
+ "r":0.0,
595
+ "f":0.0
596
+ },
597
+ "PRODUCT":{
598
  "p":0.0,
599
  "r":0.0,
600
  "f":0.0
604
  "r":0.0,
605
  "f":0.0
606
  },
607
+ "NORP":{
608
  "p":0.0,
609
  "r":0.0,
610
  "f":0.0
611
  },
612
+ "DATE":{
613
  "p":0.0,
614
  "r":0.0,
615
  "f":0.0
629
  "r":0.0,
630
  "f":0.0
631
  },
632
+ "PERCENT":{
633
  "p":0.0,
634
  "r":0.0,
635
  "f":0.0
636
  },
637
+ "LOC":{
638
  "p":0.0,
639
  "r":0.0,
640
  "f":0.0
649
  "r":0.0,
650
  "f":0.0
651
  },
 
 
 
 
 
652
  "LAW":{
653
  "p":0.0,
654
  "r":0.0,
655
  "f":0.0
656
  }
657
  },
658
+ "speed":10089.2857654723,
659
  "token_acc":0.9997165842,
660
  "token_p":0.9974281853,
661
  "token_r":0.9975586363,
662
  "token_f":0.9974934066,
663
+ "pos_acc":0.9666739368,
664
+ "morph_acc":0.9639000982,
665
+ "morph_micro_p":0.9741415265,
666
+ "morph_micro_r":0.9571096845,
667
+ "morph_micro_f":0.965550503,
668
  "morph_per_feat":{
669
  "Person":{
670
+ "p":0.994157741,
671
+ "r":0.9760994264,
672
+ "f":0.9850458273
673
  },
674
  "Poss":{
675
+ "p":0.9886363636,
676
+ "r":1.0,
677
+ "f":0.9942857143
678
  },
679
  "PronType":{
680
+ "p":0.9914965986,
681
+ "r":0.9692435578,
682
+ "f":0.9802437999
683
  },
684
  "Gender":{
685
+ "p":0.9321944809,
686
+ "r":0.9023149326,
687
+ "f":0.9170113754
688
  },
689
  "Number":{
690
+ "p":0.9844227245,
691
+ "r":0.9648256249,
692
+ "f":0.9745256633
693
  },
694
  "Tense":{
695
+ "p":0.9838530067,
696
+ "r":0.9714128642,
697
+ "f":0.977593361
698
  },
699
  "VerbForm":{
700
+ "p":0.9687726943,
701
+ "r":0.9600575747,
702
+ "f":0.9643954455
703
  },
704
  "Degree":{
705
+ "p":0.9598246896,
706
+ "r":0.9439655172,
707
+ "f":0.9518290474
708
  },
709
  "Definite":{
710
+ "p":0.9964757709,
711
+ "r":0.9955985915,
712
+ "f":0.9960369881
713
  },
714
  "Case":{
715
+ "p":0.998003992,
716
+ "r":0.9960159363,
717
+ "f":0.9970089731
718
  },
719
  "Reflex":{
720
  "p":1.0,
723
  },
724
  "Abbr":{
725
  "p":1.0,
726
+ "r":0.5555555556,
727
+ "f":0.7142857143
728
  }
729
  },
730
+ "lemma_acc":0.9568877162
731
  },
732
  "sources":[
733
  {
742
  "license":"CC BY-SA 4.0",
743
  "author":"NLP Town"
744
  },
 
 
 
 
 
 
745
  {
746
  "name":"UD Dutch Alpino v2.8",
747
  "url":"https://github.com/UniversalDependencies/UD_Dutch-Alpino",
morphologizer/cfg CHANGED
@@ -53,6 +53,7 @@
53
  "Number=Sing|POS=PROPN":"Number=Sing",
54
  "POS=PRON|PronType=Dem":"PronType=Dem",
55
  "POS=AUX|VerbForm=Part":"VerbForm=Part",
 
56
  "POS=PRON|Person=3|PronType=Rel":"Person=3|PronType=Rel",
57
  "Number=Plur|POS=PROPN":"Number=Plur",
58
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs":"Person=2|Poss=Yes|PronType=Prs",
@@ -120,6 +121,7 @@
120
  "Number=Sing|POS=PROPN":96,
121
  "POS=PRON|PronType=Dem":95,
122
  "POS=AUX|VerbForm=Part":87,
 
123
  "POS=PRON|Person=3|PronType=Rel":95,
124
  "Number=Plur|POS=PROPN":96,
125
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs":95,
53
  "Number=Sing|POS=PROPN":"Number=Sing",
54
  "POS=PRON|PronType=Dem":"PronType=Dem",
55
  "POS=AUX|VerbForm=Part":"VerbForm=Part",
56
+ "POS=SPACE":"",
57
  "POS=PRON|Person=3|PronType=Rel":"Person=3|PronType=Rel",
58
  "Number=Plur|POS=PROPN":"Number=Plur",
59
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs":"Person=2|Poss=Yes|PronType=Prs",
121
  "Number=Sing|POS=PROPN":96,
122
  "POS=PRON|PronType=Dem":95,
123
  "POS=AUX|VerbForm=Part":87,
124
+ "POS=SPACE":103,
125
  "POS=PRON|Person=3|PronType=Rel":95,
126
  "Number=Plur|POS=PROPN":96,
127
  "POS=PRON|Person=2|Poss=Yes|PronType=Prs":95,
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:002fb75f4ff6e09c75716386d3f603669f6b229fad7ba1f7b6a26620e998b0d5
3
- size 25650
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85008130e8c322f8f5cec41d1470689ab18b81a65681341b3a5b12b5e838e89c
3
+ size 26038
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:377530352be09f8af817499145d3dd8a3abc32bd7d952084a390046ee59660f8
3
  size 6511153
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13100cddcae2839fa86aa0979ef9eba092e9736a94459f92ef6309cb52071d35
3
  size 6511153
ner/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{},"1":{"PERSON":4544,"DATE":2681,"GPE":2521,"ORG":2170,"WORK_OF_ART":1437,"NORP":1021,"CARDINAL":761,"EVENT":604,"FAC":527,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"2":{"PERSON":4544,"DATE":2681,"GPE":2521,"ORG":2170,"WORK_OF_ART":1437,"NORP":1021,"CARDINAL":761,"EVENT":604,"FAC":527,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"3":{"PERSON":4544,"DATE":2681,"GPE":2521,"ORG":2170,"WORK_OF_ART":1437,"NORP":1021,"CARDINAL":761,"EVENT":604,"FAC":527,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"4":{"PERSON":4544,"DATE":2681,"GPE":2521,"ORG":2170,"WORK_OF_ART":1437,"NORP":1021,"CARDINAL":761,"EVENT":604,"FAC":527,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34,"":1},"5":{"":1}}�cfg��neg_key�
1
+ ��moves��{"0":{},"1":{"PERSON":4553,"DATE":2684,"GPE":2525,"ORG":2170,"WORK_OF_ART":1439,"NORP":1021,"CARDINAL":761,"EVENT":605,"FAC":530,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"2":{"PERSON":4553,"DATE":2684,"GPE":2525,"ORG":2170,"WORK_OF_ART":1439,"NORP":1021,"CARDINAL":761,"EVENT":605,"FAC":530,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"3":{"PERSON":4553,"DATE":2684,"GPE":2525,"ORG":2170,"WORK_OF_ART":1439,"NORP":1021,"CARDINAL":761,"EVENT":605,"FAC":530,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34},"4":{"PERSON":4553,"DATE":2684,"GPE":2525,"ORG":2170,"WORK_OF_ART":1439,"NORP":1021,"CARDINAL":761,"EVENT":605,"FAC":530,"LOC":301,"ORDINAL":233,"LANGUAGE":146,"PERCENT":100,"QUANTITY":90,"LAW":65,"TIME":54,"PRODUCT":37,"MONEY":34,"":1},"5":{"":1}}�cfg��neg_key�
nl_core_news_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f2d97ecb9d50259b0b25ebc57a3e2f79a7b6260a9e238c26792ed1854d9830d
3
- size 568149198
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f8ab23ba250ffeaf98a70822fcbf08ea1d6aa59774faacae113dfc8559fc4f4
3
+ size 568072659
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8a641079ec81300f2485528a68139797e6bccea9122ae8c014303aa65a075bfc
3
  size 315229
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:371ca1381fc2478487abd07478b549eebbc3a2e149f3127ff07dde2ced4ace73
3
  size 315229
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{"":151638},"1":{"":91446},"2":{"det":29834,"case":26244,"nsubj":13574,"amod":12930,"punct":11769,"advmod":9701,"obl":8122,"mark":6683,"cc":5442,"obj":4514,"aux":4218,"nsubj:pass":2513,"aux:pass":2468,"cop":2077,"nummod":2054,"nmod:poss":2019,"nmod":1249,"xcomp":1160,"compound:prt":839,"advcl":643,"acl":507,"parataxis":413,"iobj":307,"expl":273,"advmod||xcomp":266,"expl:pv":261,"obl||xcomp":259,"obl:agent":227,"obj||xcomp":200,"case||obl":162,"ccomp":108,"expl||advcl":60,"case||advcl":51,"obl||ccomp":50,"csubj":50,"advmod||ccomp":49,"obj||ccomp":47,"obl||obj":42,"advcl||xcomp":31,"dep":0},"3":{"punct":19558,"nmod":12995,"flat":9165,"conj":7139,"obl":6798,"fixed":4636,"nsubj":4269,"appos":3301,"obj":3144,"advmod":3092,"parataxis":2279,"xcomp":2098,"acl:relcl":2035,"advcl":1596,"compound:prt":1376,"cop":1282,"ccomp":1231,"acl":775,"amod":490,"aux:pass":398,"csubj":396,"nummod":365,"aux":355,"iobj":230,"expl:pv":225,"obl:agent":221,"nmod||obj":178,"advcl||advmod":152,"case":147,"acl:relcl||obj":135,"case||obl":132,"acl:relcl||nsubj":98,"acl||obj":88,"expl":83,"mark":69,"orphan":68,"acl:relcl||nsubj:pass":55,"obl||xcomp":53,"expl||advcl":47,"cc":35,"advcl||amod":35,"advcl||nmod":34,"obl||obj":32,"nmod||nsubj":31,"dep":0},"4":{"ROOT":18078}}�cfg��neg_key�
1
+ ��moves��{"0":{"":151646},"1":{"":92888},"2":{"det":29834,"case":26244,"nsubj":13574,"amod":12930,"punct":11769,"advmod":9701,"obl":8122,"mark":6683,"cc":5442,"obj":4514,"aux":4218,"nsubj:pass":2513,"aux:pass":2468,"cop":2077,"nummod":2054,"nmod:poss":2019,"nmod":1249,"xcomp":1160,"compound:prt":839,"advcl":643,"acl":507,"parataxis":413,"iobj":307,"expl":273,"advmod||xcomp":266,"expl:pv":261,"obl||xcomp":259,"obl:agent":227,"obj||xcomp":200,"case||obl":162,"ccomp":108,"expl||advcl":60,"case||advcl":51,"obl||ccomp":50,"csubj":50,"advmod||ccomp":49,"obj||ccomp":47,"obl||obj":42,"advcl||xcomp":31,"dep":0},"3":{"punct":19558,"nmod":12995,"flat":9165,"conj":7139,"obl":6798,"fixed":4636,"nsubj":4269,"appos":3301,"obj":3144,"advmod":3092,"parataxis":2279,"xcomp":2098,"acl:relcl":2035,"advcl":1596,"dep":1442,"compound:prt":1376,"cop":1282,"ccomp":1231,"acl":775,"amod":490,"aux:pass":398,"csubj":396,"nummod":365,"aux":355,"iobj":230,"expl:pv":225,"obl:agent":221,"nmod||obj":178,"advcl||advmod":152,"case":147,"acl:relcl||obj":135,"case||obl":132,"acl:relcl||nsubj":98,"acl||obj":88,"expl":83,"mark":69,"orphan":68,"acl:relcl||nsubj:pass":55,"obl||xcomp":53,"expl||advcl":47,"cc":35,"advcl||amod":35,"advcl||nmod":34,"obl||obj":32,"nmod||nsubj":31},"4":{"ROOT":18078}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9b950cda3c8d2d290ae31b1e6c39157efa215b65c09f0f4560fb0b5e773bb2e3
3
  size 219953
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb9fbbd1012c9276d1e34185cfe49de343ff355dc23b95a9e1800f2b828d942e
3
  size 219953
tagger/cfg CHANGED
@@ -201,7 +201,8 @@
201
  "WW|vd|nom|met-e|zonder-n",
202
  "WW|vd|prenom|met-e",
203
  "WW|vd|prenom|zonder",
204
- "WW|vd|vrij|zonder"
 
205
  ],
206
  "neg_prefix":"!",
207
  "overwrite":false
201
  "WW|vd|nom|met-e|zonder-n",
202
  "WW|vd|prenom|met-e",
203
  "WW|vd|prenom|zonder",
204
+ "WW|vd|vrij|zonder",
205
+ "_SP"
206
  ],
207
  "neg_prefix":"!",
208
  "overwrite":false
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c241bc6b1c22b0c4a305e5f647a903b6bb1f79520bc8ad0ea1917ed7484fa5e4
3
- size 78813
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:506d4e1c7eee81e758cef57ed23e996af393cc74b1ab0d5001f84798f7297dcb
3
+ size 79201
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a746887c36b1681fcca5f0cd12c159c61ab1ea33aac78377e59672be1ee74d19
3
  size 6365604
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c3136d0e6227a5efb09ef5e5ad159e3d22a41af10f57aee3344d782973405ee0
3
  size 6365604
vocab/key2row CHANGED
Binary files a/vocab/key2row and b/vocab/key2row differ
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d1a5fc70ba2fd5c9823bd960f3b3f6c9b84ef776f0b4668f997cb871be8c859c
3
- size 10172588
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be0c25c54764ddfd5c4c89c2c416d5fa7e558bc4acd526b5aa730db3c0af2bb3
3
+ size 10075413