osanseviero HF staff commited on
Commit
c79eab0
1 Parent(s): c50b7dc

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -1,4 +1,4 @@
1
- # UD Danish DDT v2.5
2
 
3
  * Author: Johannsen, Anders; Martínez Alonso, Héctor; Plank, Barbara
4
  * URL: https://github.com/UniversalDependencies/UD_Danish-DDT
@@ -878,553 +878,442 @@ Creative Commons may be contacted at creativecommons.org.
878
 
879
 
880
 
881
- # Lemmatization Lists
882
 
883
- * Author: Michal Měchura
884
- * URL: https://github.com/michmech/lemmatization-lists/
885
- * License: ODbL
886
 
887
  ```
888
- ## ODC Open Database License (ODbL)
889
-
890
- ### Preamble
891
-
892
- The Open Database License (ODbL) is a license agreement intended to
893
- allow users to freely share, modify, and use this Database while
894
- maintaining this same freedom for others. Many databases are covered by
895
- copyright, and therefore this document licenses these rights. Some
896
- jurisdictions, mainly in the European Union, have specific rights that
897
- cover databases, and so the ODbL addresses these rights, too. Finally,
898
- the ODbL is also an agreement in contract for users of this Database to
899
- act in certain ways in return for accessing this Database.
900
-
901
- Databases can contain a wide variety of types of content (images,
902
- audiovisual material, and sounds all in the same database, for example),
903
- and so the ODbL only governs the rights over the Database, and not the
904
- contents of the Database individually. Licensors should use the ODbL
905
- together with another license for the contents, if the contents have a
906
- single set of rights that uniformly covers all of the contents. If the
907
- contents have multiple sets of different rights, Licensors should
908
- describe what rights govern what contents together in the individual
909
- record or in some other way that clarifies what rights apply.
910
-
911
- Sometimes the contents of a database, or the database itself, can be
912
- covered by other rights not addressed here (such as private contracts,
913
- trade mark over the name, or privacy rights / data protection rights
914
- over information in the contents), and so you are advised that you may
915
- have to consult other documents or clear other rights before doing
916
- activities not covered by this License.
917
-
918
- ------
919
-
920
- The Licensor (as defined below)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
921
 
922
- and
923
-
924
- You (as defined below)
925
 
926
- agree as follows:
927
-
928
- ### 1.0 Definitions of Capitalised Words
929
-
930
- "Collective Database" – Means this Database in unmodified form as part
931
- of a collection of independent databases in themselves that together are
932
- assembled into a collective whole. A work that constitutes a Collective
933
- Database will not be considered a Derivative Database.
934
-
935
- "Convey" – As a verb, means Using the Database, a Derivative Database,
936
- or the Database as part of a Collective Database in any way that enables
937
- a Person to make or receive copies of the Database or a Derivative
938
- Database. Conveying does not include interaction with a user through a
939
- computer network, or creating and Using a Produced Work, where no
940
- transfer of a copy of the Database or a Derivative Database occurs.
941
- "Contents" – The contents of this Database, which includes the
942
- information, independent works, or other material collected into the
943
- Database. For example, the contents of the Database could be factual
944
- data or works such as images, audiovisual material, text, or sounds.
945
-
946
- "Database" – A collection of material (the Contents) arranged in a
947
- systematic or methodical way and individually accessible by electronic
948
- or other means offered under the terms of this License.
949
-
950
- "Database Directive" – Means Directive 96/9/EC of the European
951
- Parliament and of the Council of 11 March 1996 on the legal protection
952
- of databases, as amended or succeeded.
953
-
954
- "Database Right" – Means rights resulting from the Chapter III ("sui
955
- generis") rights in the Database Directive (as amended and as transposed
956
- by member states), which includes the Extraction and Re-utilisation of
957
- the whole or a Substantial part of the Contents, as well as any similar
958
- rights available in the relevant jurisdiction under Section 10.4.
959
-
960
- "Derivative Database" – Means a database based upon the Database, and
961
- includes any translation, adaptation, arrangement, modification, or any
962
- other alteration of the Database or of a Substantial part of the
963
- Contents. This includes, but is not limited to, Extracting or
964
- Re-utilising the whole or a Substantial part of the Contents in a new
965
- Database.
966
-
967
- "Extraction" – Means the permanent or temporary transfer of all or a
968
- Substantial part of the Contents to another medium by any means or in
969
- any form.
970
-
971
- "License" – Means this license agreement and is both a license of rights
972
- such as copyright and Database Rights and an agreement in contract.
973
-
974
- "Licensor" – Means the Person that offers the Database under the terms
975
- of this License.
976
-
977
- "Person" – Means a natural or legal person or a body of persons
978
- corporate or incorporate.
979
-
980
- "Produced Work" – a work (such as an image, audiovisual material, text,
981
- or sounds) resulting from using the whole or a Substantial part of the
982
- Contents (via a search or other query) from this Database, a Derivative
983
- Database, or this Database as part of a Collective Database.
984
-
985
- "Publicly" – means to Persons other than You or under Your control by
986
- either more than 50% ownership or by the power to direct their
987
- activities (such as contracting with an independent consultant).
988
-
989
- "Re-utilisation" – means any form of making available to the public all
990
- or a Substantial part of the Contents by the distribution of copies, by
991
- renting, by online or other forms of transmission.
992
-
993
- "Substantial" – Means substantial in terms of quantity or quality or a
994
- combination of both. The repeated and systematic Extraction or
995
- Re-utilisation of insubstantial parts of the Contents may amount to the
996
- Extraction or Re-utilisation of a Substantial part of the Contents.
997
-
998
- "Use" – As a verb, means doing any act that is restricted by copyright
999
- or Database Rights whether in the original medium or any other; and
1000
- includes without limitation distributing, copying, publicly performing,
1001
- publicly displaying, and preparing derivative works of the Database, as
1002
- well as modifying the Database as may be technically necessary to use it
1003
- in a different mode or format.
1004
-
1005
- "You" – Means a Person exercising rights under this License who has not
1006
- previously violated the terms of this License with respect to the
1007
- Database, or who has received express permission from the Licensor to
1008
- exercise rights under this License despite a previous violation.
1009
 
1010
- Words in the singular include the plural and vice versa.
1011
-
1012
- ### 2.0 What this License covers
1013
-
1014
- 2.1. Legal effect of this document. This License is:
1015
 
1016
- a. A license of applicable copyright and neighbouring rights;
 
1017
 
1018
- b. A license of the Database Right; and
1019
-
1020
- c. An agreement in contract between You and the Licensor.
1021
-
1022
- 2.2 Legal rights covered. This License covers the legal rights in the
1023
- Database, including:
1024
-
1025
- a. Copyright. Any copyright or neighbouring rights in the Database.
1026
- The copyright licensed includes any individual elements of the
1027
- Database, but does not cover the copyright over the Contents
1028
- independent of this Database. See Section 2.4 for details. Copyright
1029
- law varies between jurisdictions, but is likely to cover: the Database
1030
- model or schema, which is the structure, arrangement, and organisation
1031
- of the Database, and can also include the Database tables and table
1032
- indexes; the data entry and output sheets; and the Field names of
1033
- Contents stored in the Database;
1034
-
1035
- b. Database Rights. Database Rights only extend to the Extraction and
1036
- Re-utilisation of the whole or a Substantial part of the Contents.
1037
- Database Rights can apply even when there is no copyright over the
1038
- Database. Database Rights can also apply when the Contents are removed
1039
- from the Database and are selected and arranged in a way that would
1040
- not infringe any applicable copyright; and
1041
-
1042
- c. Contract. This is an agreement between You and the Licensor for
1043
- access to the Database. In return you agree to certain conditions of
1044
- use on this access as outlined in this License.
1045
-
1046
- 2.3 Rights not covered.
1047
-
1048
- a. This License does not apply to computer programs used in the making
1049
- or operation of the Database;
1050
-
1051
- b. This License does not cover any patents over the Contents or the
1052
- Database; and
1053
-
1054
- c. This License does not cover any trademarks associated with the
1055
- Database.
1056
-
1057
- 2.4 Relationship to Contents in the Database. The individual items of
1058
- the Contents contained in this Database may be covered by other rights,
1059
- including copyright, patent, data protection, privacy, or personality
1060
- rights, and this License does not cover any rights (other than Database
1061
- Rights or in contract) in individual Contents contained in the Database.
1062
- For example, if used on a Database of images (the Contents), this
1063
- License would not apply to copyright over individual images, which could
1064
- have their own separate licenses, or one single license covering all of
1065
- the rights over the images.
1066
-
1067
- ### 3.0 Rights granted
1068
-
1069
- 3.1 Subject to the terms and conditions of this License, the Licensor
1070
- grants to You a worldwide, royalty-free, non-exclusive, terminable (but
1071
- only under Section 9) license to Use the Database for the duration of
1072
- any applicable copyright and Database Rights. These rights explicitly
1073
- include commercial use, and do not exclude any field of endeavour. To
1074
- the extent possible in the relevant jurisdiction, these rights may be
1075
- exercised in all media and formats whether now known or created in the
1076
- future.
1077
-
1078
- The rights granted cover, for example:
1079
-
1080
- a. Extraction and Re-utilisation of the whole or a Substantial part of
1081
- the Contents;
1082
-
1083
- b. Creation of Derivative Databases;
1084
 
1085
- c. Creation of Collective Databases;
1086
-
1087
- d. Creation of temporary or permanent reproductions by any means and
1088
- in any form, in whole or in part, including of any Derivative
1089
- Databases or as a part of Collective Databases; and
1090
-
1091
- e. Distribution, communication, display, lending, making available, or
1092
- performance to the public by any means and in any form, in whole or in
1093
- part, including of any Derivative Database or as a part of Collective
1094
- Databases.
1095
-
1096
- 3.2 Compulsory license schemes. For the avoidance of doubt:
1097
 
1098
- a. Non-waivable compulsory license schemes. In those jurisdictions in
1099
- which the right to collect royalties through any statutory or
1100
- compulsory licensing scheme cannot be waived, the Licensor reserves
1101
- the exclusive right to collect such royalties for any exercise by You
1102
- of the rights granted under this License;
 
 
 
 
 
 
 
 
 
 
1103
 
1104
- b. Waivable compulsory license schemes. In those jurisdictions in
1105
- which the right to collect royalties through any statutory or
1106
- compulsory licensing scheme can be waived, the Licensor waives the
1107
- exclusive right to collect such royalties for any exercise by You of
1108
- the rights granted under this License; and,
1109
 
1110
- c. Voluntary license schemes. The Licensor waives the right to collect
1111
- royalties, whether individually or, in the event that the Licensor is
1112
- a member of a collecting society that administers voluntary licensing
1113
- schemes, via that society, from any exercise by You of the rights
1114
- granted under this License.
1115
 
1116
- 3.3 The right to release the Database under different terms, or to stop
1117
- distributing or making available the Database, is reserved. Note that
1118
- this Database may be multiple-licensed, and so You may have the choice
1119
- of using alternative licenses for this Database. Subject to Section
1120
- 10.4, all other rights not expressly granted by Licensor are reserved.
1121
-
1122
- ### 4.0 Conditions of Use
1123
-
1124
- 4.1 The rights granted in Section 3 above are expressly made subject to
1125
- Your complying with the following conditions of use. These are important
1126
- conditions of this License, and if You fail to follow them, You will be
1127
- in material breach of its terms.
1128
-
1129
- 4.2 Notices. If You Publicly Convey this Database, any Derivative
1130
- Database, or the Database as part of a Collective Database, then You
1131
- must:
1132
-
1133
- a. Do so only under the terms of this License or another license
1134
- permitted under Section 4.4;
1135
-
1136
- b. Include a copy of this License (or, as applicable, a license
1137
- permitted under Section 4.4) or its Uniform Resource Identifier (URI)
1138
- with the Database or Derivative Database, including both in the
1139
- Database or Derivative Database and in any relevant documentation; and
1140
 
1141
- c. Keep intact any copyright or Database Right notices and notices
1142
- that refer to this License.
1143
-
1144
- d. If it is not possible to put the required notices in a particular
1145
- file due to its structure, then You must include the notices in a
1146
- location (such as a relevant directory) where users would be likely to
1147
- look for it.
1148
-
1149
- 4.3 Notice for using output (Contents). Creating and Using a Produced
1150
- Work does not require the notice in Section 4.2. However, if you
1151
- Publicly Use a Produced Work, You must include a notice associated with
1152
- the Produced Work reasonably calculated to make any Person that uses,
1153
- views, accesses, interacts with, or is otherwise exposed to the Produced
1154
- Work aware that Content was obtained from the Database, Derivative
1155
- Database, or the Database as part of a Collective Database, and that it
1156
- is available under this License.
1157
-
1158
- a. Example notice. The following text will satisfy notice under
1159
- Section 4.3:
1160
-
1161
- Contains information from DATABASE NAME, which is made available
1162
- here under the Open Database License (ODbL).
1163
-
1164
- DATABASE NAME should be replaced with the name of the Database and a
1165
- hyperlink to the URI of the Database. "Open Database License" should
1166
- contain a hyperlink to the URI of the text of this License. If
1167
- hyperlinks are not possible, You should include the plain text of the
1168
- required URI's with the above notice.
1169
-
1170
- 4.4 Share alike.
1171
-
1172
- a. Any Derivative Database that You Publicly Use must be only under
1173
- the terms of:
1174
-
1175
- i. This License;
1176
-
1177
- ii. A later version of this License similar in spirit to this
1178
- License; or
1179
-
1180
- iii. A compatible license.
1181
-
1182
- If You license the Derivative Database under one of the licenses
1183
- mentioned in (iii), You must comply with the terms of that license.
1184
-
1185
- b. For the avoidance of doubt, Extraction or Re-utilisation of the
1186
- whole or a Substantial part of the Contents into a new database is a
1187
- Derivative Database and must comply with Section 4.4.
1188
-
1189
- c. Derivative Databases and Produced Works. A Derivative Database is
1190
- Publicly Used and so must comply with Section 4.4. if a Produced Work
1191
- created from the Derivative Database is Publicly Used.
1192
-
1193
- d. Share Alike and additional Contents. For the avoidance of doubt,
1194
- You must not add Contents to Derivative Databases under Section 4.4 a
1195
- that are incompatible with the rights granted under this License.
1196
-
1197
- e. Compatible licenses. Licensors may authorise a proxy to determine
1198
- compatible licenses under Section 4.4 a iii. If they do so, the
1199
- authorised proxy's public statement of acceptance of a compatible
1200
- license grants You permission to use the compatible license.
1201
-
1202
-
1203
- 4.5 Limits of Share Alike. The requirements of Section 4.4 do not apply
1204
- in the following:
1205
-
1206
- a. For the avoidance of doubt, You are not required to license
1207
- Collective Databases under this License if You incorporate this
1208
- Database or a Derivative Database in the collection, but this License
1209
- still applies to this Database or a Derivative Database as a part of
1210
- the Collective Database;
1211
-
1212
- b. Using this Database, a Derivative Database, or this Database as
1213
- part of a Collective Database to create a Produced Work does not
1214
- create a Derivative Database for purposes of Section 4.4; and
1215
-
1216
- c. Use of a Derivative Database internally within an organisation is
1217
- not to the public and therefore does not fall under the requirements
1218
- of Section 4.4.
1219
-
1220
- 4.6 Access to Derivative Databases. If You Publicly Use a Derivative
1221
- Database or a Produced Work from a Derivative Database, You must also
1222
- offer to recipients of the Derivative Database or Produced Work a copy
1223
- in a machine readable form of:
1224
-
1225
- a. The entire Derivative Database; or
1226
-
1227
- b. A file containing all of the alterations made to the Database or
1228
- the method of making the alterations to the Database (such as an
1229
- algorithm), including any additional Contents, that make up all the
1230
- differences between the Database and the Derivative Database.
1231
-
1232
- The Derivative Database (under a.) or alteration file (under b.) must be
1233
- available at no more than a reasonable production cost for physical
1234
- distributions and free of charge if distributed over the internet.
1235
-
1236
- 4.7 Technological measures and additional terms
1237
-
1238
- a. This License does not allow You to impose (except subject to
1239
- Section 4.7 b.) any terms or any technological measures on the
1240
- Database, a Derivative Database, or the whole or a Substantial part of
1241
- the Contents that alter or restrict the terms of this License, or any
1242
- rights granted under it, or have the effect or intent of restricting
1243
- the ability of any person to exercise those rights.
1244
-
1245
- b. Parallel distribution. You may impose terms or technological
1246
- measures on the Database, a Derivative Database, or the whole or a
1247
- Substantial part of the Contents (a "Restricted Database") in
1248
- contravention of Section 4.74 a. only if You also make a copy of the
1249
- Database or a Derivative Database available to the recipient of the
1250
- Restricted Database:
1251
-
1252
- i. That is available without additional fee;
1253
-
1254
- ii. That is available in a medium that does not alter or restrict
1255
- the terms of this License, or any rights granted under it, or have
1256
- the effect or intent of restricting the ability of any person to
1257
- exercise those rights (an "Unrestricted Database"); and
1258
-
1259
- iii. The Unrestricted Database is at least as accessible to the
1260
- recipient as a practical matter as the Restricted Database.
1261
-
1262
- c. For the avoidance of doubt, You may place this Database or a
1263
- Derivative Database in an authenticated environment, behind a
1264
- password, or within a similar access control scheme provided that You
1265
- do not alter or restrict the terms of this License or any rights
1266
- granted under it or have the effect or intent of restricting the
1267
- ability of any person to exercise those rights.
1268
-
1269
- 4.8 Licensing of others. You may not sublicense the Database. Each time
1270
- You communicate the Database, the whole or Substantial part of the
1271
- Contents, or any Derivative Database to anyone else in any way, the
1272
- Licensor offers to the recipient a license to the Database on the same
1273
- terms and conditions as this License. You are not responsible for
1274
- enforcing compliance by third parties with this License, but You may
1275
- enforce any rights that You have over a Derivative Database. You are
1276
- solely responsible for any modifications of a Derivative Database made
1277
- by You or another Person at Your direction. You may not impose any
1278
- further restrictions on the exercise of the rights granted or affirmed
1279
- under this License.
1280
-
1281
- ### 5.0 Moral rights
1282
-
1283
- 5.1 Moral rights. This section covers moral rights, including any rights
1284
- to be identified as the author of the Database or to object to treatment
1285
- that would otherwise prejudice the author's honour and reputation, or
1286
- any other derogatory treatment:
1287
-
1288
- a. For jurisdictions allowing waiver of moral rights, Licensor waives
1289
- all moral rights that Licensor may have in the Database to the fullest
1290
- extent possible by the law of the relevant jurisdiction under Section
1291
- 10.4;
1292
-
1293
- b. If waiver of moral rights under Section 5.1 a in the relevant
1294
- jurisdiction is not possible, Licensor agrees not to assert any moral
1295
- rights over the Database and waives all claims in moral rights to the
1296
- fullest extent possible by the law of the relevant jurisdiction under
1297
- Section 10.4; and
1298
-
1299
- c. For jurisdictions not allowing waiver or an agreement not to assert
1300
- moral rights under Section 5.1 a and b, the author may retain their
1301
- moral rights over certain aspects of the Database.
1302
-
1303
- Please note that some jurisdictions do not allow for the waiver of moral
1304
- rights, and so moral rights may still subsist over the Database in some
1305
- jurisdictions.
1306
-
1307
- ### 6.0 Fair dealing, Database exceptions, and other rights not affected
1308
-
1309
- 6.1 This License does not affect any rights that You or anyone else may
1310
- independently have under any applicable law to make any use of this
1311
- Database, including without limitation:
1312
-
1313
- a. Exceptions to the Database Right including: Extraction of Contents
1314
- from non-electronic Databases for private purposes, Extraction for
1315
- purposes of illustration for teaching or scientific research, and
1316
- Extraction or Re-utilisation for public security or an administrative
1317
- or judicial procedure.
1318
-
1319
- b. Fair dealing, fair use, or any other legally recognised limitation
1320
- or exception to infringement of copyright or other applicable laws.
1321
-
1322
- 6.2 This License does not affect any rights of lawful users to Extract
1323
- and Re-utilise insubstantial parts of the Contents, evaluated
1324
- quantitatively or qualitatively, for any purposes whatsoever, including
1325
- creating a Derivative Database (subject to other rights over the
1326
- Contents, see Section 2.4). The repeated and systematic Extraction or
1327
- Re-utilisation of insubstantial parts of the Contents may however amount
1328
- to the Extraction or Re-utilisation of a Substantial part of the
1329
- Contents.
1330
-
1331
- ### 7.0 Warranties and Disclaimer
1332
-
1333
- 7.1 The Database is licensed by the Licensor "as is" and without any
1334
- warranty of any kind, either express, implied, or arising by statute,
1335
- custom, course of dealing, or trade usage. Licensor specifically
1336
- disclaims any and all implied warranties or conditions of title,
1337
- non-infringement, accuracy or completeness, the presence or absence of
1338
- errors, fitness for a particular purpose, merchantability, or otherwise.
1339
- Some jurisdictions do not allow the exclusion of implied warranties, so
1340
- this exclusion may not apply to You.
1341
-
1342
- ### 8.0 Limitation of liability
1343
-
1344
- 8.1 Subject to any liability that may not be excluded or limited by law,
1345
- the Licensor is not liable for, and expressly excludes, all liability
1346
- for loss or damage however and whenever caused to anyone by any use
1347
- under this License, whether by You or by anyone else, and whether caused
1348
- by any fault on the part of the Licensor or not. This exclusion of
1349
- liability includes, but is not limited to, any special, incidental,
1350
- consequential, punitive, or exemplary damages such as loss of revenue,
1351
- data, anticipated profits, and lost business. This exclusion applies
1352
- even if the Licensor has been advised of the possibility of such
1353
- damages.
1354
-
1355
- 8.2 If liability may not be excluded by law, it is limited to actual and
1356
- direct financial loss to the extent it is caused by proved negligence on
1357
- the part of the Licensor.
1358
-
1359
- ### 9.0 Termination of Your rights under this License
1360
-
1361
- 9.1 Any breach by You of the terms and conditions of this License
1362
- automatically terminates this License with immediate effect and without
1363
- notice to You. For the avoidance of doubt, Persons who have received the
1364
- Database, the whole or a Substantial part of the Contents, Derivative
1365
- Databases, or the Database as part of a Collective Database from You
1366
- under this License will not have their licenses terminated provided
1367
- their use is in full compliance with this License or a license granted
1368
- under Section 4.8 of this License. Sections 1, 2, 7, 8, 9 and 10 will
1369
- survive any termination of this License.
1370
-
1371
- 9.2 If You are not in breach of the terms of this License, the Licensor
1372
- will not terminate Your rights under it.
1373
-
1374
- 9.3 Unless terminated under Section 9.1, this License is granted to You
1375
- for the duration of applicable rights in the Database.
1376
-
1377
- 9.4 Reinstatement of rights. If you cease any breach of the terms and
1378
- conditions of this License, then your full rights under this License
1379
- will be reinstated:
1380
-
1381
- a. Provisionally and subject to permanent termination until the 60th
1382
- day after cessation of breach;
1383
-
1384
- b. Permanently on the 60th day after cessation of breach unless
1385
- otherwise reasonably notified by the Licensor; or
1386
-
1387
- c. Permanently if reasonably notified by the Licensor of the
1388
- violation, this is the first time You have received notice of
1389
- violation of this License from the Licensor, and You cure the
1390
- violation prior to 30 days after your receipt of the notice.
1391
-
1392
- Persons subject to permanent termination of rights are not eligible to
1393
- be a recipient and receive a license under Section 4.8.
1394
-
1395
- 9.5 Notwithstanding the above, Licensor reserves the right to release
1396
- the Database under different license terms or to stop distributing or
1397
- making available the Database. Releasing the Database under different
1398
- license terms or stopping the distribution of the Database will not
1399
- withdraw this License (or any other license that has been, or is
1400
- required to be, granted under the terms of this License), and this
1401
- License will continue in full force and effect unless terminated as
1402
- stated above.
1403
-
1404
- ### 10.0 General
1405
-
1406
- 10.1 If any provision of this License is held to be invalid or
1407
- unenforceable, that must not affect the validity or enforceability of
1408
- the remainder of the terms and conditions of this License and each
1409
- remaining provision of this License shall be valid and enforced to the
1410
- fullest extent permitted by law.
1411
-
1412
- 10.2 This License is the entire agreement between the parties with
1413
- respect to the rights granted here over the Database. It replaces any
1414
- earlier understandings, agreements or representations with respect to
1415
- the Database.
1416
-
1417
- 10.3 If You are in breach of the terms of this License, You will not be
1418
- entitled to rely on the terms of this License or to complain of any
1419
- breach by the Licensor.
1420
-
1421
- 10.4 Choice of law. This License takes effect in and will be governed by
1422
- the laws of the relevant jurisdiction in which the License terms are
1423
- sought to be enforced. If the standard suite of rights granted under
1424
- applicable copyright law and Database Rights in the relevant
1425
- jurisdiction includes additional rights not granted under this License,
1426
- these additional rights are granted in this License in order to meet the
1427
- terms of this License.```
1428
 
1429
 
1430
 
 
1
+ # UD Danish DDT v2.8
2
 
3
  * Author: Johannsen, Anders; Martínez Alonso, Héctor; Plank, Barbara
4
  * URL: https://github.com/UniversalDependencies/UD_Danish-DDT
 
878
 
879
 
880
 
881
+ # Sprogteknologisk orddatabase over det danske sprog
882
 
883
+ * Author: Center for Language Technology, University of Copenhagen
884
+ * URL: https://cst.ku.dk/sto_ordbase/
885
+ * License: CC BY-SA 4.0
886
 
887
  ```
888
+ Attribution-ShareAlike 4.0 International
889
+
890
+ =======================================================================
891
+
892
+ Creative Commons Corporation ("Creative Commons") is not a law firm and
893
+ does not provide legal services or legal advice. Distribution of
894
+ Creative Commons public licenses does not create a lawyer-client or
895
+ other relationship. Creative Commons makes its licenses and related
896
+ information available on an "as-is" basis. Creative Commons gives no
897
+ warranties regarding its licenses, any material licensed under their
898
+ terms and conditions, or any related information. Creative Commons
899
+ disclaims all liability for damages resulting from their use to the
900
+ fullest extent possible.
901
+
902
+ Using Creative Commons Public Licenses
903
+
904
+ Creative Commons public licenses provide a standard set of terms and
905
+ conditions that creators and other rights holders may use to share
906
+ original works of authorship and other material subject to copyright
907
+ and certain other rights specified in the public license below. The
908
+ following considerations are for informational purposes only, are not
909
+ exhaustive, and do not form part of our licenses.
910
+
911
+ Considerations for licensors: Our public licenses are
912
+ intended for use by those authorized to give the public
913
+ permission to use material in ways otherwise restricted by
914
+ copyright and certain other rights. Our licenses are
915
+ irrevocable. Licensors should read and understand the terms
916
+ and conditions of the license they choose before applying it.
917
+ Licensors should also secure all rights necessary before
918
+ applying our licenses so that the public can reuse the
919
+ material as expected. Licensors should clearly mark any
920
+ material not subject to the license. This includes other CC-
921
+ licensed material, or material used under an exception or
922
+ limitation to copyright. More considerations for licensors:
923
+ wiki.creativecommons.org/Considerations_for_licensors
924
+
925
+ Considerations for the public: By using one of our public
926
+ licenses, a licensor grants the public permission to use the
927
+ licensed material under specified terms and conditions. If
928
+ the licensor's permission is not necessary for any reason--for
929
+ example, because of any applicable exception or limitation to
930
+ copyright--then that use is not regulated by the license. Our
931
+ licenses grant only permissions under copyright and certain
932
+ other rights that a licensor has authority to grant. Use of
933
+ the licensed material may still be restricted for other
934
+ reasons, including because others have copyright or other
935
+ rights in the material. A licensor may make special requests,
936
+ such as asking that all changes be marked or described.
937
+ Although not required by our licenses, you are encouraged to
938
+ respect those requests where reasonable. More considerations
939
+ for the public:
940
+ wiki.creativecommons.org/Considerations_for_licensees
941
+
942
+ =======================================================================
943
+
944
+ Creative Commons Attribution-ShareAlike 4.0 International Public
945
+ License
946
+
947
+ By exercising the Licensed Rights (defined below), You accept and agree
948
+ to be bound by the terms and conditions of this Creative Commons
949
+ Attribution-ShareAlike 4.0 International Public License ("Public
950
+ License"). To the extent this Public License may be interpreted as a
951
+ contract, You are granted the Licensed Rights in consideration of Your
952
+ acceptance of these terms and conditions, and the Licensor grants You
953
+ such rights in consideration of benefits the Licensor receives from
954
+ making the Licensed Material available under these terms and
955
+ conditions.
956
+
957
+
958
+ Section 1 -- Definitions.
959
+
960
+ a. Adapted Material means material subject to Copyright and Similar
961
+ Rights that is derived from or based upon the Licensed Material
962
+ and in which the Licensed Material is translated, altered,
963
+ arranged, transformed, or otherwise modified in a manner requiring
964
+ permission under the Copyright and Similar Rights held by the
965
+ Licensor. For purposes of this Public License, where the Licensed
966
+ Material is a musical work, performance, or sound recording,
967
+ Adapted Material is always produced where the Licensed Material is
968
+ synched in timed relation with a moving image.
969
+
970
+ b. Adapter's License means the license You apply to Your Copyright
971
+ and Similar Rights in Your contributions to Adapted Material in
972
+ accordance with the terms and conditions of this Public License.
973
+
974
+ c. BY-SA Compatible License means a license listed at
975
+ creativecommons.org/compatiblelicenses, approved by Creative
976
+ Commons as essentially the equivalent of this Public License.
977
+
978
+ d. Copyright and Similar Rights means copyright and/or similar rights
979
+ closely related to copyright including, without limitation,
980
+ performance, broadcast, sound recording, and Sui Generis Database
981
+ Rights, without regard to how the rights are labeled or
982
+ categorized. For purposes of this Public License, the rights
983
+ specified in Section 2(b)(1)-(2) are not Copyright and Similar
984
+ Rights.
985
+
986
+ e. Effective Technological Measures means those measures that, in the
987
+ absence of proper authority, may not be circumvented under laws
988
+ fulfilling obligations under Article 11 of the WIPO Copyright
989
+ Treaty adopted on December 20, 1996, and/or similar international
990
+ agreements.
991
+
992
+ f. Exceptions and Limitations means fair use, fair dealing, and/or
993
+ any other exception or limitation to Copyright and Similar Rights
994
+ that applies to Your use of the Licensed Material.
995
 
996
+ g. License Elements means the license attributes listed in the name
997
+ of a Creative Commons Public License. The License Elements of this
998
+ Public License are Attribution and ShareAlike.
999
 
1000
+ h. Licensed Material means the artistic or literary work, database,
1001
+ or other material to which the Licensor applied this Public
1002
+ License.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1003
 
1004
+ i. Licensed Rights means the rights granted to You subject to the
1005
+ terms and conditions of this Public License, which are limited to
1006
+ all Copyright and Similar Rights that apply to Your use of the
1007
+ Licensed Material and that the Licensor has authority to license.
 
1008
 
1009
+ j. Licensor means the individual(s) or entity(ies) granting rights
1010
+ under this Public License.
1011
 
1012
+ k. Share means to provide material to the public by any means or
1013
+ process that requires permission under the Licensed Rights, such
1014
+ as reproduction, public display, public performance, distribution,
1015
+ dissemination, communication, or importation, and to make material
1016
+ available to the public including in ways that members of the
1017
+ public may access the material from a place and at a time
1018
+ individually chosen by them.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1019
 
1020
+ l. Sui Generis Database Rights means rights other than copyright
1021
+ resulting from Directive 96/9/EC of the European Parliament and of
1022
+ the Council of 11 March 1996 on the legal protection of databases,
1023
+ as amended and/or succeeded, as well as other essentially
1024
+ equivalent rights anywhere in the world.
 
 
 
 
 
 
 
1025
 
1026
+ m. You means the individual or entity exercising the Licensed Rights
1027
+ under this Public License. Your has a corresponding meaning.
1028
+
1029
+
1030
+ Section 2 -- Scope.
1031
+
1032
+ a. License grant.
1033
+
1034
+ 1. Subject to the terms and conditions of this Public License,
1035
+ the Licensor hereby grants You a worldwide, royalty-free,
1036
+ non-sublicensable, non-exclusive, irrevocable license to
1037
+ exercise the Licensed Rights in the Licensed Material to:
1038
+
1039
+ a. reproduce and Share the Licensed Material, in whole or
1040
+ in part; and
1041
 
1042
+ b. produce, reproduce, and Share Adapted Material.
 
 
 
 
1043
 
1044
+ 2. Exceptions and Limitations. For the avoidance of doubt, where
1045
+ Exceptions and Limitations apply to Your use, this Public
1046
+ License does not apply, and You do not need to comply with
1047
+ its terms and conditions.
 
1048
 
1049
+ 3. Term. The term of this Public License is specified in Section
1050
+ 6(a).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1051
 
1052
+ 4. Media and formats; technical modifications allowed. The
1053
+ Licensor authorizes You to exercise the Licensed Rights in
1054
+ all media and formats whether now known or hereafter created,
1055
+ and to make technical modifications necessary to do so. The
1056
+ Licensor waives and/or agrees not to assert any right or
1057
+ authority to forbid You from making technical modifications
1058
+ necessary to exercise the Licensed Rights, including
1059
+ technical modifications necessary to circumvent Effective
1060
+ Technological Measures. For purposes of this Public License,
1061
+ simply making modifications authorized by this Section 2(a)
1062
+ (4) never produces Adapted Material.
1063
+
1064
+ 5. Downstream recipients.
1065
+
1066
+ a. Offer from the Licensor -- Licensed Material. Every
1067
+ recipient of the Licensed Material automatically
1068
+ receives an offer from the Licensor to exercise the
1069
+ Licensed Rights under the terms and conditions of this
1070
+ Public License.
1071
+
1072
+ b. Additional offer from the Licensor -- Adapted Material.
1073
+ Every recipient of Adapted Material from You
1074
+ automatically receives an offer from the Licensor to
1075
+ exercise the Licensed Rights in the Adapted Material
1076
+ under the conditions of the Adapter's License You apply.
1077
+
1078
+ c. No downstream restrictions. You may not offer or impose
1079
+ any additional or different terms or conditions on, or
1080
+ apply any Effective Technological Measures to, the
1081
+ Licensed Material if doing so restricts exercise of the
1082
+ Licensed Rights by any recipient of the Licensed
1083
+ Material.
1084
+
1085
+ 6. No endorsement. Nothing in this Public License constitutes or
1086
+ may be construed as permission to assert or imply that You
1087
+ are, or that Your use of the Licensed Material is, connected
1088
+ with, or sponsored, endorsed, or granted official status by,
1089
+ the Licensor or others designated to receive attribution as
1090
+ provided in Section 3(a)(1)(A)(i).
1091
+
1092
+ b. Other rights.
1093
+
1094
+ 1. Moral rights, such as the right of integrity, are not
1095
+ licensed under this Public License, nor are publicity,
1096
+ privacy, and/or other similar personality rights; however, to
1097
+ the extent possible, the Licensor waives and/or agrees not to
1098
+ assert any such rights held by the Licensor to the limited
1099
+ extent necessary to allow You to exercise the Licensed
1100
+ Rights, but not otherwise.
1101
+
1102
+ 2. Patent and trademark rights are not licensed under this
1103
+ Public License.
1104
+
1105
+ 3. To the extent possible, the Licensor waives any right to
1106
+ collect royalties from You for the exercise of the Licensed
1107
+ Rights, whether directly or through a collecting society
1108
+ under any voluntary or waivable statutory or compulsory
1109
+ licensing scheme. In all other cases the Licensor expressly
1110
+ reserves any right to collect such royalties.
1111
+
1112
+
1113
+ Section 3 -- License Conditions.
1114
+
1115
+ Your exercise of the Licensed Rights is expressly made subject to the
1116
+ following conditions.
1117
+
1118
+ a. Attribution.
1119
+
1120
+ 1. If You Share the Licensed Material (including in modified
1121
+ form), You must:
1122
+
1123
+ a. retain the following if it is supplied by the Licensor
1124
+ with the Licensed Material:
1125
+
1126
+ i. identification of the creator(s) of the Licensed
1127
+ Material and any others designated to receive
1128
+ attribution, in any reasonable manner requested by
1129
+ the Licensor (including by pseudonym if
1130
+ designated);
1131
+
1132
+ ii. a copyright notice;
1133
+
1134
+ iii. a notice that refers to this Public License;
1135
+
1136
+ iv. a notice that refers to the disclaimer of
1137
+ warranties;
1138
+
1139
+ v. a URI or hyperlink to the Licensed Material to the
1140
+ extent reasonably practicable;
1141
+
1142
+ b. indicate if You modified the Licensed Material and
1143
+ retain an indication of any previous modifications; and
1144
+
1145
+ c. indicate the Licensed Material is licensed under this
1146
+ Public License, and include the text of, or the URI or
1147
+ hyperlink to, this Public License.
1148
+
1149
+ 2. You may satisfy the conditions in Section 3(a)(1) in any
1150
+ reasonable manner based on the medium, means, and context in
1151
+ which You Share the Licensed Material. For example, it may be
1152
+ reasonable to satisfy the conditions by providing a URI or
1153
+ hyperlink to a resource that includes the required
1154
+ information.
1155
+
1156
+ 3. If requested by the Licensor, You must remove any of the
1157
+ information required by Section 3(a)(1)(A) to the extent
1158
+ reasonably practicable.
1159
+
1160
+ b. ShareAlike.
1161
+
1162
+ In addition to the conditions in Section 3(a), if You Share
1163
+ Adapted Material You produce, the following conditions also apply.
1164
+
1165
+ 1. The Adapter's License You apply must be a Creative Commons
1166
+ license with the same License Elements, this version or
1167
+ later, or a BY-SA Compatible License.
1168
+
1169
+ 2. You must include the text of, or the URI or hyperlink to, the
1170
+ Adapter's License You apply. You may satisfy this condition
1171
+ in any reasonable manner based on the medium, means, and
1172
+ context in which You Share Adapted Material.
1173
+
1174
+ 3. You may not offer or impose any additional or different terms
1175
+ or conditions on, or apply any Effective Technological
1176
+ Measures to, Adapted Material that restrict exercise of the
1177
+ rights granted under the Adapter's License You apply.
1178
+
1179
+
1180
+ Section 4 -- Sui Generis Database Rights.
1181
+
1182
+ Where the Licensed Rights include Sui Generis Database Rights that
1183
+ apply to Your use of the Licensed Material:
1184
+
1185
+ a. for the avoidance of doubt, Section 2(a)(1) grants You the right
1186
+ to extract, reuse, reproduce, and Share all or a substantial
1187
+ portion of the contents of the database;
1188
+
1189
+ b. if You include all or a substantial portion of the database
1190
+ contents in a database in which You have Sui Generis Database
1191
+ Rights, then the database in which You have Sui Generis Database
1192
+ Rights (but not its individual contents) is Adapted Material,
1193
+
1194
+ including for purposes of Section 3(b); and
1195
+ c. You must comply with the conditions in Section 3(a) if You Share
1196
+ all or a substantial portion of the contents of the database.
1197
+
1198
+ For the avoidance of doubt, this Section 4 supplements and does not
1199
+ replace Your obligations under this Public License where the Licensed
1200
+ Rights include other Copyright and Similar Rights.
1201
+
1202
+
1203
+ Section 5 -- Disclaimer of Warranties and Limitation of Liability.
1204
+
1205
+ a. UNLESS OTHERWISE SEPARATELY UNDERTAKEN BY THE LICENSOR, TO THE
1206
+ EXTENT POSSIBLE, THE LICENSOR OFFERS THE LICENSED MATERIAL AS-IS
1207
+ AND AS-AVAILABLE, AND MAKES NO REPRESENTATIONS OR WARRANTIES OF
1208
+ ANY KIND CONCERNING THE LICENSED MATERIAL, WHETHER EXPRESS,
1209
+ IMPLIED, STATUTORY, OR OTHER. THIS INCLUDES, WITHOUT LIMITATION,
1210
+ WARRANTIES OF TITLE, MERCHANTABILITY, FITNESS FOR A PARTICULAR
1211
+ PURPOSE, NON-INFRINGEMENT, ABSENCE OF LATENT OR OTHER DEFECTS,
1212
+ ACCURACY, OR THE PRESENCE OR ABSENCE OF ERRORS, WHETHER OR NOT
1213
+ KNOWN OR DISCOVERABLE. WHERE DISCLAIMERS OF WARRANTIES ARE NOT
1214
+ ALLOWED IN FULL OR IN PART, THIS DISCLAIMER MAY NOT APPLY TO YOU.
1215
+
1216
+ b. TO THE EXTENT POSSIBLE, IN NO EVENT WILL THE LICENSOR BE LIABLE
1217
+ TO YOU ON ANY LEGAL THEORY (INCLUDING, WITHOUT LIMITATION,
1218
+ NEGLIGENCE) OR OTHERWISE FOR ANY DIRECT, SPECIAL, INDIRECT,
1219
+ INCIDENTAL, CONSEQUENTIAL, PUNITIVE, EXEMPLARY, OR OTHER LOSSES,
1220
+ COSTS, EXPENSES, OR DAMAGES ARISING OUT OF THIS PUBLIC LICENSE OR
1221
+ USE OF THE LICENSED MATERIAL, EVEN IF THE LICENSOR HAS BEEN
1222
+ ADVISED OF THE POSSIBILITY OF SUCH LOSSES, COSTS, EXPENSES, OR
1223
+ DAMAGES. WHERE A LIMITATION OF LIABILITY IS NOT ALLOWED IN FULL OR
1224
+ IN PART, THIS LIMITATION MAY NOT APPLY TO YOU.
1225
+
1226
+ c. The disclaimer of warranties and limitation of liability provided
1227
+ above shall be interpreted in a manner that, to the extent
1228
+ possible, most closely approximates an absolute disclaimer and
1229
+ waiver of all liability.
1230
+
1231
+
1232
+ Section 6 -- Term and Termination.
1233
+
1234
+ a. This Public License applies for the term of the Copyright and
1235
+ Similar Rights licensed here. However, if You fail to comply with
1236
+ this Public License, then Your rights under this Public License
1237
+ terminate automatically.
1238
+
1239
+ b. Where Your right to use the Licensed Material has terminated under
1240
+ Section 6(a), it reinstates:
1241
+
1242
+ 1. automatically as of the date the violation is cured, provided
1243
+ it is cured within 30 days of Your discovery of the
1244
+ violation; or
1245
+
1246
+ 2. upon express reinstatement by the Licensor.
1247
+
1248
+ For the avoidance of doubt, this Section 6(b) does not affect any
1249
+ right the Licensor may have to seek remedies for Your violations
1250
+ of this Public License.
1251
+
1252
+ c. For the avoidance of doubt, the Licensor may also offer the
1253
+ Licensed Material under separate terms or conditions or stop
1254
+ distributing the Licensed Material at any time; however, doing so
1255
+ will not terminate this Public License.
1256
+
1257
+ d. Sections 1, 5, 6, 7, and 8 survive termination of this Public
1258
+ License.
1259
+
1260
+
1261
+ Section 7 -- Other Terms and Conditions.
1262
+
1263
+ a. The Licensor shall not be bound by any additional or different
1264
+ terms or conditions communicated by You unless expressly agreed.
1265
+
1266
+ b. Any arrangements, understandings, or agreements regarding the
1267
+ Licensed Material not stated herein are separate from and
1268
+ independent of the terms and conditions of this Public License.
1269
+
1270
+
1271
+ Section 8 -- Interpretation.
1272
+
1273
+ a. For the avoidance of doubt, this Public License does not, and
1274
+ shall not be interpreted to, reduce, limit, restrict, or impose
1275
+ conditions on any use of the Licensed Material that could lawfully
1276
+ be made without permission under this Public License.
1277
+
1278
+ b. To the extent possible, if any provision of this Public License is
1279
+ deemed unenforceable, it shall be automatically reformed to the
1280
+ minimum extent necessary to make it enforceable. If the provision
1281
+ cannot be reformed, it shall be severed from this Public License
1282
+ without affecting the enforceability of the remaining terms and
1283
+ conditions.
1284
+
1285
+ c. No term or condition of this Public License will be waived and no
1286
+ failure to comply consented to unless expressly agreed to by the
1287
+ Licensor.
1288
+
1289
+ d. Nothing in this Public License constitutes or may be interpreted
1290
+ as a limitation upon, or waiver of, any privileges and immunities
1291
+ that apply to the Licensor or You, including from the legal
1292
+ processes of any jurisdiction or authority.
1293
+
1294
+
1295
+ =======================================================================
1296
+
1297
+ Creative Commons is not a party to its public
1298
+ licenses. Notwithstanding, Creative Commons may elect to apply one of
1299
+ its public licenses to material it publishes and in those instances
1300
+ will be considered the Licensor.” The text of the Creative Commons
1301
+ public licenses is dedicated to the public domain under the CC0 Public
1302
+ Domain Dedication. Except for the limited purpose of indicating that
1303
+ material is shared under a Creative Commons public license or as
1304
+ otherwise permitted by the Creative Commons policies published at
1305
+ creativecommons.org/policies, Creative Commons does not authorize the
1306
+ use of the trademark "Creative Commons" or any other trademark or logo
1307
+ of Creative Commons without its prior written consent including,
1308
+ without limitation, in connection with any unauthorized modifications
1309
+ to any of its public licenses or any other arrangements,
1310
+ understandings, or agreements concerning use of licensed material. For
1311
+ the avoidance of doubt, this paragraph does not form part of the
1312
+ public licenses.
1313
+
1314
+ Creative Commons may be contacted at creativecommons.org.
1315
+
1316
+ ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1317
 
1318
 
1319
 
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - token-classification
5
  language:
6
  - da
7
- license: CC-BY-SA-4.0
8
  model-index:
9
  - name: da_core_news_trf
10
  results:
@@ -14,47 +14,47 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8204081633
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8375
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8288659794
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9797559086
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
- value: 0.8653846154
38
  - name: SENTER Recall
39
  type: recall
40
- value: 0.8776595745
41
  - name: SENTER F Score
42
  type: f_score
43
- value: 0.8714788732
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8764860189
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
- value: 0.8764860189
58
  ---
59
  ### Details: https://spacy.io/models/da#da_core_news_trf
60
 
@@ -63,12 +63,12 @@ Danish transformer pipeline (Maltehb/danish-bert-botxo). Components: transformer
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `da_core_news_trf` |
66
- | **Version** | `3.1.0` |
67
- | **spaCy** | `>=3.1.0,<3.2.0` |
68
  | **Default Pipeline** | `transformer`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `transformer`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
- | **Sources** | [UD Danish DDT v2.5](https://github.com/UniversalDependencies/UD_Danish-DDT) (Johannsen, Anders; Martínez Alonso, Héctor; Plank, Barbara)<br />[DaNE](https://github.com/alexandrainst/danlp/blob/master/docs/datasets.md#danish-dependency-treebank-dane) (Rasmus Hvingelby, Amalie B. Pauli, Maria Barrett, Christina Rosted, Lasse M. Lidegaard, Anders Søgaard)<br />[Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[Maltehb/danish-bert-botxo](https://huggingface.co/Maltehb/danish-bert-botxo) (BotXO.ai) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,12 +76,12 @@ Danish transformer pipeline (Maltehb/danish-bert-botxo). Components: transformer
76
 
77
  <details>
78
 
79
- <summary>View label scheme (192 labels for 3 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`morphologizer`** | `AdpType=Prep\|POS=ADP`, `Definite=Ind\|Gender=Com\|Number=Sing\|POS=NOUN`, `Mood=Ind\|POS=AUX\|Tense=Pres\|VerbForm=Fin\|Voice=Act`, `POS=PROPN`, `Definite=Ind\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=SCONJ`, `Definite=Def\|Gender=Com\|Number=Sing\|POS=NOUN`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin\|Voice=Act`, `POS=ADV`, `Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Pos\|Number=Plur\|POS=ADJ`, `Definite=Ind\|Gender=Com\|Number=Plur\|POS=NOUN`, `POS=PUNCT`, `POS=CCONJ`, `Definite=Ind\|Degree=Cmp\|Number=Sing\|POS=ADJ`, `Degree=Cmp\|POS=ADJ`, `POS=PRON\|PartType=Inf`, `Gender=Com\|Number=Sing\|POS=DET\|PronType=Ind`, `Definite=Ind\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Case=Acc\|Gender=Neut\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Definite=Ind\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Definite=Def\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Gender=Neut\|Number=Sing\|POS=DET\|PronType=Dem`, `Degree=Pos\|POS=ADV`, `Definite=Def\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=PRON\|PronType=Dem`, `NumType=Card\|POS=NUM`, `Definite=Ind\|Degree=Pos\|Gender=Neut\|Number=Sing\|POS=ADJ`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Pos\|Gender=Com\|Number=Sing\|POS=ADJ`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `NumType=Ord\|POS=ADJ`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Mood=Ind\|POS=AUX\|Tense=Past\|VerbForm=Fin\|Voice=Act`, `POS=VERB\|VerbForm=Inf\|Voice=Act`, `Mood=Ind\|POS=VERB\|Tense=Past\|VerbForm=Fin\|Voice=Act`, `POS=NOUN`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin\|Voice=Pass`, `POS=ADP\|PartType=Inf`, `Degree=Pos\|POS=ADJ`, `Definite=Def\|Gender=Com\|Number=Plur\|POS=NOUN`, `Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Com\|Number=Sing\|POS=NOUN`, `POS=AUX\|VerbForm=Inf\|Voice=Act`, `Definite=Ind\|Degree=Pos\|Gender=Com\|Number=Sing\|POS=ADJ`, `Gender=Com\|Number=Sing\|POS=DET\|PronType=Dem`, `Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Ind`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs\|Reflex=Yes`, `POS=PART\|PartType=Inf`, `Gender=Neut\|Number=Sing\|POS=DET\|PronType=Ind`, `Case=Acc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Neut\|Number=Sing\|POS=NOUN`, `Case=Nom\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Nom\|Gender=Com\|POS=PRON\|PronType=Ind`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Ind`, `Mood=Imp\|POS=VERB`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Definite=Ind\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `Case=Nom\|Gender=Com\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Com\|Number=Plur\|POS=NOUN`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Number=Plur\|POS=PRON\|PronType=Int,Rel`, `POS=VERB\|VerbForm=Inf\|Voice=Pass`, `Case=Gen\|Definite=Ind\|Gender=Com\|Number=Sing\|POS=NOUN`, `Degree=Cmp\|POS=ADV`, `POS=ADV\|PartType=Inf`, `Degree=Sup\|POS=ADV`, `Number=Plur\|POS=PRON\|PronType=Dem`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|POS=PROPN`, `POS=ADP`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Definite=Def\|Degree=Sup\|POS=ADJ`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Degree=Pos\|Number=Sing\|POS=ADJ`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Gender=Com\|Number=Sing\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Number=Plur\|POS=PRON\|PronType=Rcp`, `Case=Gen\|Degree=Cmp\|POS=ADJ`, `Case=Gen\|Definite=Def\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Number[psor]=Plur\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Degree=Pos\|Gender=Neut\|Number=Sing\|POS=ADJ`, `Gender=Neut\|Number=Sing\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Definite=Ind\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Number=Sing\|POS=PRON\|PronType=Int,Rel`, `Number=Plur\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Int,Rel`, `Definite=Def\|Degree=Sup\|Number=Plur\|POS=ADJ`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Definite=Ind\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `POS=SYM`, `Case=Nom\|Gender=Com\|POS=PRON\|Person=2\|Polite=Form\|PronType=Prs`, `Degree=Sup\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Ind\|Style=Arch`, `Case=Gen\|Gender=Com\|Number=Sing\|POS=DET\|PronType=Dem`, `Foreign=Yes\|POS=X`, `POS=DET\|Person=2\|Polite=Form\|Poss=Yes\|PronType=Prs`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Dem`, `Case=Acc\|Gender=Com\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|Definite=Ind\|Gender=Neut\|Number=Sing\|POS=NOUN`, `Case=Gen\|POS=PRON\|PronType=Int,Rel`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Dem`, `Abbr=Yes\|POS=X`, `Case=Gen\|Definite=Ind\|Gender=Com\|Number=Plur\|POS=NOUN`, `Definite=Def\|Degree=Abs\|POS=ADJ`, `Definite=Ind\|Degree=Sup\|Number=Sing\|POS=ADJ`, `Definite=Ind\|POS=NOUN`, `Gender=Com\|Number=Plur\|POS=NOUN`, `Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Gender=Com\|POS=PRON\|PronType=Int,Rel`, `Case=Nom\|Gender=Com\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Degree=Abs\|POS=ADV`, `POS=VERB\|VerbForm=Ger`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Degree=Sup\|Number=Sing\|POS=ADJ`, `Number=Plur\|Number[psor]=Plur\|POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Case=Gen\|Definite=Def\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Case=Gen\|Degree=Pos\|Number=Plur\|POS=ADJ`, `Case=Acc\|Gender=Com\|POS=PRON\|Person=2\|Polite=Form\|PronType=Prs`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Int,Rel`, `POS=VERB\|Tense=Pres`, `Case=Gen\|Number=Plur\|POS=DET\|PronType=Ind`, `Number[psor]=Plur\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `POS=PRON\|Person=2\|Polite=Form\|Poss=Yes\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Mood=Ind\|POS=VERB\|Tense=Past\|VerbForm=Fin\|Voice=Pass`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Degree=Sup\|Number=Plur\|POS=ADJ`, `Case=Acc\|Gender=Com\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Definite=Ind\|Number=Plur\|POS=NOUN`, `Case=Gen\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|POS=AUX`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs`, `Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `Definite=Def\|Gender=Com\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Gender=Com\|Number=Sing\|POS=DET\|PronType=Ind`, `Case=Gen\|POS=NOUN`, `Number[psor]=Plur\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=DET\|PronType=Dem`, `Definite=Def\|Number=Plur\|POS=NOUN` |
84
- | **`parser`** | `ROOT`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux`, `case`, `cc`, `ccomp`, `compound:prt`, `conj`, `cop`, `dep`, `det`, `expl`, `fixed`, `flat`, `iobj`, `list`, `mark`, `nmod`, `nmod:poss`, `nsubj`, `nummod`, `obj`, `obl`, `obl:loc`, `obl:tmod`, `punct`, `xcomp` |
85
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
86
 
87
  </details>
@@ -91,15 +91,21 @@ Danish transformer pipeline (Maltehb/danish-bert-botxo). Components: transformer
91
  | Type | Score |
92
  | --- | --- |
93
  | `TOKEN_ACC` | 99.95 |
94
- | `TAG_ACC` | 97.98 |
95
- | `POS_ACC` | 97.98 |
96
- | `MORPH_ACC` | 97.74 |
 
 
 
 
 
 
 
 
 
 
 
97
  | `LEMMA_ACC` | 84.91 |
98
- | `DEP_UAS` | 87.65 |
99
- | `DEP_LAS` | 85.04 |
100
- | `ENTS_P` | 82.04 |
101
- | `ENTS_R` | 83.75 |
102
- | `ENTS_F` | 82.89 |
103
- | `SENTS_P` | 86.54 |
104
- | `SENTS_R` | 87.77 |
105
- | `SENTS_F` | 87.15 |
 
4
  - token-classification
5
  language:
6
  - da
7
+ license: cc-by-sa-4.0
8
  model-index:
9
  - name: da_core_news_trf
10
  results:
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7784313725
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8270833333
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.802020202
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
+ value: 0.9734127561
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
+ value: 0.8057921635
38
  - name: SENTER Recall
39
  type: recall
40
+ value: 0.8386524823
41
  - name: SENTER F Score
42
  type: f_score
43
+ value: 0.8218940052
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
+ value: 0.8639535533
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
+ value: 0.8639535533
58
  ---
59
  ### Details: https://spacy.io/models/da#da_core_news_trf
60
 
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `da_core_news_trf` |
66
+ | **Version** | `3.2.0` |
67
+ | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `transformer`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `transformer`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
+ | **Sources** | [UD Danish DDT v2.8](https://github.com/UniversalDependencies/UD_Danish-DDT) (Johannsen, Anders; Martínez Alonso, Héctor; Plank, Barbara)<br />[DaNE](https://github.com/alexandrainst/danlp/blob/master/docs/datasets.md#danish-dependency-treebank-dane) (Rasmus Hvingelby, Amalie B. Pauli, Maria Barrett, Christina Rosted, Lasse M. Lidegaard, Anders Søgaard)<br />[Sprogteknologisk orddatabase over det danske sprog](https://cst.ku.dk/sto_ordbase/) (Center for Language Technology, University of Copenhagen)<br />[Maltehb/danish-bert-botxo](https://huggingface.co/Maltehb/danish-bert-botxo) (BotXO.ai) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
 
76
 
77
  <details>
78
 
79
+ <summary>View label scheme (193 labels for 3 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`morphologizer`** | `AdpType=Prep\|POS=ADP`, `Definite=Ind\|Gender=Com\|Number=Sing\|POS=NOUN`, `Mood=Ind\|POS=AUX\|Tense=Pres\|VerbForm=Fin\|Voice=Act`, `POS=PROPN`, `Definite=Ind\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=SCONJ`, `Definite=Def\|Gender=Com\|Number=Sing\|POS=NOUN`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin\|Voice=Act`, `POS=ADV`, `Number=Plur\|POS=DET\|PronType=Dem`, `Degree=Pos\|Number=Plur\|POS=ADJ`, `Definite=Ind\|Gender=Com\|Number=Plur\|POS=NOUN`, `POS=PUNCT`, `POS=CCONJ`, `Definite=Ind\|Degree=Cmp\|Number=Sing\|POS=ADJ`, `Degree=Cmp\|POS=ADJ`, `POS=PRON\|PartType=Inf`, `Gender=Com\|Number=Sing\|POS=DET\|PronType=Ind`, `Definite=Ind\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Case=Acc\|Gender=Neut\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Definite=Ind\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Definite=Def\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Gender=Neut\|Number=Sing\|POS=DET\|PronType=Dem`, `Degree=Pos\|POS=ADV`, `Definite=Def\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Neut\|Number=Sing\|POS=NOUN`, `POS=PRON\|PronType=Dem`, `NumType=Card\|POS=NUM`, `Definite=Ind\|Degree=Pos\|Gender=Neut\|Number=Sing\|POS=ADJ`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Degree=Pos\|Gender=Com\|Number=Sing\|POS=ADJ`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `NumType=Ord\|POS=ADJ`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Mood=Ind\|POS=AUX\|Tense=Past\|VerbForm=Fin\|Voice=Act`, `POS=VERB\|VerbForm=Inf\|Voice=Act`, `Mood=Ind\|POS=VERB\|Tense=Past\|VerbForm=Fin\|Voice=Act`, `POS=NOUN`, `Mood=Ind\|POS=VERB\|Tense=Pres\|VerbForm=Fin\|Voice=Pass`, `POS=ADP\|PartType=Inf`, `Degree=Pos\|POS=ADJ`, `Definite=Def\|Gender=Com\|Number=Plur\|POS=NOUN`, `Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Com\|Number=Sing\|POS=NOUN`, `POS=AUX\|VerbForm=Inf\|Voice=Act`, `Definite=Ind\|Degree=Pos\|Gender=Com\|Number=Sing\|POS=ADJ`, `Gender=Com\|Number=Sing\|POS=DET\|PronType=Dem`, `Number=Plur\|POS=DET\|PronType=Ind`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Ind`, `Case=Acc\|POS=PRON\|Person=3\|PronType=Prs\|Reflex=Yes`, `POS=PART\|PartType=Inf`, `Gender=Neut\|Number=Sing\|POS=DET\|PronType=Ind`, `Case=Acc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Neut\|Number=Sing\|POS=NOUN`, `Case=Nom\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Nom\|Gender=Com\|POS=PRON\|PronType=Ind`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Ind`, `Mood=Imp\|POS=VERB`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Definite=Ind\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `Case=Nom\|Gender=Com\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|Definite=Def\|Gender=Com\|Number=Plur\|POS=NOUN`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Number=Plur\|POS=PRON\|PronType=Int,Rel`, `POS=VERB\|VerbForm=Inf\|Voice=Pass`, `Case=Gen\|Definite=Ind\|Gender=Com\|Number=Sing\|POS=NOUN`, `Degree=Cmp\|POS=ADV`, `POS=ADV\|PartType=Inf`, `Degree=Sup\|POS=ADV`, `Number=Plur\|POS=PRON\|PronType=Dem`, `Number=Plur\|POS=PRON\|PronType=Ind`, `Definite=Def\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|POS=PROPN`, `POS=ADP`, `Degree=Cmp\|Number=Plur\|POS=ADJ`, `Definite=Def\|Degree=Sup\|POS=ADJ`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Degree=Pos\|Number=Sing\|POS=ADJ`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Gender=Com\|Number=Sing\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Number=Plur\|POS=PRON\|PronType=Rcp`, `Case=Gen\|Degree=Cmp\|POS=ADJ`, `Case=Gen\|Definite=Def\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Number[psor]=Plur\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=INTJ`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Degree=Pos\|Gender=Neut\|Number=Sing\|POS=ADJ`, `Gender=Neut\|Number=Sing\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Case=Acc\|Gender=Com\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Definite=Ind\|Gender=Neut\|Number=Plur\|POS=NOUN`, `Number=Sing\|POS=PRON\|PronType=Int,Rel`, `Number=Plur\|Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Int,Rel`, `Definite=Def\|Degree=Sup\|Number=Plur\|POS=ADJ`, `Case=Nom\|Gender=Com\|Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Definite=Ind\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `POS=SYM`, `Case=Nom\|Gender=Com\|POS=PRON\|Person=2\|Polite=Form\|PronType=Prs`, `Degree=Sup\|POS=ADJ`, `Number=Plur\|POS=DET\|PronType=Ind\|Style=Arch`, `Case=Gen\|Gender=Com\|Number=Sing\|POS=DET\|PronType=Dem`, `Foreign=Yes\|POS=X`, `POS=DET\|Person=2\|Polite=Form\|Poss=Yes\|PronType=Prs`, `Gender=Neut\|Number=Sing\|POS=PRON\|PronType=Dem`, `Case=Acc\|Gender=Com\|Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Case=Gen\|Definite=Ind\|Gender=Neut\|Number=Sing\|POS=NOUN`, `Case=Gen\|POS=PRON\|PronType=Int,Rel`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Dem`, `Abbr=Yes\|POS=X`, `Case=Gen\|Definite=Ind\|Gender=Com\|Number=Plur\|POS=NOUN`, `Definite=Def\|Degree=Abs\|POS=ADJ`, `Definite=Ind\|Degree=Sup\|Number=Sing\|POS=ADJ`, `Definite=Ind\|POS=NOUN`, `Gender=Com\|Number=Plur\|POS=NOUN`, `Number[psor]=Plur\|POS=DET\|Person=1\|Poss=Yes\|PronType=Prs`, `Gender=Com\|POS=PRON\|PronType=Int,Rel`, `Case=Nom\|Gender=Com\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Degree=Abs\|POS=ADV`, `POS=VERB\|VerbForm=Ger`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Def\|Degree=Sup\|Number=Sing\|POS=ADJ`, `Number=Plur\|Number[psor]=Plur\|POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs\|Style=Form`, `Case=Gen\|Definite=Def\|Degree=Pos\|Number=Sing\|POS=ADJ`, `Case=Gen\|Degree=Pos\|Number=Plur\|POS=ADJ`, `Case=Acc\|Gender=Com\|POS=PRON\|Person=2\|Polite=Form\|PronType=Prs`, `Gender=Com\|Number=Sing\|POS=PRON\|PronType=Int,Rel`, `POS=VERB\|Tense=Pres`, `Case=Gen\|Number=Plur\|POS=DET\|PronType=Ind`, `Number[psor]=Plur\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `POS=PRON\|Person=2\|Polite=Form\|Poss=Yes\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Mood=Ind\|POS=VERB\|Tense=Past\|VerbForm=Fin\|Voice=Pass`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Degree=Sup\|Number=Plur\|POS=ADJ`, `Case=Acc\|Gender=Com\|Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Neut\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs\|Reflex=Yes`, `Definite=Ind\|Number=Plur\|POS=NOUN`, `Case=Gen\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Mood=Imp\|POS=AUX`, `Gender=Com\|Number=Sing\|Number[psor]=Sing\|POS=PRON\|Person=1\|Poss=Yes\|PronType=Prs`, `Number[psor]=Sing\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `Definite=Def\|Gender=Com\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Plur\|Number[psor]=Sing\|POS=DET\|Person=2\|Poss=Yes\|PronType=Prs`, `Case=Gen\|Gender=Com\|Number=Sing\|POS=DET\|PronType=Ind`, `Case=Gen\|POS=NOUN`, `Number[psor]=Plur\|POS=PRON\|Person=3\|Poss=Yes\|PronType=Prs`, `POS=DET\|PronType=Dem`, `Definite=Def\|Number=Plur\|POS=NOUN` |
84
+ | **`parser`** | `ROOT`, `acl:relcl`, `advcl`, `advmod`, `advmod:lmod`, `amod`, `appos`, `aux`, `case`, `cc`, `ccomp`, `compound:prt`, `conj`, `cop`, `dep`, `det`, `expl`, `fixed`, `flat`, `iobj`, `list`, `mark`, `nmod`, `nmod:poss`, `nsubj`, `nummod`, `obj`, `obl`, `obl:lmod`, `obl:tmod`, `punct`, `xcomp` |
85
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
86
 
87
  </details>
 
91
  | Type | Score |
92
  | --- | --- |
93
  | `TOKEN_ACC` | 99.95 |
94
+ | `TOKEN_P` | 99.78 |
95
+ | `TOKEN_R` | 99.75 |
96
+ | `TOKEN_F` | 99.76 |
97
+ | `POS_ACC` | 97.34 |
98
+ | `MORPH_ACC` | 97.00 |
99
+ | `MORPH_MICRO_P` | 98.74 |
100
+ | `MORPH_MICRO_R` | 97.64 |
101
+ | `MORPH_MICRO_F` | 98.19 |
102
+ | `SENTS_P` | 80.58 |
103
+ | `SENTS_R` | 83.87 |
104
+ | `SENTS_F` | 82.19 |
105
+ | `DEP_UAS` | 86.40 |
106
+ | `DEP_LAS` | 83.27 |
107
+ | `TAG_ACC` | 97.34 |
108
  | `LEMMA_ACC` | 84.91 |
109
+ | `ENTS_P` | 77.84 |
110
+ | `ENTS_R` | 82.71 |
111
+ | `ENTS_F` | 80.20 |
 
 
 
 
 
accuracy.json CHANGED
@@ -1,88 +1,83 @@
1
  {
2
  "token_acc": 0.9994672349,
3
- "tag_acc": 0.9797559086,
4
- "pos_acc": 0.9797559086,
5
- "morph_acc": 0.9774312282,
6
- "lemma_acc": 0.8491041162,
7
- "dep_uas": 0.8764860189,
8
- "dep_las": 0.8503822758,
9
- "ents_p": 0.8204081633,
10
- "ents_r": 0.8375,
11
- "ents_f": 0.8288659794,
12
- "sents_p": 0.8653846154,
13
- "sents_r": 0.8776595745,
14
- "sents_f": 0.8714788732,
15
- "speed": 1719.0691313848,
16
  "morph_per_feat": {
17
  "Mood": {
18
- "p": 0.9933142311,
19
- "r": 0.9914204004,
20
- "f": 0.9923664122
21
  },
22
  "Tense": {
23
- "p": 0.983495874,
24
- "r": 0.9871987952,
25
- "f": 0.9853438557
26
  },
27
  "VerbForm": {
28
- "p": 0.9834862385,
29
- "r": 0.9840881273,
30
- "f": 0.9837870909
31
  },
32
  "Voice": {
33
- "p": 0.994011976,
34
- "r": 0.9925261584,
35
- "f": 0.9932685116
36
  },
37
  "Definite": {
38
- "p": 0.9876592357,
39
- "r": 0.9802449625,
40
- "f": 0.9839381321
41
  },
42
  "Gender": {
43
- "p": 0.983277592,
44
- "r": 0.9770687936,
45
- "f": 0.9801633606
46
  },
47
  "Number": {
48
- "p": 0.9884484117,
49
- "r": 0.9820031299,
50
- "f": 0.9852152296
51
  },
52
  "AdpType": {
53
- "p": 1.0,
54
- "r": 0.9955791335,
55
- "f": 0.9977846699
56
  },
57
  "PartType": {
58
  "p": 1.0,
59
- "r": 0.9902597403,
60
- "f": 0.9951060359
61
  },
62
  "Case": {
63
- "p": 0.9968102073,
64
- "r": 0.9873617694,
65
- "f": 0.9920634921
66
  },
67
  "Person": {
68
- "p": 0.9910714286,
69
- "r": 0.9857904085,
70
- "f": 0.9884238646
71
  },
72
  "PronType": {
73
- "p": 0.9958779885,
74
- "r": 0.9934210526,
75
- "f": 0.9946480033
76
  },
77
  "NumType": {
78
- "p": 0.9863945578,
79
  "r": 0.9602649007,
80
- "f": 0.9731543624
81
  },
82
  "Degree": {
83
- "p": 0.9792176039,
84
- "r": 0.965060241,
85
- "f": 0.9720873786
86
  },
87
  "Reflex": {
88
  "p": 1.0,
@@ -95,24 +90,24 @@
95
  "f": 0.0
96
  },
97
  "Number[psor]": {
98
- "p": 0.9772727273,
99
- "r": 1.0,
100
- "f": 0.9885057471
101
  },
102
  "Poss": {
103
  "p": 1.0,
104
- "r": 1.0,
105
- "f": 1.0
106
  },
107
  "Foreign": {
108
- "p": 1.0,
109
- "r": 0.5,
110
- "f": 0.6666666667
111
  },
112
  "Abbr": {
113
- "p": 0.6666666667,
114
  "r": 0.4,
115
- "f": 0.5
116
  },
117
  "Style": {
118
  "p": 1.0,
@@ -120,136 +115,146 @@
120
  "f": 1.0
121
  }
122
  },
 
 
 
 
 
123
  "dep_las_per_type": {
124
  "advmod": {
125
- "p": 0.8047752809,
126
- "r": 0.8093220339,
127
- "f": 0.8070422535
128
  },
129
  "root": {
130
- "p": 0.8641114983,
131
- "r": 0.8794326241,
132
- "f": 0.8717047452
133
  },
134
  "nsubj": {
135
- "p": 0.9172852598,
136
- "r": 0.9124472574,
137
- "f": 0.9148598625
138
  },
139
  "case": {
140
- "p": 0.9202392822,
141
- "r": 0.912055336,
142
- "f": 0.9161290323
143
  },
144
  "obl": {
145
- "p": 0.7947122862,
146
- "r": 0.7947122862,
147
- "f": 0.7947122862
148
  },
149
  "cc": {
150
- "p": 0.863372093,
151
- "r": 0.863372093,
152
- "f": 0.863372093
153
  },
154
  "conj": {
155
- "p": 0.7780748663,
156
- "r": 0.776,
157
- "f": 0.7770360481
158
  },
159
  "obj": {
160
- "p": 0.8913857678,
161
- "r": 0.9242718447,
162
- "f": 0.9075309819
163
  },
164
  "aux": {
165
- "p": 0.9058823529,
166
- "r": 0.8979591837,
167
- "f": 0.9019033675
168
  },
169
  "acl:relcl": {
170
- "p": 0.7298850575,
171
- "r": 0.6864864865,
172
- "f": 0.7075208914
173
  },
174
- "obl:loc": {
175
- "p": 0.7777777778,
176
- "r": 0.8,
177
- "f": 0.7887323944
178
  },
179
  "det": {
180
- "p": 0.9299674267,
181
- "r": 0.9406919275,
182
- "f": 0.9352989353
183
  },
184
  "amod": {
185
- "p": 0.8752107926,
186
- "r": 0.885665529,
187
- "f": 0.8804071247
188
  },
189
  "nmod:poss": {
190
- "p": 0.7058823529,
191
- "r": 0.7128712871,
192
- "f": 0.7093596059
193
  },
194
  "ccomp": {
195
- "p": 0.7833333333,
196
- "r": 0.7580645161,
197
- "f": 0.7704918033
198
  },
199
  "nummod": {
200
- "p": 0.8083333333,
201
- "r": 0.8083333333,
202
- "f": 0.8083333333
203
  },
204
  "flat": {
205
- "p": 0.8527607362,
206
- "r": 0.9205298013,
207
- "f": 0.8853503185
208
  },
209
  "compound:prt": {
210
- "p": 0.696969697,
211
  "r": 0.5609756098,
212
- "f": 0.6216216216
213
  },
214
  "advcl": {
215
- "p": 0.7213114754,
216
- "r": 0.7586206897,
217
- "f": 0.7394957983
218
  },
219
  "mark": {
220
- "p": 0.9351464435,
221
- "r": 0.9178644764,
222
- "f": 0.9264248705
223
  },
224
  "cop": {
225
- "p": 0.8965517241,
226
- "r": 0.8914285714,
227
- "f": 0.893982808
228
  },
229
  "dep": {
230
- "p": 0.2755102041,
231
- "r": 0.5094339623,
232
- "f": 0.357615894
233
  },
234
  "nmod": {
235
- "p": 0.737804878,
236
- "r": 0.708984375,
237
- "f": 0.7231075697
238
  },
239
  "iobj": {
240
- "p": 0.9230769231,
241
- "r": 0.5454545455,
242
- "f": 0.6857142857
 
 
 
 
 
243
  },
244
  "xcomp": {
245
- "p": 0.7368421053,
246
- "r": 0.4745762712,
247
- "f": 0.5773195876
248
  },
249
  "list": {
250
- "p": 0.4666666667,
251
- "r": 0.3888888889,
252
- "f": 0.4242424242
253
  },
254
  "vocative": {
255
  "p": 0.0,
@@ -257,24 +262,24 @@
257
  "f": 0.0
258
  },
259
  "fixed": {
260
- "p": 0.9705882353,
261
- "r": 0.7857142857,
262
  "f": 0.8684210526
263
  },
264
  "expl": {
265
- "p": 0.9117647059,
266
  "r": 0.9117647059,
267
- "f": 0.9117647059
268
  },
269
  "appos": {
270
- "p": 0.6153846154,
271
- "r": 0.7272727273,
272
- "f": 0.6666666667
273
  },
274
  "obl:tmod": {
275
- "p": 0.9230769231,
276
- "r": 0.6666666667,
277
- "f": 0.7741935484
278
  },
279
  "discourse": {
280
  "p": 0.0,
@@ -282,26 +287,32 @@
282
  "f": 0.0
283
  }
284
  },
 
 
 
 
 
285
  "ents_per_type": {
286
  "PER": {
287
- "p": 0.8947368421,
288
- "r": 0.921686747,
289
- "f": 0.9080118694
290
  },
291
  "ORG": {
292
- "p": 0.7922077922,
293
- "r": 0.6777777778,
294
- "f": 0.7305389222
295
  },
296
  "MISC": {
297
- "p": 0.7154471545,
298
  "r": 0.7787610619,
299
- "f": 0.7457627119
300
  },
301
  "LOC": {
302
- "p": 0.8403361345,
303
- "r": 0.9009009009,
304
- "f": 0.8695652174
305
  }
306
- }
 
307
  }
 
1
  {
2
  "token_acc": 0.9994672349,
3
+ "token_p": 0.9977732598,
4
+ "token_r": 0.9974835463,
5
+ "token_f": 0.997628382,
6
+ "pos_acc": 0.9734127561,
7
+ "morph_acc": 0.9700227614,
8
+ "morph_micro_p": 0.9874419317,
9
+ "morph_micro_r": 0.9763767604,
10
+ "morph_micro_f": 0.9818781726,
 
 
 
 
 
11
  "morph_per_feat": {
12
  "Mood": {
13
+ "p": 0.9923298178,
14
+ "r": 0.9866539561,
15
+ "f": 0.9894837476
16
  },
17
  "Tense": {
18
+ "p": 0.9832953683,
19
+ "r": 0.9751506024,
20
+ "f": 0.9792060491
21
  },
22
  "VerbForm": {
23
+ "p": 0.9833127318,
24
+ "r": 0.9736842105,
25
+ "f": 0.9784747847
26
  },
27
  "Voice": {
28
+ "p": 0.9932381668,
29
+ "r": 0.9880418535,
30
+ "f": 0.990633196
31
  },
32
  "Definite": {
33
+ "p": 0.9899678973,
34
+ "r": 0.974713552,
35
+ "f": 0.9822815051
36
  },
37
  "Gender": {
38
+ "p": 0.9814690027,
39
+ "r": 0.9680957129,
40
+ "f": 0.9747364899
41
  },
42
  "Number": {
43
+ "p": 0.9889006342,
44
+ "r": 0.9760041732,
45
+ "f": 0.9824100814
46
  },
47
  "AdpType": {
48
+ "p": 0.9946428571,
49
+ "r": 0.9849690539,
50
+ "f": 0.989782319
51
  },
52
  "PartType": {
53
  "p": 1.0,
54
+ "r": 0.9967532468,
55
+ "f": 0.9983739837
56
  },
57
  "Case": {
58
+ "p": 0.9935794543,
59
+ "r": 0.9778830964,
60
+ "f": 0.9856687898
61
  },
62
  "Person": {
63
+ "p": 0.9857397504,
64
+ "r": 0.9822380107,
65
+ "f": 0.9839857651
66
  },
67
  "PronType": {
68
+ "p": 0.9950166113,
69
+ "r": 0.9851973684,
70
+ "f": 0.9900826446
71
  },
72
  "NumType": {
73
+ "p": 0.9731543624,
74
  "r": 0.9602649007,
75
+ "f": 0.9666666667
76
  },
77
  "Degree": {
78
+ "p": 0.9696233293,
79
+ "r": 0.9614457831,
80
+ "f": 0.9655172414
81
  },
82
  "Reflex": {
83
  "p": 1.0,
 
90
  "f": 0.0
91
  },
92
  "Number[psor]": {
93
+ "p": 0.9770114943,
94
+ "r": 0.988372093,
95
+ "f": 0.9826589595
96
  },
97
  "Poss": {
98
  "p": 1.0,
99
+ "r": 0.9886363636,
100
+ "f": 0.9942857143
101
  },
102
  "Foreign": {
103
+ "p": 0.875,
104
+ "r": 0.7,
105
+ "f": 0.7777777778
106
  },
107
  "Abbr": {
108
+ "p": 1.0,
109
  "r": 0.4,
110
+ "f": 0.5714285714
111
  },
112
  "Style": {
113
  "p": 1.0,
 
115
  "f": 1.0
116
  }
117
  },
118
+ "sents_p": 0.8057921635,
119
+ "sents_r": 0.8386524823,
120
+ "sents_f": 0.8218940052,
121
+ "dep_uas": 0.8639535533,
122
+ "dep_las": 0.8326913415,
123
  "dep_las_per_type": {
124
  "advmod": {
125
+ "p": 0.7794729542,
126
+ "r": 0.7937853107,
127
+ "f": 0.7865640308
128
  },
129
  "root": {
130
+ "p": 0.8398637138,
131
+ "r": 0.8741134752,
132
+ "f": 0.8566463944
133
  },
134
  "nsubj": {
135
+ "p": 0.9073482428,
136
+ "r": 0.8987341772,
137
+ "f": 0.9030206677
138
  },
139
  "case": {
140
+ "p": 0.9097291876,
141
+ "r": 0.8944773176,
142
+ "f": 0.9020387867
143
  },
144
  "obl": {
145
+ "p": 0.7601809955,
146
+ "r": 0.7826086957,
147
+ "f": 0.7712318286
148
  },
149
  "cc": {
150
+ "p": 0.8397626113,
151
+ "r": 0.8226744186,
152
+ "f": 0.8311306902
153
  },
154
  "conj": {
155
+ "p": 0.7506849315,
156
+ "r": 0.7306666667,
157
+ "f": 0.7405405405
158
  },
159
  "obj": {
160
+ "p": 0.868852459,
161
+ "r": 0.9262135922,
162
+ "f": 0.8966165414
163
  },
164
  "aux": {
165
+ "p": 0.8941176471,
166
+ "r": 0.8862973761,
167
+ "f": 0.8901903367
168
  },
169
  "acl:relcl": {
170
+ "p": 0.7719298246,
171
+ "r": 0.7135135135,
172
+ "f": 0.7415730337
173
  },
174
+ "advmod:lmod": {
175
+ "p": 0.7846153846,
176
+ "r": 0.7611940299,
177
+ "f": 0.7727272727
178
  },
179
  "det": {
180
+ "p": 0.9048387097,
181
+ "r": 0.9242174629,
182
+ "f": 0.9144254279
183
  },
184
  "amod": {
185
+ "p": 0.8640275387,
186
+ "r": 0.8566552901,
187
+ "f": 0.8603256213
188
  },
189
  "nmod:poss": {
190
+ "p": 0.7087378641,
191
+ "r": 0.7227722772,
192
+ "f": 0.7156862745
193
  },
194
  "ccomp": {
195
+ "p": 0.7419354839,
196
+ "r": 0.7419354839,
197
+ "f": 0.7419354839
198
  },
199
  "nummod": {
200
+ "p": 0.8196721311,
201
+ "r": 0.8333333333,
202
+ "f": 0.826446281
203
  },
204
  "flat": {
205
+ "p": 0.8291139241,
206
+ "r": 0.8675496689,
207
+ "f": 0.8478964401
208
  },
209
  "compound:prt": {
210
+ "p": 0.575,
211
  "r": 0.5609756098,
212
+ "f": 0.5679012346
213
  },
214
  "advcl": {
215
+ "p": 0.664,
216
+ "r": 0.7155172414,
217
+ "f": 0.6887966805
218
  },
219
  "mark": {
220
+ "p": 0.9201680672,
221
+ "r": 0.8993839836,
222
+ "f": 0.9096573209
223
  },
224
  "cop": {
225
+ "p": 0.8833333333,
226
+ "r": 0.9085714286,
227
+ "f": 0.8957746479
228
  },
229
  "dep": {
230
+ "p": 0.1844660194,
231
+ "r": 0.358490566,
232
+ "f": 0.2435897436
233
  },
234
  "nmod": {
235
+ "p": 0.7558386412,
236
+ "r": 0.6953125,
237
+ "f": 0.7243133266
238
  },
239
  "iobj": {
240
+ "p": 0.875,
241
+ "r": 0.6363636364,
242
+ "f": 0.7368421053
243
+ },
244
+ "obl:lmod": {
245
+ "p": 0.0,
246
+ "r": 0.0,
247
+ "f": 0.0
248
  },
249
  "xcomp": {
250
+ "p": 0.5714285714,
251
+ "r": 0.3389830508,
252
+ "f": 0.4255319149
253
  },
254
  "list": {
255
+ "p": 0.25,
256
+ "r": 0.1111111111,
257
+ "f": 0.1538461538
258
  },
259
  "vocative": {
260
  "p": 0.0,
 
262
  "f": 0.0
263
  },
264
  "fixed": {
265
+ "p": 0.9428571429,
266
+ "r": 0.8048780488,
267
  "f": 0.8684210526
268
  },
269
  "expl": {
270
+ "p": 0.96875,
271
  "r": 0.9117647059,
272
+ "f": 0.9393939394
273
  },
274
  "appos": {
275
+ "p": 0.6551724138,
276
+ "r": 0.5757575758,
277
+ "f": 0.6129032258
278
  },
279
  "obl:tmod": {
280
+ "p": 0.8181818182,
281
+ "r": 0.5,
282
+ "f": 0.6206896552
283
  },
284
  "discourse": {
285
  "p": 0.0,
 
287
  "f": 0.0
288
  }
289
  },
290
+ "tag_acc": 0.9734127561,
291
+ "lemma_acc": 0.8491041162,
292
+ "ents_p": 0.7784313725,
293
+ "ents_r": 0.8270833333,
294
+ "ents_f": 0.802020202,
295
  "ents_per_type": {
296
  "PER": {
297
+ "p": 0.8982035928,
298
+ "r": 0.9036144578,
299
+ "f": 0.9009009009
300
  },
301
  "ORG": {
302
+ "p": 0.7058823529,
303
+ "r": 0.6666666667,
304
+ "f": 0.6857142857
305
  },
306
  "MISC": {
307
+ "p": 0.6769230769,
308
  "r": 0.7787610619,
309
+ "f": 0.7242798354
310
  },
311
  "LOC": {
312
+ "p": 0.7734375,
313
+ "r": 0.8918918919,
314
+ "f": 0.8284518828
315
  }
316
+ },
317
+ "speed": 5370.7462201589
318
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -1,10 +1,8 @@
1
  [paths]
2
- train = "corpus/da-core-news/train.spacy"
3
- dev = "corpus/da-core-news/dev.spacy"
4
  vectors = null
5
- raw = null
6
  init_tok2vec = null
7
- vocab_data = null
8
 
9
  [system]
10
  gpu_allocator = "pytorch"
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
24
 
25
  [components.attribute_ruler]
26
  factory = "attribute_ruler"
 
27
  validate = false
28
 
29
  [components.lemmatizer]
@@ -31,9 +30,13 @@ factory = "lemmatizer"
31
  mode = "lookup"
32
  model = null
33
  overwrite = false
 
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
 
 
 
37
 
38
  [components.morphologizer.model]
39
  @architectures = "spacy.Tagger.v1"
@@ -49,6 +52,7 @@ pooling = {"@layers":"reduce_mean.v1"}
49
  factory = "ner"
50
  incorrect_spans_key = null
51
  moves = null
 
52
  update_with_oracle_cut_size = 100
53
 
54
  [components.ner.model]
@@ -71,6 +75,7 @@ factory = "parser"
71
  learn_tokens = false
72
  min_action_freq = 30
73
  moves = null
 
74
  update_with_oracle_cut_size = 100
75
 
76
  [components.parser.model]
@@ -94,37 +99,39 @@ max_batch_items = 4096
94
  set_extra_annotations = {"@annotation_setters":"spacy-transformers.null_annotation_setter.v1"}
95
 
96
  [components.transformer.model]
97
- @architectures = "spacy-transformers.TransformerModel.v1"
98
  name = "Maltehb/danish-bert-botxo"
 
99
 
100
  [components.transformer.model.get_spans]
101
  @span_getters = "spacy-transformers.strided_spans.v1"
102
  window = 128
103
  stride = 96
104
 
 
 
105
  [components.transformer.model.tokenizer_config]
106
  use_fast = true
107
 
 
 
108
  [corpora]
109
 
110
  [corpora.dev]
111
  @readers = "spacy.Corpus.v1"
112
- limit = 0
113
- max_length = 0
114
- path = ${paths:dev}
115
  gold_preproc = false
 
 
116
  augmenter = null
117
 
118
  [corpora.train]
119
  @readers = "spacy.Corpus.v1"
120
- path = ${paths:train}
121
- max_length = 500
122
  gold_preproc = false
 
123
  limit = 0
124
-
125
- [corpora.train.augmenter]
126
- @augmenters = "spacy.lower_case.v1"
127
- level = 0.1
128
 
129
  [training]
130
  train_corpus = "corpora.train"
@@ -183,11 +190,12 @@ ents_f = 0.16
183
  ents_p = 0.0
184
  ents_r = 0.0
185
  ents_per_type = null
 
186
 
187
  [pretraining]
188
 
189
  [initialize]
190
- vocab_data = ${paths.vocab_data}
191
  vectors = ${paths.vectors}
192
  init_tok2vec = ${paths.init_tok2vec}
193
  before_init = null
 
1
  [paths]
2
+ train = null
3
+ dev = null
4
  vectors = null
 
5
  init_tok2vec = null
 
6
 
7
  [system]
8
  gpu_allocator = "pytorch"
 
22
 
23
  [components.attribute_ruler]
24
  factory = "attribute_ruler"
25
+ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
 
30
  mode = "lookup"
31
  model = null
32
  overwrite = false
33
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
37
+ extend = false
38
+ overwrite = true
39
+ scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
  @architectures = "spacy.Tagger.v1"
 
52
  factory = "ner"
53
  incorrect_spans_key = null
54
  moves = null
55
+ scorer = {"@scorers":"spacy.ner_scorer.v1"}
56
  update_with_oracle_cut_size = 100
57
 
58
  [components.ner.model]
 
75
  learn_tokens = false
76
  min_action_freq = 30
77
  moves = null
78
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
79
  update_with_oracle_cut_size = 100
80
 
81
  [components.parser.model]
 
99
  set_extra_annotations = {"@annotation_setters":"spacy-transformers.null_annotation_setter.v1"}
100
 
101
  [components.transformer.model]
102
+ @architectures = "spacy-transformers.TransformerModel.v3"
103
  name = "Maltehb/danish-bert-botxo"
104
+ mixed_precision = false
105
 
106
  [components.transformer.model.get_spans]
107
  @span_getters = "spacy-transformers.strided_spans.v1"
108
  window = 128
109
  stride = 96
110
 
111
+ [components.transformer.model.grad_scaler_config]
112
+
113
  [components.transformer.model.tokenizer_config]
114
  use_fast = true
115
 
116
+ [components.transformer.model.transformer_config]
117
+
118
  [corpora]
119
 
120
  [corpora.dev]
121
  @readers = "spacy.Corpus.v1"
122
+ path = ${paths.dev}
 
 
123
  gold_preproc = false
124
+ max_length = 0
125
+ limit = 0
126
  augmenter = null
127
 
128
  [corpora.train]
129
  @readers = "spacy.Corpus.v1"
130
+ path = ${paths.train}
 
131
  gold_preproc = false
132
+ max_length = 0
133
  limit = 0
134
+ augmenter = null
 
 
 
135
 
136
  [training]
137
  train_corpus = "corpora.train"
 
190
  ents_p = 0.0
191
  ents_r = 0.0
192
  ents_per_type = null
193
+ speed = 0.0
194
 
195
  [pretraining]
196
 
197
  [initialize]
198
+ vocab_data = null
199
  vectors = ${paths.vectors}
200
  init_tok2vec = ${paths.init_tok2vec}
201
  before_init = null
da_core_news_trf-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8a91a8bc40c3bb22170006bbec93c2aabcc6aa93c14da5282fb97c7a7412385c
3
- size 418030504
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ac3b39be8adc4426c9d1083986d08767b564b40dec4fcfd3c6f2547083a8efa
3
+ size 418019836
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"da",
3
  "name":"core_news_trf",
4
- "version":"3.1.0",
5
  "description":"Danish transformer pipeline (Maltehb/danish-bert-botxo). Components: transformer, morphologizer, parser, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.1.0,<3.2.0",
11
- "spacy_git_version":"caba63b74",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -183,6 +183,7 @@
183
  "acl:relcl",
184
  "advcl",
185
  "advmod",
 
186
  "amod",
187
  "appos",
188
  "aux",
@@ -206,7 +207,7 @@
206
  "nummod",
207
  "obj",
208
  "obl",
209
- "obl:loc",
210
  "obl:tmod",
211
  "punct",
212
  "xcomp"
@@ -245,89 +246,84 @@
245
  ],
246
  "performance":{
247
  "token_acc":0.9994672349,
248
- "tag_acc":0.9797559086,
249
- "pos_acc":0.9797559086,
250
- "morph_acc":0.9774312282,
251
- "lemma_acc":0.8491041162,
252
- "dep_uas":0.8764860189,
253
- "dep_las":0.8503822758,
254
- "ents_p":0.8204081633,
255
- "ents_r":0.8375,
256
- "ents_f":0.8288659794,
257
- "sents_p":0.8653846154,
258
- "sents_r":0.8776595745,
259
- "sents_f":0.8714788732,
260
- "speed":1719.0691313848,
261
  "morph_per_feat":{
262
  "Mood":{
263
- "p":0.9933142311,
264
- "r":0.9914204004,
265
- "f":0.9923664122
266
  },
267
  "Tense":{
268
- "p":0.983495874,
269
- "r":0.9871987952,
270
- "f":0.9853438557
271
  },
272
  "VerbForm":{
273
- "p":0.9834862385,
274
- "r":0.9840881273,
275
- "f":0.9837870909
276
  },
277
  "Voice":{
278
- "p":0.994011976,
279
- "r":0.9925261584,
280
- "f":0.9932685116
281
  },
282
  "Definite":{
283
- "p":0.9876592357,
284
- "r":0.9802449625,
285
- "f":0.9839381321
286
  },
287
  "Gender":{
288
- "p":0.983277592,
289
- "r":0.9770687936,
290
- "f":0.9801633606
291
  },
292
  "Number":{
293
- "p":0.9884484117,
294
- "r":0.9820031299,
295
- "f":0.9852152296
296
  },
297
  "AdpType":{
298
- "p":1.0,
299
- "r":0.9955791335,
300
- "f":0.9977846699
301
  },
302
  "PartType":{
303
  "p":1.0,
304
- "r":0.9902597403,
305
- "f":0.9951060359
306
  },
307
  "Case":{
308
- "p":0.9968102073,
309
- "r":0.9873617694,
310
- "f":0.9920634921
311
  },
312
  "Person":{
313
- "p":0.9910714286,
314
- "r":0.9857904085,
315
- "f":0.9884238646
316
  },
317
  "PronType":{
318
- "p":0.9958779885,
319
- "r":0.9934210526,
320
- "f":0.9946480033
321
  },
322
  "NumType":{
323
- "p":0.9863945578,
324
  "r":0.9602649007,
325
- "f":0.9731543624
326
  },
327
  "Degree":{
328
- "p":0.9792176039,
329
- "r":0.965060241,
330
- "f":0.9720873786
331
  },
332
  "Reflex":{
333
  "p":1.0,
@@ -340,24 +336,24 @@
340
  "f":0.0
341
  },
342
  "Number[psor]":{
343
- "p":0.9772727273,
344
- "r":1.0,
345
- "f":0.9885057471
346
  },
347
  "Poss":{
348
  "p":1.0,
349
- "r":1.0,
350
- "f":1.0
351
  },
352
  "Foreign":{
353
- "p":1.0,
354
- "r":0.5,
355
- "f":0.6666666667
356
  },
357
  "Abbr":{
358
- "p":0.6666666667,
359
  "r":0.4,
360
- "f":0.5
361
  },
362
  "Style":{
363
  "p":1.0,
@@ -365,136 +361,146 @@
365
  "f":1.0
366
  }
367
  },
 
 
 
 
 
368
  "dep_las_per_type":{
369
  "advmod":{
370
- "p":0.8047752809,
371
- "r":0.8093220339,
372
- "f":0.8070422535
373
  },
374
  "root":{
375
- "p":0.8641114983,
376
- "r":0.8794326241,
377
- "f":0.8717047452
378
  },
379
  "nsubj":{
380
- "p":0.9172852598,
381
- "r":0.9124472574,
382
- "f":0.9148598625
383
  },
384
  "case":{
385
- "p":0.9202392822,
386
- "r":0.912055336,
387
- "f":0.9161290323
388
  },
389
  "obl":{
390
- "p":0.7947122862,
391
- "r":0.7947122862,
392
- "f":0.7947122862
393
  },
394
  "cc":{
395
- "p":0.863372093,
396
- "r":0.863372093,
397
- "f":0.863372093
398
  },
399
  "conj":{
400
- "p":0.7780748663,
401
- "r":0.776,
402
- "f":0.7770360481
403
  },
404
  "obj":{
405
- "p":0.8913857678,
406
- "r":0.9242718447,
407
- "f":0.9075309819
408
  },
409
  "aux":{
410
- "p":0.9058823529,
411
- "r":0.8979591837,
412
- "f":0.9019033675
413
  },
414
  "acl:relcl":{
415
- "p":0.7298850575,
416
- "r":0.6864864865,
417
- "f":0.7075208914
418
  },
419
- "obl:loc":{
420
- "p":0.7777777778,
421
- "r":0.8,
422
- "f":0.7887323944
423
  },
424
  "det":{
425
- "p":0.9299674267,
426
- "r":0.9406919275,
427
- "f":0.9352989353
428
  },
429
  "amod":{
430
- "p":0.8752107926,
431
- "r":0.885665529,
432
- "f":0.8804071247
433
  },
434
  "nmod:poss":{
435
- "p":0.7058823529,
436
- "r":0.7128712871,
437
- "f":0.7093596059
438
  },
439
  "ccomp":{
440
- "p":0.7833333333,
441
- "r":0.7580645161,
442
- "f":0.7704918033
443
  },
444
  "nummod":{
445
- "p":0.8083333333,
446
- "r":0.8083333333,
447
- "f":0.8083333333
448
  },
449
  "flat":{
450
- "p":0.8527607362,
451
- "r":0.9205298013,
452
- "f":0.8853503185
453
  },
454
  "compound:prt":{
455
- "p":0.696969697,
456
  "r":0.5609756098,
457
- "f":0.6216216216
458
  },
459
  "advcl":{
460
- "p":0.7213114754,
461
- "r":0.7586206897,
462
- "f":0.7394957983
463
  },
464
  "mark":{
465
- "p":0.9351464435,
466
- "r":0.9178644764,
467
- "f":0.9264248705
468
  },
469
  "cop":{
470
- "p":0.8965517241,
471
- "r":0.8914285714,
472
- "f":0.893982808
473
  },
474
  "dep":{
475
- "p":0.2755102041,
476
- "r":0.5094339623,
477
- "f":0.357615894
478
  },
479
  "nmod":{
480
- "p":0.737804878,
481
- "r":0.708984375,
482
- "f":0.7231075697
483
  },
484
  "iobj":{
485
- "p":0.9230769231,
486
- "r":0.5454545455,
487
- "f":0.6857142857
 
 
 
 
 
488
  },
489
  "xcomp":{
490
- "p":0.7368421053,
491
- "r":0.4745762712,
492
- "f":0.5773195876
493
  },
494
  "list":{
495
- "p":0.4666666667,
496
- "r":0.3888888889,
497
- "f":0.4242424242
498
  },
499
  "vocative":{
500
  "p":0.0,
@@ -502,24 +508,24 @@
502
  "f":0.0
503
  },
504
  "fixed":{
505
- "p":0.9705882353,
506
- "r":0.7857142857,
507
  "f":0.8684210526
508
  },
509
  "expl":{
510
- "p":0.9117647059,
511
  "r":0.9117647059,
512
- "f":0.9117647059
513
  },
514
  "appos":{
515
- "p":0.6153846154,
516
- "r":0.7272727273,
517
- "f":0.6666666667
518
  },
519
  "obl:tmod":{
520
- "p":0.9230769231,
521
- "r":0.6666666667,
522
- "f":0.7741935484
523
  },
524
  "discourse":{
525
  "p":0.0,
@@ -527,32 +533,38 @@
527
  "f":0.0
528
  }
529
  },
 
 
 
 
 
530
  "ents_per_type":{
531
  "PER":{
532
- "p":0.8947368421,
533
- "r":0.921686747,
534
- "f":0.9080118694
535
  },
536
  "ORG":{
537
- "p":0.7922077922,
538
- "r":0.6777777778,
539
- "f":0.7305389222
540
  },
541
  "MISC":{
542
- "p":0.7154471545,
543
  "r":0.7787610619,
544
- "f":0.7457627119
545
  },
546
  "LOC":{
547
- "p":0.8403361345,
548
- "r":0.9009009009,
549
- "f":0.8695652174
550
  }
551
- }
 
552
  },
553
  "sources":[
554
  {
555
- "name":"UD Danish DDT v2.5",
556
  "url":"https://github.com/UniversalDependencies/UD_Danish-DDT",
557
  "license":"CC BY-SA 4.0",
558
  "author":"Johannsen, Anders; Mart\u00ednez Alonso, H\u00e9ctor; Plank, Barbara"
@@ -564,10 +576,10 @@
564
  "author":"Rasmus Hvingelby, Amalie B. Pauli, Maria Barrett, Christina Rosted, Lasse M. Lidegaard, Anders S\u00f8gaard"
565
  },
566
  {
567
- "name":"Lemmatization Lists",
568
- "url":"https://github.com/michmech/lemmatization-lists/",
569
- "license":"ODbL",
570
- "author":"Michal M\u011bchura"
571
  },
572
  {
573
  "name":"Maltehb/danish-bert-botxo",
@@ -577,6 +589,6 @@
577
  }
578
  ],
579
  "requirements":[
580
- "spacy-transformers>=1.0.3,<1.1.0"
581
  ]
582
  }
 
1
  {
2
  "lang":"da",
3
  "name":"core_news_trf",
4
+ "version":"3.2.0",
5
  "description":"Danish transformer pipeline (Maltehb/danish-bert-botxo). Components: transformer, morphologizer, parser, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.2.0,<3.3.0",
11
+ "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
183
  "acl:relcl",
184
  "advcl",
185
  "advmod",
186
+ "advmod:lmod",
187
  "amod",
188
  "appos",
189
  "aux",
 
207
  "nummod",
208
  "obj",
209
  "obl",
210
+ "obl:lmod",
211
  "obl:tmod",
212
  "punct",
213
  "xcomp"
 
246
  ],
247
  "performance":{
248
  "token_acc":0.9994672349,
249
+ "token_p":0.9977732598,
250
+ "token_r":0.9974835463,
251
+ "token_f":0.997628382,
252
+ "pos_acc":0.9734127561,
253
+ "morph_acc":0.9700227614,
254
+ "morph_micro_p":0.9874419317,
255
+ "morph_micro_r":0.9763767604,
256
+ "morph_micro_f":0.9818781726,
 
 
 
 
 
257
  "morph_per_feat":{
258
  "Mood":{
259
+ "p":0.9923298178,
260
+ "r":0.9866539561,
261
+ "f":0.9894837476
262
  },
263
  "Tense":{
264
+ "p":0.9832953683,
265
+ "r":0.9751506024,
266
+ "f":0.9792060491
267
  },
268
  "VerbForm":{
269
+ "p":0.9833127318,
270
+ "r":0.9736842105,
271
+ "f":0.9784747847
272
  },
273
  "Voice":{
274
+ "p":0.9932381668,
275
+ "r":0.9880418535,
276
+ "f":0.990633196
277
  },
278
  "Definite":{
279
+ "p":0.9899678973,
280
+ "r":0.974713552,
281
+ "f":0.9822815051
282
  },
283
  "Gender":{
284
+ "p":0.9814690027,
285
+ "r":0.9680957129,
286
+ "f":0.9747364899
287
  },
288
  "Number":{
289
+ "p":0.9889006342,
290
+ "r":0.9760041732,
291
+ "f":0.9824100814
292
  },
293
  "AdpType":{
294
+ "p":0.9946428571,
295
+ "r":0.9849690539,
296
+ "f":0.989782319
297
  },
298
  "PartType":{
299
  "p":1.0,
300
+ "r":0.9967532468,
301
+ "f":0.9983739837
302
  },
303
  "Case":{
304
+ "p":0.9935794543,
305
+ "r":0.9778830964,
306
+ "f":0.9856687898
307
  },
308
  "Person":{
309
+ "p":0.9857397504,
310
+ "r":0.9822380107,
311
+ "f":0.9839857651
312
  },
313
  "PronType":{
314
+ "p":0.9950166113,
315
+ "r":0.9851973684,
316
+ "f":0.9900826446
317
  },
318
  "NumType":{
319
+ "p":0.9731543624,
320
  "r":0.9602649007,
321
+ "f":0.9666666667
322
  },
323
  "Degree":{
324
+ "p":0.9696233293,
325
+ "r":0.9614457831,
326
+ "f":0.9655172414
327
  },
328
  "Reflex":{
329
  "p":1.0,
 
336
  "f":0.0
337
  },
338
  "Number[psor]":{
339
+ "p":0.9770114943,
340
+ "r":0.988372093,
341
+ "f":0.9826589595
342
  },
343
  "Poss":{
344
  "p":1.0,
345
+ "r":0.9886363636,
346
+ "f":0.9942857143
347
  },
348
  "Foreign":{
349
+ "p":0.875,
350
+ "r":0.7,
351
+ "f":0.7777777778
352
  },
353
  "Abbr":{
354
+ "p":1.0,
355
  "r":0.4,
356
+ "f":0.5714285714
357
  },
358
  "Style":{
359
  "p":1.0,
 
361
  "f":1.0
362
  }
363
  },
364
+ "sents_p":0.8057921635,
365
+ "sents_r":0.8386524823,
366
+ "sents_f":0.8218940052,
367
+ "dep_uas":0.8639535533,
368
+ "dep_las":0.8326913415,
369
  "dep_las_per_type":{
370
  "advmod":{
371
+ "p":0.7794729542,
372
+ "r":0.7937853107,
373
+ "f":0.7865640308
374
  },
375
  "root":{
376
+ "p":0.8398637138,
377
+ "r":0.8741134752,
378
+ "f":0.8566463944
379
  },
380
  "nsubj":{
381
+ "p":0.9073482428,
382
+ "r":0.8987341772,
383
+ "f":0.9030206677
384
  },
385
  "case":{
386
+ "p":0.9097291876,
387
+ "r":0.8944773176,
388
+ "f":0.9020387867
389
  },
390
  "obl":{
391
+ "p":0.7601809955,
392
+ "r":0.7826086957,
393
+ "f":0.7712318286
394
  },
395
  "cc":{
396
+ "p":0.8397626113,
397
+ "r":0.8226744186,
398
+ "f":0.8311306902
399
  },
400
  "conj":{
401
+ "p":0.7506849315,
402
+ "r":0.7306666667,
403
+ "f":0.7405405405
404
  },
405
  "obj":{
406
+ "p":0.868852459,
407
+ "r":0.9262135922,
408
+ "f":0.8966165414
409
  },
410
  "aux":{
411
+ "p":0.8941176471,
412
+ "r":0.8862973761,
413
+ "f":0.8901903367
414
  },
415
  "acl:relcl":{
416
+ "p":0.7719298246,
417
+ "r":0.7135135135,
418
+ "f":0.7415730337
419
  },
420
+ "advmod:lmod":{
421
+ "p":0.7846153846,
422
+ "r":0.7611940299,
423
+ "f":0.7727272727
424
  },
425
  "det":{
426
+ "p":0.9048387097,
427
+ "r":0.9242174629,
428
+ "f":0.9144254279
429
  },
430
  "amod":{
431
+ "p":0.8640275387,
432
+ "r":0.8566552901,
433
+ "f":0.8603256213
434
  },
435
  "nmod:poss":{
436
+ "p":0.7087378641,
437
+ "r":0.7227722772,
438
+ "f":0.7156862745
439
  },
440
  "ccomp":{
441
+ "p":0.7419354839,
442
+ "r":0.7419354839,
443
+ "f":0.7419354839
444
  },
445
  "nummod":{
446
+ "p":0.8196721311,
447
+ "r":0.8333333333,
448
+ "f":0.826446281
449
  },
450
  "flat":{
451
+ "p":0.8291139241,
452
+ "r":0.8675496689,
453
+ "f":0.8478964401
454
  },
455
  "compound:prt":{
456
+ "p":0.575,
457
  "r":0.5609756098,
458
+ "f":0.5679012346
459
  },
460
  "advcl":{
461
+ "p":0.664,
462
+ "r":0.7155172414,
463
+ "f":0.6887966805
464
  },
465
  "mark":{
466
+ "p":0.9201680672,
467
+ "r":0.8993839836,
468
+ "f":0.9096573209
469
  },
470
  "cop":{
471
+ "p":0.8833333333,
472
+ "r":0.9085714286,
473
+ "f":0.8957746479
474
  },
475
  "dep":{
476
+ "p":0.1844660194,
477
+ "r":0.358490566,
478
+ "f":0.2435897436
479
  },
480
  "nmod":{
481
+ "p":0.7558386412,
482
+ "r":0.6953125,
483
+ "f":0.7243133266
484
  },
485
  "iobj":{
486
+ "p":0.875,
487
+ "r":0.6363636364,
488
+ "f":0.7368421053
489
+ },
490
+ "obl:lmod":{
491
+ "p":0.0,
492
+ "r":0.0,
493
+ "f":0.0
494
  },
495
  "xcomp":{
496
+ "p":0.5714285714,
497
+ "r":0.3389830508,
498
+ "f":0.4255319149
499
  },
500
  "list":{
501
+ "p":0.25,
502
+ "r":0.1111111111,
503
+ "f":0.1538461538
504
  },
505
  "vocative":{
506
  "p":0.0,
 
508
  "f":0.0
509
  },
510
  "fixed":{
511
+ "p":0.9428571429,
512
+ "r":0.8048780488,
513
  "f":0.8684210526
514
  },
515
  "expl":{
516
+ "p":0.96875,
517
  "r":0.9117647059,
518
+ "f":0.9393939394
519
  },
520
  "appos":{
521
+ "p":0.6551724138,
522
+ "r":0.5757575758,
523
+ "f":0.6129032258
524
  },
525
  "obl:tmod":{
526
+ "p":0.8181818182,
527
+ "r":0.5,
528
+ "f":0.6206896552
529
  },
530
  "discourse":{
531
  "p":0.0,
 
533
  "f":0.0
534
  }
535
  },
536
+ "tag_acc":0.9734127561,
537
+ "lemma_acc":0.8491041162,
538
+ "ents_p":0.7784313725,
539
+ "ents_r":0.8270833333,
540
+ "ents_f":0.802020202,
541
  "ents_per_type":{
542
  "PER":{
543
+ "p":0.8982035928,
544
+ "r":0.9036144578,
545
+ "f":0.9009009009
546
  },
547
  "ORG":{
548
+ "p":0.7058823529,
549
+ "r":0.6666666667,
550
+ "f":0.6857142857
551
  },
552
  "MISC":{
553
+ "p":0.6769230769,
554
  "r":0.7787610619,
555
+ "f":0.7242798354
556
  },
557
  "LOC":{
558
+ "p":0.7734375,
559
+ "r":0.8918918919,
560
+ "f":0.8284518828
561
  }
562
+ },
563
+ "speed":5370.7462201589
564
  },
565
  "sources":[
566
  {
567
+ "name":"UD Danish DDT v2.8",
568
  "url":"https://github.com/UniversalDependencies/UD_Danish-DDT",
569
  "license":"CC BY-SA 4.0",
570
  "author":"Johannsen, Anders; Mart\u00ednez Alonso, H\u00e9ctor; Plank, Barbara"
 
576
  "author":"Rasmus Hvingelby, Amalie B. Pauli, Maria Barrett, Christina Rosted, Lasse M. Lidegaard, Anders S\u00f8gaard"
577
  },
578
  {
579
+ "name":"Sprogteknologisk orddatabase over det danske sprog",
580
+ "url":"https://cst.ku.dk/sto_ordbase/",
581
+ "license":"CC BY-SA 4.0",
582
+ "author":"Center for Language Technology, University of Copenhagen"
583
  },
584
  {
585
  "name":"Maltehb/danish-bert-botxo",
 
589
  }
590
  ],
591
  "requirements":[
592
+ "spacy-transformers>=1.1.2,<1.2.0"
593
  ]
594
  }
morphologizer/cfg CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "labels_morph":{
3
  "AdpType=Prep|POS=ADP":"AdpType=Prep",
4
  "Definite=Ind|Gender=Com|Number=Sing|POS=NOUN":"Definite=Ind|Gender=Com|Number=Sing",
@@ -316,5 +317,6 @@
316
  "Number[psor]=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs":95,
317
  "POS=DET|PronType=Dem":90,
318
  "Definite=Def|Number=Plur|POS=NOUN":92
319
- }
 
320
  }
 
1
  {
2
+ "extend":false,
3
  "labels_morph":{
4
  "AdpType=Prep|POS=ADP":"AdpType=Prep",
5
  "Definite=Ind|Gender=Com|Number=Sing|POS=NOUN":"Definite=Ind|Gender=Com|Number=Sing",
 
317
  "Number[psor]=Plur|POS=PRON|Person=3|Poss=Yes|PronType=Prs":95,
318
  "POS=DET|PronType=Dem":90,
319
  "Definite=Def|Number=Plur|POS=NOUN":92
320
+ },
321
+ "overwrite":true
322
  }
morphologizer/model CHANGED
Binary files a/morphologizer/model and b/morphologizer/model differ
 
ner/model CHANGED
Binary files a/ner/model and b/ner/model differ
 
parser/model CHANGED
Binary files a/parser/model and b/parser/model differ
 
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�2{"0":{"":41514},"1":{"":34292},"2":{"case":7489,"nsubj":6009,"det":4334,"amod":3968,"advmod":3657,"mark":3529,"aux":2432,"cc":2261,"punct":2182,"cop":1329,"obl":894,"nummod":799,"nmod:poss":651,"nmod":460,"expl":291,"ccomp":202,"obj":195,"xcomp":122,"case||nmod":73,"obl:tmod":53,"dep":49,"acl:relcl":43},"3":{"punct":8600,"obl":3949,"obj":3758,"nmod":3565,"conj":2743,"advmod":2095,"flat":1294,"nsubj":1172,"acl:relcl":1131,"advcl":808,"amod":629,"obl:loc":467,"fixed":390,"dep":322,"xcomp":272,"appos":268,"compound:prt":261,"ccomp":252,"acl:relcl||nsubj":237,"case":202,"nummod":167,"list":161,"nmod:poss":156,"punct||conj":151,"mark":137,"cc":135,"iobj":107,"expl":77,"cop":69,"nmod||case":60,"aux":48,"obl:tmod":45,"cc||case":43,"advcl||advmod":43,"cc||conj":40,"case||obl":38,"punct||case":33},"4":{"ROOT":4367}}�cfg��neg_key�
 
1
+ ��moves�D{"0":{"":41514},"1":{"":34292},"2":{"case":7489,"nsubj":6009,"det":4334,"amod":3968,"advmod":3657,"mark":3529,"aux":2432,"cc":2261,"punct":2182,"cop":1329,"obl":894,"nummod":799,"nmod:poss":651,"nmod":460,"expl":291,"ccomp":202,"obj":195,"xcomp":122,"case||nmod":73,"obl:tmod":53,"dep":49,"acl:relcl":43},"3":{"punct":8600,"obl":3949,"obj":3758,"nmod":3565,"conj":2743,"advmod":2095,"flat":1294,"nsubj":1172,"acl:relcl":1131,"advcl":808,"amod":629,"advmod:lmod":423,"fixed":390,"dep":322,"xcomp":272,"appos":268,"compound:prt":261,"ccomp":252,"acl:relcl||nsubj":237,"case":202,"nummod":167,"list":161,"nmod:poss":156,"punct||conj":151,"mark":137,"cc":135,"iobj":107,"expl":77,"cop":69,"nmod||case":60,"aux":48,"obl:tmod":45,"obl:lmod":44,"cc||case":43,"advcl||advmod":43,"cc||conj":40,"case||obl":38,"punct||case":33},"4":{"ROOT":4367}}�cfg��neg_key�
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
 
transformer/{model/pytorch_model.bin → model} RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2d68d0fb3b9a0b57bec9bf27371b1baf31ac650ee6ea57d46863b13fe943d4f5
3
- size 442547953
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5eb70898ea5adf43b82eb7093006b869b200273a1a055518794b0508c828a6e
3
+ size 443301525
transformer/model/config.json DELETED
@@ -1,30 +0,0 @@
1
- {
2
- "_name_or_path": "/mnt/scratch/tmp/da_core_news_trf/dfeacfd6-97e0-4a91-870e-6d37f52ab596/training/core/model-best/transformer/model",
3
- "architectures": [
4
- "BertForPreTraining"
5
- ],
6
- "attention_probs_dropout_prob": 0.1,
7
- "directionality": "bidi",
8
- "gradient_checkpointing": false,
9
- "hidden_act": "gelu",
10
- "hidden_dropout_prob": 0.1,
11
- "hidden_size": 768,
12
- "initializer_range": 0.02,
13
- "intermediate_size": 3072,
14
- "layer_norm_eps": 1e-12,
15
- "max_position_embeddings": 512,
16
- "model_type": "bert",
17
- "num_attention_heads": 12,
18
- "num_hidden_layers": 12,
19
- "pad_token_id": 0,
20
- "pooler_fc_size": 768,
21
- "pooler_num_attention_heads": 12,
22
- "pooler_num_fc_layers": 3,
23
- "pooler_size_per_head": 128,
24
- "pooler_type": "first_token_transform",
25
- "position_embedding_type": "absolute",
26
- "transformers_version": "4.6.1",
27
- "type_vocab_size": 2,
28
- "use_cache": true,
29
- "vocab_size": 32000
30
- }
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
transformer/model/special_tokens_map.json DELETED
@@ -1 +0,0 @@
1
- {"unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]"}
 
 
transformer/model/tokenizer.json DELETED
The diff for this file is too large to render. See raw diff
 
transformer/model/tokenizer_config.json DELETED
@@ -1 +0,0 @@
1
- {"do_lower_case": true, "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenize_chinese_chars": true, "strip_accents": null, "special_tokens_map_file": null, "name_or_path": "/mnt/scratch/tmp/da_core_news_trf/dfeacfd6-97e0-4a91-870e-6d37f52ab596/training/core/model-best/transformer/model", "do_basic_tokenize": true, "never_split": null}
 
 
transformer/model/vocab.txt DELETED
The diff for this file is too large to render. See raw diff
 
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b678d4b9a0b175ec347df6057e55bd49461db4816dc924fc458b49781701233a
3
- size 459590
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e38be75a720d79baa44576b52d962219b789d7d640e7ca0e0891b4152fb37f6
3
+ size 459712
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }