oroszgy commited on
Commit
86b0693
1 Parent(s): 0304c2a

Update spacy pipeline to 3.7.0

Browse files
README.md CHANGED
@@ -14,74 +14,74 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8585339943
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8524964838
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8555045872
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9695664657
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
- value: 0.969328676
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
- value: 0.9461192459
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
- value: 0.974834944
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
- value: 0.8140300006
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
- value: 0.7415379468
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
- value: 0.9755011136
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_md` |
79
- | **Version** | `3.6.1` |
80
- | **spaCy** | `>=3.6.0,<3.7.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
83
  | **Vectors** | -1 keys, 200000 unique vectors (100 dimensions) |
84
- | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[Hungarian lg Floret vectors](https://huggingface.co/huspacy/hu_vectors_web_lg) (Szeged AI) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
@@ -108,18 +108,18 @@ Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morpholog
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
- | `SENTS_P` | 97.55 |
112
- | `SENTS_R` | 97.55 |
113
- | `SENTS_F` | 97.55 |
114
- | `TAG_ACC` | 96.96 |
115
- | `POS_ACC` | 96.93 |
116
- | `MORPH_ACC` | 94.61 |
117
- | `MORPH_MICRO_P` | 97.48 |
118
- | `MORPH_MICRO_R` | 96.79 |
119
- | `MORPH_MICRO_F` | 97.13 |
120
- | `LEMMA_ACC` | 97.48 |
121
- | `DEP_UAS` | 81.40 |
122
- | `DEP_LAS` | 74.15 |
123
- | `ENTS_P` | 85.85 |
124
- | `ENTS_R` | 85.25 |
125
- | `ENTS_F` | 85.55 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8459219858
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8387834037
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8423375706
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9694736842
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9686124402
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
+ value: 0.9439180783
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9745478902
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
+ value: 0.8147198216
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
+ value: 0.743867083
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
+ value: 0.9754464286
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_md` |
79
+ | **Version** | `3.7.0` |
80
+ | **spaCy** | `>=3.7.0,<3.8.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
83
  | **Vectors** | -1 keys, 200000 unique vectors (100 dimensions) |
84
+ | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br>[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br>[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br>[Hungarian lg Floret vectors](https://huggingface.co/huspacy/hu_vectors_web_lg) (Szeged AI) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
+ | `SENTS_P` | 97.76 |
112
+ | `SENTS_R` | 97.33 |
113
+ | `SENTS_F` | 97.54 |
114
+ | `TAG_ACC` | 96.95 |
115
+ | `POS_ACC` | 96.86 |
116
+ | `MORPH_ACC` | 94.39 |
117
+ | `MORPH_MICRO_P` | 97.64 |
118
+ | `MORPH_MICRO_R` | 96.75 |
119
+ | `MORPH_MICRO_F` | 97.19 |
120
+ | `LEMMA_ACC` | 97.45 |
121
+ | `DEP_UAS` | 81.47 |
122
+ | `DEP_LAS` | 74.39 |
123
+ | `ENTS_P` | 84.59 |
124
+ | `ENTS_R` | 83.88 |
125
+ | `ENTS_F` | 84.23 |
config.cfg CHANGED
@@ -1,8 +1,8 @@
1
  [paths]
2
- parser_model = "models/hu_core_news_md-parser-3.6.1/model-best"
3
- ner_model = "models/hu_core_news_md-ner-3.6.1/model-best"
4
- lemmatizer_lookups = "models/hu_core_news_md-lookup-lemmatizer-3.6.1"
5
- tagger_model = "models/hu_core_news_md-tagger-3.6.1/model-best"
6
  train = null
7
  dev = null
8
  vectors = null
@@ -21,6 +21,7 @@ before_creation = null
21
  after_creation = null
22
  after_pipeline_creation = null
23
  batch_size = 1000
 
24
 
25
  [components]
26
 
1
  [paths]
2
+ parser_model = "models/hu_core_news_md-parser-3.7.0/model-best"
3
+ ner_model = "models/hu_core_news_md-ner-3.7.0/model-best"
4
+ lemmatizer_lookups = "models/hu_core_news_md-lookup-lemmatizer-3.7.0"
5
+ tagger_model = "models/hu_core_news_md-tagger-3.7.0/model-best"
6
  train = null
7
  dev = null
8
  vectors = null
21
  after_creation = null
22
  after_pipeline_creation = null
23
  batch_size = 1000
24
+ vectors = {"@vectors":"spacy.Vectors.v1"}
25
 
26
  [components]
27
 
hu_core_news_md-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9c04cff940d2a9fc3125ab972ee8f444334a867fe4bc5e188d11851bc5aceaed
3
- size 127014450
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ad3d006acf6cbc45cfb45080d50b54116a881b0992d6dbd90e377ca9ae56fd0d
3
+ size 127001547
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"hu",
3
  "name":"core_news_md",
4
- "version":"3.6.1",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
8
  "url":"https://github.com/huspacy/huspacy",
9
  "license":"cc-by-sa-4.0",
10
- "spacy_version":">=3.6.0,<3.7.0",
11
- "spacy_git_version":"6fc153a26",
12
  "vectors":{
13
  "width":100,
14
  "vectors":200000,
@@ -1268,90 +1268,85 @@
1268
  "token_p":0.998565417,
1269
  "token_r":0.9993300153,
1270
  "token_f":0.9989475698,
1271
- "sents_p":0.9755011136,
1272
- "sents_r":0.9755011136,
1273
- "sents_f":0.9755011136,
1274
- "tag_acc":0.9695664657,
1275
- "pos_acc":0.969328676,
1276
- "morph_acc":0.9461192459,
1277
- "morph_micro_p":0.9747695504,
1278
- "morph_micro_r":0.9679415557,
1279
- "morph_micro_f":0.9713435539,
1280
  "morph_per_feat":{
1281
  "Definite":{
1282
- "p":0.9739655974,
1283
- "r":0.9776014932,
1284
- "f":0.9757801584
1285
  },
1286
  "PronType":{
1287
- "p":0.971869829,
1288
- "r":0.972406181,
1289
- "f":0.972137931
1290
  },
1291
  "Case":{
1292
- "p":0.9844497608,
1293
- "r":0.9756965027,
1294
- "f":0.9800535874
1295
  },
1296
  "Degree":{
1297
- "p":0.9401785714,
1298
- "r":0.8760399334,
1299
- "f":0.9069767442
1300
  },
1301
  "Number":{
1302
- "p":0.989758227,
1303
- "r":0.987933635,
1304
- "f":0.9888450893
1305
  },
1306
  "Mood":{
1307
- "p":0.9386637459,
1308
- "r":0.9501108647,
1309
- "f":0.9443526171
1310
  },
1311
  "Person":{
1312
- "p":0.9571890145,
1313
- "r":0.9745065789,
1314
- "f":0.9657701711
1315
  },
1316
  "Tense":{
1317
- "p":0.9704271632,
1318
- "r":0.9790055249,
1319
- "f":0.9746974697
1320
  },
1321
  "VerbForm":{
1322
- "p":0.9651656754,
1323
- "r":0.9109863673,
1324
- "f":0.9372937294
1325
  },
1326
  "Voice":{
1327
- "p":0.9625884732,
1328
- "r":0.9734151329,
1329
- "f":0.9679715302
1330
  },
1331
  "Number[psor]":{
1332
- "p":0.9884057971,
1333
- "r":0.9715099715,
1334
- "f":0.9798850575
1335
  },
1336
  "Person[psor]":{
1337
- "p":0.9869565217,
1338
- "r":0.9714693295,
1339
- "f":0.9791516894
1340
  },
1341
  "NumType":{
1342
- "p":0.9427207637,
1343
- "r":0.9634146341,
1344
- "f":0.9529553679
1345
- },
1346
- "Poss":{
1347
- "p":0.6,
1348
- "r":1.0,
1349
- "f":0.75
1350
  },
1351
  "Reflex":{
1352
  "p":1.0,
1353
- "r":0.375,
1354
- "f":0.5454545455
1355
  },
1356
  "Reflexive":{
1357
  "p":0.0,
@@ -1368,120 +1363,125 @@
1368
  "r":0.0,
1369
  "f":0.0
1370
  },
 
 
 
 
 
1371
  "Number[psed]":{
1372
  "p":1.0,
1373
  "r":0.1111111111,
1374
  "f":0.2
1375
  }
1376
  },
1377
- "lemma_acc":0.974834944,
1378
- "dep_uas":0.8140300006,
1379
- "dep_las":0.7415379468,
1380
  "dep_las_per_type":{
1381
  "det":{
1382
- "p":0.86015625,
1383
- "r":0.8765923567,
1384
- "f":0.86829653
1385
  },
1386
  "amod:att":{
1387
- "p":0.8429617575,
1388
- "r":0.8470973017,
1389
- "f":0.8450244698
1390
  },
1391
  "nsubj":{
1392
- "p":0.7375,
1393
- "r":0.7375,
1394
- "f":0.7375
1395
  },
1396
  "advmod:mode":{
1397
- "p":0.5593220339,
1398
- "r":0.5661764706,
1399
- "f":0.56272838
1400
  },
1401
  "nmod:att":{
1402
- "p":0.7669421488,
1403
- "r":0.786440678,
1404
- "f":0.7765690377
1405
  },
1406
  "obl":{
1407
- "p":0.7752707581,
1408
- "r":0.7731773177,
1409
- "f":0.7742226228
1410
  },
1411
  "obj":{
1412
- "p":0.8482142857,
1413
- "r":0.8539325843,
1414
- "f":0.8510638298
1415
  },
1416
  "root":{
1417
- "p":0.8195991091,
1418
- "r":0.8195991091,
1419
- "f":0.8195991091
1420
  },
1421
  "cc":{
1422
- "p":0.6929637527,
1423
- "r":0.6842105263,
1424
- "f":0.688559322
1425
  },
1426
  "conj":{
1427
- "p":0.4356261023,
1428
- "r":0.5145833333,
1429
- "f":0.4718242598
1430
  },
1431
  "advmod":{
1432
- "p":0.806122449,
1433
- "r":0.8315789474,
1434
- "f":0.8186528497
1435
  },
1436
  "flat:name":{
1437
- "p":0.8444444444,
1438
- "r":0.8878504673,
1439
- "f":0.8656036446
1440
  },
1441
  "appos":{
1442
- "p":0.3950617284,
1443
- "r":0.3404255319,
1444
- "f":0.3657142857
1445
  },
1446
  "advcl":{
1447
- "p":0.2592592593,
1448
- "r":0.2142857143,
1449
- "f":0.2346368715
1450
  },
1451
  "advmod:tlocy":{
1452
- "p":0.6138211382,
1453
  "r":0.6565217391,
1454
- "f":0.6344537815
1455
  },
1456
  "ccomp:obj":{
1457
- "p":0.2040816327,
1458
- "r":0.303030303,
1459
- "f":0.243902439
1460
  },
1461
  "mark":{
1462
- "p":0.821656051,
1463
- "r":0.8164556962,
1464
- "f":0.819047619
1465
  },
1466
  "compound:preverb":{
1467
- "p":0.8879310345,
1468
- "r":0.9449541284,
1469
- "f":0.9155555556
1470
  },
1471
  "advmod:locy":{
1472
- "p":0.7894736842,
1473
- "r":0.46875,
1474
- "f":0.5882352941
1475
  },
1476
  "cop":{
1477
- "p":0.8214285714,
1478
- "r":0.5609756098,
1479
- "f":0.6666666667
1480
  },
1481
  "nmod:obl":{
1482
- "p":0.380952381,
1483
- "r":0.2,
1484
- "f":0.262295082
1485
  },
1486
  "advmod:to":{
1487
  "p":0.0,
@@ -1494,86 +1494,96 @@
1494
  "f":0.0
1495
  },
1496
  "ccomp:obl":{
1497
- "p":0.3461538462,
1498
- "r":0.28125,
1499
- "f":0.3103448276
1500
  },
1501
  "iobj":{
1502
- "p":0.2352941176,
1503
  "r":0.2666666667,
1504
- "f":0.25
1505
- },
1506
- "parataxis":{
1507
- "p":0.12,
1508
- "r":0.0410958904,
1509
- "f":0.0612244898
1510
- },
1511
- "dep":{
1512
- "p":0.0,
1513
- "r":0.0,
1514
- "f":0.0
1515
  },
1516
  "case":{
1517
- "p":0.9492385787,
1518
- "r":0.9540816327,
1519
- "f":0.951653944
1520
  },
1521
  "csubj":{
1522
- "p":0.4137931034,
1523
- "r":0.3243243243,
1524
- "f":0.3636363636
 
 
 
 
 
1525
  },
1526
  "xcomp":{
1527
- "p":0.8513513514,
1528
- "r":0.8513513514,
1529
- "f":0.8513513514
1530
  },
1531
  "nummod":{
1532
- "p":0.5145631068,
1533
- "r":0.5698924731,
1534
- "f":0.5408163265
1535
  },
1536
  "acl":{
1537
- "p":0.2931034483,
1538
- "r":0.2361111111,
1539
- "f":0.2615384615
1540
  },
1541
  "advmod:tto":{
1542
- "p":0.6666666667,
1543
  "r":0.2,
1544
- "f":0.3076923077
1545
  },
1546
  "nmod":{
1547
- "p":0.375,
1548
- "r":0.2727272727,
1549
- "f":0.3157894737
 
 
 
 
 
 
 
 
 
 
1550
  },
1551
  "aux":{
1552
- "p":0.9090909091,
1553
- "r":0.8333333333,
1554
- "f":0.8695652174
1555
  },
1556
  "advmod:tfrom":{
1557
  "p":0.0,
1558
  "r":0.0,
1559
  "f":0.0
1560
  },
 
 
 
 
 
1561
  "goeswith":{
1562
  "p":0.0,
1563
  "r":0.0,
1564
  "f":0.0
1565
  },
1566
  "compound":{
1567
- "p":0.95,
1568
- "r":0.95,
1569
- "f":0.95
1570
  },
1571
- "orphan":{
1572
  "p":0.0,
1573
  "r":0.0,
1574
  "f":0.0
1575
  },
1576
- "obl:lvc":{
1577
  "p":0.0,
1578
  "r":0.0,
1579
  "f":0.0
@@ -1583,53 +1593,43 @@
1583
  "r":0.0,
1584
  "f":0.0
1585
  },
1586
- "list":{
1587
- "p":0.0909090909,
1588
- "r":0.1666666667,
1589
- "f":0.1176470588
1590
- },
1591
- "ccomp":{
1592
- "p":0.1428571429,
1593
- "r":0.0769230769,
1594
- "f":0.1
1595
  },
1596
  "ccomp:pred":{
1597
  "p":0.0,
1598
  "r":0.0,
1599
  "f":0.0
1600
- },
1601
- "advmod:que":{
1602
- "p":1.0,
1603
- "r":0.5,
1604
- "f":0.6666666667
1605
  }
1606
  },
1607
- "ents_p":0.8585339943,
1608
- "ents_r":0.8524964838,
1609
- "ents_f":0.8555045872,
1610
  "ents_per_type":{
1611
  "ORG":{
1612
- "p":0.8798206278,
1613
- "r":0.909596662,
1614
- "f":0.8944609072
1615
  },
1616
  "PER":{
1617
- "p":0.893373494,
1618
- "r":0.8859020311,
1619
- "f":0.8896220756
1620
  },
1621
  "LOC":{
1622
- "p":0.8763537906,
1623
  "r":0.8428819444,
1624
- "f":0.8592920354
1625
  },
1626
  "MISC":{
1627
- "p":0.6661538462,
1628
- "r":0.6141843972,
1629
- "f":0.6391143911
1630
  }
1631
  },
1632
- "speed":2610.2533974109
1633
  },
1634
  "sources":[
1635
  {
1
  {
2
  "lang":"hu",
3
  "name":"core_news_md",
4
+ "version":"3.7.0",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
8
  "url":"https://github.com/huspacy/huspacy",
9
  "license":"cc-by-sa-4.0",
10
+ "spacy_version":">=3.7.0,<3.8.0",
11
+ "spacy_git_version":"a89eae928",
12
  "vectors":{
13
  "width":100,
14
  "vectors":200000,
1268
  "token_p":0.998565417,
1269
  "token_r":0.9993300153,
1270
  "token_f":0.9989475698,
1271
+ "sents_p":0.9776286353,
1272
+ "sents_r":0.9732739421,
1273
+ "sents_f":0.9754464286,
1274
+ "tag_acc":0.9694736842,
1275
+ "pos_acc":0.9686124402,
1276
+ "morph_acc":0.9439180783,
1277
+ "morph_micro_p":0.9764073207,
1278
+ "morph_micro_r":0.9675118178,
1279
+ "morph_micro_f":0.971939216,
1280
  "morph_per_feat":{
1281
  "Definite":{
1282
+ "p":0.9722222222,
1283
+ "r":0.979934671,
1284
+ "f":0.9760632117
1285
  },
1286
  "PronType":{
1287
+ "p":0.97353914,
1288
+ "r":0.9746136865,
1289
+ "f":0.9740761169
1290
  },
1291
  "Case":{
1292
+ "p":0.9836523126,
1293
+ "r":0.974906145,
1294
+ "f":0.9792597003
1295
  },
1296
  "Degree":{
1297
+ "p":0.9437896646,
1298
+ "r":0.8660565724,
1299
+ "f":0.9032537961
1300
  },
1301
  "Number":{
1302
+ "p":0.9899006901,
1303
+ "r":0.9855873974,
1304
+ "f":0.9877393349
1305
  },
1306
  "Mood":{
1307
+ "p":0.9458563536,
1308
+ "r":0.9490022173,
1309
+ "f":0.947426674
1310
  },
1311
  "Person":{
1312
+ "p":0.9670239077,
1313
+ "r":0.9646381579,
1314
+ "f":0.9658295595
1315
  },
1316
  "Tense":{
1317
+ "p":0.9812154696,
1318
+ "r":0.9812154696,
1319
+ "f":0.9812154696
1320
  },
1321
  "VerbForm":{
1322
+ "p":0.976450799,
1323
+ "r":0.9310344828,
1324
+ "f":0.9532019704
1325
  },
1326
  "Voice":{
1327
+ "p":0.9714576962,
1328
+ "r":0.9744376278,
1329
+ "f":0.9729453803
1330
  },
1331
  "Number[psor]":{
1332
+ "p":0.9771101574,
1333
+ "r":0.9729344729,
1334
+ "f":0.9750178444
1335
  },
1336
  "Person[psor]":{
1337
+ "p":0.9771101574,
1338
+ "r":0.9743223966,
1339
+ "f":0.9757142857
1340
  },
1341
  "NumType":{
1342
+ "p":0.9484029484,
1343
+ "r":0.9414634146,
1344
+ "f":0.9449204406
 
 
 
 
 
1345
  },
1346
  "Reflex":{
1347
  "p":1.0,
1348
+ "r":0.625,
1349
+ "f":0.7692307692
1350
  },
1351
  "Reflexive":{
1352
  "p":0.0,
1363
  "r":0.0,
1364
  "f":0.0
1365
  },
1366
+ "Poss":{
1367
+ "p":0.6,
1368
+ "r":1.0,
1369
+ "f":0.75
1370
+ },
1371
  "Number[psed]":{
1372
  "p":1.0,
1373
  "r":0.1111111111,
1374
  "f":0.2
1375
  }
1376
  },
1377
+ "lemma_acc":0.9745478902,
1378
+ "dep_uas":0.8147198216,
1379
+ "dep_las":0.743867083,
1380
  "dep_las_per_type":{
1381
  "det":{
1382
+ "p":0.8673946958,
1383
+ "r":0.8853503185,
1384
+ "f":0.8762805359
1385
  },
1386
  "amod:att":{
1387
+ "p":0.8639736191,
1388
+ "r":0.8569092396,
1389
+ "f":0.8604269294
1390
  },
1391
  "nsubj":{
1392
+ "p":0.7293934681,
1393
+ "r":0.7328125,
1394
+ "f":0.7310989867
1395
  },
1396
  "advmod:mode":{
1397
+ "p":0.5444191344,
1398
+ "r":0.5857843137,
1399
+ "f":0.5643447462
1400
  },
1401
  "nmod:att":{
1402
+ "p":0.7882960413,
1403
+ "r":0.7762711864,
1404
+ "f":0.7822374039
1405
  },
1406
  "obl":{
1407
+ "p":0.7491289199,
1408
+ "r":0.7740774077,
1409
+ "f":0.761398849
1410
  },
1411
  "obj":{
1412
+ "p":0.8519362187,
1413
+ "r":0.8404494382,
1414
+ "f":0.8461538462
1415
  },
1416
  "root":{
1417
+ "p":0.8277404922,
1418
+ "r":0.8240534521,
1419
+ "f":0.8258928571
1420
  },
1421
  "cc":{
1422
+ "p":0.6731182796,
1423
+ "r":0.6589473684,
1424
+ "f":0.6659574468
1425
  },
1426
  "conj":{
1427
+ "p":0.4225352113,
1428
+ "r":0.5,
1429
+ "f":0.4580152672
1430
  },
1431
  "advmod":{
1432
+ "p":0.824742268,
1433
+ "r":0.8421052632,
1434
+ "f":0.8333333333
1435
  },
1436
  "flat:name":{
1437
+ "p":0.8240343348,
1438
+ "r":0.8971962617,
1439
+ "f":0.8590604027
1440
  },
1441
  "appos":{
1442
+ "p":0.3975903614,
1443
+ "r":0.3510638298,
1444
+ "f":0.3728813559
1445
  },
1446
  "advcl":{
1447
+ "p":0.2207792208,
1448
+ "r":0.1734693878,
1449
+ "f":0.1942857143
1450
  },
1451
  "advmod:tlocy":{
1452
+ "p":0.6371308017,
1453
  "r":0.6565217391,
1454
+ "f":0.6466809422
1455
  },
1456
  "ccomp:obj":{
1457
+ "p":0.28,
1458
+ "r":0.4242424242,
1459
+ "f":0.3373493976
1460
  },
1461
  "mark":{
1462
+ "p":0.8441558442,
1463
+ "r":0.8227848101,
1464
+ "f":0.8333333333
1465
  },
1466
  "compound:preverb":{
1467
+ "p":0.9189189189,
1468
+ "r":0.9357798165,
1469
+ "f":0.9272727273
1470
  },
1471
  "advmod:locy":{
1472
+ "p":0.8095238095,
1473
+ "r":0.53125,
1474
+ "f":0.641509434
1475
  },
1476
  "cop":{
1477
+ "p":0.8064516129,
1478
+ "r":0.6097560976,
1479
+ "f":0.6944444444
1480
  },
1481
  "nmod:obl":{
1482
+ "p":0.125,
1483
+ "r":0.075,
1484
+ "f":0.09375
1485
  },
1486
  "advmod:to":{
1487
  "p":0.0,
1494
  "f":0.0
1495
  },
1496
  "ccomp:obl":{
1497
+ "p":0.5652173913,
1498
+ "r":0.40625,
1499
+ "f":0.4727272727
1500
  },
1501
  "iobj":{
1502
+ "p":0.3076923077,
1503
  "r":0.2666666667,
1504
+ "f":0.2857142857
 
 
 
 
 
 
 
 
 
 
1505
  },
1506
  "case":{
1507
+ "p":0.9390862944,
1508
+ "r":0.943877551,
1509
+ "f":0.941475827
1510
  },
1511
  "csubj":{
1512
+ "p":0.4782608696,
1513
+ "r":0.2972972973,
1514
+ "f":0.3666666667
1515
+ },
1516
+ "parataxis":{
1517
+ "p":0.2413793103,
1518
+ "r":0.095890411,
1519
+ "f":0.137254902
1520
  },
1521
  "xcomp":{
1522
+ "p":0.9014084507,
1523
+ "r":0.8648648649,
1524
+ "f":0.8827586207
1525
  },
1526
  "nummod":{
1527
+ "p":0.5794392523,
1528
+ "r":0.6666666667,
1529
+ "f":0.62
1530
  },
1531
  "acl":{
1532
+ "p":0.4038461538,
1533
+ "r":0.2916666667,
1534
+ "f":0.3387096774
1535
  },
1536
  "advmod:tto":{
1537
+ "p":0.5,
1538
  "r":0.2,
1539
+ "f":0.2857142857
1540
  },
1541
  "nmod":{
1542
+ "p":0.4,
1543
+ "r":0.1818181818,
1544
+ "f":0.25
1545
+ },
1546
+ "ccomp":{
1547
+ "p":0.1428571429,
1548
+ "r":0.0769230769,
1549
+ "f":0.1
1550
+ },
1551
+ "dep":{
1552
+ "p":0.0,
1553
+ "r":0.0,
1554
+ "f":0.0
1555
  },
1556
  "aux":{
1557
+ "p":0.8461538462,
1558
+ "r":0.9166666667,
1559
+ "f":0.88
1560
  },
1561
  "advmod:tfrom":{
1562
  "p":0.0,
1563
  "r":0.0,
1564
  "f":0.0
1565
  },
1566
+ "list":{
1567
+ "p":0.0769230769,
1568
+ "r":0.1666666667,
1569
+ "f":0.1052631579
1570
+ },
1571
  "goeswith":{
1572
  "p":0.0,
1573
  "r":0.0,
1574
  "f":0.0
1575
  },
1576
  "compound":{
1577
+ "p":0.9285714286,
1578
+ "r":0.975,
1579
+ "f":0.9512195122
1580
  },
1581
+ "obl:lvc":{
1582
  "p":0.0,
1583
  "r":0.0,
1584
  "f":0.0
1585
  },
1586
+ "orphan":{
1587
  "p":0.0,
1588
  "r":0.0,
1589
  "f":0.0
1593
  "r":0.0,
1594
  "f":0.0
1595
  },
1596
+ "advmod:que":{
1597
+ "p":0.5,
1598
+ "r":0.25,
1599
+ "f":0.3333333333
 
 
 
 
 
1600
  },
1601
  "ccomp:pred":{
1602
  "p":0.0,
1603
  "r":0.0,
1604
  "f":0.0
 
 
 
 
 
1605
  }
1606
  },
1607
+ "ents_p":0.8459219858,
1608
+ "ents_r":0.8387834037,
1609
+ "ents_f":0.8423375706,
1610
  "ents_per_type":{
1611
  "ORG":{
1612
+ "p":0.8763686131,
1613
+ "r":0.8905887807,
1614
+ "f":0.8834214762
1615
  },
1616
  "PER":{
1617
+ "p":0.8937998772,
1618
+ "r":0.8697729988,
1619
+ "f":0.8816227672
1620
  },
1621
  "LOC":{
1622
+ "p":0.8631111111,
1623
  "r":0.8428819444,
1624
+ "f":0.852876592
1625
  },
1626
  "MISC":{
1627
+ "p":0.6095100865,
1628
+ "r":0.6,
1629
+ "f":0.6047176555
1630
  }
1631
  },
1632
+ "speed":2571.9543216127
1633
  },
1634
  "sources":[
1635
  {
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b59555ff38051fbf45fc3e5166e33ee0d78b41a030a35d062576a2f0648ee623
3
  size 463022
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d94c79de663f5f97b568c14633ae56d7f5d760997484752d23bea65b0b893cb1
3
  size 463022
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:86aafd5f7fc971801c75f57099b3292f63f42cd253b617cf5682a3de2eb48a60
3
  size 9791307
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d9ac04f677390bc6234834d283863ea0035c2d88fafa9742f0d14d05b52500c5
3
  size 9791307
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a2f0a5847926d84b43cb4ac5291ddf93bce08ea4ef49cea67ba1c4b83deb8ab1
3
  size 25601129
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6c9d8050c89a70230791b10b332f235d9519816c9ff18d7d87cf9aa4c1e40083
3
  size 25601129
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b81540cfd184f400a3ff2dbcfbd2606cd3a65836e0ab348c3457d2d1dc66281a
3
  size 1237
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d2f4a34e2f4cd9f766eac2476a2929adf82a335ff18611d4fffbc976a62520d
3
  size 1237
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:20cb552430c4dbb3880081377133931e3c64aea53924763c3d243275873d7e69
3
  size 7297
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9edc7a5e306bdb61193547851bf6b090fd2731114a69398b171d99045fc91bb
3
  size 7297
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f1cb7ce2115cc897e4c998494b76a9294ce994a3db586a31f8cbdf331cd4629
3
  size 9659749
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56d7bd61838b579590581a9a1aadf39e6067c62dba35136fe74f4220f587b2e8
3
  size 9659749
trainable_lemmatizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ce0122da2b20fb63b9d9fb629a2ea663af3f1635f19761a23de9ec0ba9f7e3ed
3
  size 11282980
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f1903089bfb7ac344c3a197e44a9a367fabcaf51c5a6b533937d3976f7d7a76
3
  size 11282980
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9df155e8041b21734b17f711752037219115a343ea0cae24210ede7ed702fe65
3
- size 6390329
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ef5e125edd7df78f5f07048d5268a62335857da7c834fc9d02d77d88601526b
3
+ size 6389762