oroszgy commited on
Commit
23eaa59
1 Parent(s): aee94a9

Update spacy pipeline to 3.7.0

Browse files
README.md CHANGED
@@ -14,74 +14,74 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8643145515
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8656821378
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8649978041
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9688008422
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
- value: 0.9683223275
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
- value: 0.9386544167
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
- value: 0.976365898
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
- value: 0.8291946309
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
- value: 0.7686371008
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
- value: 0.9821029083
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_lg` |
79
- | **Version** | `3.6.1` |
80
- | **spaCy** | `>=3.6.0,<3.7.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
83
  | **Vectors** | -1 keys, 200000 unique vectors (300 dimensions) |
84
- | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[Hungarian lg Floret vectors](https://huggingface.co/huspacy/hu_vectors_web_lg) (Szeged AI) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
@@ -108,18 +108,18 @@ Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morpholog
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
- | `SENTS_P` | 94.34 |
112
- | `SENTS_R` | 96.44 |
113
- | `SENTS_F` | 95.37 |
114
- | `TAG_ACC` | 96.70 |
115
- | `POS_ACC` | 96.78 |
116
- | `MORPH_ACC` | 93.33 |
117
- | `MORPH_MICRO_P` | 97.17 |
118
- | `MORPH_MICRO_R` | 96.36 |
119
- | `MORPH_MICRO_F` | 96.77 |
120
- | `LEMMA_ACC` | 97.57 |
121
- | `DEP_UAS` | 80.23 |
122
- | `DEP_LAS` | 73.02 |
123
- | `ENTS_P` | 86.40 |
124
- | `ENTS_R` | 85.65 |
125
- | `ENTS_F` | 86.02 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8621839888
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8633966245
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8627898805
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9696621686
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9677481099
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
+ value: 0.9376016844
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9755047364
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
+ value: 0.8266144814
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
+ value: 0.7624266145
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
+ value: 0.9821428571
73
  ---
74
  Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_lg` |
79
+ | **Version** | `3.7.0` |
80
+ | **spaCy** | `>=3.7.0,<3.8.0` |
81
  | **Default Pipeline** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
82
  | **Components** | `tok2vec`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `parser`, `ner` |
83
  | **Vectors** | -1 keys, 200000 unique vectors (300 dimensions) |
84
+ | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br>[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br>[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br>[Hungarian lg Floret vectors](https://huggingface.co/huspacy/hu_vectors_web_lg) (Szeged AI) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
+ | `SENTS_P` | 98.43 |
112
+ | `SENTS_R` | 98.00 |
113
+ | `SENTS_F` | 98.21 |
114
+ | `TAG_ACC` | 96.97 |
115
+ | `POS_ACC` | 96.77 |
116
+ | `MORPH_ACC` | 93.76 |
117
+ | `MORPH_MICRO_P` | 97.04 |
118
+ | `MORPH_MICRO_R` | 96.16 |
119
+ | `MORPH_MICRO_F` | 96.60 |
120
+ | `LEMMA_ACC` | 97.55 |
121
+ | `DEP_UAS` | 82.66 |
122
+ | `DEP_LAS` | 76.24 |
123
+ | `ENTS_P` | 86.22 |
124
+ | `ENTS_R` | 86.34 |
125
+ | `ENTS_F` | 86.28 |
config.cfg CHANGED
@@ -1,8 +1,8 @@
1
  [paths]
2
- parser_model = "models/hu_core_news_lg-parser-3.6.1/model-best"
3
- ner_model = "models/hu_core_news_lg-ner-3.6.1/model-best"
4
- lemmatizer_lookups = "models/hu_core_news_lg-lookup-lemmatizer-3.6.1"
5
- tagger_model = "models/hu_core_news_lg-tagger-3.6.1/model-best"
6
  train = null
7
  dev = null
8
  vectors = null
@@ -21,6 +21,7 @@ before_creation = null
21
  after_creation = null
22
  after_pipeline_creation = null
23
  batch_size = 1000
 
24
 
25
  [components]
26
 
1
  [paths]
2
+ parser_model = "models/hu_core_news_lg-parser-3.7.0/model-best"
3
+ ner_model = "models/hu_core_news_lg-ner-3.7.0/model-best"
4
+ lemmatizer_lookups = "models/hu_core_news_lg-lookup-lemmatizer-3.7.0"
5
+ tagger_model = "models/hu_core_news_lg-tagger-3.7.0/model-best"
6
  train = null
7
  dev = null
8
  vectors = null
21
  after_creation = null
22
  after_pipeline_creation = null
23
  batch_size = 1000
24
+ vectors = {"@vectors":"spacy.Vectors.v1"}
25
 
26
  [components]
27
 
hu_core_news_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7e4fa1709ab778dd221228ca54a0479317cd1bf0277295c0e62c9cac10638613
3
- size 401612730
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:073cfc9b7c13d5d49065cc79c955eaa9a42e71ecbe63645f3ed50199287b939c
3
+ size 401409146
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"hu",
3
  "name":"core_news_lg",
4
- "version":"3.6.1",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
8
  "url":"https://github.com/huspacy/huspacy",
9
  "license":"cc-by-sa-4.0",
10
- "spacy_version":">=3.6.0,<3.7.0",
11
- "spacy_git_version":"6fc153a26",
12
  "vectors":{
13
  "width":300,
14
  "vectors":200000,
@@ -1268,90 +1268,90 @@
1268
  "token_p":0.998565417,
1269
  "token_r":0.9993300153,
1270
  "token_f":0.9989475698,
1271
- "sents_p":0.9865168539,
1272
- "sents_r":0.9777282851,
1273
- "sents_f":0.9821029083,
1274
- "tag_acc":0.9688008422,
1275
- "pos_acc":0.9683223275,
1276
- "morph_acc":0.9386544167,
1277
- "morph_micro_p":0.9702422145,
1278
- "morph_micro_r":0.9639879673,
1279
- "morph_micro_f":0.9671049795,
1280
  "morph_per_feat":{
1281
  "Definite":{
1282
- "p":0.968952734,
1283
- "r":0.975734951,
1284
- "f":0.9723320158
1285
  },
1286
  "PronType":{
1287
- "p":0.9773230088,
1288
- "r":0.9751655629,
1289
- "f":0.9762430939
1290
  },
1291
  "Case":{
1292
- "p":0.9755223881,
1293
- "r":0.9685832839,
1294
- "f":0.9720404521
1295
  },
1296
  "Degree":{
1297
- "p":0.9291958042,
1298
- "r":0.884359401,
1299
- "f":0.9062233589
1300
  },
1301
  "Number":{
1302
- "p":0.9860480753,
1303
- "r":0.9830735713,
1304
- "f":0.9845585767
1305
  },
1306
  "Mood":{
1307
- "p":0.9326086957,
1308
  "r":0.9512195122,
1309
- "f":0.9418221734
1310
  },
1311
  "Person":{
1312
- "p":0.9496753247,
1313
- "r":0.9621710526,
1314
- "f":0.9558823529
1315
  },
1316
  "Tense":{
1317
- "p":0.9641304348,
1318
- "r":0.9801104972,
1319
- "f":0.9720547945
1320
  },
1321
  "VerbForm":{
1322
- "p":0.9659574468,
1323
- "r":0.9101844427,
1324
- "f":0.9372419488
1325
  },
1326
  "Voice":{
1327
- "p":0.9598393574,
1328
- "r":0.9775051125,
1329
- "f":0.968591692
1330
  },
1331
  "Number[psor]":{
1332
- "p":0.9795620438,
1333
- "r":0.9558404558,
1334
- "f":0.967555876
1335
  },
1336
  "Person[psor]":{
1337
- "p":0.9824817518,
1338
- "r":0.9600570613,
1339
- "f":0.9711399711
1340
  },
1341
  "NumType":{
1342
- "p":0.9343065693,
1343
- "r":0.9365853659,
1344
- "f":0.9354445798
1345
  },
1346
  "Poss":{
1347
- "p":0.5,
1348
  "r":1.0,
1349
- "f":0.6666666667
1350
  },
1351
  "Reflex":{
1352
  "p":1.0,
1353
- "r":0.75,
1354
- "f":0.8571428571
1355
  },
1356
  "Aspect":{
1357
  "p":0.0,
@@ -1364,189 +1364,189 @@
1364
  "f":0.0
1365
  }
1366
  },
1367
- "lemma_acc":0.976365898,
1368
- "dep_uas":0.8291946309,
1369
- "dep_las":0.7686371008,
1370
  "dep_las_per_type":{
1371
  "det":{
1372
- "p":0.872488408,
1373
- "r":0.8988853503,
1374
- "f":0.8854901961
1375
  },
1376
  "amod:att":{
1377
- "p":0.8730290456,
1378
- "r":0.8601798855,
1379
- "f":0.8665568369
1380
  },
1381
  "nsubj":{
1382
- "p":0.7379518072,
1383
- "r":0.765625,
1384
- "f":0.7515337423
1385
  },
1386
  "advmod:mode":{
1387
- "p":0.6165048544,
1388
- "r":0.6225490196,
1389
- "f":0.6195121951
1390
  },
1391
  "nmod:att":{
1392
- "p":0.7933333333,
1393
  "r":0.806779661,
1394
- "f":0.8
1395
  },
1396
  "obl":{
1397
- "p":0.7814748201,
1398
- "r":0.7821782178,
1399
- "f":0.7818263608
1400
  },
1401
  "obj":{
1402
- "p":0.8775510204,
1403
- "r":0.8696629213,
1404
- "f":0.8735891648
1405
  },
1406
  "root":{
1407
- "p":0.8471910112,
1408
- "r":0.8396436526,
1409
- "f":0.8434004474
1410
  },
1411
  "cc":{
1412
- "p":0.6946236559,
1413
- "r":0.68,
1414
- "f":0.6872340426
1415
  },
1416
  "conj":{
1417
- "p":0.5336134454,
1418
- "r":0.5291666667,
1419
- "f":0.5313807531
1420
  },
1421
  "advmod":{
1422
- "p":0.8080808081,
1423
- "r":0.8421052632,
1424
- "f":0.824742268
1425
  },
1426
  "flat:name":{
1427
- "p":0.8888888889,
1428
- "r":0.8971962617,
1429
- "f":0.8930232558
1430
  },
1431
  "appos":{
1432
- "p":0.4252873563,
1433
- "r":0.3936170213,
1434
- "f":0.408839779
1435
  },
1436
  "advcl":{
1437
- "p":0.3398058252,
1438
- "r":0.3571428571,
1439
- "f":0.3482587065
1440
  },
1441
  "advmod:tlocy":{
1442
- "p":0.6704980843,
1443
- "r":0.7608695652,
1444
- "f":0.7128309572
1445
  },
1446
  "ccomp:obj":{
1447
- "p":0.3469387755,
1448
- "r":0.5151515152,
1449
- "f":0.4146341463
1450
  },
1451
  "mark":{
1452
- "p":0.7975460123,
1453
  "r":0.8227848101,
1454
- "f":0.8099688474
1455
  },
1456
  "compound:preverb":{
1457
- "p":0.9203539823,
1458
  "r":0.9541284404,
1459
- "f":0.9369369369
1460
  },
1461
  "advmod:locy":{
1462
- "p":0.8260869565,
1463
- "r":0.59375,
1464
- "f":0.6909090909
1465
  },
1466
  "cop":{
1467
- "p":0.8125,
1468
- "r":0.6341463415,
1469
- "f":0.7123287671
1470
  },
1471
  "nmod:obl":{
1472
- "p":0.2727272727,
1473
- "r":0.225,
1474
- "f":0.2465753425
1475
  },
1476
  "advmod:to":{
1477
- "p":0.0,
1478
- "r":0.0,
1479
- "f":0.0
1480
  },
1481
  "obj:lvc":{
1482
- "p":0.5,
1483
- "r":0.1666666667,
1484
- "f":0.25
1485
  },
1486
  "ccomp:obl":{
1487
- "p":0.619047619,
1488
  "r":0.40625,
1489
- "f":0.4905660377
1490
  },
1491
  "iobj":{
1492
- "p":0.2857142857,
1493
- "r":0.2666666667,
1494
- "f":0.275862069
1495
  },
1496
  "case":{
1497
- "p":0.9538461538,
1498
- "r":0.9489795918,
1499
- "f":0.9514066496
1500
  },
1501
  "csubj":{
1502
- "p":0.6153846154,
1503
- "r":0.4324324324,
1504
- "f":0.5079365079
1505
  },
1506
  "parataxis":{
1507
- "p":0.2916666667,
1508
- "r":0.095890411,
1509
- "f":0.1443298969
1510
  },
1511
  "xcomp":{
1512
- "p":0.8441558442,
1513
- "r":0.8783783784,
1514
- "f":0.8609271523
1515
  },
1516
  "nummod":{
1517
- "p":0.547826087,
1518
- "r":0.6774193548,
1519
- "f":0.6057692308
1520
  },
1521
  "acl":{
1522
- "p":0.4166666667,
1523
- "r":0.3472222222,
1524
- "f":0.3787878788
1525
  },
1526
  "advmod:tto":{
1527
- "p":1.0,
1528
- "r":0.1,
1529
- "f":0.1818181818
1530
  },
1531
  "nmod":{
1532
- "p":0.4,
1533
- "r":0.1818181818,
1534
- "f":0.25
1535
  },
1536
- "dep":{
 
 
 
 
 
1537
  "p":0.0,
1538
  "r":0.0,
1539
  "f":0.0
1540
  },
1541
- "aux":{
1542
- "p":0.9,
1543
- "r":0.75,
1544
- "f":0.8181818182
1545
- },
1546
- "advmod:tfrom":{
1547
- "p":1.0,
1548
- "r":0.1666666667,
1549
- "f":0.2857142857
1550
  },
1551
  "goeswith":{
1552
  "p":0.0,
@@ -1554,9 +1554,9 @@
1554
  "f":0.0
1555
  },
1556
  "compound":{
1557
- "p":0.847826087,
1558
  "r":0.975,
1559
- "f":0.9069767442
1560
  },
1561
  "obl:lvc":{
1562
  "p":0.0,
@@ -1569,9 +1569,9 @@
1569
  "f":0.0
1570
  },
1571
  "ccomp":{
1572
- "p":0.2,
1573
- "r":0.0769230769,
1574
- "f":0.1111111111
1575
  },
1576
  "nsubj:lvc":{
1577
  "p":0.0,
@@ -1579,47 +1579,47 @@
1579
  "f":0.0
1580
  },
1581
  "list":{
1582
- "p":0.1666666667,
1583
  "r":0.1666666667,
1584
- "f":0.1666666667
1585
- },
1586
- "advmod:que":{
1587
- "p":1.0,
1588
- "r":0.5,
1589
- "f":0.6666666667
1590
  },
1591
- "ccomp:pred":{
1592
  "p":0.0,
1593
  "r":0.0,
1594
  "f":0.0
 
 
 
 
 
1595
  }
1596
  },
1597
- "ents_p":0.8643145515,
1598
- "ents_r":0.8656821378,
1599
- "ents_f":0.8649978041,
1600
  "ents_per_type":{
1601
  "ORG":{
1602
- "p":0.8920532843,
1603
- "r":0.9003245248,
1604
- "f":0.89616982
1605
  },
1606
  "PER":{
1607
- "p":0.8984741784,
1608
- "r":0.9145758662,
1609
- "f":0.9064535228
1610
  },
1611
  "LOC":{
1612
- "p":0.8717277487,
1613
  "r":0.8671875,
1614
- "f":0.8694516971
1615
  },
1616
  "MISC":{
1617
- "p":0.6746268657,
1618
- "r":0.6411347518,
1619
- "f":0.6574545455
1620
  }
1621
  },
1622
- "speed":881.0252665471
1623
  },
1624
  "sources":[
1625
  {
1
  {
2
  "lang":"hu",
3
  "name":"core_news_lg",
4
+ "version":"3.7.0",
5
  "description":"Core Hungarian model for HuSpaCy. Components: tok2vec, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
8
  "url":"https://github.com/huspacy/huspacy",
9
  "license":"cc-by-sa-4.0",
10
+ "spacy_version":">=3.7.0,<3.8.0",
11
+ "spacy_git_version":"a89eae928",
12
  "vectors":{
13
  "width":300,
14
  "vectors":200000,
1268
  "token_p":0.998565417,
1269
  "token_r":0.9993300153,
1270
  "token_f":0.9989475698,
1271
+ "sents_p":0.9843400447,
1272
+ "sents_r":0.9799554566,
1273
+ "sents_f":0.9821428571,
1274
+ "tag_acc":0.9696621686,
1275
+ "pos_acc":0.9677481099,
1276
+ "morph_acc":0.9376016844,
1277
+ "morph_micro_p":0.9703816132,
1278
+ "morph_micro_r":0.9616244091,
1279
+ "morph_micro_f":0.9659831643,
1280
  "morph_per_feat":{
1281
  "Definite":{
1282
+ "p":0.9625399726,
1283
+ "r":0.9832011199,
1284
+ "f":0.9727608495
1285
  },
1286
  "PronType":{
1287
+ "p":0.9747113799,
1288
+ "r":0.9784768212,
1289
+ "f":0.9765904709
1290
  },
1291
  "Case":{
1292
+ "p":0.975795159,
1293
+ "r":0.9638411381,
1294
+ "f":0.9697813121
1295
  },
1296
  "Degree":{
1297
+ "p":0.9310653536,
1298
+ "r":0.8652246256,
1299
+ "f":0.8969383355
1300
  },
1301
  "Number":{
1302
+ "p":0.9890165596,
1303
+ "r":0.9808949221,
1304
+ "f":0.9849389987
1305
  },
1306
  "Mood":{
1307
+ "p":0.9346405229,
1308
  "r":0.9512195122,
1309
+ "f":0.9428571429
1310
  },
1311
  "Person":{
1312
+ "p":0.9553719008,
1313
+ "r":0.9506578947,
1314
+ "f":0.9530090684
1315
  },
1316
  "Tense":{
1317
+ "p":0.962962963,
1318
+ "r":0.9767955801,
1319
+ "f":0.9698299506
1320
  },
1321
  "VerbForm":{
1322
+ "p":0.958643507,
1323
+ "r":0.9294306335,
1324
+ "f":0.9438110749
1325
  },
1326
  "Voice":{
1327
+ "p":0.9567404427,
1328
+ "r":0.972392638,
1329
+ "f":0.9645030426
1330
  },
1331
  "Number[psor]":{
1332
+ "p":0.9822747415,
1333
+ "r":0.9472934473,
1334
+ "f":0.9644670051
1335
  },
1336
  "Person[psor]":{
1337
+ "p":0.9837518464,
1338
+ "r":0.9500713267,
1339
+ "f":0.9666182874
1340
  },
1341
  "NumType":{
1342
+ "p":0.935483871,
1343
+ "r":0.9195121951,
1344
+ "f":0.9274292743
1345
  },
1346
  "Poss":{
1347
+ "p":0.6,
1348
  "r":1.0,
1349
+ "f":0.75
1350
  },
1351
  "Reflex":{
1352
  "p":1.0,
1353
+ "r":0.875,
1354
+ "f":0.9333333333
1355
  },
1356
  "Aspect":{
1357
  "p":0.0,
1364
  "f":0.0
1365
  }
1366
  },
1367
+ "lemma_acc":0.9755047364,
1368
+ "dep_uas":0.8266144814,
1369
+ "dep_las":0.7624266145,
1370
  "dep_las_per_type":{
1371
  "det":{
1372
+ "p":0.8717156105,
1373
+ "r":0.898089172,
1374
+ "f":0.8847058824
1375
  },
1376
  "amod:att":{
1377
+ "p":0.858677686,
1378
+ "r":0.8495502862,
1379
+ "f":0.8540896013
1380
  },
1381
  "nsubj":{
1382
+ "p":0.7348837209,
1383
+ "r":0.740625,
1384
+ "f":0.7377431907
1385
  },
1386
  "advmod:mode":{
1387
+ "p":0.621483376,
1388
+ "r":0.5955882353,
1389
+ "f":0.6082603254
1390
  },
1391
  "nmod:att":{
1392
+ "p":0.7603833866,
1393
  "r":0.806779661,
1394
+ "f":0.7828947368
1395
  },
1396
  "obl":{
1397
+ "p":0.7868561279,
1398
+ "r":0.797479748,
1399
+ "f":0.7921323201
1400
  },
1401
  "obj":{
1402
+ "p":0.8511111111,
1403
+ "r":0.8606741573,
1404
+ "f":0.8558659218
1405
  },
1406
  "root":{
1407
+ "p":0.8299776286,
1408
+ "r":0.8262806236,
1409
+ "f":0.828125
1410
  },
1411
  "cc":{
1412
+ "p":0.6937901499,
1413
+ "r":0.6821052632,
1414
+ "f":0.6878980892
1415
  },
1416
  "conj":{
1417
+ "p":0.5161987041,
1418
+ "r":0.4979166667,
1419
+ "f":0.506892895
1420
  },
1421
  "advmod":{
1422
+ "p":0.8282828283,
1423
+ "r":0.8631578947,
1424
+ "f":0.8453608247
1425
  },
1426
  "flat:name":{
1427
+ "p":0.8620689655,
1428
+ "r":0.9345794393,
1429
+ "f":0.8968609865
1430
  },
1431
  "appos":{
1432
+ "p":0.4852941176,
1433
+ "r":0.3510638298,
1434
+ "f":0.4074074074
1435
  },
1436
  "advcl":{
1437
+ "p":0.3,
1438
+ "r":0.2755102041,
1439
+ "f":0.2872340426
1440
  },
1441
  "advmod:tlocy":{
1442
+ "p":0.6818181818,
1443
+ "r":0.7826086957,
1444
+ "f":0.7287449393
1445
  },
1446
  "ccomp:obj":{
1447
+ "p":0.2711864407,
1448
+ "r":0.4848484848,
1449
+ "f":0.347826087
1450
  },
1451
  "mark":{
1452
+ "p":0.8024691358,
1453
  "r":0.8227848101,
1454
+ "f":0.8125
1455
  },
1456
  "compound:preverb":{
1457
+ "p":0.8888888889,
1458
  "r":0.9541284404,
1459
+ "f":0.9203539823
1460
  },
1461
  "advmod:locy":{
1462
+ "p":0.8,
1463
+ "r":0.625,
1464
+ "f":0.701754386
1465
  },
1466
  "cop":{
1467
+ "p":0.7567567568,
1468
+ "r":0.6829268293,
1469
+ "f":0.7179487179
1470
  },
1471
  "nmod:obl":{
1472
+ "p":0.3181818182,
1473
+ "r":0.175,
1474
+ "f":0.2258064516
1475
  },
1476
  "advmod:to":{
1477
+ "p":0.5,
1478
+ "r":0.125,
1479
+ "f":0.2
1480
  },
1481
  "obj:lvc":{
1482
+ "p":0.25,
1483
+ "r":0.0833333333,
1484
+ "f":0.125
1485
  },
1486
  "ccomp:obl":{
1487
+ "p":0.5,
1488
  "r":0.40625,
1489
+ "f":0.4482758621
1490
  },
1491
  "iobj":{
1492
+ "p":0.2,
1493
+ "r":0.3333333333,
1494
+ "f":0.25
1495
  },
1496
  "case":{
1497
+ "p":0.9479166667,
1498
+ "r":0.9285714286,
1499
+ "f":0.9381443299
1500
  },
1501
  "csubj":{
1502
+ "p":0.40625,
1503
+ "r":0.3513513514,
1504
+ "f":0.3768115942
1505
  },
1506
  "parataxis":{
1507
+ "p":0.2,
1508
+ "r":0.0410958904,
1509
+ "f":0.0681818182
1510
  },
1511
  "xcomp":{
1512
+ "p":0.8873239437,
1513
+ "r":0.8513513514,
1514
+ "f":0.8689655172
1515
  },
1516
  "nummod":{
1517
+ "p":0.5909090909,
1518
+ "r":0.6989247312,
1519
+ "f":0.6403940887
1520
  },
1521
  "acl":{
1522
+ "p":0.406779661,
1523
+ "r":0.3333333333,
1524
+ "f":0.3664122137
1525
  },
1526
  "advmod:tto":{
1527
+ "p":0.6666666667,
1528
+ "r":0.4,
1529
+ "f":0.5
1530
  },
1531
  "nmod":{
1532
+ "p":0.1428571429,
1533
+ "r":0.0909090909,
1534
+ "f":0.1111111111
1535
  },
1536
+ "aux":{
1537
+ "p":0.9090909091,
1538
+ "r":0.8333333333,
1539
+ "f":0.8695652174
1540
+ },
1541
+ "advmod:tfrom":{
1542
  "p":0.0,
1543
  "r":0.0,
1544
  "f":0.0
1545
  },
1546
+ "ccomp:pred":{
1547
+ "p":0.0,
1548
+ "r":0.0,
1549
+ "f":0.0
 
 
 
 
 
1550
  },
1551
  "goeswith":{
1552
  "p":0.0,
1554
  "f":0.0
1555
  },
1556
  "compound":{
1557
+ "p":0.9512195122,
1558
  "r":0.975,
1559
+ "f":0.962962963
1560
  },
1561
  "obl:lvc":{
1562
  "p":0.0,
1569
  "f":0.0
1570
  },
1571
  "ccomp":{
1572
+ "p":0.0,
1573
+ "r":0.0,
1574
+ "f":0.0
1575
  },
1576
  "nsubj:lvc":{
1577
  "p":0.0,
1579
  "f":0.0
1580
  },
1581
  "list":{
1582
+ "p":0.125,
1583
  "r":0.1666666667,
1584
+ "f":0.1428571429
 
 
 
 
 
1585
  },
1586
+ "dep":{
1587
  "p":0.0,
1588
  "r":0.0,
1589
  "f":0.0
1590
+ },
1591
+ "advmod:que":{
1592
+ "p":1.0,
1593
+ "r":0.75,
1594
+ "f":0.8571428571
1595
  }
1596
  },
1597
+ "ents_p":0.8621839888,
1598
+ "ents_r":0.8633966245,
1599
+ "ents_f":0.8627898805,
1600
  "ents_per_type":{
1601
  "ORG":{
1602
+ "p":0.8843537415,
1603
+ "r":0.9040333797,
1604
+ "f":0.894085282
1605
  },
1606
  "PER":{
1607
+ "p":0.9024683925,
1608
+ "r":0.8954599761,
1609
+ "f":0.8989505247
1610
  },
1611
  "LOC":{
1612
+ "p":0.872489083,
1613
  "r":0.8671875,
1614
+ "f":0.8698302133
1615
  },
1616
  "MISC":{
1617
+ "p":0.6759124088,
1618
+ "r":0.6567375887,
1619
+ "f":0.6661870504
1620
  }
1621
  },
1622
+ "speed":725.0647161719
1623
  },
1624
  "sources":[
1625
  {
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1ac63135fc77a2506feda18524ad2ab0899a8af00d82cc12e4be64a86922c23b
3
  size 1379030
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b29df29c80e5a4d6f03d4bef5679b025f5a3f2d309b399371b81b423da1090ca
3
  size 1379030
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc7dcfa7ceb9db55d82cc94d695aa13941367cd05449a445ac52b926c880abfe
3
  size 56989063
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:79db4cbc396f6029a8258884fb3b63d2a9eab0558274364e1c603b3a5461d650
3
  size 56989063
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c672c4c39c789ec10bdfae6da4136d0d3e085b215cb6e94585cf5864ac8e64bf
3
  size 26010735
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8f63e981ee75ee27b5dc5ba658776dd1bf124c6d84b6d398e224c09ff32e34f
3
  size 26010735
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:eeef97df9d91f7b927283a182546c66a53dfdf7287bc6d09af9e728bf28fbe34
3
  size 2845
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3afb9d113f491c285b2ea823c6fd7f50cbd9f7e886807829f82176e7140efda0
3
  size 2845
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:869824783064eb22aa905fcf4dc464a04e6b8541d83847ab51cc69715e641a3c
3
  size 20905
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ebafb99f7ca67aa5edda2721893a99587fea5d1401349f92766073d5052a346a
3
  size 20905
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b7a77f38f45eb158c9f8e187f195016c33073167ec54984b98185df744ad5f06
3
  size 56806299
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:339d89848c54094ec89b25ae3ea81dd9e7cd5dafcc4ebd98f1c4ce8a798b8626
3
  size 56806299
trainable_lemmatizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a6b84f3f0a92eac01e8e8e8918b4d867222d367e86c98044ee6646299d7cd5f5
3
  size 61643136
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4833cb4030d1dff71de1c4a71fe5a0f7c881cd3a1e50350944d3048ca716a44c
3
  size 61643136
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ca50e0fc0cfa6a17ce62ac7b7dc49a441e9f380d48694184291198e1afecc6f
3
- size 6389432
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4a699484b44f2ff44891c917e9976dc67e11dc6c681ebcbf4775941f67e2dd38
3
+ size 6388769