oroszgy commited on
Commit
b62c7b5
1 Parent(s): 029c58f

Update spacy pipeline to 3.5.3

Browse files
README.md CHANGED
@@ -14,74 +14,74 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.909059294
18
  - name: NER Recall
19
  type: recall
20
- value: 0.9191279887
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.9140659149
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
- value: 0.9817207388
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
- value: 0.979902383
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
- value: 0.9645403646
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
- value: 0.986030045
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
- value: 0.9078903297
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
- value: 0.8674641148
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
- value: 0.9966555184
73
  ---
74
  Hungarian transformer pipeline (huBERT) for HuSpaCy. Components: transformer, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_trf` |
79
- | **Version** | `3.5.2` |
80
  | **spaCy** | `>=3.5.0,<3.6.0` |
81
  | **Default Pipeline** | `transformer`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `experimental_arc_predicter`, `experimental_arc_labeler`, `ner` |
82
  | **Components** | `transformer`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `experimental_arc_predicter`, `experimental_arc_labeler`, `ner` |
83
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
84
- | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[hunNERwiki](http://hlt.sztaki.hu/resources/hunnerwiki.html) (Eszter Simon, Dávid Márk Nemeskey (HLT Group, Budapest University of Technology and Economics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[huBERT base model (cased)](https://huggingface.co/SZTAKI-HLT/hubert-base-cc) (Dávid Márk Nemeskey (SZTAKI-HLT)) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
@@ -108,20 +108,20 @@ Hungarian transformer pipeline (huBERT) for HuSpaCy. Components: transformer, se
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
- | `SENTS_P` | 99.78 |
112
- | `SENTS_R` | 99.55 |
113
- | `SENTS_F` | 99.67 |
114
- | `TAG_ACC` | 98.17 |
115
- | `POS_ACC` | 97.99 |
116
- | `MORPH_ACC` | 96.45 |
117
- | `MORPH_MICRO_P` | 98.67 |
118
- | `MORPH_MICRO_R` | 98.29 |
119
- | `MORPH_MICRO_F` | 98.48 |
120
- | `LEMMA_ACC` | 98.60 |
121
- | `BOUND_DEP_LAS` | 86.74 |
122
- | `BOUND_DEP_UAS` | 90.77 |
123
- | `DEP_UAS` | 90.79 |
124
- | `DEP_LAS` | 86.75 |
125
- | `ENTS_P` | 90.91 |
126
- | `ENTS_R` | 91.91 |
127
- | `ENTS_F` | 91.41 |
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.9121680264
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.9238748242
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.9179841034
24
  - task:
25
  name: TAG
26
  type: token-classification
27
  metrics:
28
  - name: TAG (XPOS) Accuracy
29
  type: accuracy
30
+ value: 0.9831092397
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
  - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9825350495
38
  - task:
39
  name: MORPH
40
  type: token-classification
41
  metrics:
42
  - name: Morph (UFeats) Accuracy
43
  type: accuracy
44
+ value: 0.9674610011
45
  - task:
46
  name: LEMMA
47
  type: token-classification
48
  metrics:
49
  - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9882307913
52
  - task:
53
  name: UNLABELED_DEPENDENCIES
54
  type: token-classification
55
  metrics:
56
  - name: Unlabeled Attachment Score (UAS)
57
  type: f_score
58
+ value: 0.8952581463
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
  - name: Labeled Attachment Score (LAS)
64
  type: f_score
65
+ value: 0.8520574163
66
  - task:
67
  name: SENTS
68
  type: token-classification
69
  metrics:
70
  - name: Sentences F-Score
71
  type: f_score
72
+ value: 0.9933184855
73
  ---
74
  Hungarian transformer pipeline (huBERT) for HuSpaCy. Components: transformer, senter, tagger, morphologizer, lemmatizer, parser, ner
75
 
76
  | Feature | Description |
77
  | --- | --- |
78
  | **Name** | `hu_core_news_trf` |
79
+ | **Version** | `3.5.3` |
80
  | **spaCy** | `>=3.5.0,<3.6.0` |
81
  | **Default Pipeline** | `transformer`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `experimental_arc_predicter`, `experimental_arc_labeler`, `ner` |
82
  | **Components** | `transformer`, `senter`, `tagger`, `morphologizer`, `lookup_lemmatizer`, `trainable_lemmatizer`, `experimental_arc_predicter`, `experimental_arc_labeler`, `ner` |
83
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
84
+ | **Sources** | [UD Hungarian Szeged](https://universaldependencies.org/treebanks/hu_szeged/index.html) (Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze (MTA-SZTE Research Group on Artificial Intelligence))<br />[NYTK-NerKor Corpus](https://github.com/nytud/NYTK-NerKor) (Eszter Simon, Noémi Vadász (Department of Language Technology and Applied Linguistics))<br />[Szeged NER Corpus](https://rgai.inf.u-szeged.hu/node/130) (György Szarvas, Richárd Farkas, László Felföldi, András Kocsor, János Csirik (MTA-SZTE Research Group on Artificial Intelligence))<br />[huBERT base model (cased)](https://huggingface.co/SZTAKI-HLT/hubert-base-cc) (Dávid Márk Nemeskey (SZTAKI-HLT)) |
85
  | **License** | `cc-by-sa-4.0` |
86
  | **Author** | [SzegedAI, MILAB](https://github.com/huspacy/huspacy) |
87
 
 
108
  | `TOKEN_P` | 99.86 |
109
  | `TOKEN_R` | 99.93 |
110
  | `TOKEN_F` | 99.89 |
111
+ | `SENTS_P` | 99.33 |
112
+ | `SENTS_R` | 99.33 |
113
+ | `SENTS_F` | 99.33 |
114
+ | `TAG_ACC` | 98.31 |
115
+ | `POS_ACC` | 98.25 |
116
+ | `MORPH_ACC` | 96.75 |
117
+ | `MORPH_MICRO_P` | 98.81 |
118
+ | `MORPH_MICRO_R` | 98.58 |
119
+ | `MORPH_MICRO_F` | 98.69 |
120
+ | `LEMMA_ACC` | 98.82 |
121
+ | `BOUND_DEP_LAS` | 85.22 |
122
+ | `BOUND_DEP_UAS` | 89.54 |
123
+ | `DEP_UAS` | 89.53 |
124
+ | `DEP_LAS` | 85.21 |
125
+ | `ENTS_P` | 91.22 |
126
+ | `ENTS_R` | 92.39 |
127
+ | `ENTS_F` | 91.80 |
config.cfg CHANGED
@@ -1,8 +1,8 @@
1
  [paths]
2
- tagger_model = "models/hu_core_news_trf-tagger-3.5.2/model-best"
3
- parser_model = "models/hu_core_news_trf-parser-3.5.2/model-best"
4
- ner_model = "models/hu_core_news_trf-ner-3.5.2/model-best"
5
- lemmatizer_lookups = "models/hu_core_news_trf-lookup-lemmatizer-3.5.2"
6
  train = null
7
  dev = null
8
  vectors = null
 
1
  [paths]
2
+ tagger_model = "models/hu_core_news_trf-tagger-3.5.3/model-best"
3
+ parser_model = "models/hu_core_news_trf-parser-3.5.3/model-best"
4
+ ner_model = "models/hu_core_news_trf-ner-3.5.3/model-best"
5
+ lemmatizer_lookups = "models/hu_core_news_trf-lookup-lemmatizer-3.5.3"
6
  train = null
7
  dev = null
8
  vectors = null
experimental_arc_labeler/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3f48181517669b0533e2304d4837173a99389377a33345adbffc4f87a2e11edd
3
  size 14947179
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb415ca51404d8e5a100885d6fb66892562d745e3e1f45b99671daa5db833f8b
3
  size 14947179
experimental_arc_predicter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e2681bfb35605ba7bdf88d320139036210ca9d9458cd7b7c2823b96f188931a4
3
  size 413192
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b1205f330e7aa342fdb28984d530e7bdf7ba776a995364c21d4d57a8c87b2d0
3
  size 413192
hu_core_news_trf-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0544eaf5c73bdb41432c55407f92978c15485d995812bb17d55c9c03e1e56553
3
- size 1266506371
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8dcef8d1b79a67fdd9185b3062a07b182480d5cc0c4028fb0ee3397607cbb1e8
3
+ size 1266435135
meta.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "lang":"hu",
3
  "name":"core_news_trf",
4
- "version":"3.5.2",
5
  "description":"Hungarian transformer pipeline (huBERT) for HuSpaCy. Components: transformer, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
@@ -1281,222 +1281,222 @@
1281
  "token_p":0.998565417,
1282
  "token_r":0.9993300153,
1283
  "token_f":0.9989475698,
1284
- "sents_p":0.9977678571,
1285
- "sents_r":0.995545657,
1286
- "sents_f":0.9966555184,
1287
- "tag_acc":0.9817207388,
1288
- "pos_acc":0.979902383,
1289
- "morph_acc":0.9645403646,
1290
- "morph_micro_p":0.9866706928,
1291
- "morph_micro_r":0.982939407,
1292
- "morph_micro_f":0.9848015155,
1293
  "morph_per_feat":{
1294
  "Definite":{
1295
- "p":0.9815583218,
1296
- "r":0.9934671022,
1297
- "f":0.9874768089
1298
  },
1299
  "PronType":{
1300
- "p":0.9824175824,
1301
- "r":0.9867549669,
1302
- "f":0.9845814978
1303
  },
1304
  "Case":{
1305
- "p":0.9930389817,
1306
- "r":0.9865639202,
1307
- "f":0.9897908613
1308
  },
1309
  "Degree":{
1310
- "p":0.9568593615,
1311
- "r":0.9226289517,
1312
- "f":0.9394324439
1313
  },
1314
  "Number":{
1315
- "p":0.9951178451,
1316
- "r":0.9906150494,
1317
- "f":0.9928613421
1318
  },
1319
  "Mood":{
1320
- "p":0.9834801762,
1321
- "r":0.9900221729,
1322
- "f":0.9867403315
1323
  },
1324
  "Person":{
1325
- "p":0.9819078947,
1326
- "r":0.9819078947,
1327
- "f":0.9819078947
1328
  },
1329
  "Tense":{
1330
- "p":0.9911991199,
1331
- "r":0.9955801105,
1332
- "f":0.993384785
1333
  },
1334
  "VerbForm":{
1335
- "p":0.986852917,
1336
- "r":0.9631114675,
1337
- "f":0.9748376623
1338
  },
1339
  "Voice":{
1340
- "p":0.9806910569,
1341
  "r":0.9867075665,
1342
- "f":0.9836901121
1343
  },
1344
  "Number[psor]":{
1345
- "p":0.9872521246,
1346
- "r":0.9928774929,
1347
- "f":0.9900568182
1348
  },
1349
  "Person[psor]":{
1350
- "p":0.9858356941,
1351
  "r":0.9928673324,
1352
- "f":0.9893390192
 
 
 
 
 
1353
  },
1354
  "NumType":{
1355
- "p":0.941031941,
1356
- "r":0.9341463415,
1357
- "f":0.9375764994
1358
  },
1359
  "Reflex":{
1360
- "p":1.0,
1361
  "r":0.875,
1362
- "f":0.9333333333
1363
  },
1364
  "Aspect":{
1365
  "p":0.0,
1366
  "r":0.0,
1367
  "f":0.0
1368
  },
1369
- "Number[psed]":{
1370
- "p":1.0,
1371
- "r":0.3333333333,
1372
- "f":0.5
1373
- },
1374
  "Poss":{
1375
- "p":1.0,
1376
  "r":1.0,
1377
- "f":1.0
1378
  }
1379
  },
1380
- "lemma_acc":0.986030045,
1381
- "bound_dep_las":0.8673938002,
1382
- "bound_dep_uas":0.9076731726,
1383
- "dep_uas":0.9078903297,
1384
- "dep_las":0.8674641148,
1385
  "dep_las_per_type":{
1386
  "415":{
1387
- "p":0.9336455894,
1388
- "r":0.9522292994,
1389
- "f":0.942845881
1390
  },
1391
  "7411097074813287689":{
1392
- "p":0.9124497992,
1393
- "r":0.9288634505,
1394
- "f":0.9205834684
1395
  },
1396
  "429":{
1397
- "p":0.9141965679,
1398
- "r":0.915625,
1399
- "f":0.9149102264
1400
  },
1401
  "15861261214731031920":{
1402
- "p":0.7392344498,
1403
- "r":0.7573529412,
1404
- "f":0.7481840194
1405
  },
1406
  "991268021520064439":{
1407
- "p":0.8758278146,
1408
- "r":0.8966101695,
1409
- "f":0.8860971524
1410
  },
1411
  "435":{
1412
- "p":0.8942222222,
1413
- "r":0.9054905491,
1414
- "f":0.8998211091
1415
  },
1416
  "434":{
1417
- "p":0.952173913,
1418
- "r":0.9842696629,
1419
- "f":0.9679558011
1420
  },
1421
  "8206900633647566924":{
1422
- "p":0.875,
1423
- "r":0.9510022272,
1424
- "f":0.9114194237
1425
  },
1426
  "407":{
1427
- "p":0.8132780083,
1428
- "r":0.8252631579,
1429
- "f":0.8192267503
1430
  },
1431
  "410":{
1432
- "p":0.7398373984,
1433
- "r":0.7583333333,
1434
- "f":0.7489711934
1435
  },
1436
  "445":{
1437
- "p":0.8738127544,
1438
- "r":0.8708586883,
1439
- "f":0.8723332205
1440
  },
1441
  "400":{
1442
- "p":0.8367346939,
1443
- "r":0.8631578947,
1444
- "f":0.8497409326
1445
  },
1446
  "17772752594865228322":{
1447
- "p":0.9663461538,
1448
- "r":0.9392523364,
1449
- "f":0.9526066351
1450
  },
1451
  "403":{
1452
- "p":0.6811594203,
1453
- "r":0.5,
1454
- "f":0.5766871166
1455
  },
1456
  "399":{
1457
- "p":0.5,
1458
- "r":0.5510204082,
1459
- "f":0.5242718447
1460
  },
1461
  "3143985677199705895":{
1462
- "p":0.8091286307,
1463
- "r":0.847826087,
1464
- "f":0.8280254777
1465
  },
1466
  "9241468201421778905":{
1467
- "p":0.4210526316,
1468
- "r":0.4848484848,
1469
- "f":0.4507042254
1470
  },
1471
  "423":{
1472
- "p":0.9487179487,
1473
- "r":0.9367088608,
1474
- "f":0.9426751592
1475
  },
1476
  "13543738850102096385":{
1477
- "p":0.9444444444,
1478
- "r":0.9357798165,
1479
- "f":0.9400921659
1480
  },
1481
  "10901028881100056900":{
1482
- "p":0.75,
1483
- "r":0.75,
1484
- "f":0.75
1485
  },
1486
  "411":{
1487
- "p":0.8108108108,
1488
  "r":0.7317073171,
1489
- "f":0.7692307692
1490
  },
1491
  "12549387360942434255":{
1492
- "p":0.4864864865,
1493
- "r":0.45,
1494
- "f":0.4675324675
1495
  },
1496
  "303601073839818384":{
1497
- "p":0.5,
1498
- "r":0.125,
1499
- "f":0.2
1500
  },
1501
  "8884235091647096537":{
1502
  "p":0.0,
@@ -1504,74 +1504,64 @@
1504
  "f":0.0
1505
  },
1506
  "2249809950233855422":{
1507
- "p":0.5925925926,
1508
- "r":0.5,
1509
- "f":0.5423728814
1510
  },
1511
  "422":{
1512
- "p":0.4761904762,
1513
- "r":0.6666666667,
1514
- "f":0.5555555556
1515
- },
1516
- "408":{
1517
- "p":0.1333333333,
1518
- "r":0.1538461538,
1519
- "f":0.1428571429
1520
  },
1521
  "8110129090154140942":{
1522
- "p":0.9740932642,
1523
- "r":0.9591836735,
1524
- "f":0.9665809769
1525
  },
1526
  "412":{
1527
- "p":0.7083333333,
1528
- "r":0.4594594595,
1529
- "f":0.5573770492
1530
  },
1531
  "436":{
1532
- "p":0.4117647059,
1533
- "r":0.095890411,
1534
- "f":0.1555555556
1535
  },
1536
  "450":{
1537
- "p":0.9466666667,
1538
- "r":0.9594594595,
1539
- "f":0.9530201342
1540
  },
1541
  "12837356684637874264":{
1542
- "p":0.7466666667,
1543
- "r":0.6021505376,
1544
- "f":0.6666666667
1545
- },
1546
- "3350290345017230236":{
1547
- "p":0.1666666667,
1548
- "r":0.0416666667,
1549
- "f":0.0666666667
1550
  },
1551
  "451":{
1552
- "p":0.5507246377,
1553
- "r":0.5277777778,
1554
- "f":0.5390070922
1555
  },
1556
  "7349492218059511525":{
1557
- "p":0.625,
1558
- "r":1.0,
1559
- "f":0.7692307692
1560
  },
1561
  "426":{
1562
- "p":1.0,
1563
- "r":0.3636363636,
1564
- "f":0.5333333333
1565
  },
1566
  "405":{
1567
- "p":0.9090909091,
1568
- "r":0.8333333333,
1569
- "f":0.8695652174
1570
  },
1571
  "17865338459503383721":{
1572
- "p":1.0,
1573
  "r":0.1666666667,
1574
- "f":0.2857142857
1575
  },
1576
  "17311980334327143026":{
1577
  "p":0.0,
@@ -1588,15 +1578,25 @@
1588
  "r":0.0,
1589
  "f":0.0
1590
  },
 
 
 
 
 
1591
  "10069665988847657778":{
1592
  "p":0.0,
1593
  "r":0.0,
1594
  "f":0.0
1595
  },
1596
  "17473201795025412735":{
1597
- "p":0.2,
1598
  "r":0.1666666667,
1599
- "f":0.1818181818
 
 
 
 
 
1600
  },
1601
  "6522094215780122214":{
1602
  "p":1.0,
@@ -1609,32 +1609,32 @@
1609
  "f":0.0
1610
  }
1611
  },
1612
- "ents_p":0.909059294,
1613
- "ents_r":0.9191279887,
1614
- "ents_f":0.9140659149,
1615
  "ents_per_type":{
1616
  "ORG":{
1617
- "p":0.935604293,
1618
- "r":0.9295317571,
1619
- "f":0.9325581395
1620
  },
1621
  "PER":{
1622
- "p":0.9309551208,
1623
- "r":0.9665471924,
1624
- "f":0.9484173505
1625
  },
1626
  "LOC":{
1627
- "p":0.9361702128,
1628
- "r":0.9166666667,
1629
- "f":0.9263157895
1630
  },
1631
  "MISC":{
1632
- "p":0.7398921833,
1633
  "r":0.7787234043,
1634
- "f":0.7588113338
1635
  }
1636
  },
1637
- "speed":3095.7537591888
1638
  },
1639
  "sources":[
1640
  {
@@ -1649,12 +1649,6 @@
1649
  "license":"CC BY-SA 4.0",
1650
  "author":"Eszter Simon, No\u00e9mi Vad\u00e1sz (Department of Language Technology and Applied Linguistics)"
1651
  },
1652
- {
1653
- "name":"hunNERwiki",
1654
- "url":"http://hlt.sztaki.hu/resources/hunnerwiki.html",
1655
- "license":"CC-BY-SA-3.0",
1656
- "author":"Eszter Simon, D\u00e1vid M\u00e1rk Nemeskey (HLT Group, Budapest University of Technology and Economics)"
1657
- },
1658
  {
1659
  "name":"Szeged NER Corpus",
1660
  "url":"https://rgai.inf.u-szeged.hu/node/130",
 
1
  {
2
  "lang":"hu",
3
  "name":"core_news_trf",
4
+ "version":"3.5.3",
5
  "description":"Hungarian transformer pipeline (huBERT) for HuSpaCy. Components: transformer, senter, tagger, morphologizer, lemmatizer, parser, ner",
6
  "author":"SzegedAI, MILAB",
7
  "email":"gyorgy@orosz.link",
 
1281
  "token_p":0.998565417,
1282
  "token_r":0.9993300153,
1283
  "token_f":0.9989475698,
1284
+ "sents_p":0.9933184855,
1285
+ "sents_r":0.9933184855,
1286
+ "sents_f":0.9933184855,
1287
+ "tag_acc":0.9831092397,
1288
+ "pos_acc":0.9825350495,
1289
+ "morph_acc":0.9674610011,
1290
+ "morph_micro_p":0.9880685734,
1291
+ "morph_micro_r":0.9857756768,
1292
+ "morph_micro_f":0.9869207934,
1293
  "morph_per_feat":{
1294
  "Definite":{
1295
+ "p":0.9920449228,
1296
+ "r":0.9892673822,
1297
+ "f":0.9906542056
1298
  },
1299
  "PronType":{
1300
+ "p":0.9884169884,
1301
+ "r":0.9889624724,
1302
+ "f":0.9886896552
1303
  },
1304
  "Case":{
1305
+ "p":0.9932793042,
1306
+ "r":0.9928867813,
1307
+ "f":0.993083004
1308
  },
1309
  "Degree":{
1310
+ "p":0.9472789116,
1311
+ "r":0.9267886855,
1312
+ "f":0.936921783
1313
  },
1314
  "Number":{
1315
+ "p":0.9949740325,
1316
+ "r":0.9953075247,
1317
+ "f":0.9951407507
1318
  },
1319
  "Mood":{
1320
+ "p":0.985619469,
1321
+ "r":0.987804878,
1322
+ "f":0.9867109635
1323
  },
1324
  "Person":{
1325
+ "p":0.9772542648,
1326
+ "r":0.9893092105,
1327
+ "f":0.9832447895
1328
  },
1329
  "Tense":{
1330
+ "p":0.9944751381,
1331
+ "r":0.9944751381,
1332
+ "f":0.9944751381
1333
  },
1334
  "VerbForm":{
1335
+ "p":0.9924686192,
1336
+ "r":0.9510825982,
1337
+ "f":0.9713349713
1338
  },
1339
  "Voice":{
1340
+ "p":0.9846938776,
1341
  "r":0.9867075665,
1342
+ "f":0.9856996936
1343
  },
1344
  "Number[psor]":{
1345
+ "p":0.99002849,
1346
+ "r":0.99002849,
1347
+ "f":0.99002849
1348
  },
1349
  "Person[psor]":{
1350
+ "p":0.9914529915,
1351
  "r":0.9928673324,
1352
+ "f":0.9921596579
1353
+ },
1354
+ "Number[psed]":{
1355
+ "p":0.8,
1356
+ "r":0.4444444444,
1357
+ "f":0.5714285714
1358
  },
1359
  "NumType":{
1360
+ "p":0.9366197183,
1361
+ "r":0.9731707317,
1362
+ "f":0.9545454545
1363
  },
1364
  "Reflex":{
1365
+ "p":0.875,
1366
  "r":0.875,
1367
+ "f":0.875
1368
  },
1369
  "Aspect":{
1370
  "p":0.0,
1371
  "r":0.0,
1372
  "f":0.0
1373
  },
 
 
 
 
 
1374
  "Poss":{
1375
+ "p":0.75,
1376
  "r":1.0,
1377
+ "f":0.8571428571
1378
  }
1379
  },
1380
+ "lemma_acc":0.9882307913,
1381
+ "bound_dep_las":0.8522205207,
1382
+ "bound_dep_uas":0.8953866769,
1383
+ "dep_uas":0.8952581463,
1384
+ "dep_las":0.8520574163,
1385
  "dep_las_per_type":{
1386
  "415":{
1387
+ "p":0.9417398244,
1388
+ "r":0.9394904459,
1389
+ "f":0.9406137904
1390
  },
1391
  "7411097074813287689":{
1392
+ "p":0.9087974173,
1393
+ "r":0.9206868357,
1394
+ "f":0.9147034931
1395
  },
1396
  "429":{
1397
+ "p":0.8983050847,
1398
+ "r":0.9109375,
1399
+ "f":0.9045771916
1400
  },
1401
  "15861261214731031920":{
1402
+ "p":0.7593582888,
1403
+ "r":0.6960784314,
1404
+ "f":0.726342711
1405
  },
1406
  "991268021520064439":{
1407
+ "p":0.8785357737,
1408
+ "r":0.8949152542,
1409
+ "f":0.8866498741
1410
  },
1411
  "435":{
1412
+ "p":0.8637931034,
1413
+ "r":0.901890189,
1414
+ "f":0.8824306473
1415
  },
1416
  "434":{
1417
+ "p":0.9456521739,
1418
+ "r":0.9775280899,
1419
+ "f":0.9613259669
1420
  },
1421
  "8206900633647566924":{
1422
+ "p":0.8150943396,
1423
+ "r":0.9621380846,
1424
+ "f":0.8825331971
1425
  },
1426
  "407":{
1427
+ "p":0.8069815195,
1428
+ "r":0.8273684211,
1429
+ "f":0.817047817
1430
  },
1431
  "410":{
1432
+ "p":0.7730496454,
1433
+ "r":0.68125,
1434
+ "f":0.7242524917
1435
  },
1436
  "445":{
1437
+ "p":0.8453747468,
1438
+ "r":0.8465179175,
1439
+ "f":0.8459459459
1440
  },
1441
  "400":{
1442
+ "p":0.8645833333,
1443
+ "r":0.8736842105,
1444
+ "f":0.8691099476
1445
  },
1446
  "17772752594865228322":{
1447
+ "p":0.9466019417,
1448
+ "r":0.9112149533,
1449
+ "f":0.9285714286
1450
  },
1451
  "403":{
1452
+ "p":0.606741573,
1453
+ "r":0.5744680851,
1454
+ "f":0.5901639344
1455
  },
1456
  "399":{
1457
+ "p":0.3868613139,
1458
+ "r":0.5408163265,
1459
+ "f":0.4510638298
1460
  },
1461
  "3143985677199705895":{
1462
+ "p":0.7626459144,
1463
+ "r":0.852173913,
1464
+ "f":0.8049281314
1465
  },
1466
  "9241468201421778905":{
1467
+ "p":0.3023255814,
1468
+ "r":0.3939393939,
1469
+ "f":0.3421052632
1470
  },
1471
  "423":{
1472
+ "p":0.9225806452,
1473
+ "r":0.9050632911,
1474
+ "f":0.9137380192
1475
  },
1476
  "13543738850102096385":{
1477
+ "p":0.9351851852,
1478
+ "r":0.9266055046,
1479
+ "f":0.930875576
1480
  },
1481
  "10901028881100056900":{
1482
+ "p":0.8928571429,
1483
+ "r":0.78125,
1484
+ "f":0.8333333333
1485
  },
1486
  "411":{
1487
+ "p":0.8823529412,
1488
  "r":0.7317073171,
1489
+ "f":0.8
1490
  },
1491
  "12549387360942434255":{
1492
+ "p":0.4571428571,
1493
+ "r":0.4,
1494
+ "f":0.4266666667
1495
  },
1496
  "303601073839818384":{
1497
+ "p":0.75,
1498
+ "r":0.375,
1499
+ "f":0.5
1500
  },
1501
  "8884235091647096537":{
1502
  "p":0.0,
 
1504
  "f":0.0
1505
  },
1506
  "2249809950233855422":{
1507
+ "p":0.3913043478,
1508
+ "r":0.28125,
1509
+ "f":0.3272727273
1510
  },
1511
  "422":{
1512
+ "p":0.375,
1513
+ "r":0.8,
1514
+ "f":0.5106382979
 
 
 
 
 
1515
  },
1516
  "8110129090154140942":{
1517
+ "p":0.9692307692,
1518
+ "r":0.9642857143,
1519
+ "f":0.9667519182
1520
  },
1521
  "412":{
1522
+ "p":0.6315789474,
1523
+ "r":0.3243243243,
1524
+ "f":0.4285714286
1525
  },
1526
  "436":{
1527
+ "p":0.3157894737,
1528
+ "r":0.0821917808,
1529
+ "f":0.1304347826
1530
  },
1531
  "450":{
1532
+ "p":0.96,
1533
+ "r":0.972972973,
1534
+ "f":0.966442953
1535
  },
1536
  "12837356684637874264":{
1537
+ "p":0.6551724138,
1538
+ "r":0.6129032258,
1539
+ "f":0.6333333333
 
 
 
 
 
1540
  },
1541
  "451":{
1542
+ "p":0.4807692308,
1543
+ "r":0.3472222222,
1544
+ "f":0.4032258065
1545
  },
1546
  "7349492218059511525":{
1547
+ "p":0.6666666667,
1548
+ "r":0.8,
1549
+ "f":0.7272727273
1550
  },
1551
  "426":{
1552
+ "p":0.6,
1553
+ "r":0.2727272727,
1554
+ "f":0.375
1555
  },
1556
  "405":{
1557
+ "p":0.9166666667,
1558
+ "r":0.9166666667,
1559
+ "f":0.9166666667
1560
  },
1561
  "17865338459503383721":{
1562
+ "p":0.3333333333,
1563
  "r":0.1666666667,
1564
+ "f":0.2222222222
1565
  },
1566
  "17311980334327143026":{
1567
  "p":0.0,
 
1578
  "r":0.0,
1579
  "f":0.0
1580
  },
1581
+ "3350290345017230236":{
1582
+ "p":0.0,
1583
+ "r":0.0,
1584
+ "f":0.0
1585
+ },
1586
  "10069665988847657778":{
1587
  "p":0.0,
1588
  "r":0.0,
1589
  "f":0.0
1590
  },
1591
  "17473201795025412735":{
1592
+ "p":0.1111111111,
1593
  "r":0.1666666667,
1594
+ "f":0.1333333333
1595
+ },
1596
+ "408":{
1597
+ "p":0.0,
1598
+ "r":0.0,
1599
+ "f":0.0
1600
  },
1601
  "6522094215780122214":{
1602
  "p":1.0,
 
1609
  "f":0.0
1610
  }
1611
  },
1612
+ "ents_p":0.9121680264,
1613
+ "ents_r":0.9238748242,
1614
+ "ents_f":0.9179841034,
1615
  "ents_per_type":{
1616
  "ORG":{
1617
+ "p":0.9296551724,
1618
+ "r":0.9374130737,
1619
+ "f":0.9335180055
1620
  },
1621
  "PER":{
1622
+ "p":0.9427235535,
1623
+ "r":0.9635603345,
1624
+ "f":0.953028065
1625
  },
1626
  "LOC":{
1627
+ "p":0.9402985075,
1628
+ "r":0.9296875,
1629
+ "f":0.9349628983
1630
  },
1631
  "MISC":{
1632
+ "p":0.745923913,
1633
  "r":0.7787234043,
1634
+ "f":0.7619708536
1635
  }
1636
  },
1637
+ "speed":3075.1123333576
1638
  },
1639
  "sources":[
1640
  {
 
1649
  "license":"CC BY-SA 4.0",
1650
  "author":"Eszter Simon, No\u00e9mi Vad\u00e1sz (Department of Language Technology and Applied Linguistics)"
1651
  },
 
 
 
 
 
 
1652
  {
1653
  "name":"Szeged NER Corpus",
1654
  "url":"https://rgai.inf.u-szeged.hu/node/130",
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1669c374c94ccfa4774c89cde15cfb3acdf4fa0a42dd5d991a40f9b45c3f6f0a
3
  size 3522673
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ad795837e3057eaaddb2d8004674ee0f5eceaea131a13c2606c67bcae66f151a
3
  size 3522673
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5ea595d55219434c3cf1be3617618b79af0b7bd4128671b43f7f0cbaac20b3d1
3
  size 443884420
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1db8479716f8dbd964226ba0cc3be73b92a20f9f2c91f71727d274999c2098f
3
  size 443884420
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f582a0d0c0fedeb0588bcbfcf56ddb4aa73126efb9f39233bb088fc9f64d7b0
3
  size 6792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d9560d873a7691947932096d2492fee2bef9bc8df690d048241056e70be77cd
3
  size 6792
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5f3aaec946ca56a88eb57e323209fcdf655a614585449d54561523dcb23d02fa
3
  size 52932
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4aacfb2a645012bc47a0c2eb3c48458f6d749500674d4c2e1dc937d0496bcee9
3
  size 52932
trainable_lemmatizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1cf14b4f1965a19970f0c21a320da12c04cd510e96c5beea7360f873de9c9744
3
  size 455959169
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e6710bf3538f17d19184489a163fd6adae772800da8c67f7965896b5f745c3fa
3
  size 455959169
trainable_lemmatizer/trees CHANGED
Binary files a/trainable_lemmatizer/trees and b/trainable_lemmatizer/trees differ
 
transformer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:69c6dc9812f2080e1bf63506e972590b295335660867186711be369539fd3fa2
3
  size 443602220
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a72b2247afcca07de20bc57c19afd053f46108798158e1b3192521b058c56ca
3
  size 443602220
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9a5f80767d8b4723935240536536bfb543f502da5daa2fdb6eb6b83072758b28
3
- size 6399388
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b53dd9706ec7f856b63debedcc17bd3df3f7c8be9e0e9b72231f2990fac9b1f4
3
+ size 6387620