osanseviero
commited on
Commit
•
239866e
1
Parent(s):
70692c8
Update spaCy pipeline
Browse files- LICENSES_SOURCES +1 -1
- README.md +34 -28
- accuracy.json +336 -335
- attribute_ruler/patterns +0 -0
- config.cfg +29 -26
- meta.json +354 -346
- ner/model +0 -0
- parser/model +0 -0
- parser/moves +1 -1
- ro_core_news_lg-any-py3-none-any.whl +2 -2
- senter/cfg +1 -1
- senter/model +0 -0
- tagger/cfg +14 -6
- tagger/model +0 -0
- tok2vec/model +0 -0
- tokenizer +2 -2
- vocab/strings.json +2 -2
- vocab/vectors.cfg +3 -0
LICENSES_SOURCES
CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
|
|
549 |
|
550 |
|
551 |
|
552 |
-
# UD Romanian RRT v2.
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
|
|
549 |
|
550 |
|
551 |
|
552 |
+
# UD Romanian RRT v2.8
|
553 |
|
554 |
* Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
|
555 |
* URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
|
README.md
CHANGED
@@ -4,7 +4,7 @@ tags:
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
-
license:
|
8 |
model-index:
|
9 |
- name: ro_core_news_lg
|
10 |
results:
|
@@ -14,47 +14,47 @@ model-index:
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
-
value: 0.
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
-
value: 0.
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
-
value: 0.
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
-
value: 0.
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
-
value: 0.
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
-
value: 0.
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
-
value: 0.
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
-
value: 0.
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
-
value: 0.
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_lg
|
60 |
|
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_lg` |
|
66 |
-
| **Version** | `3.
|
67 |
-
| **spaCy** | `>=3.
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
|
71 |
-
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
-
<summary>View label scheme (
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
-
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-
|
84 |
-
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
-
| `
|
96 |
-
| `
|
97 |
-
| `
|
98 |
-
| `
|
99 |
-
| `
|
100 |
-
| `
|
101 |
-
| `
|
102 |
-
| `
|
103 |
-
| `
|
104 |
-
| `
|
105 |
-
| `
|
106 |
-
| `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- token-classification
|
5 |
language:
|
6 |
- ro
|
7 |
+
license: cc-by-sa-4.0
|
8 |
model-index:
|
9 |
- name: ro_core_news_lg
|
10 |
results:
|
|
|
14 |
metrics:
|
15 |
- name: NER Precision
|
16 |
type: precision
|
17 |
+
value: 0.7550713749
|
18 |
- name: NER Recall
|
19 |
type: recall
|
20 |
+
value: 0.7721859393
|
21 |
- name: NER F Score
|
22 |
type: f_score
|
23 |
+
value: 0.7635327635
|
24 |
- task:
|
25 |
name: POS
|
26 |
type: token-classification
|
27 |
metrics:
|
28 |
- name: POS Accuracy
|
29 |
type: accuracy
|
30 |
+
value: 0.9664291788
|
31 |
- task:
|
32 |
name: SENTER
|
33 |
type: token-classification
|
34 |
metrics:
|
35 |
- name: SENTER Precision
|
36 |
type: precision
|
37 |
+
value: 0.954787234
|
38 |
- name: SENTER Recall
|
39 |
type: recall
|
40 |
+
value: 0.954787234
|
41 |
- name: SENTER F Score
|
42 |
type: f_score
|
43 |
+
value: 0.954787234
|
44 |
- task:
|
45 |
name: UNLABELED_DEPENDENCIES
|
46 |
type: token-classification
|
47 |
metrics:
|
48 |
- name: Unlabeled Dependencies Accuracy
|
49 |
type: accuracy
|
50 |
+
value: 0.8897462438
|
51 |
- task:
|
52 |
name: LABELED_DEPENDENCIES
|
53 |
type: token-classification
|
54 |
metrics:
|
55 |
- name: Labeled Dependencies Accuracy
|
56 |
type: accuracy
|
57 |
+
value: 0.8897462438
|
58 |
---
|
59 |
### Details: https://spacy.io/models/ro#ro_core_news_lg
|
60 |
|
|
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
65 |
| **Name** | `ro_core_news_lg` |
|
66 |
+
| **Version** | `3.2.0` |
|
67 |
+
| **spaCy** | `>=3.2.0,<3.3.0` |
|
68 |
| **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
|
69 |
| **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
|
70 |
| **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
|
71 |
+
| **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
|
72 |
| **License** | `CC BY-SA 4.0` |
|
73 |
| **Author** | [Explosion](https://explosion.ai) |
|
74 |
|
|
|
76 |
|
77 |
<details>
|
78 |
|
79 |
+
<summary>View label scheme (541 labels for 4 components)</summary>
|
80 |
|
81 |
| Component | Labels |
|
82 |
| --- | --- |
|
83 |
+
| **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
|
84 |
+
| **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
|
85 |
| **`senter`** | `I`, `S` |
|
86 |
| **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
|
87 |
|
|
|
92 |
| Type | Score |
|
93 |
| --- | --- |
|
94 |
| `TOKEN_ACC` | 99.90 |
|
95 |
+
| `TOKEN_P` | 99.67 |
|
96 |
+
| `TOKEN_R` | 99.57 |
|
97 |
+
| `TOKEN_F` | 99.59 |
|
98 |
+
| `TAG_ACC` | 96.64 |
|
99 |
+
| `SENTS_P` | 95.48 |
|
100 |
+
| `SENTS_R` | 95.48 |
|
101 |
+
| `SENTS_F` | 95.48 |
|
102 |
+
| `DEP_UAS` | 88.97 |
|
103 |
+
| `DEP_LAS` | 83.90 |
|
104 |
+
| `POS_ACC` | 94.06 |
|
105 |
+
| `MORPH_ACC` | 95.11 |
|
106 |
+
| `MORPH_MICRO_P` | 98.96 |
|
107 |
+
| `MORPH_MICRO_R` | 95.82 |
|
108 |
+
| `MORPH_MICRO_F` | 97.07 |
|
109 |
+
| `LEMMA_ACC` | 81.83 |
|
110 |
+
| `ENTS_P` | 75.51 |
|
111 |
+
| `ENTS_R` | 77.22 |
|
112 |
+
| `ENTS_F` | 76.35 |
|
accuracy.json
CHANGED
@@ -1,447 +1,448 @@
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
-
"
|
4 |
-
"
|
5 |
-
"
|
6 |
-
"
|
7 |
-
"
|
8 |
-
"
|
9 |
-
"
|
10 |
-
"
|
11 |
-
"
|
12 |
-
"
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
"AdpType": {
|
18 |
-
"p": 0.9970784641,
|
19 |
-
"r": 0.9941739492,
|
20 |
-
"f": 0.9956240884
|
21 |
},
|
22 |
-
"
|
23 |
-
"p": 0.
|
24 |
-
"r": 0.
|
25 |
-
"f": 0.
|
26 |
},
|
27 |
-
"
|
28 |
-
"p": 0.
|
29 |
-
"r": 0.
|
30 |
-
"f": 0.
|
31 |
},
|
32 |
-
"
|
33 |
-
"p": 0.
|
34 |
-
"r": 0.
|
35 |
-
"f": 0.
|
36 |
},
|
37 |
-
"
|
38 |
-
"p": 0.
|
39 |
-
"r": 0.
|
40 |
-
"f": 0.
|
41 |
},
|
42 |
-
"
|
43 |
-
"p": 0.
|
44 |
-
"r": 0.
|
45 |
-
"f": 0.
|
46 |
},
|
47 |
-
"
|
48 |
-
"p": 0.
|
49 |
-
"r": 0.
|
50 |
-
"f": 0.
|
51 |
},
|
52 |
-
"
|
53 |
-
"p": 0.
|
54 |
-
"r": 0.
|
55 |
-
"f": 0.
|
56 |
},
|
57 |
-
"
|
58 |
-
"p": 0.
|
59 |
-
"r": 0.
|
60 |
-
"f": 0.
|
61 |
},
|
62 |
-
"
|
63 |
-
"p": 0.
|
64 |
-
"r": 0.
|
65 |
-
"f": 0.
|
66 |
},
|
67 |
-
"
|
68 |
-
"p": 0.
|
69 |
-
"r": 0.
|
70 |
-
"f": 0.
|
71 |
},
|
72 |
-
"
|
73 |
-
"p": 0.
|
74 |
-
"r": 0.
|
75 |
-
"f": 0.
|
76 |
},
|
77 |
-
"
|
78 |
-
"p": 0.
|
79 |
-
"r": 0.
|
80 |
-
"f": 0.
|
81 |
},
|
82 |
-
"
|
83 |
-
"p": 0.
|
84 |
-
"r": 0.
|
85 |
-
"f": 0.
|
86 |
},
|
87 |
-
"
|
88 |
-
"p": 0.
|
89 |
-
"r": 0.
|
90 |
-
"f": 0.
|
91 |
},
|
92 |
-
"
|
93 |
-
"p": 0.
|
94 |
-
"r": 0.
|
95 |
-
"f": 0.
|
96 |
},
|
97 |
-
"
|
98 |
-
"p": 0.
|
99 |
-
"r": 0.
|
100 |
-
"f": 0.
|
101 |
},
|
102 |
-
"
|
103 |
-
"p": 0.
|
104 |
-
"r": 0.
|
105 |
-
"f": 0.
|
106 |
},
|
107 |
-
"
|
108 |
-
"p": 0.
|
109 |
-
"r": 0.
|
110 |
-
"f": 0.
|
111 |
},
|
112 |
-
"
|
113 |
-
"p": 0.
|
114 |
-
"r": 0.
|
115 |
-
"f": 0.
|
116 |
},
|
117 |
-
"
|
118 |
-
"p": 0.
|
119 |
-
"r": 0.
|
120 |
-
"f": 0.
|
121 |
},
|
122 |
-
"
|
123 |
-
"p": 0.
|
124 |
-
"r": 0.
|
125 |
-
"f": 0.
|
126 |
},
|
127 |
-
"
|
|
|
|
|
|
|
|
|
|
|
128 |
"p": 0.0,
|
129 |
"r": 0.0,
|
130 |
"f": 0.0
|
131 |
-
}
|
132 |
-
},
|
133 |
-
"dep_las_per_type": {
|
134 |
-
"case": {
|
135 |
-
"p": 0.9257307139,
|
136 |
-
"r": 0.9415204678,
|
137 |
-
"f": 0.9335588306
|
138 |
},
|
139 |
-
"
|
140 |
-
"p": 0.
|
141 |
-
"r": 0.
|
142 |
-
"f": 0.
|
143 |
-
},
|
144 |
-
"nmod:tmod": {
|
145 |
-
"p": 0.4,
|
146 |
-
"r": 0.0465116279,
|
147 |
-
"f": 0.0833333333
|
148 |
-
},
|
149 |
-
"amod": {
|
150 |
-
"p": 0.8639212175,
|
151 |
-
"r": 0.8756805808,
|
152 |
-
"f": 0.8697611537
|
153 |
-
},
|
154 |
-
"cc": {
|
155 |
-
"p": 0.8669354839,
|
156 |
-
"r": 0.89958159,
|
157 |
-
"f": 0.8829568789
|
158 |
-
},
|
159 |
-
"conj": {
|
160 |
-
"p": 0.5984962406,
|
161 |
-
"r": 0.6012084592,
|
162 |
-
"f": 0.5998492841
|
163 |
-
},
|
164 |
-
"nmod": {
|
165 |
-
"p": 0.7883565797,
|
166 |
-
"r": 0.8217446271,
|
167 |
-
"f": 0.8047044259
|
168 |
-
},
|
169 |
-
"mark": {
|
170 |
-
"p": 0.8857142857,
|
171 |
-
"r": 0.9056179775,
|
172 |
-
"f": 0.8955555556
|
173 |
-
},
|
174 |
-
"fixed": {
|
175 |
-
"p": 0.8689217759,
|
176 |
-
"r": 0.7172774869,
|
177 |
-
"f": 0.7858508604
|
178 |
-
},
|
179 |
-
"nsubj": {
|
180 |
-
"p": 0.8333333333,
|
181 |
-
"r": 0.7814485388,
|
182 |
-
"f": 0.806557377
|
183 |
},
|
184 |
-
"
|
185 |
"p": 0.0,
|
186 |
"r": 0.0,
|
187 |
"f": 0.0
|
188 |
},
|
189 |
-
"
|
190 |
-
"p": 0.
|
191 |
-
"r": 0.
|
192 |
-
"f": 0.
|
193 |
-
},
|
194 |
-
"nummod": {
|
195 |
-
"p": 0.8703703704,
|
196 |
-
"r": 0.8676923077,
|
197 |
-
"f": 0.8690292758
|
198 |
},
|
199 |
"flat": {
|
200 |
-
"p": 0.
|
201 |
-
"r": 0.
|
202 |
-
"f": 0.
|
203 |
},
|
204 |
-
"
|
205 |
-
"p": 0.
|
206 |
-
"r": 0.
|
207 |
-
"f": 0.
|
208 |
},
|
209 |
-
"
|
210 |
-
"p": 0.
|
211 |
-
"r": 0.
|
212 |
-
"f": 0.
|
213 |
},
|
214 |
-
"
|
215 |
-
"p": 0.
|
216 |
-
"r": 0.
|
217 |
-
"f": 0.
|
218 |
},
|
219 |
-
"
|
220 |
-
"p": 0.
|
221 |
-
"r": 0.
|
222 |
-
"f": 0.
|
223 |
},
|
224 |
"expl:pv": {
|
225 |
-
"p": 0.
|
226 |
-
"r": 0.
|
227 |
-
"f": 0.
|
228 |
-
},
|
229 |
-
"root": {
|
230 |
-
"p": 0.917222964,
|
231 |
-
"r": 0.9135638298,
|
232 |
-
"f": 0.9153897402
|
233 |
-
},
|
234 |
-
"advcl": {
|
235 |
-
"p": 0.5625,
|
236 |
-
"r": 0.5853658537,
|
237 |
-
"f": 0.5737051793
|
238 |
},
|
239 |
-
"
|
240 |
-
"p": 0.
|
241 |
-
"r": 0.
|
242 |
-
"f": 0.
|
243 |
-
},
|
244 |
-
"ccomp": {
|
245 |
-
"p": 0.7178217822,
|
246 |
-
"r": 0.8146067416,
|
247 |
-
"f": 0.7631578947
|
248 |
-
},
|
249 |
-
"goeswith": {
|
250 |
-
"p": 0.875,
|
251 |
-
"r": 0.5833333333,
|
252 |
-
"f": 0.7
|
253 |
},
|
254 |
-
"
|
255 |
-
"p": 0.
|
256 |
-
"r": 0.
|
257 |
-
"f": 0.
|
258 |
},
|
259 |
"expl:poss": {
|
260 |
-
"p": 0.
|
261 |
-
"r": 0.
|
262 |
-
"f": 0.
|
263 |
},
|
264 |
-
"
|
265 |
-
"p": 0.7647058824,
|
266 |
-
"r": 0.8024691358,
|
267 |
-
"f": 0.7831325301
|
268 |
-
},
|
269 |
-
"cc:preconj": {
|
270 |
"p": 0.0,
|
271 |
"r": 0.0,
|
272 |
"f": 0.0
|
273 |
},
|
274 |
-
"aux": {
|
275 |
-
"p": 0.9716713881,
|
276 |
-
"r": 0.9122340426,
|
277 |
-
"f": 0.9410150892
|
278 |
-
},
|
279 |
-
"expl": {
|
280 |
-
"p": 0.5294117647,
|
281 |
-
"r": 0.4186046512,
|
282 |
-
"f": 0.4675324675
|
283 |
-
},
|
284 |
-
"appos": {
|
285 |
-
"p": 0.4347826087,
|
286 |
-
"r": 0.396039604,
|
287 |
-
"f": 0.414507772
|
288 |
-
},
|
289 |
"xcomp": {
|
290 |
-
"p": 0.
|
291 |
-
"r": 0.
|
292 |
-
"f": 0.
|
293 |
-
},
|
294 |
-
"csubj": {
|
295 |
-
"p": 0.7966101695,
|
296 |
-
"r": 0.746031746,
|
297 |
-
"f": 0.7704918033
|
298 |
-
},
|
299 |
-
"nmod:agent": {
|
300 |
-
"p": 0.7285714286,
|
301 |
-
"r": 0.7846153846,
|
302 |
-
"f": 0.7555555556
|
303 |
-
},
|
304 |
-
"aux:pass": {
|
305 |
-
"p": 0.7769784173,
|
306 |
-
"r": 0.9,
|
307 |
-
"f": 0.833976834
|
308 |
},
|
309 |
-
"
|
310 |
"p": 0.0,
|
311 |
"r": 0.0,
|
312 |
"f": 0.0
|
313 |
},
|
314 |
-
"
|
315 |
-
"p": 0
|
316 |
-
"r": 0.
|
317 |
-
"f": 0.
|
318 |
},
|
319 |
-
"
|
320 |
"p": 0.0,
|
321 |
"r": 0.0,
|
322 |
"f": 0.0
|
323 |
},
|
324 |
-
"expl:pass": {
|
325 |
-
"p": 0.6734693878,
|
326 |
-
"r": 0.7252747253,
|
327 |
-
"f": 0.6984126984
|
328 |
-
},
|
329 |
-
"ccomp:pmod": {
|
330 |
-
"p": 0.4,
|
331 |
-
"r": 0.2666666667,
|
332 |
-
"f": 0.32
|
333 |
-
},
|
334 |
"compound": {
|
335 |
-
"p": 0.
|
336 |
-
"r": 0.
|
337 |
-
"f": 0.
|
338 |
},
|
339 |
-
"
|
340 |
"p": 0.0,
|
341 |
"r": 0.0,
|
342 |
"f": 0.0
|
343 |
},
|
344 |
-
"
|
345 |
-
"p": 0.3333333333,
|
346 |
-
"r": 0.1,
|
347 |
-
"f": 0.1538461538
|
348 |
-
},
|
349 |
-
"csubj:pass": {
|
350 |
"p": 0.25,
|
351 |
"r": 0.3333333333,
|
352 |
"f": 0.2857142857
|
353 |
},
|
354 |
-
"
|
355 |
"p": 0.0,
|
356 |
"r": 0.0,
|
357 |
"f": 0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
358 |
},
|
359 |
-
"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
360 |
"p": 0.0,
|
361 |
"r": 0.0,
|
362 |
"f": 0.0
|
363 |
}
|
364 |
},
|
|
|
|
|
|
|
|
|
365 |
"ents_per_type": {
|
366 |
"DATETIME": {
|
367 |
-
"p": 0.
|
368 |
-
"r": 0.
|
369 |
-
"f": 0.
|
370 |
},
|
371 |
"ORGANIZATION": {
|
372 |
-
"p": 0.
|
373 |
-
"r": 0.
|
374 |
-
"f": 0.
|
375 |
},
|
376 |
"FACILITY": {
|
377 |
-
"p": 0.
|
378 |
-
"r": 0.
|
379 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
380 |
},
|
381 |
"NUMERIC_VALUE": {
|
382 |
-
"p": 0.
|
383 |
-
"r": 0.
|
384 |
-
"f": 0.
|
385 |
},
|
386 |
"ORDINAL": {
|
387 |
-
"p": 0.
|
388 |
"r": 0.8363636364,
|
389 |
-
"f": 0.
|
390 |
},
|
391 |
"EVENT": {
|
392 |
-
"p": 0.
|
393 |
-
"r": 0.
|
394 |
-
"f": 0.
|
395 |
},
|
396 |
"GPE": {
|
397 |
-
"p": 0.
|
398 |
-
"r": 0.
|
399 |
-
"f": 0.
|
400 |
},
|
401 |
"PERSON": {
|
402 |
-
"p": 0.
|
403 |
-
"r": 0.
|
404 |
-
"f": 0.
|
405 |
},
|
406 |
"NAT_REL_POL": {
|
407 |
-
"p": 0.
|
408 |
"r": 0.9066666667,
|
409 |
-
"f": 0.
|
410 |
},
|
411 |
"MONEY": {
|
412 |
-
"p": 0.
|
413 |
-
"r": 0.
|
414 |
-
"f": 0.
|
415 |
-
},
|
416 |
-
"PRODUCT": {
|
417 |
-
"p": 0.6260162602,
|
418 |
-
"r": 0.5620437956,
|
419 |
-
"f": 0.5923076923
|
420 |
},
|
421 |
"LOC": {
|
422 |
-
"p": 0.
|
423 |
-
"r": 0.
|
424 |
-
"f": 0.
|
425 |
},
|
426 |
"WORK_OF_ART": {
|
427 |
-
"p": 0.
|
428 |
-
"r": 0.
|
429 |
-
"f": 0.
|
430 |
},
|
431 |
"QUANTITY": {
|
432 |
-
"p": 0.
|
433 |
-
"r": 0
|
434 |
-
"f": 0.
|
435 |
-
},
|
436 |
-
"PERIOD": {
|
437 |
-
"p": 0.9428571429,
|
438 |
-
"r": 0.7857142857,
|
439 |
-
"f": 0.8571428571
|
440 |
},
|
441 |
"LANGUAGE": {
|
442 |
-
"p": 0.
|
443 |
-
"r": 0
|
444 |
-
"f": 0.
|
|
|
|
|
|
|
|
|
|
|
445 |
}
|
446 |
-
}
|
|
|
447 |
}
|
|
|
1 |
{
|
2 |
"token_acc": 0.9990029326,
|
3 |
+
"token_p": 0.9967350492,
|
4 |
+
"token_r": 0.9957244934,
|
5 |
+
"token_f": 0.9959492157,
|
6 |
+
"tag_acc": 0.9664291788,
|
7 |
+
"sents_p": 0.954787234,
|
8 |
+
"sents_r": 0.954787234,
|
9 |
+
"sents_f": 0.954787234,
|
10 |
+
"dep_uas": 0.8897462438,
|
11 |
+
"dep_las": 0.8389686971,
|
12 |
+
"dep_las_per_type": {
|
13 |
+
"root": {
|
14 |
+
"p": 0.8786231884,
|
15 |
+
"r": 0.9133709981,
|
16 |
+
"f": 0.8956602031
|
|
|
|
|
|
|
|
|
17 |
},
|
18 |
+
"mark": {
|
19 |
+
"p": 0.9288389513,
|
20 |
+
"r": 0.9358490566,
|
21 |
+
"f": 0.9323308271
|
22 |
},
|
23 |
+
"case": {
|
24 |
+
"p": 0.9638554217,
|
25 |
+
"r": 0.959880015,
|
26 |
+
"f": 0.9618636107
|
27 |
},
|
28 |
+
"nmod:tmod": {
|
29 |
+
"p": 0.6842105263,
|
30 |
+
"r": 0.1092436975,
|
31 |
+
"f": 0.1884057971
|
32 |
},
|
33 |
+
"amod": {
|
34 |
+
"p": 0.9172297297,
|
35 |
+
"r": 0.9250425894,
|
36 |
+
"f": 0.9211195929
|
37 |
},
|
38 |
+
"nsubj": {
|
39 |
+
"p": 0.8803986711,
|
40 |
+
"r": 0.8372827804,
|
41 |
+
"f": 0.8582995951
|
42 |
},
|
43 |
+
"nmod": {
|
44 |
+
"p": 0.8218838527,
|
45 |
+
"r": 0.8286326312,
|
46 |
+
"f": 0.8252444444
|
47 |
},
|
48 |
+
"aux": {
|
49 |
+
"p": 0.9867924528,
|
50 |
+
"r": 0.9561243144,
|
51 |
+
"f": 0.9712163417
|
52 |
},
|
53 |
+
"advcl": {
|
54 |
+
"p": 0.5862068966,
|
55 |
+
"r": 0.6390977444,
|
56 |
+
"f": 0.6115107914
|
57 |
},
|
58 |
+
"obj": {
|
59 |
+
"p": 0.8326180258,
|
60 |
+
"r": 0.896073903,
|
61 |
+
"f": 0.8631813126
|
62 |
},
|
63 |
+
"det": {
|
64 |
+
"p": 0.9575688073,
|
65 |
+
"r": 0.9456398641,
|
66 |
+
"f": 0.9515669516
|
67 |
},
|
68 |
+
"cc": {
|
69 |
+
"p": 0.9340425532,
|
70 |
+
"r": 0.9164926931,
|
71 |
+
"f": 0.9251844046
|
72 |
},
|
73 |
+
"conj": {
|
74 |
+
"p": 0.6115288221,
|
75 |
+
"r": 0.5654692932,
|
76 |
+
"f": 0.5875978326
|
77 |
},
|
78 |
+
"nummod": {
|
79 |
+
"p": 0.887675507,
|
80 |
+
"r": 0.8835403727,
|
81 |
+
"f": 0.8856031128
|
82 |
},
|
83 |
+
"acl": {
|
84 |
+
"p": 0.8063583815,
|
85 |
+
"r": 0.7209302326,
|
86 |
+
"f": 0.761255116
|
87 |
},
|
88 |
+
"advmod": {
|
89 |
+
"p": 0.8117048346,
|
90 |
+
"r": 0.8416886544,
|
91 |
+
"f": 0.8264248705
|
92 |
},
|
93 |
+
"obl": {
|
94 |
+
"p": 0.6821052632,
|
95 |
+
"r": 0.8223350254,
|
96 |
+
"f": 0.7456846951
|
97 |
},
|
98 |
+
"expl:pass": {
|
99 |
+
"p": 0.8085106383,
|
100 |
+
"r": 0.7037037037,
|
101 |
+
"f": 0.7524752475
|
102 |
},
|
103 |
+
"nsubj:pass": {
|
104 |
+
"p": 0.8,
|
105 |
+
"r": 0.756097561,
|
106 |
+
"f": 0.7774294671
|
107 |
},
|
108 |
+
"fixed": {
|
109 |
+
"p": 0.9,
|
110 |
+
"r": 0.8562367865,
|
111 |
+
"f": 0.8775731311
|
112 |
},
|
113 |
+
"appos": {
|
114 |
+
"p": 0.4956896552,
|
115 |
+
"r": 0.4389312977,
|
116 |
+
"f": 0.4655870445
|
117 |
},
|
118 |
+
"parataxis": {
|
119 |
+
"p": 0.1627906977,
|
120 |
+
"r": 0.2,
|
121 |
+
"f": 0.1794871795
|
122 |
},
|
123 |
+
"aux:pass": {
|
124 |
+
"p": 0.9125,
|
125 |
+
"r": 0.9733333333,
|
126 |
+
"f": 0.9419354839
|
127 |
+
},
|
128 |
+
"nmod:agent": {
|
129 |
"p": 0.0,
|
130 |
"r": 0.0,
|
131 |
"f": 0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
132 |
},
|
133 |
+
"ccomp": {
|
134 |
+
"p": 0.8759689922,
|
135 |
+
"r": 0.8759689922,
|
136 |
+
"f": 0.8759689922
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
137 |
},
|
138 |
+
"nmod:pmod": {
|
139 |
"p": 0.0,
|
140 |
"r": 0.0,
|
141 |
"f": 0.0
|
142 |
},
|
143 |
+
"iobj": {
|
144 |
+
"p": 0.8157894737,
|
145 |
+
"r": 0.7654320988,
|
146 |
+
"f": 0.7898089172
|
|
|
|
|
|
|
|
|
|
|
147 |
},
|
148 |
"flat": {
|
149 |
+
"p": 0.7557251908,
|
150 |
+
"r": 0.7815789474,
|
151 |
+
"f": 0.7684346701
|
152 |
},
|
153 |
+
"cop": {
|
154 |
+
"p": 0.8524590164,
|
155 |
+
"r": 0.8387096774,
|
156 |
+
"f": 0.8455284553
|
157 |
},
|
158 |
+
"csubj": {
|
159 |
+
"p": 0.8235294118,
|
160 |
+
"r": 0.6666666667,
|
161 |
+
"f": 0.7368421053
|
162 |
},
|
163 |
+
"obl:agent": {
|
164 |
+
"p": 0.0,
|
165 |
+
"r": 0.0,
|
166 |
+
"f": 0.0
|
167 |
},
|
168 |
+
"dep": {
|
169 |
+
"p": 0.0,
|
170 |
+
"r": 0.0,
|
171 |
+
"f": 0.0
|
172 |
},
|
173 |
"expl:pv": {
|
174 |
+
"p": 0.7564102564,
|
175 |
+
"r": 0.8550724638,
|
176 |
+
"f": 0.8027210884
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
177 |
},
|
178 |
+
"expl": {
|
179 |
+
"p": 0.6875,
|
180 |
+
"r": 0.8148148148,
|
181 |
+
"f": 0.7457627119
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
182 |
},
|
183 |
+
"obl:pmod": {
|
184 |
+
"p": 0.0,
|
185 |
+
"r": 0.0,
|
186 |
+
"f": 0.0
|
187 |
},
|
188 |
"expl:poss": {
|
189 |
+
"p": 0.9655172414,
|
190 |
+
"r": 0.9032258065,
|
191 |
+
"f": 0.9333333333
|
192 |
},
|
193 |
+
"goeswith": {
|
|
|
|
|
|
|
|
|
|
|
194 |
"p": 0.0,
|
195 |
"r": 0.0,
|
196 |
"f": 0.0
|
197 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
198 |
"xcomp": {
|
199 |
+
"p": 0.5806451613,
|
200 |
+
"r": 0.6666666667,
|
201 |
+
"f": 0.6206896552
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
202 |
},
|
203 |
+
"orphan": {
|
204 |
"p": 0.0,
|
205 |
"r": 0.0,
|
206 |
"f": 0.0
|
207 |
},
|
208 |
+
"expl:impers": {
|
209 |
+
"p": 1.0,
|
210 |
+
"r": 0.3333333333,
|
211 |
+
"f": 0.5
|
212 |
},
|
213 |
+
"csubj:pass": {
|
214 |
"p": 0.0,
|
215 |
"r": 0.0,
|
216 |
"f": 0.0
|
217 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
218 |
"compound": {
|
219 |
+
"p": 0.5714285714,
|
220 |
+
"r": 0.5714285714,
|
221 |
+
"f": 0.5714285714
|
222 |
},
|
223 |
+
"list": {
|
224 |
"p": 0.0,
|
225 |
"r": 0.0,
|
226 |
"f": 0.0
|
227 |
},
|
228 |
+
"ccomp:pmod": {
|
|
|
|
|
|
|
|
|
|
|
229 |
"p": 0.25,
|
230 |
"r": 0.3333333333,
|
231 |
"f": 0.2857142857
|
232 |
},
|
233 |
+
"cc:preconj": {
|
234 |
"p": 0.0,
|
235 |
"r": 0.0,
|
236 |
"f": 0.0
|
237 |
+
}
|
238 |
+
},
|
239 |
+
"pos_acc": 0.9405873228,
|
240 |
+
"morph_acc": 0.9510657636,
|
241 |
+
"morph_micro_p": 0.9896160458,
|
242 |
+
"morph_micro_r": 0.9582489383,
|
243 |
+
"morph_micro_f": 0.9706797273,
|
244 |
+
"morph_per_feat": {
|
245 |
+
"Case": {
|
246 |
+
"p": 0.9938697318,
|
247 |
+
"r": 0.9896985883,
|
248 |
+
"f": 0.9917797744
|
249 |
+
},
|
250 |
+
"Gender": {
|
251 |
+
"p": 0.991821842,
|
252 |
+
"r": 0.9854981873,
|
253 |
+
"f": 0.9886499028
|
254 |
+
},
|
255 |
+
"Number": {
|
256 |
+
"p": 0.9894903379,
|
257 |
+
"r": 0.922363847,
|
258 |
+
"f": 0.9547486643
|
259 |
},
|
260 |
+
"Person": {
|
261 |
+
"p": 0.9911452184,
|
262 |
+
"r": 0.9893930466,
|
263 |
+
"f": 0.9902683574
|
264 |
+
},
|
265 |
+
"PronType": {
|
266 |
+
"p": 0.9965349965,
|
267 |
+
"r": 0.993780235,
|
268 |
+
"f": 0.9951557093
|
269 |
+
},
|
270 |
+
"Polarity": {
|
271 |
+
"p": 0.9918566775,
|
272 |
+
"r": 0.9983606557,
|
273 |
+
"f": 0.9950980392
|
274 |
+
},
|
275 |
+
"AdpType": {
|
276 |
+
"p": 0.998982706,
|
277 |
+
"r": 0.9969543147,
|
278 |
+
"f": 0.9979674797
|
279 |
+
},
|
280 |
+
"Definite": {
|
281 |
+
"p": 0.9886490807,
|
282 |
+
"r": 0.9815873016,
|
283 |
+
"f": 0.9851055356
|
284 |
+
},
|
285 |
+
"Degree": {
|
286 |
+
"p": 0.9582772544,
|
287 |
+
"r": 0.9563465413,
|
288 |
+
"f": 0.9573109244
|
289 |
+
},
|
290 |
+
"VerbForm": {
|
291 |
+
"p": 0.9774236388,
|
292 |
+
"r": 0.9787234043,
|
293 |
+
"f": 0.9780730897
|
294 |
+
},
|
295 |
+
"Abbr": {
|
296 |
+
"p": 0.9538461538,
|
297 |
+
"r": 0.8303571429,
|
298 |
+
"f": 0.8878281623
|
299 |
+
},
|
300 |
+
"Poss": {
|
301 |
+
"p": 1.0,
|
302 |
+
"r": 0.9927710843,
|
303 |
+
"f": 0.9963724305
|
304 |
+
},
|
305 |
+
"NumForm": {
|
306 |
+
"p": 0.9871794872,
|
307 |
+
"r": 0.3181818182,
|
308 |
+
"f": 0.48125
|
309 |
+
},
|
310 |
+
"NumType": {
|
311 |
+
"p": 0.9872881356,
|
312 |
+
"r": 0.3200549451,
|
313 |
+
"f": 0.4834024896
|
314 |
+
},
|
315 |
+
"Reflex": {
|
316 |
+
"p": 1.0,
|
317 |
+
"r": 1.0,
|
318 |
+
"f": 1.0
|
319 |
+
},
|
320 |
+
"Strength": {
|
321 |
+
"p": 0.9920318725,
|
322 |
+
"r": 0.9880952381,
|
323 |
+
"f": 0.9900596421
|
324 |
+
},
|
325 |
+
"Mood": {
|
326 |
+
"p": 0.972826087,
|
327 |
+
"r": 0.9853211009,
|
328 |
+
"f": 0.9790337284
|
329 |
+
},
|
330 |
+
"Tense": {
|
331 |
+
"p": 0.9725036179,
|
332 |
+
"r": 0.976744186,
|
333 |
+
"f": 0.9746192893
|
334 |
+
},
|
335 |
+
"Variant": {
|
336 |
+
"p": 0.9932885906,
|
337 |
+
"r": 0.9548387097,
|
338 |
+
"f": 0.9736842105
|
339 |
+
},
|
340 |
+
"Position": {
|
341 |
+
"p": 1.0,
|
342 |
+
"r": 0.9910714286,
|
343 |
+
"f": 0.9955156951
|
344 |
+
},
|
345 |
+
"Number[psor]": {
|
346 |
+
"p": 1.0,
|
347 |
+
"r": 0.9666666667,
|
348 |
+
"f": 0.9830508475
|
349 |
+
},
|
350 |
+
"PartType": {
|
351 |
+
"p": 1.0,
|
352 |
+
"r": 0.9459459459,
|
353 |
+
"f": 0.9722222222
|
354 |
+
},
|
355 |
+
"Foreign": {
|
356 |
"p": 0.0,
|
357 |
"r": 0.0,
|
358 |
"f": 0.0
|
359 |
}
|
360 |
},
|
361 |
+
"lemma_acc": 0.8183070924,
|
362 |
+
"ents_p": 0.7550713749,
|
363 |
+
"ents_r": 0.7721859393,
|
364 |
+
"ents_f": 0.7635327635,
|
365 |
"ents_per_type": {
|
366 |
"DATETIME": {
|
367 |
+
"p": 0.7818791946,
|
368 |
+
"r": 0.8118466899,
|
369 |
+
"f": 0.7965811966
|
370 |
},
|
371 |
"ORGANIZATION": {
|
372 |
+
"p": 0.7076923077,
|
373 |
+
"r": 0.7324840764,
|
374 |
+
"f": 0.7198748044
|
375 |
},
|
376 |
"FACILITY": {
|
377 |
+
"p": 0.5039370079,
|
378 |
+
"r": 0.4885496183,
|
379 |
+
"f": 0.496124031
|
380 |
+
},
|
381 |
+
"PRODUCT": {
|
382 |
+
"p": 0.5590551181,
|
383 |
+
"r": 0.5182481752,
|
384 |
+
"f": 0.5378787879
|
385 |
},
|
386 |
"NUMERIC_VALUE": {
|
387 |
+
"p": 0.8875502008,
|
388 |
+
"r": 0.936440678,
|
389 |
+
"f": 0.9113402062
|
390 |
},
|
391 |
"ORDINAL": {
|
392 |
+
"p": 0.8214285714,
|
393 |
"r": 0.8363636364,
|
394 |
+
"f": 0.8288288288
|
395 |
},
|
396 |
"EVENT": {
|
397 |
+
"p": 0.5151515152,
|
398 |
+
"r": 0.4594594595,
|
399 |
+
"f": 0.4857142857
|
400 |
},
|
401 |
"GPE": {
|
402 |
+
"p": 0.8636363636,
|
403 |
+
"r": 0.8735632184,
|
404 |
+
"f": 0.8685714286
|
405 |
},
|
406 |
"PERSON": {
|
407 |
+
"p": 0.7046153846,
|
408 |
+
"r": 0.7684563758,
|
409 |
+
"f": 0.735152488
|
410 |
},
|
411 |
"NAT_REL_POL": {
|
412 |
+
"p": 0.9315068493,
|
413 |
"r": 0.9066666667,
|
414 |
+
"f": 0.9189189189
|
415 |
},
|
416 |
"MONEY": {
|
417 |
+
"p": 0.9622641509,
|
418 |
+
"r": 0.8793103448,
|
419 |
+
"f": 0.9189189189
|
|
|
|
|
|
|
|
|
|
|
420 |
},
|
421 |
"LOC": {
|
422 |
+
"p": 0.4864864865,
|
423 |
+
"r": 0.4736842105,
|
424 |
+
"f": 0.48
|
425 |
},
|
426 |
"WORK_OF_ART": {
|
427 |
+
"p": 0.3571428571,
|
428 |
+
"r": 0.2631578947,
|
429 |
+
"f": 0.303030303
|
430 |
},
|
431 |
"QUANTITY": {
|
432 |
+
"p": 0.962962963,
|
433 |
+
"r": 1.0,
|
434 |
+
"f": 0.9811320755
|
|
|
|
|
|
|
|
|
|
|
435 |
},
|
436 |
"LANGUAGE": {
|
437 |
+
"p": 0.6666666667,
|
438 |
+
"r": 1.0,
|
439 |
+
"f": 0.8
|
440 |
+
},
|
441 |
+
"PERIOD": {
|
442 |
+
"p": 0.8648648649,
|
443 |
+
"r": 0.7619047619,
|
444 |
+
"f": 0.8101265823
|
445 |
}
|
446 |
+
},
|
447 |
+
"speed": 7699.716829035
|
448 |
}
|
attribute_ruler/patterns
CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
|
|
config.cfg
CHANGED
@@ -1,10 +1,8 @@
|
|
1 |
[paths]
|
2 |
-
train =
|
3 |
-
dev =
|
4 |
-
vectors =
|
5 |
-
raw = null
|
6 |
init_tok2vec = null
|
7 |
-
vocab_data = null
|
8 |
|
9 |
[system]
|
10 |
gpu_allocator = null
|
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
|
|
24 |
|
25 |
[components.attribute_ruler]
|
26 |
factory = "attribute_ruler"
|
|
|
27 |
validate = false
|
28 |
|
29 |
[components.lemmatizer]
|
@@ -31,11 +30,13 @@ factory = "lemmatizer"
|
|
31 |
mode = "lookup"
|
32 |
model = null
|
33 |
overwrite = false
|
|
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
|
|
39 |
update_with_oracle_cut_size = 100
|
40 |
|
41 |
[components.ner.model]
|
@@ -53,8 +54,8 @@ nO = null
|
|
53 |
[components.ner.model.tok2vec.embed]
|
54 |
@architectures = "spacy.MultiHashEmbed.v2"
|
55 |
width = 96
|
56 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
57 |
-
rows = [5000,2500,2500,2500]
|
58 |
include_static_vectors = true
|
59 |
|
60 |
[components.ner.model.tok2vec.encode]
|
@@ -69,6 +70,7 @@ factory = "parser"
|
|
69 |
learn_tokens = false
|
70 |
min_action_freq = 30
|
71 |
moves = null
|
|
|
72 |
update_with_oracle_cut_size = 100
|
73 |
|
74 |
[components.parser.model]
|
@@ -87,6 +89,8 @@ upstream = "tok2vec"
|
|
87 |
|
88 |
[components.senter]
|
89 |
factory = "senter"
|
|
|
|
|
90 |
|
91 |
[components.senter.model]
|
92 |
@architectures = "spacy.Tagger.v1"
|
@@ -98,8 +102,8 @@ nO = null
|
|
98 |
[components.senter.model.tok2vec.embed]
|
99 |
@architectures = "spacy.MultiHashEmbed.v2"
|
100 |
width = 16
|
101 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
102 |
-
rows = [1000,500,500,500]
|
103 |
include_static_vectors = true
|
104 |
|
105 |
[components.senter.model.tok2vec.encode]
|
@@ -111,6 +115,8 @@ maxout_pieces = 2
|
|
111 |
|
112 |
[components.tagger]
|
113 |
factory = "tagger"
|
|
|
|
|
114 |
|
115 |
[components.tagger.model]
|
116 |
@architectures = "spacy.Tagger.v1"
|
@@ -130,8 +136,8 @@ factory = "tok2vec"
|
|
130 |
[components.tok2vec.model.embed]
|
131 |
@architectures = "spacy.MultiHashEmbed.v2"
|
132 |
width = ${components.tok2vec.model.encode:width}
|
133 |
-
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
|
134 |
-
rows = [5000,2500,2500,2500]
|
135 |
include_static_vectors = true
|
136 |
|
137 |
[components.tok2vec.model.encode]
|
@@ -145,22 +151,19 @@ maxout_pieces = 3
|
|
145 |
|
146 |
[corpora.dev]
|
147 |
@readers = "spacy.Corpus.v1"
|
148 |
-
|
149 |
-
max_length = 0
|
150 |
-
path = ${paths:dev}
|
151 |
gold_preproc = false
|
|
|
|
|
152 |
augmenter = null
|
153 |
|
154 |
[corpora.train]
|
155 |
@readers = "spacy.Corpus.v1"
|
156 |
-
path = ${paths
|
157 |
-
max_length = 5000
|
158 |
gold_preproc = false
|
|
|
159 |
limit = 0
|
160 |
-
|
161 |
-
[corpora.train.augmenter]
|
162 |
-
@augmenters = "spacy.lower_case.v1"
|
163 |
-
level = 0.1
|
164 |
|
165 |
[training]
|
166 |
train_corpus = "corpora.train"
|
@@ -191,9 +194,8 @@ compound = 1.001
|
|
191 |
t = 0.0
|
192 |
|
193 |
[training.logger]
|
194 |
-
@loggers = "spacy.
|
195 |
-
|
196 |
-
remove_config_values = []
|
197 |
|
198 |
[training.optimizer]
|
199 |
@optimizers = "Adam.v1"
|
@@ -214,16 +216,17 @@ dep_las_per_type = null
|
|
214 |
sents_p = null
|
215 |
sents_r = null
|
216 |
sents_f = 0.02
|
217 |
-
lemma_acc = 0.
|
218 |
-
ents_f = 0.
|
219 |
ents_p = 0.0
|
220 |
ents_r = 0.0
|
221 |
ents_per_type = null
|
|
|
222 |
|
223 |
[pretraining]
|
224 |
|
225 |
[initialize]
|
226 |
-
vocab_data =
|
227 |
vectors = ${paths.vectors}
|
228 |
init_tok2vec = ${paths.init_tok2vec}
|
229 |
before_init = null
|
|
|
1 |
[paths]
|
2 |
+
train = null
|
3 |
+
dev = null
|
4 |
+
vectors = null
|
|
|
5 |
init_tok2vec = null
|
|
|
6 |
|
7 |
[system]
|
8 |
gpu_allocator = null
|
|
|
22 |
|
23 |
[components.attribute_ruler]
|
24 |
factory = "attribute_ruler"
|
25 |
+
scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
|
26 |
validate = false
|
27 |
|
28 |
[components.lemmatizer]
|
|
|
30 |
mode = "lookup"
|
31 |
model = null
|
32 |
overwrite = false
|
33 |
+
scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
|
34 |
|
35 |
[components.ner]
|
36 |
factory = "ner"
|
37 |
incorrect_spans_key = null
|
38 |
moves = null
|
39 |
+
scorer = {"@scorers":"spacy.ner_scorer.v1"}
|
40 |
update_with_oracle_cut_size = 100
|
41 |
|
42 |
[components.ner.model]
|
|
|
54 |
[components.ner.model.tok2vec.embed]
|
55 |
@architectures = "spacy.MultiHashEmbed.v2"
|
56 |
width = 96
|
57 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
58 |
+
rows = [5000,2500,2500,2500,100]
|
59 |
include_static_vectors = true
|
60 |
|
61 |
[components.ner.model.tok2vec.encode]
|
|
|
70 |
learn_tokens = false
|
71 |
min_action_freq = 30
|
72 |
moves = null
|
73 |
+
scorer = {"@scorers":"spacy.parser_scorer.v1"}
|
74 |
update_with_oracle_cut_size = 100
|
75 |
|
76 |
[components.parser.model]
|
|
|
89 |
|
90 |
[components.senter]
|
91 |
factory = "senter"
|
92 |
+
overwrite = false
|
93 |
+
scorer = {"@scorers":"spacy.senter_scorer.v1"}
|
94 |
|
95 |
[components.senter.model]
|
96 |
@architectures = "spacy.Tagger.v1"
|
|
|
102 |
[components.senter.model.tok2vec.embed]
|
103 |
@architectures = "spacy.MultiHashEmbed.v2"
|
104 |
width = 16
|
105 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
106 |
+
rows = [1000,500,500,500,50]
|
107 |
include_static_vectors = true
|
108 |
|
109 |
[components.senter.model.tok2vec.encode]
|
|
|
115 |
|
116 |
[components.tagger]
|
117 |
factory = "tagger"
|
118 |
+
overwrite = false
|
119 |
+
scorer = {"@scorers":"spacy.tagger_scorer.v1"}
|
120 |
|
121 |
[components.tagger.model]
|
122 |
@architectures = "spacy.Tagger.v1"
|
|
|
136 |
[components.tok2vec.model.embed]
|
137 |
@architectures = "spacy.MultiHashEmbed.v2"
|
138 |
width = ${components.tok2vec.model.encode:width}
|
139 |
+
attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
|
140 |
+
rows = [5000,2500,2500,2500,100]
|
141 |
include_static_vectors = true
|
142 |
|
143 |
[components.tok2vec.model.encode]
|
|
|
151 |
|
152 |
[corpora.dev]
|
153 |
@readers = "spacy.Corpus.v1"
|
154 |
+
path = ${paths.dev}
|
|
|
|
|
155 |
gold_preproc = false
|
156 |
+
max_length = 0
|
157 |
+
limit = 0
|
158 |
augmenter = null
|
159 |
|
160 |
[corpora.train]
|
161 |
@readers = "spacy.Corpus.v1"
|
162 |
+
path = ${paths.train}
|
|
|
163 |
gold_preproc = false
|
164 |
+
max_length = 0
|
165 |
limit = 0
|
166 |
+
augmenter = null
|
|
|
|
|
|
|
167 |
|
168 |
[training]
|
169 |
train_corpus = "corpora.train"
|
|
|
194 |
t = 0.0
|
195 |
|
196 |
[training.logger]
|
197 |
+
@loggers = "spacy.ConsoleLogger.v1"
|
198 |
+
progress_bar = false
|
|
|
199 |
|
200 |
[training.optimizer]
|
201 |
@optimizers = "Adam.v1"
|
|
|
216 |
sents_p = null
|
217 |
sents_r = null
|
218 |
sents_f = 0.02
|
219 |
+
lemma_acc = 0.5
|
220 |
+
ents_f = 0.16
|
221 |
ents_p = 0.0
|
222 |
ents_r = 0.0
|
223 |
ents_per_type = null
|
224 |
+
speed = 0.0
|
225 |
|
226 |
[pretraining]
|
227 |
|
228 |
[initialize]
|
229 |
+
vocab_data = null
|
230 |
vectors = ${paths.vectors}
|
231 |
init_tok2vec = ${paths.init_tok2vec}
|
232 |
before_init = null
|
meta.json
CHANGED
@@ -1,14 +1,14 @@
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_lg",
|
4 |
-
"version":"3.
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"contact@explosion.ai",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
-
"spacy_version":">=3.
|
11 |
-
"spacy_git_version":"
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":500000,
|
@@ -30,6 +30,7 @@
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
|
|
33 |
"Afpf--n",
|
34 |
"Afpfp-n",
|
35 |
"Afpfp-ny",
|
@@ -131,6 +132,7 @@
|
|
131 |
"Ds2ms-s",
|
132 |
"Ds3---p",
|
133 |
"Ds3---s",
|
|
|
134 |
"Ds3fp-s",
|
135 |
"Ds3fsos",
|
136 |
"Ds3fsrs",
|
@@ -159,18 +161,23 @@
|
|
159 |
"LSQR",
|
160 |
"LT",
|
161 |
"M",
|
162 |
-
"Mc",
|
163 |
"Mc-p-d",
|
164 |
"Mc-p-l",
|
|
|
|
|
|
|
165 |
"Mcfp-l",
|
166 |
"Mcfp-ln",
|
167 |
"Mcfprln",
|
168 |
"Mcfprly",
|
169 |
"Mcfsoln",
|
|
|
170 |
"Mcfsrln",
|
|
|
171 |
"Mcmp-l",
|
172 |
"Mcms-ln",
|
173 |
"Mcmsrl",
|
|
|
174 |
"Mcmsrly",
|
175 |
"Mffprln",
|
176 |
"Mffsrln",
|
@@ -243,7 +250,6 @@
|
|
243 |
"Pd3mpr--y",
|
244 |
"Pd3mso",
|
245 |
"Pd3msr",
|
246 |
-
"Pi3",
|
247 |
"Pi3--r",
|
248 |
"Pi3-po",
|
249 |
"Pi3-so",
|
@@ -289,6 +295,7 @@
|
|
289 |
"Pp3-po--------s",
|
290 |
"Pp3-sd--------w",
|
291 |
"Pp3-sd--y-----w",
|
|
|
292 |
"Pp3fpa--------w",
|
293 |
"Pp3fpa--y-----w",
|
294 |
"Pp3fpr--------s",
|
@@ -315,7 +322,6 @@
|
|
315 |
"Ps2fp-s",
|
316 |
"Ps2fsrp",
|
317 |
"Ps2fsrs",
|
318 |
-
"Ps2ms-s",
|
319 |
"Ps3---p",
|
320 |
"Ps3---s",
|
321 |
"Ps3fp-s",
|
@@ -348,7 +354,6 @@
|
|
348 |
"RPAR",
|
349 |
"RSQR",
|
350 |
"Rc",
|
351 |
-
"Rgc",
|
352 |
"Rgp",
|
353 |
"Rgpy",
|
354 |
"Rgs",
|
@@ -406,6 +411,7 @@
|
|
406 |
"Va--3s",
|
407 |
"Va--3s----y",
|
408 |
"Vag",
|
|
|
409 |
"Vaii1",
|
410 |
"Vaii2s",
|
411 |
"Vaii3p",
|
@@ -475,7 +481,7 @@
|
|
475 |
"Vmp--sm",
|
476 |
"Vmp--sm---y",
|
477 |
"Vmsp1p",
|
478 |
-
"
|
479 |
"Vmsp2s",
|
480 |
"Vmsp3",
|
481 |
"Vmsp3-----y",
|
@@ -488,6 +494,7 @@
|
|
488 |
"Ynmsoy",
|
489 |
"Ynmsry",
|
490 |
"Yp",
|
|
|
491 |
"Yp-sr",
|
492 |
"Yr"
|
493 |
],
|
@@ -525,14 +532,14 @@
|
|
525 |
"iobj",
|
526 |
"mark",
|
527 |
"nmod",
|
528 |
-
"nmod:agent",
|
529 |
-
"nmod:pmod",
|
530 |
"nmod:tmod",
|
531 |
"nsubj",
|
532 |
"nsubj:pass",
|
533 |
"nummod",
|
534 |
"obj",
|
535 |
"obl",
|
|
|
|
|
536 |
"orphan",
|
537 |
"parataxis",
|
538 |
"punct",
|
@@ -590,450 +597,451 @@
|
|
590 |
],
|
591 |
"performance":{
|
592 |
"token_acc":0.9990029326,
|
593 |
-
"
|
594 |
-
"
|
595 |
-
"
|
596 |
-
"
|
597 |
-
"
|
598 |
-
"
|
599 |
-
"
|
600 |
-
"
|
601 |
-
"
|
602 |
-
"
|
603 |
-
|
604 |
-
|
605 |
-
|
606 |
-
|
607 |
-
"AdpType":{
|
608 |
-
"p":0.9970784641,
|
609 |
-
"r":0.9941739492,
|
610 |
-
"f":0.9956240884
|
611 |
},
|
612 |
-
"
|
613 |
-
"p":0.
|
614 |
-
"r":0.
|
615 |
-
"f":0.
|
616 |
},
|
617 |
-
"
|
618 |
-
"p":0.
|
619 |
-
"r":0.
|
620 |
-
"f":0.
|
621 |
},
|
622 |
-
"
|
623 |
-
"p":0.
|
624 |
-
"r":0.
|
625 |
-
"f":0.
|
626 |
},
|
627 |
-
"
|
628 |
-
"p":0.
|
629 |
-
"r":0.
|
630 |
-
"f":0.
|
631 |
},
|
632 |
-
"
|
633 |
-
"p":0.
|
634 |
-
"r":0.
|
635 |
-
"f":0.
|
636 |
},
|
637 |
-
"
|
638 |
-
"p":0.
|
639 |
-
"r":0.
|
640 |
-
"f":0.
|
641 |
},
|
642 |
-
"
|
643 |
-
"p":0.
|
644 |
-
"r":0.
|
645 |
-
"f":0.
|
646 |
},
|
647 |
-
"
|
648 |
-
"p":0.
|
649 |
-
"r":0.
|
650 |
-
"f":0.
|
651 |
},
|
652 |
-
"
|
653 |
-
"p":0.
|
654 |
-
"r":0.
|
655 |
-
"f":0.
|
656 |
},
|
657 |
-
"
|
658 |
-
"p":0.
|
659 |
-
"r":0.
|
660 |
-
"f":0.
|
661 |
},
|
662 |
-
"
|
663 |
-
"p":0.
|
664 |
-
"r":0.
|
665 |
-
"f":0.
|
666 |
},
|
667 |
-
"
|
668 |
-
"p":0.
|
669 |
-
"r":0.
|
670 |
-
"f":0.
|
671 |
},
|
672 |
-
"
|
673 |
-
"p":0.
|
674 |
-
"r":0.
|
675 |
-
"f":0.
|
676 |
},
|
677 |
-
"
|
678 |
-
"p":0.
|
679 |
-
"r":0.
|
680 |
-
"f":0.
|
681 |
},
|
682 |
-
"
|
683 |
-
"p":0.
|
684 |
-
"r":0.
|
685 |
-
"f":0.
|
686 |
},
|
687 |
-
"
|
688 |
-
"p":0.
|
689 |
-
"r":0.
|
690 |
-
"f":0.
|
691 |
},
|
692 |
-
"
|
693 |
-
"p":0.
|
694 |
-
"r":0.
|
695 |
-
"f":0.
|
696 |
},
|
697 |
-
"
|
698 |
-
"p":0.
|
699 |
-
"r":0.
|
700 |
-
"f":0.
|
701 |
},
|
702 |
-
"
|
703 |
-
"p":0.
|
704 |
-
"r":0.
|
705 |
-
"f":0.
|
706 |
},
|
707 |
-
"
|
708 |
-
"p":0.
|
709 |
-
"r":0.
|
710 |
-
"f":0.
|
711 |
},
|
712 |
-
"
|
713 |
-
"p":0.
|
714 |
-
"r":0.
|
715 |
-
"f":0.
|
716 |
},
|
717 |
-
"
|
|
|
|
|
|
|
|
|
|
|
718 |
"p":0.0,
|
719 |
"r":0.0,
|
720 |
"f":0.0
|
721 |
-
}
|
722 |
-
},
|
723 |
-
"dep_las_per_type":{
|
724 |
-
"case":{
|
725 |
-
"p":0.9257307139,
|
726 |
-
"r":0.9415204678,
|
727 |
-
"f":0.9335588306
|
728 |
},
|
729 |
-
"
|
730 |
-
"p":0.
|
731 |
-
"r":0.
|
732 |
-
"f":0.
|
733 |
-
},
|
734 |
-
"nmod:tmod":{
|
735 |
-
"p":0.4,
|
736 |
-
"r":0.0465116279,
|
737 |
-
"f":0.0833333333
|
738 |
-
},
|
739 |
-
"amod":{
|
740 |
-
"p":0.8639212175,
|
741 |
-
"r":0.8756805808,
|
742 |
-
"f":0.8697611537
|
743 |
-
},
|
744 |
-
"cc":{
|
745 |
-
"p":0.8669354839,
|
746 |
-
"r":0.89958159,
|
747 |
-
"f":0.8829568789
|
748 |
-
},
|
749 |
-
"conj":{
|
750 |
-
"p":0.5984962406,
|
751 |
-
"r":0.6012084592,
|
752 |
-
"f":0.5998492841
|
753 |
-
},
|
754 |
-
"nmod":{
|
755 |
-
"p":0.7883565797,
|
756 |
-
"r":0.8217446271,
|
757 |
-
"f":0.8047044259
|
758 |
-
},
|
759 |
-
"mark":{
|
760 |
-
"p":0.8857142857,
|
761 |
-
"r":0.9056179775,
|
762 |
-
"f":0.8955555556
|
763 |
-
},
|
764 |
-
"fixed":{
|
765 |
-
"p":0.8689217759,
|
766 |
-
"r":0.7172774869,
|
767 |
-
"f":0.7858508604
|
768 |
-
},
|
769 |
-
"nsubj":{
|
770 |
-
"p":0.8333333333,
|
771 |
-
"r":0.7814485388,
|
772 |
-
"f":0.806557377
|
773 |
},
|
774 |
-
"
|
775 |
"p":0.0,
|
776 |
"r":0.0,
|
777 |
"f":0.0
|
778 |
},
|
779 |
-
"
|
780 |
-
"p":0.
|
781 |
-
"r":0.
|
782 |
-
"f":0.
|
783 |
-
},
|
784 |
-
"nummod":{
|
785 |
-
"p":0.8703703704,
|
786 |
-
"r":0.8676923077,
|
787 |
-
"f":0.8690292758
|
788 |
},
|
789 |
"flat":{
|
790 |
-
"p":0.
|
791 |
-
"r":0.
|
792 |
-
"f":0.
|
793 |
},
|
794 |
-
"
|
795 |
-
"p":0.
|
796 |
-
"r":0.
|
797 |
-
"f":0.
|
798 |
},
|
799 |
-
"
|
800 |
-
"p":0.
|
801 |
-
"r":0.
|
802 |
-
"f":0.
|
803 |
},
|
804 |
-
"
|
805 |
-
"p":0.
|
806 |
-
"r":0.
|
807 |
-
"f":0.
|
808 |
},
|
809 |
-
"
|
810 |
-
"p":0.
|
811 |
-
"r":0.
|
812 |
-
"f":0.
|
813 |
},
|
814 |
"expl:pv":{
|
815 |
-
"p":0.
|
816 |
-
"r":0.
|
817 |
-
"f":0.
|
818 |
-
},
|
819 |
-
"root":{
|
820 |
-
"p":0.917222964,
|
821 |
-
"r":0.9135638298,
|
822 |
-
"f":0.9153897402
|
823 |
-
},
|
824 |
-
"advcl":{
|
825 |
-
"p":0.5625,
|
826 |
-
"r":0.5853658537,
|
827 |
-
"f":0.5737051793
|
828 |
},
|
829 |
-
"
|
830 |
-
"p":0.
|
831 |
-
"r":0.
|
832 |
-
"f":0.
|
833 |
-
},
|
834 |
-
"ccomp":{
|
835 |
-
"p":0.7178217822,
|
836 |
-
"r":0.8146067416,
|
837 |
-
"f":0.7631578947
|
838 |
-
},
|
839 |
-
"goeswith":{
|
840 |
-
"p":0.875,
|
841 |
-
"r":0.5833333333,
|
842 |
-
"f":0.7
|
843 |
},
|
844 |
-
"
|
845 |
-
"p":0.
|
846 |
-
"r":0.
|
847 |
-
"f":0.
|
848 |
},
|
849 |
"expl:poss":{
|
850 |
-
"p":0.
|
851 |
-
"r":0.
|
852 |
-
"f":0.
|
853 |
},
|
854 |
-
"
|
855 |
-
"p":0.7647058824,
|
856 |
-
"r":0.8024691358,
|
857 |
-
"f":0.7831325301
|
858 |
-
},
|
859 |
-
"cc:preconj":{
|
860 |
"p":0.0,
|
861 |
"r":0.0,
|
862 |
"f":0.0
|
863 |
},
|
864 |
-
"aux":{
|
865 |
-
"p":0.9716713881,
|
866 |
-
"r":0.9122340426,
|
867 |
-
"f":0.9410150892
|
868 |
-
},
|
869 |
-
"expl":{
|
870 |
-
"p":0.5294117647,
|
871 |
-
"r":0.4186046512,
|
872 |
-
"f":0.4675324675
|
873 |
-
},
|
874 |
-
"appos":{
|
875 |
-
"p":0.4347826087,
|
876 |
-
"r":0.396039604,
|
877 |
-
"f":0.414507772
|
878 |
-
},
|
879 |
"xcomp":{
|
880 |
-
"p":0.
|
881 |
-
"r":0.
|
882 |
-
"f":0.
|
883 |
-
},
|
884 |
-
"csubj":{
|
885 |
-
"p":0.7966101695,
|
886 |
-
"r":0.746031746,
|
887 |
-
"f":0.7704918033
|
888 |
-
},
|
889 |
-
"nmod:agent":{
|
890 |
-
"p":0.7285714286,
|
891 |
-
"r":0.7846153846,
|
892 |
-
"f":0.7555555556
|
893 |
-
},
|
894 |
-
"aux:pass":{
|
895 |
-
"p":0.7769784173,
|
896 |
-
"r":0.9,
|
897 |
-
"f":0.833976834
|
898 |
},
|
899 |
-
"
|
900 |
"p":0.0,
|
901 |
"r":0.0,
|
902 |
"f":0.0
|
903 |
},
|
904 |
-
"
|
905 |
-
"p":0
|
906 |
-
"r":0.
|
907 |
-
"f":0.
|
908 |
},
|
909 |
-
"
|
910 |
"p":0.0,
|
911 |
"r":0.0,
|
912 |
"f":0.0
|
913 |
},
|
914 |
-
"expl:pass":{
|
915 |
-
"p":0.6734693878,
|
916 |
-
"r":0.7252747253,
|
917 |
-
"f":0.6984126984
|
918 |
-
},
|
919 |
-
"ccomp:pmod":{
|
920 |
-
"p":0.4,
|
921 |
-
"r":0.2666666667,
|
922 |
-
"f":0.32
|
923 |
-
},
|
924 |
"compound":{
|
925 |
-
"p":0.
|
926 |
-
"r":0.
|
927 |
-
"f":0.
|
928 |
},
|
929 |
-
"
|
930 |
"p":0.0,
|
931 |
"r":0.0,
|
932 |
"f":0.0
|
933 |
},
|
934 |
-
"
|
935 |
-
"p":0.3333333333,
|
936 |
-
"r":0.1,
|
937 |
-
"f":0.1538461538
|
938 |
-
},
|
939 |
-
"csubj:pass":{
|
940 |
"p":0.25,
|
941 |
"r":0.3333333333,
|
942 |
"f":0.2857142857
|
943 |
},
|
944 |
-
"
|
945 |
"p":0.0,
|
946 |
"r":0.0,
|
947 |
"f":0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
948 |
},
|
949 |
-
"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
950 |
"p":0.0,
|
951 |
"r":0.0,
|
952 |
"f":0.0
|
953 |
}
|
954 |
},
|
|
|
|
|
|
|
|
|
955 |
"ents_per_type":{
|
956 |
"DATETIME":{
|
957 |
-
"p":0.
|
958 |
-
"r":0.
|
959 |
-
"f":0.
|
960 |
},
|
961 |
"ORGANIZATION":{
|
962 |
-
"p":0.
|
963 |
-
"r":0.
|
964 |
-
"f":0.
|
965 |
},
|
966 |
"FACILITY":{
|
967 |
-
"p":0.
|
968 |
-
"r":0.
|
969 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
970 |
},
|
971 |
"NUMERIC_VALUE":{
|
972 |
-
"p":0.
|
973 |
-
"r":0.
|
974 |
-
"f":0.
|
975 |
},
|
976 |
"ORDINAL":{
|
977 |
-
"p":0.
|
978 |
"r":0.8363636364,
|
979 |
-
"f":0.
|
980 |
},
|
981 |
"EVENT":{
|
982 |
-
"p":0.
|
983 |
-
"r":0.
|
984 |
-
"f":0.
|
985 |
},
|
986 |
"GPE":{
|
987 |
-
"p":0.
|
988 |
-
"r":0.
|
989 |
-
"f":0.
|
990 |
},
|
991 |
"PERSON":{
|
992 |
-
"p":0.
|
993 |
-
"r":0.
|
994 |
-
"f":0.
|
995 |
},
|
996 |
"NAT_REL_POL":{
|
997 |
-
"p":0.
|
998 |
"r":0.9066666667,
|
999 |
-
"f":0.
|
1000 |
},
|
1001 |
"MONEY":{
|
1002 |
-
"p":0.
|
1003 |
-
"r":0.
|
1004 |
-
"f":0.
|
1005 |
-
},
|
1006 |
-
"PRODUCT":{
|
1007 |
-
"p":0.6260162602,
|
1008 |
-
"r":0.5620437956,
|
1009 |
-
"f":0.5923076923
|
1010 |
},
|
1011 |
"LOC":{
|
1012 |
-
"p":0.
|
1013 |
-
"r":0.
|
1014 |
-
"f":0.
|
1015 |
},
|
1016 |
"WORK_OF_ART":{
|
1017 |
-
"p":0.
|
1018 |
-
"r":0.
|
1019 |
-
"f":0.
|
1020 |
},
|
1021 |
"QUANTITY":{
|
1022 |
-
"p":0.
|
1023 |
-
"r":0
|
1024 |
-
"f":0.
|
1025 |
-
},
|
1026 |
-
"PERIOD":{
|
1027 |
-
"p":0.9428571429,
|
1028 |
-
"r":0.7857142857,
|
1029 |
-
"f":0.8571428571
|
1030 |
},
|
1031 |
"LANGUAGE":{
|
1032 |
-
"p":0.
|
1033 |
-
"r":0
|
1034 |
-
"f":0.
|
|
|
|
|
|
|
|
|
|
|
1035 |
}
|
1036 |
-
}
|
|
|
1037 |
},
|
1038 |
"sources":[
|
1039 |
{
|
@@ -1043,7 +1051,7 @@
|
|
1043 |
"author":"Michal M\u011bchura"
|
1044 |
},
|
1045 |
{
|
1046 |
-
"name":"UD Romanian RRT v2.
|
1047 |
"url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
|
1048 |
"license":"CC BY-SA 4.0",
|
1049 |
"author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
|
|
|
1 |
{
|
2 |
"lang":"ro",
|
3 |
"name":"core_news_lg",
|
4 |
+
"version":"3.2.0",
|
5 |
"description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
|
6 |
"author":"Explosion",
|
7 |
"email":"contact@explosion.ai",
|
8 |
"url":"https://explosion.ai",
|
9 |
"license":"CC BY-SA 4.0",
|
10 |
+
"spacy_version":">=3.2.0,<3.3.0",
|
11 |
+
"spacy_git_version":"bb26550e2",
|
12 |
"vectors":{
|
13 |
"width":300,
|
14 |
"vectors":500000,
|
|
|
30 |
"Afp",
|
31 |
"Afp-p-n",
|
32 |
"Afp-poy",
|
33 |
+
"Afp-srn",
|
34 |
"Afpf--n",
|
35 |
"Afpfp-n",
|
36 |
"Afpfp-ny",
|
|
|
132 |
"Ds2ms-s",
|
133 |
"Ds3---p",
|
134 |
"Ds3---s",
|
135 |
+
"Ds3---sy",
|
136 |
"Ds3fp-s",
|
137 |
"Ds3fsos",
|
138 |
"Ds3fsrs",
|
|
|
161 |
"LSQR",
|
162 |
"LT",
|
163 |
"M",
|
|
|
164 |
"Mc-p-d",
|
165 |
"Mc-p-l",
|
166 |
+
"Mc-s-b",
|
167 |
+
"Mc-s-d",
|
168 |
+
"Mc-s-l",
|
169 |
"Mcfp-l",
|
170 |
"Mcfp-ln",
|
171 |
"Mcfprln",
|
172 |
"Mcfprly",
|
173 |
"Mcfsoln",
|
174 |
+
"Mcfsrl",
|
175 |
"Mcfsrln",
|
176 |
+
"Mcfsrly",
|
177 |
"Mcmp-l",
|
178 |
"Mcms-ln",
|
179 |
"Mcmsrl",
|
180 |
+
"Mcmsrln",
|
181 |
"Mcmsrly",
|
182 |
"Mffprln",
|
183 |
"Mffsrln",
|
|
|
250 |
"Pd3mpr--y",
|
251 |
"Pd3mso",
|
252 |
"Pd3msr",
|
|
|
253 |
"Pi3--r",
|
254 |
"Pi3-po",
|
255 |
"Pi3-so",
|
|
|
295 |
"Pp3-po--------s",
|
296 |
"Pp3-sd--------w",
|
297 |
"Pp3-sd--y-----w",
|
298 |
+
"Pp3-so--------s",
|
299 |
"Pp3fpa--------w",
|
300 |
"Pp3fpa--y-----w",
|
301 |
"Pp3fpr--------s",
|
|
|
322 |
"Ps2fp-s",
|
323 |
"Ps2fsrp",
|
324 |
"Ps2fsrs",
|
|
|
325 |
"Ps3---p",
|
326 |
"Ps3---s",
|
327 |
"Ps3fp-s",
|
|
|
354 |
"RPAR",
|
355 |
"RSQR",
|
356 |
"Rc",
|
|
|
357 |
"Rgp",
|
358 |
"Rgpy",
|
359 |
"Rgs",
|
|
|
411 |
"Va--3s",
|
412 |
"Va--3s----y",
|
413 |
"Vag",
|
414 |
+
"Vag-------y",
|
415 |
"Vaii1",
|
416 |
"Vaii2s",
|
417 |
"Vaii3p",
|
|
|
481 |
"Vmp--sm",
|
482 |
"Vmp--sm---y",
|
483 |
"Vmsp1p",
|
484 |
+
"Vmsp2p",
|
485 |
"Vmsp2s",
|
486 |
"Vmsp3",
|
487 |
"Vmsp3-----y",
|
|
|
494 |
"Ynmsoy",
|
495 |
"Ynmsry",
|
496 |
"Yp",
|
497 |
+
"Yp,Yn",
|
498 |
"Yp-sr",
|
499 |
"Yr"
|
500 |
],
|
|
|
532 |
"iobj",
|
533 |
"mark",
|
534 |
"nmod",
|
|
|
|
|
535 |
"nmod:tmod",
|
536 |
"nsubj",
|
537 |
"nsubj:pass",
|
538 |
"nummod",
|
539 |
"obj",
|
540 |
"obl",
|
541 |
+
"obl:agent",
|
542 |
+
"obl:pmod",
|
543 |
"orphan",
|
544 |
"parataxis",
|
545 |
"punct",
|
|
|
597 |
],
|
598 |
"performance":{
|
599 |
"token_acc":0.9990029326,
|
600 |
+
"token_p":0.9967350492,
|
601 |
+
"token_r":0.9957244934,
|
602 |
+
"token_f":0.9959492157,
|
603 |
+
"tag_acc":0.9664291788,
|
604 |
+
"sents_p":0.954787234,
|
605 |
+
"sents_r":0.954787234,
|
606 |
+
"sents_f":0.954787234,
|
607 |
+
"dep_uas":0.8897462438,
|
608 |
+
"dep_las":0.8389686971,
|
609 |
+
"dep_las_per_type":{
|
610 |
+
"root":{
|
611 |
+
"p":0.8786231884,
|
612 |
+
"r":0.9133709981,
|
613 |
+
"f":0.8956602031
|
|
|
|
|
|
|
|
|
614 |
},
|
615 |
+
"mark":{
|
616 |
+
"p":0.9288389513,
|
617 |
+
"r":0.9358490566,
|
618 |
+
"f":0.9323308271
|
619 |
},
|
620 |
+
"case":{
|
621 |
+
"p":0.9638554217,
|
622 |
+
"r":0.959880015,
|
623 |
+
"f":0.9618636107
|
624 |
},
|
625 |
+
"nmod:tmod":{
|
626 |
+
"p":0.6842105263,
|
627 |
+
"r":0.1092436975,
|
628 |
+
"f":0.1884057971
|
629 |
},
|
630 |
+
"amod":{
|
631 |
+
"p":0.9172297297,
|
632 |
+
"r":0.9250425894,
|
633 |
+
"f":0.9211195929
|
634 |
},
|
635 |
+
"nsubj":{
|
636 |
+
"p":0.8803986711,
|
637 |
+
"r":0.8372827804,
|
638 |
+
"f":0.8582995951
|
639 |
},
|
640 |
+
"nmod":{
|
641 |
+
"p":0.8218838527,
|
642 |
+
"r":0.8286326312,
|
643 |
+
"f":0.8252444444
|
644 |
},
|
645 |
+
"aux":{
|
646 |
+
"p":0.9867924528,
|
647 |
+
"r":0.9561243144,
|
648 |
+
"f":0.9712163417
|
649 |
},
|
650 |
+
"advcl":{
|
651 |
+
"p":0.5862068966,
|
652 |
+
"r":0.6390977444,
|
653 |
+
"f":0.6115107914
|
654 |
},
|
655 |
+
"obj":{
|
656 |
+
"p":0.8326180258,
|
657 |
+
"r":0.896073903,
|
658 |
+
"f":0.8631813126
|
659 |
},
|
660 |
+
"det":{
|
661 |
+
"p":0.9575688073,
|
662 |
+
"r":0.9456398641,
|
663 |
+
"f":0.9515669516
|
664 |
},
|
665 |
+
"cc":{
|
666 |
+
"p":0.9340425532,
|
667 |
+
"r":0.9164926931,
|
668 |
+
"f":0.9251844046
|
669 |
},
|
670 |
+
"conj":{
|
671 |
+
"p":0.6115288221,
|
672 |
+
"r":0.5654692932,
|
673 |
+
"f":0.5875978326
|
674 |
},
|
675 |
+
"nummod":{
|
676 |
+
"p":0.887675507,
|
677 |
+
"r":0.8835403727,
|
678 |
+
"f":0.8856031128
|
679 |
},
|
680 |
+
"acl":{
|
681 |
+
"p":0.8063583815,
|
682 |
+
"r":0.7209302326,
|
683 |
+
"f":0.761255116
|
684 |
},
|
685 |
+
"advmod":{
|
686 |
+
"p":0.8117048346,
|
687 |
+
"r":0.8416886544,
|
688 |
+
"f":0.8264248705
|
689 |
},
|
690 |
+
"obl":{
|
691 |
+
"p":0.6821052632,
|
692 |
+
"r":0.8223350254,
|
693 |
+
"f":0.7456846951
|
694 |
},
|
695 |
+
"expl:pass":{
|
696 |
+
"p":0.8085106383,
|
697 |
+
"r":0.7037037037,
|
698 |
+
"f":0.7524752475
|
699 |
},
|
700 |
+
"nsubj:pass":{
|
701 |
+
"p":0.8,
|
702 |
+
"r":0.756097561,
|
703 |
+
"f":0.7774294671
|
704 |
},
|
705 |
+
"fixed":{
|
706 |
+
"p":0.9,
|
707 |
+
"r":0.8562367865,
|
708 |
+
"f":0.8775731311
|
709 |
},
|
710 |
+
"appos":{
|
711 |
+
"p":0.4956896552,
|
712 |
+
"r":0.4389312977,
|
713 |
+
"f":0.4655870445
|
714 |
},
|
715 |
+
"parataxis":{
|
716 |
+
"p":0.1627906977,
|
717 |
+
"r":0.2,
|
718 |
+
"f":0.1794871795
|
719 |
},
|
720 |
+
"aux:pass":{
|
721 |
+
"p":0.9125,
|
722 |
+
"r":0.9733333333,
|
723 |
+
"f":0.9419354839
|
724 |
+
},
|
725 |
+
"nmod:agent":{
|
726 |
"p":0.0,
|
727 |
"r":0.0,
|
728 |
"f":0.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
729 |
},
|
730 |
+
"ccomp":{
|
731 |
+
"p":0.8759689922,
|
732 |
+
"r":0.8759689922,
|
733 |
+
"f":0.8759689922
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
734 |
},
|
735 |
+
"nmod:pmod":{
|
736 |
"p":0.0,
|
737 |
"r":0.0,
|
738 |
"f":0.0
|
739 |
},
|
740 |
+
"iobj":{
|
741 |
+
"p":0.8157894737,
|
742 |
+
"r":0.7654320988,
|
743 |
+
"f":0.7898089172
|
|
|
|
|
|
|
|
|
|
|
744 |
},
|
745 |
"flat":{
|
746 |
+
"p":0.7557251908,
|
747 |
+
"r":0.7815789474,
|
748 |
+
"f":0.7684346701
|
749 |
},
|
750 |
+
"cop":{
|
751 |
+
"p":0.8524590164,
|
752 |
+
"r":0.8387096774,
|
753 |
+
"f":0.8455284553
|
754 |
},
|
755 |
+
"csubj":{
|
756 |
+
"p":0.8235294118,
|
757 |
+
"r":0.6666666667,
|
758 |
+
"f":0.7368421053
|
759 |
},
|
760 |
+
"obl:agent":{
|
761 |
+
"p":0.0,
|
762 |
+
"r":0.0,
|
763 |
+
"f":0.0
|
764 |
},
|
765 |
+
"dep":{
|
766 |
+
"p":0.0,
|
767 |
+
"r":0.0,
|
768 |
+
"f":0.0
|
769 |
},
|
770 |
"expl:pv":{
|
771 |
+
"p":0.7564102564,
|
772 |
+
"r":0.8550724638,
|
773 |
+
"f":0.8027210884
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
774 |
},
|
775 |
+
"expl":{
|
776 |
+
"p":0.6875,
|
777 |
+
"r":0.8148148148,
|
778 |
+
"f":0.7457627119
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
779 |
},
|
780 |
+
"obl:pmod":{
|
781 |
+
"p":0.0,
|
782 |
+
"r":0.0,
|
783 |
+
"f":0.0
|
784 |
},
|
785 |
"expl:poss":{
|
786 |
+
"p":0.9655172414,
|
787 |
+
"r":0.9032258065,
|
788 |
+
"f":0.9333333333
|
789 |
},
|
790 |
+
"goeswith":{
|
|
|
|
|
|
|
|
|
|
|
791 |
"p":0.0,
|
792 |
"r":0.0,
|
793 |
"f":0.0
|
794 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
795 |
"xcomp":{
|
796 |
+
"p":0.5806451613,
|
797 |
+
"r":0.6666666667,
|
798 |
+
"f":0.6206896552
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
799 |
},
|
800 |
+
"orphan":{
|
801 |
"p":0.0,
|
802 |
"r":0.0,
|
803 |
"f":0.0
|
804 |
},
|
805 |
+
"expl:impers":{
|
806 |
+
"p":1.0,
|
807 |
+
"r":0.3333333333,
|
808 |
+
"f":0.5
|
809 |
},
|
810 |
+
"csubj:pass":{
|
811 |
"p":0.0,
|
812 |
"r":0.0,
|
813 |
"f":0.0
|
814 |
},
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
815 |
"compound":{
|
816 |
+
"p":0.5714285714,
|
817 |
+
"r":0.5714285714,
|
818 |
+
"f":0.5714285714
|
819 |
},
|
820 |
+
"list":{
|
821 |
"p":0.0,
|
822 |
"r":0.0,
|
823 |
"f":0.0
|
824 |
},
|
825 |
+
"ccomp:pmod":{
|
|
|
|
|
|
|
|
|
|
|
826 |
"p":0.25,
|
827 |
"r":0.3333333333,
|
828 |
"f":0.2857142857
|
829 |
},
|
830 |
+
"cc:preconj":{
|
831 |
"p":0.0,
|
832 |
"r":0.0,
|
833 |
"f":0.0
|
834 |
+
}
|
835 |
+
},
|
836 |
+
"pos_acc":0.9405873228,
|
837 |
+
"morph_acc":0.9510657636,
|
838 |
+
"morph_micro_p":0.9896160458,
|
839 |
+
"morph_micro_r":0.9582489383,
|
840 |
+
"morph_micro_f":0.9706797273,
|
841 |
+
"morph_per_feat":{
|
842 |
+
"Case":{
|
843 |
+
"p":0.9938697318,
|
844 |
+
"r":0.9896985883,
|
845 |
+
"f":0.9917797744
|
846 |
+
},
|
847 |
+
"Gender":{
|
848 |
+
"p":0.991821842,
|
849 |
+
"r":0.9854981873,
|
850 |
+
"f":0.9886499028
|
851 |
+
},
|
852 |
+
"Number":{
|
853 |
+
"p":0.9894903379,
|
854 |
+
"r":0.922363847,
|
855 |
+
"f":0.9547486643
|
856 |
+
},
|
857 |
+
"Person":{
|
858 |
+
"p":0.9911452184,
|
859 |
+
"r":0.9893930466,
|
860 |
+
"f":0.9902683574
|
861 |
+
},
|
862 |
+
"PronType":{
|
863 |
|