osanseviero HF staff commited on
Commit
239866e
1 Parent(s): 70692c8

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -549,7 +549,7 @@ terms of this License.```
549
 
550
 
551
 
552
- # UD Romanian RRT v2.5
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
 
549
 
550
 
551
 
552
+ # UD Romanian RRT v2.8
553
 
554
  * Author: Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin
555
  * URL: https://github.com/UniversalDependencies/UD_Romanian-RRT
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - token-classification
5
  language:
6
  - ro
7
- license: CC-BY-SA-4.0
8
  model-index:
9
  - name: ro_core_news_lg
10
  results:
@@ -14,47 +14,47 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.7588147037
18
  - name: NER Recall
19
  type: recall
20
- value: 0.7771801767
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.7678876447
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
- value: 0.975315026
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
- value: 0.9533954727
38
  - name: SENTER Recall
39
  type: recall
40
- value: 0.9521276596
41
  - name: SENTER F Score
42
  type: f_score
43
- value: 0.9527611444
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8904573687
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
- value: 0.8904573687
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_lg
60
 
@@ -63,12 +63,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_lg` |
66
- | **Version** | `3.1.0` |
67
- | **spaCy** | `>=3.1.0,<3.2.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
71
- | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.5](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,12 +76,12 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
76
 
77
  <details>
78
 
79
- <summary>View label scheme (534 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
- | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc`, `Mc-p-d`, `Mc-p-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrln`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps2ms-s`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp1s`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp-sr`, `Yr` |
84
- | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:agent`, `nmod:pmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
@@ -92,15 +92,21 @@ Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
- | `TAG_ACC` | 97.53 |
96
- | `POS_ACC` | 96.54 |
97
- | `MORPH_ACC` | 97.61 |
98
- | `LEMMA_ACC` | 81.87 |
99
- | `DEP_UAS` | 89.05 |
100
- | `DEP_LAS` | 84.67 |
101
- | `ENTS_P` | 75.88 |
102
- | `ENTS_R` | 77.72 |
103
- | `ENTS_F` | 76.79 |
104
- | `SENTS_P` | 95.34 |
105
- | `SENTS_R` | 95.21 |
106
- | `SENTS_F` | 95.28 |
 
 
 
 
 
 
 
4
  - token-classification
5
  language:
6
  - ro
7
+ license: cc-by-sa-4.0
8
  model-index:
9
  - name: ro_core_news_lg
10
  results:
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.7550713749
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.7721859393
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.7635327635
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
+ value: 0.9664291788
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
+ value: 0.954787234
38
  - name: SENTER Recall
39
  type: recall
40
+ value: 0.954787234
41
  - name: SENTER F Score
42
  type: f_score
43
+ value: 0.954787234
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
+ value: 0.8897462438
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
+ value: 0.8897462438
58
  ---
59
  ### Details: https://spacy.io/models/ro#ro_core_news_lg
60
 
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `ro_core_news_lg` |
66
+ | **Version** | `3.2.0` |
67
+ | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `tagger`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `tagger`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 500000 keys, 500000 unique vectors (300 dimensions) |
71
+ | **Sources** | [Lemmatization Lists](https://github.com/michmech/lemmatization-lists/) (Michal Měchura)<br />[UD Romanian RRT v2.8](https://github.com/UniversalDependencies/UD_Romanian-RRT) (Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin)<br />[RONEC - the Romanian Named Entity Corpus (ca9ce460)](https://github.com/dumitrescustefan/ronec) (Dumitrescu, Stefan Daniel; Avram, Andrei-Marius; Morogan, Luciana; Toma; Stefan)<br />[Explosion fastText Vectors (cbow, OSCAR Common Crawl + Wikipedia)](https://spacy.io) (Explosion) |
72
  | **License** | `CC BY-SA 4.0` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
 
76
 
77
  <details>
78
 
79
+ <summary>View label scheme (541 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
+ | **`tagger`** | `ARROW`, `Af`, `Afcfp-n`, `Afcfson`, `Afcfsrn`, `Afcmpoy`, `Afcms-n`, `Afp`, `Afp-p-n`, `Afp-poy`, `Afp-srn`, `Afpf--n`, `Afpfp-n`, `Afpfp-ny`, `Afpfpoy`, `Afpfpry`, `Afpfson`, `Afpfsoy`, `Afpfsrn`, `Afpfsry`, `Afpm--n`, `Afpmp-n`, `Afpmpoy`, `Afpmpry`, `Afpms-n`, `Afpmsoy`, `Afpmsry`, `Afsfp-n`, `Afsfsrn`, `BULLET`, `COLON`, `COMMA`, `Ccssp`, `Ccsspy`, `Crssp`, `Csssp`, `Cssspy`, `DASH`, `DBLQ`, `Dd3-po---e`, `Dd3-po---o`, `Dd3fpo`, `Dd3fpr`, `Dd3fpr---e`, `Dd3fpr---o`, `Dd3fpr--y`, `Dd3fso`, `Dd3fso---e`, `Dd3fsr`, `Dd3fsr---e`, `Dd3fsr---o`, `Dd3fsr--yo`, `Dd3mpo`, `Dd3mpr`, `Dd3mpr---e`, `Dd3mpr---o`, `Dd3mso---e`, `Dd3msr`, `Dd3msr---e`, `Dd3msr---o`, `Dh1ms`, `Dh3fp`, `Dh3fso`, `Dh3fsr`, `Dh3mp`, `Dh3ms`, `Di3`, `Di3-----y`, `Di3--r---e`, `Di3-po`, `Di3-po---e`, `Di3-sr`, `Di3-sr---e`, `Di3-sr--y`, `Di3fp`, `Di3fpr`, `Di3fpr---e`, `Di3fso`, `Di3fso---e`, `Di3fsr`, `Di3fsr---e`, `Di3mp`, `Di3mpr`, `Di3mpr---e`, `Di3ms`, `Di3ms----e`, `Di3mso---e`, `Di3msr`, `Di3msr---e`, `Ds1fp-p`, `Ds1fp-s`, `Ds1fsop`, `Ds1fsos`, `Ds1fsrp`, `Ds1fsrs`, `Ds1fsrs-y`, `Ds1mp-p`, `Ds1mp-s`, `Ds1ms-p`, `Ds1ms-s`, `Ds1msrs-y`, `Ds2---s`, `Ds2fp-p`, `Ds2fp-s`, `Ds2fsrp`, `Ds2fsrs`, `Ds2mp-p`, `Ds2mp-s`, `Ds2ms-p`, `Ds2ms-s`, `Ds3---p`, `Ds3---s`, `Ds3---sy`, `Ds3fp-s`, `Ds3fsos`, `Ds3fsrs`, `Ds3mp-s`, `Ds3ms-s`, `Dw3--r---e`, `Dw3-po---e`, `Dw3fpr`, `Dw3fso---e`, `Dw3fsr`, `Dw3mpr`, `Dw3mso---e`, `Dw3msr`, `Dz3fsr---e`, `Dz3mso---e`, `Dz3msr---e`, `EQUAL`, `EXCL`, `EXCLHELLIP`, `GE`, `GT`, `HELLIP`, `I`, `LCURL`, `LPAR`, `LSQR`, `LT`, `M`, `Mc-p-d`, `Mc-p-l`, `Mc-s-b`, `Mc-s-d`, `Mc-s-l`, `Mcfp-l`, `Mcfp-ln`, `Mcfprln`, `Mcfprly`, `Mcfsoln`, `Mcfsrl`, `Mcfsrln`, `Mcfsrly`, `Mcmp-l`, `Mcms-ln`, `Mcmsrl`, `Mcmsrln`, `Mcmsrly`, `Mffprln`, `Mffsrln`, `Mlfpo`, `Mlfpr`, `Mlmpr`, `Mo---l`, `Mo---ln`, `Mo-s-r`, `Mofp-ln`, `Mofpoly`, `Mofprly`, `Mofs-l`, `Mofsoln`, `Mofsoly`, `Mofsrln`, `Mofsrly`, `Mompoly`, `Momprly`, `Moms-l`, `Moms-ln`, `Momsoly`, `Momsrly`, `Nc`, `Nc---n`, `Ncf--n`, `Ncfp-n`, `Ncfpoy`, `Ncfpry`, `Ncfs-n`, `Ncfson`, `Ncfsoy`, `Ncfsrn`, `Ncfsry`, `Ncfsryy`, `Ncfsvy`, `Ncm--n`, `Ncmp-n`, `Ncmpoy`, `Ncmpry`, `Ncms-n`, `Ncms-ny`, `Ncms-y`, `Ncmsoy`, `Ncmsrn`, `Ncmsry`, `Ncmsryy`, `Ncmsvn`, `Ncmsvy`, `Np`, `Npfson`, `Npfsoy`, `Npfsrn`, `Npfsry`, `Npmpoy`, `Npmpry`, `Npms-n`, `Npmsoy`, `Npmsry`, `PERCENT`, `PERIOD`, `PLUS`, `PLUSMINUS`, `Pd3-po`, `Pd3fpr`, `Pd3fso`, `Pd3fsr`, `Pd3mpo`, `Pd3mpr`, `Pd3mpr--y`, `Pd3mso`, `Pd3msr`, `Pi3--r`, `Pi3-po`, `Pi3-so`, `Pi3-sr`, `Pi3fpr`, `Pi3fso`, `Pi3fsr`, `Pi3mpr`, `Pi3mso`, `Pi3msr`, `Pi3msr--y`, `Pp1-pa--------w`, `Pp1-pa--y-----w`, `Pp1-pd--------s`, `Pp1-pd--------w`, `Pp1-pd--y-----w`, `Pp1-pr--------s`, `Pp1-sa--------s`, `Pp1-sa--------w`, `Pp1-sa--y-----w`, `Pp1-sd--------s`, `Pp1-sd--------w`, `Pp1-sd--y-----w`, `Pp1-sn--------s`, `Pp2-----------s`, `Pp2-pa--------w`, `Pp2-pa--y-----w`, `Pp2-pd--------w`, `Pp2-pd--y-----w`, `Pp2-pr--------s`, `Pp2-sa--------s`, `Pp2-sa--------w`, `Pp2-sa--y-----w`, `Pp2-sd--------s`, `Pp2-sd--------w`, `Pp2-sd--y-----w`, `Pp2-sn--------s`, `Pp2-so--------s`, `Pp2-sr--------s`, `Pp3-p---------s`, `Pp3-pd--------w`, `Pp3-pd--y-----w`, `Pp3-po--------s`, `Pp3-sd--------w`, `Pp3-sd--y-----w`, `Pp3-so--------s`, `Pp3fpa--------w`, `Pp3fpa--y-----w`, `Pp3fpr--------s`, `Pp3fs---------s`, `Pp3fsa--------w`, `Pp3fsa--y-----w`, `Pp3fso--------s`, `Pp3fsr--------s`, `Pp3fsr--y-----s`, `Pp3mpa--------w`, `Pp3mpa--y-----w`, `Pp3mpr--------s`, `Pp3ms---------s`, `Pp3msa--------w`, `Pp3msa--y-----w`, `Pp3mso--------s`, `Pp3msr--------s`, `Pp3msr--y-----s`, `Ps1fp-s`, `Ps1fsrp`, `Ps1fsrs`, `Ps1mp-p`, `Ps1ms-p`, `Ps2fp-s`, `Ps2fsrp`, `Ps2fsrs`, `Ps3---p`, `Ps3---s`, `Ps3fp-s`, `Ps3fsrs`, `Ps3mp-s`, `Ps3ms-s`, `Pw3--r`, `Pw3-po`, `Pw3-so`, `Pw3fpr`, `Pw3fso`, `Pw3mpr`, `Pw3mso`, `Px3--a--------s`, `Px3--a--------w`, `Px3--a--y-----w`, `Px3--d--------w`, `Px3--d--y-----w`, `Pz3-sr`, `Pz3fsr`, `QUEST`, `QUOT`, `Qf`, `Qn`, `Qs`, `Qs-y`, `Qz`, `Qz-y`, `RCURL`, `RPAR`, `RSQR`, `Rc`, `Rgp`, `Rgpy`, `Rgs`, `Rp`, `Rw`, `Rw-y`, `Rz`, `SCOLON`, `SLASH`, `STAR`, `Sp`, `Spsa`, `Spsay`, `Spsd`, `Spsg`, `Td-po`, `Tdfpr`, `Tdfso`, `Tdfsr`, `Tdmpr`, `Tdmso`, `Tdmsr`, `Tf-so`, `Tffpoy`, `Tffpry`, `Tffs-y`, `Tfmpoy`, `Tfms-y`, `Tfmsoy`, `Tfmsry`, `Ti-po`, `Tifp-y`, `Tifso`, `Tifsr`, `Timso`, `Timsr`, `Tsfp`, `Tsfs`, `Tsmp`, `Tsms`, `UNDERSC`, `Va--1`, `Va--1-----y`, `Va--1p`, `Va--1s`, `Va--1s----y`, `Va--2p`, `Va--2p----y`, `Va--2s`, `Va--2s----y`, `Va--3`, `Va--3-----y`, `Va--3p`, `Va--3p----y`, `Va--3s`, `Va--3s----y`, `Vag`, `Vag-------y`, `Vaii1`, `Vaii2s`, `Vaii3p`, `Vaii3s`, `Vail3p`, `Vail3s`, `Vaip1p`, `Vaip1s`, `Vaip2p`, `Vaip2s`, `Vaip3p`, `Vaip3p----y`, `Vaip3s`, `Vaip3s----y`, `Vais3p`, `Vais3s`, `Vam-2s`, `Vanp`, `Vap--sm`, `Vasp1p`, `Vasp1s`, `Vasp2p`, `Vasp2s`, `Vasp3`, `Vmg`, `Vmg-------y`, `Vmii1`, `Vmii1-----y`, `Vmii2p`, `Vmii2s`, `Vmii3p`, `Vmii3p----y`, `Vmii3s`, `Vmii3s----y`, `Vmil1`, `Vmil1p`, `Vmil2s`, `Vmil3p`, `Vmil3p----y`, `Vmil3s`, `Vmil3s----y`, `Vmip1p`, `Vmip1p----y`, `Vmip1s`, `Vmip1s----y`, `Vmip2p`, `Vmip2s`, `Vmip2s----y`, `Vmip3`, `Vmip3-----y`, `Vmip3p`, `Vmip3s`, `Vmip3s----y`, `Vmis1p`, `Vmis1s`, `Vmis3p`, `Vmis3p----y`, `Vmis3s`, `Vmis3s----y`, `Vmm-2p`, `Vmm-2s`, `Vmnp`, `Vmnp------y`, `Vmp--pf`, `Vmp--pm`, `Vmp--sf`, `Vmp--sm`, `Vmp--sm---y`, `Vmsp1p`, `Vmsp2p`, `Vmsp2s`, `Vmsp3`, `Vmsp3-----y`, `X`, `Y`, `Ya`, `Yn`, `Ynfsoy`, `Ynfsry`, `Ynmsoy`, `Ynmsry`, `Yp`, `Yp,Yn`, `Yp-sr`, `Yr` |
84
+ | **`parser`** | `ROOT`, `acl`, `advcl`, `advcl:tcl`, `advmod`, `advmod:tmod`, `amod`, `appos`, `aux`, `aux:pass`, `case`, `cc`, `cc:preconj`, `ccomp`, `ccomp:pmod`, `compound`, `conj`, `cop`, `csubj`, `csubj:pass`, `dep`, `det`, `expl`, `expl:impers`, `expl:pass`, `expl:poss`, `expl:pv`, `fixed`, `flat`, `goeswith`, `iobj`, `mark`, `nmod`, `nmod:tmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl`, `obl:agent`, `obl:pmod`, `orphan`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `DATETIME`, `EVENT`, `FACILITY`, `GPE`, `LANGUAGE`, `LOC`, `MONEY`, `NAT_REL_POL`, `NUMERIC_VALUE`, `ORDINAL`, `ORGANIZATION`, `PERIOD`, `PERSON`, `PRODUCT`, `QUANTITY`, `WORK_OF_ART` |
87
 
 
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
+ | `TOKEN_P` | 99.67 |
96
+ | `TOKEN_R` | 99.57 |
97
+ | `TOKEN_F` | 99.59 |
98
+ | `TAG_ACC` | 96.64 |
99
+ | `SENTS_P` | 95.48 |
100
+ | `SENTS_R` | 95.48 |
101
+ | `SENTS_F` | 95.48 |
102
+ | `DEP_UAS` | 88.97 |
103
+ | `DEP_LAS` | 83.90 |
104
+ | `POS_ACC` | 94.06 |
105
+ | `MORPH_ACC` | 95.11 |
106
+ | `MORPH_MICRO_P` | 98.96 |
107
+ | `MORPH_MICRO_R` | 95.82 |
108
+ | `MORPH_MICRO_F` | 97.07 |
109
+ | `LEMMA_ACC` | 81.83 |
110
+ | `ENTS_P` | 75.51 |
111
+ | `ENTS_R` | 77.22 |
112
+ | `ENTS_F` | 76.35 |
accuracy.json CHANGED
@@ -1,447 +1,448 @@
1
  {
2
  "token_acc": 0.9990029326,
3
- "tag_acc": 0.975315026,
4
- "pos_acc": 0.965353945,
5
- "morph_acc": 0.9760744713,
6
- "lemma_acc": 0.8186589263,
7
- "dep_uas": 0.8904573687,
8
- "dep_las": 0.8467281511,
9
- "ents_p": 0.7588147037,
10
- "ents_r": 0.7771801767,
11
- "ents_f": 0.7678876447,
12
- "sents_p": 0.9533954727,
13
- "sents_r": 0.9521276596,
14
- "sents_f": 0.9527611444,
15
- "speed": 10281.5880630945,
16
- "morph_per_feat": {
17
- "AdpType": {
18
- "p": 0.9970784641,
19
- "r": 0.9941739492,
20
- "f": 0.9956240884
21
  },
22
- "Case": {
23
- "p": 0.9896781203,
24
- "r": 0.9840648211,
25
- "f": 0.9868634886
26
  },
27
- "Variant": {
28
- "p": 0.9845559846,
29
- "r": 0.9205776173,
30
- "f": 0.9514925373
31
  },
32
- "Gender": {
33
- "p": 0.9840800225,
34
- "r": 0.9782913165,
35
- "f": 0.9811771316
36
  },
37
- "Number": {
38
- "p": 0.9833276236,
39
- "r": 0.9778560073,
40
- "f": 0.9805841827
41
  },
42
- "PronType": {
43
- "p": 0.9938366718,
44
- "r": 0.987244898,
45
- "f": 0.9905298183
46
  },
47
- "Definite": {
48
- "p": 0.9784728611,
49
- "r": 0.9725676664,
50
- "f": 0.9755113272
51
  },
52
- "Degree": {
53
- "p": 0.9530685921,
54
- "r": 0.9428571429,
55
- "f": 0.947935368
56
  },
57
- "Polarity": {
58
- "p": 0.9884467266,
59
- "r": 0.985915493,
60
- "f": 0.9871794872
61
  },
62
- "Mood": {
63
- "p": 0.9740072202,
64
- "r": 0.9635714286,
65
- "f": 0.9687612208
66
  },
67
- "Person": {
68
- "p": 0.9837338262,
69
- "r": 0.9718772827,
70
- "f": 0.9777696123
71
  },
72
- "Tense": {
73
- "p": 0.9730337079,
74
- "r": 0.9572586588,
75
- "f": 0.9650817236
76
  },
77
- "VerbForm": {
78
- "p": 0.9698996656,
79
- "r": 0.9593572779,
80
- "f": 0.9645996674
81
  },
82
- "NumForm": {
83
- "p": 0.9901960784,
84
- "r": 0.9853658537,
85
- "f": 0.9877750611
86
  },
87
- "NumType": {
88
- "p": 0.9927536232,
89
- "r": 0.9856115108,
90
- "f": 0.9891696751
91
  },
92
- "PartType": {
93
- "p": 0.9473684211,
94
- "r": 0.9,
95
- "f": 0.9230769231
96
  },
97
- "Strength": {
98
- "p": 0.9897959184,
99
- "r": 0.9797979798,
100
- "f": 0.9847715736
101
  },
102
- "Reflex": {
103
- "p": 0.9938461538,
104
- "r": 0.990797546,
105
- "f": 0.9923195084
106
  },
107
- "Poss": {
108
- "p": 0.9792387543,
109
- "r": 0.9895104895,
110
- "f": 0.9843478261
111
  },
112
- "Position": {
113
- "p": 0.9928057554,
114
- "r": 0.9517241379,
115
- "f": 0.9718309859
116
  },
117
- "Number[psor]": {
118
- "p": 0.9295774648,
119
- "r": 0.9565217391,
120
- "f": 0.9428571429
121
  },
122
- "Abbr": {
123
- "p": 0.9746835443,
124
- "r": 0.9058823529,
125
- "f": 0.9390243902
126
  },
127
- "Foreign": {
 
 
 
 
 
128
  "p": 0.0,
129
  "r": 0.0,
130
  "f": 0.0
131
- }
132
- },
133
- "dep_las_per_type": {
134
- "case": {
135
- "p": 0.9257307139,
136
- "r": 0.9415204678,
137
- "f": 0.9335588306
138
  },
139
- "det": {
140
- "p": 0.9473684211,
141
- "r": 0.9671052632,
142
- "f": 0.9571351058
143
- },
144
- "nmod:tmod": {
145
- "p": 0.4,
146
- "r": 0.0465116279,
147
- "f": 0.0833333333
148
- },
149
- "amod": {
150
- "p": 0.8639212175,
151
- "r": 0.8756805808,
152
- "f": 0.8697611537
153
- },
154
- "cc": {
155
- "p": 0.8669354839,
156
- "r": 0.89958159,
157
- "f": 0.8829568789
158
- },
159
- "conj": {
160
- "p": 0.5984962406,
161
- "r": 0.6012084592,
162
- "f": 0.5998492841
163
- },
164
- "nmod": {
165
- "p": 0.7883565797,
166
- "r": 0.8217446271,
167
- "f": 0.8047044259
168
- },
169
- "mark": {
170
- "p": 0.8857142857,
171
- "r": 0.9056179775,
172
- "f": 0.8955555556
173
- },
174
- "fixed": {
175
- "p": 0.8689217759,
176
- "r": 0.7172774869,
177
- "f": 0.7858508604
178
- },
179
- "nsubj": {
180
- "p": 0.8333333333,
181
- "r": 0.7814485388,
182
- "f": 0.806557377
183
  },
184
- "advcl:tcl": {
185
  "p": 0.0,
186
  "r": 0.0,
187
  "f": 0.0
188
  },
189
- "obj": {
190
- "p": 0.7794117647,
191
- "r": 0.8139931741,
192
- "f": 0.796327212
193
- },
194
- "nummod": {
195
- "p": 0.8703703704,
196
- "r": 0.8676923077,
197
- "f": 0.8690292758
198
  },
199
  "flat": {
200
- "p": 0.7441860465,
201
- "r": 0.6857142857,
202
- "f": 0.7137546468
203
  },
204
- "obl": {
205
- "p": 0.649068323,
206
- "r": 0.7116912599,
207
- "f": 0.6789388197
208
  },
209
- "nmod:pmod": {
210
- "p": 0.44,
211
- "r": 0.1692307692,
212
- "f": 0.2444444444
213
  },
214
- "acl": {
215
- "p": 0.7024793388,
216
- "r": 0.7264957265,
217
- "f": 0.7142857143
218
  },
219
- "advmod": {
220
- "p": 0.7860962567,
221
- "r": 0.7577319588,
222
- "f": 0.7716535433
223
  },
224
  "expl:pv": {
225
- "p": 0.7883597884,
226
- "r": 0.7967914439,
227
- "f": 0.7925531915
228
- },
229
- "root": {
230
- "p": 0.917222964,
231
- "r": 0.9135638298,
232
- "f": 0.9153897402
233
- },
234
- "advcl": {
235
- "p": 0.5625,
236
- "r": 0.5853658537,
237
- "f": 0.5737051793
238
  },
239
- "iobj": {
240
- "p": 0.7591240876,
241
- "r": 0.7027027027,
242
- "f": 0.7298245614
243
- },
244
- "ccomp": {
245
- "p": 0.7178217822,
246
- "r": 0.8146067416,
247
- "f": 0.7631578947
248
- },
249
- "goeswith": {
250
- "p": 0.875,
251
- "r": 0.5833333333,
252
- "f": 0.7
253
  },
254
- "parataxis": {
255
- "p": 0.7027027027,
256
- "r": 0.5954198473,
257
- "f": 0.6446280992
258
  },
259
  "expl:poss": {
260
- "p": 0.5909090909,
261
- "r": 0.6046511628,
262
- "f": 0.5977011494
263
  },
264
- "cop": {
265
- "p": 0.7647058824,
266
- "r": 0.8024691358,
267
- "f": 0.7831325301
268
- },
269
- "cc:preconj": {
270
  "p": 0.0,
271
  "r": 0.0,
272
  "f": 0.0
273
  },
274
- "aux": {
275
- "p": 0.9716713881,
276
- "r": 0.9122340426,
277
- "f": 0.9410150892
278
- },
279
- "expl": {
280
- "p": 0.5294117647,
281
- "r": 0.4186046512,
282
- "f": 0.4675324675
283
- },
284
- "appos": {
285
- "p": 0.4347826087,
286
- "r": 0.396039604,
287
- "f": 0.414507772
288
- },
289
  "xcomp": {
290
- "p": 0.5441176471,
291
- "r": 0.4512195122,
292
- "f": 0.4933333333
293
- },
294
- "csubj": {
295
- "p": 0.7966101695,
296
- "r": 0.746031746,
297
- "f": 0.7704918033
298
- },
299
- "nmod:agent": {
300
- "p": 0.7285714286,
301
- "r": 0.7846153846,
302
- "f": 0.7555555556
303
- },
304
- "aux:pass": {
305
- "p": 0.7769784173,
306
- "r": 0.9,
307
- "f": 0.833976834
308
  },
309
- "dep": {
310
  "p": 0.0,
311
  "r": 0.0,
312
  "f": 0.0
313
  },
314
- "nsubj:pass": {
315
- "p": 0.6111111111,
316
- "r": 0.6644295302,
317
- "f": 0.6366559486
318
  },
319
- "advmod:tmod": {
320
  "p": 0.0,
321
  "r": 0.0,
322
  "f": 0.0
323
  },
324
- "expl:pass": {
325
- "p": 0.6734693878,
326
- "r": 0.7252747253,
327
- "f": 0.6984126984
328
- },
329
- "ccomp:pmod": {
330
- "p": 0.4,
331
- "r": 0.2666666667,
332
- "f": 0.32
333
- },
334
  "compound": {
335
- "p": 0.25,
336
- "r": 0.3333333333,
337
- "f": 0.2857142857
338
  },
339
- "orphan": {
340
  "p": 0.0,
341
  "r": 0.0,
342
  "f": 0.0
343
  },
344
- "expl:impers": {
345
- "p": 0.3333333333,
346
- "r": 0.1,
347
- "f": 0.1538461538
348
- },
349
- "csubj:pass": {
350
  "p": 0.25,
351
  "r": 0.3333333333,
352
  "f": 0.2857142857
353
  },
354
- "vocative": {
355
  "p": 0.0,
356
  "r": 0.0,
357
  "f": 0.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
358
  },
359
- "discourse": {
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
360
  "p": 0.0,
361
  "r": 0.0,
362
  "f": 0.0
363
  }
364
  },
 
 
 
 
365
  "ents_per_type": {
366
  "DATETIME": {
367
- "p": 0.7852348993,
368
- "r": 0.8153310105,
369
- "f": 0.8
370
  },
371
  "ORGANIZATION": {
372
- "p": 0.6873065015,
373
- "r": 0.7070063694,
374
- "f": 0.6970172684
375
  },
376
  "FACILITY": {
377
- "p": 0.5317460317,
378
- "r": 0.5114503817,
379
- "f": 0.5214007782
 
 
 
 
 
380
  },
381
  "NUMERIC_VALUE": {
382
- "p": 0.8978723404,
383
- "r": 0.8940677966,
384
- "f": 0.8959660297
385
  },
386
  "ORDINAL": {
387
- "p": 0.7931034483,
388
  "r": 0.8363636364,
389
- "f": 0.814159292
390
  },
391
  "EVENT": {
392
- "p": 0.5675675676,
393
- "r": 0.5675675676,
394
- "f": 0.5675675676
395
  },
396
  "GPE": {
397
- "p": 0.8351409978,
398
- "r": 0.8850574713,
399
- "f": 0.859375
400
  },
401
  "PERSON": {
402
- "p": 0.7360890302,
403
- "r": 0.7768456376,
404
- "f": 0.7559183673
405
  },
406
  "NAT_REL_POL": {
407
- "p": 0.925170068,
408
  "r": 0.9066666667,
409
- "f": 0.9158249158
410
  },
411
  "MONEY": {
412
- "p": 0.9411764706,
413
- "r": 0.8275862069,
414
- "f": 0.880733945
415
- },
416
- "PRODUCT": {
417
- "p": 0.6260162602,
418
- "r": 0.5620437956,
419
- "f": 0.5923076923
420
  },
421
  "LOC": {
422
- "p": 0.4886363636,
423
- "r": 0.5657894737,
424
- "f": 0.5243902439
425
  },
426
  "WORK_OF_ART": {
427
- "p": 0.4285714286,
428
- "r": 0.4736842105,
429
- "f": 0.45
430
  },
431
  "QUANTITY": {
432
- "p": 0.8620689655,
433
- "r": 0.9615384615,
434
- "f": 0.9090909091
435
- },
436
- "PERIOD": {
437
- "p": 0.9428571429,
438
- "r": 0.7857142857,
439
- "f": 0.8571428571
440
  },
441
  "LANGUAGE": {
442
- "p": 0.6,
443
- "r": 0.75,
444
- "f": 0.6666666667
 
 
 
 
 
445
  }
446
- }
 
447
  }
 
1
  {
2
  "token_acc": 0.9990029326,
3
+ "token_p": 0.9967350492,
4
+ "token_r": 0.9957244934,
5
+ "token_f": 0.9959492157,
6
+ "tag_acc": 0.9664291788,
7
+ "sents_p": 0.954787234,
8
+ "sents_r": 0.954787234,
9
+ "sents_f": 0.954787234,
10
+ "dep_uas": 0.8897462438,
11
+ "dep_las": 0.8389686971,
12
+ "dep_las_per_type": {
13
+ "root": {
14
+ "p": 0.8786231884,
15
+ "r": 0.9133709981,
16
+ "f": 0.8956602031
 
 
 
 
17
  },
18
+ "mark": {
19
+ "p": 0.9288389513,
20
+ "r": 0.9358490566,
21
+ "f": 0.9323308271
22
  },
23
+ "case": {
24
+ "p": 0.9638554217,
25
+ "r": 0.959880015,
26
+ "f": 0.9618636107
27
  },
28
+ "nmod:tmod": {
29
+ "p": 0.6842105263,
30
+ "r": 0.1092436975,
31
+ "f": 0.1884057971
32
  },
33
+ "amod": {
34
+ "p": 0.9172297297,
35
+ "r": 0.9250425894,
36
+ "f": 0.9211195929
37
  },
38
+ "nsubj": {
39
+ "p": 0.8803986711,
40
+ "r": 0.8372827804,
41
+ "f": 0.8582995951
42
  },
43
+ "nmod": {
44
+ "p": 0.8218838527,
45
+ "r": 0.8286326312,
46
+ "f": 0.8252444444
47
  },
48
+ "aux": {
49
+ "p": 0.9867924528,
50
+ "r": 0.9561243144,
51
+ "f": 0.9712163417
52
  },
53
+ "advcl": {
54
+ "p": 0.5862068966,
55
+ "r": 0.6390977444,
56
+ "f": 0.6115107914
57
  },
58
+ "obj": {
59
+ "p": 0.8326180258,
60
+ "r": 0.896073903,
61
+ "f": 0.8631813126
62
  },
63
+ "det": {
64
+ "p": 0.9575688073,
65
+ "r": 0.9456398641,
66
+ "f": 0.9515669516
67
  },
68
+ "cc": {
69
+ "p": 0.9340425532,
70
+ "r": 0.9164926931,
71
+ "f": 0.9251844046
72
  },
73
+ "conj": {
74
+ "p": 0.6115288221,
75
+ "r": 0.5654692932,
76
+ "f": 0.5875978326
77
  },
78
+ "nummod": {
79
+ "p": 0.887675507,
80
+ "r": 0.8835403727,
81
+ "f": 0.8856031128
82
  },
83
+ "acl": {
84
+ "p": 0.8063583815,
85
+ "r": 0.7209302326,
86
+ "f": 0.761255116
87
  },
88
+ "advmod": {
89
+ "p": 0.8117048346,
90
+ "r": 0.8416886544,
91
+ "f": 0.8264248705
92
  },
93
+ "obl": {
94
+ "p": 0.6821052632,
95
+ "r": 0.8223350254,
96
+ "f": 0.7456846951
97
  },
98
+ "expl:pass": {
99
+ "p": 0.8085106383,
100
+ "r": 0.7037037037,
101
+ "f": 0.7524752475
102
  },
103
+ "nsubj:pass": {
104
+ "p": 0.8,
105
+ "r": 0.756097561,
106
+ "f": 0.7774294671
107
  },
108
+ "fixed": {
109
+ "p": 0.9,
110
+ "r": 0.8562367865,
111
+ "f": 0.8775731311
112
  },
113
+ "appos": {
114
+ "p": 0.4956896552,
115
+ "r": 0.4389312977,
116
+ "f": 0.4655870445
117
  },
118
+ "parataxis": {
119
+ "p": 0.1627906977,
120
+ "r": 0.2,
121
+ "f": 0.1794871795
122
  },
123
+ "aux:pass": {
124
+ "p": 0.9125,
125
+ "r": 0.9733333333,
126
+ "f": 0.9419354839
127
+ },
128
+ "nmod:agent": {
129
  "p": 0.0,
130
  "r": 0.0,
131
  "f": 0.0
 
 
 
 
 
 
 
132
  },
133
+ "ccomp": {
134
+ "p": 0.8759689922,
135
+ "r": 0.8759689922,
136
+ "f": 0.8759689922
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
  },
138
+ "nmod:pmod": {
139
  "p": 0.0,
140
  "r": 0.0,
141
  "f": 0.0
142
  },
143
+ "iobj": {
144
+ "p": 0.8157894737,
145
+ "r": 0.7654320988,
146
+ "f": 0.7898089172
 
 
 
 
 
147
  },
148
  "flat": {
149
+ "p": 0.7557251908,
150
+ "r": 0.7815789474,
151
+ "f": 0.7684346701
152
  },
153
+ "cop": {
154
+ "p": 0.8524590164,
155
+ "r": 0.8387096774,
156
+ "f": 0.8455284553
157
  },
158
+ "csubj": {
159
+ "p": 0.8235294118,
160
+ "r": 0.6666666667,
161
+ "f": 0.7368421053
162
  },
163
+ "obl:agent": {
164
+ "p": 0.0,
165
+ "r": 0.0,
166
+ "f": 0.0
167
  },
168
+ "dep": {
169
+ "p": 0.0,
170
+ "r": 0.0,
171
+ "f": 0.0
172
  },
173
  "expl:pv": {
174
+ "p": 0.7564102564,
175
+ "r": 0.8550724638,
176
+ "f": 0.8027210884
 
 
 
 
 
 
 
 
 
 
177
  },
178
+ "expl": {
179
+ "p": 0.6875,
180
+ "r": 0.8148148148,
181
+ "f": 0.7457627119
 
 
 
 
 
 
 
 
 
 
182
  },
183
+ "obl:pmod": {
184
+ "p": 0.0,
185
+ "r": 0.0,
186
+ "f": 0.0
187
  },
188
  "expl:poss": {
189
+ "p": 0.9655172414,
190
+ "r": 0.9032258065,
191
+ "f": 0.9333333333
192
  },
193
+ "goeswith": {
 
 
 
 
 
194
  "p": 0.0,
195
  "r": 0.0,
196
  "f": 0.0
197
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
198
  "xcomp": {
199
+ "p": 0.5806451613,
200
+ "r": 0.6666666667,
201
+ "f": 0.6206896552
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
202
  },
203
+ "orphan": {
204
  "p": 0.0,
205
  "r": 0.0,
206
  "f": 0.0
207
  },
208
+ "expl:impers": {
209
+ "p": 1.0,
210
+ "r": 0.3333333333,
211
+ "f": 0.5
212
  },
213
+ "csubj:pass": {
214
  "p": 0.0,
215
  "r": 0.0,
216
  "f": 0.0
217
  },
 
 
 
 
 
 
 
 
 
 
218
  "compound": {
219
+ "p": 0.5714285714,
220
+ "r": 0.5714285714,
221
+ "f": 0.5714285714
222
  },
223
+ "list": {
224
  "p": 0.0,
225
  "r": 0.0,
226
  "f": 0.0
227
  },
228
+ "ccomp:pmod": {
 
 
 
 
 
229
  "p": 0.25,
230
  "r": 0.3333333333,
231
  "f": 0.2857142857
232
  },
233
+ "cc:preconj": {
234
  "p": 0.0,
235
  "r": 0.0,
236
  "f": 0.0
237
+ }
238
+ },
239
+ "pos_acc": 0.9405873228,
240
+ "morph_acc": 0.9510657636,
241
+ "morph_micro_p": 0.9896160458,
242
+ "morph_micro_r": 0.9582489383,
243
+ "morph_micro_f": 0.9706797273,
244
+ "morph_per_feat": {
245
+ "Case": {
246
+ "p": 0.9938697318,
247
+ "r": 0.9896985883,
248
+ "f": 0.9917797744
249
+ },
250
+ "Gender": {
251
+ "p": 0.991821842,
252
+ "r": 0.9854981873,
253
+ "f": 0.9886499028
254
+ },
255
+ "Number": {
256
+ "p": 0.9894903379,
257
+ "r": 0.922363847,
258
+ "f": 0.9547486643
259
  },
260
+ "Person": {
261
+ "p": 0.9911452184,
262
+ "r": 0.9893930466,
263
+ "f": 0.9902683574
264
+ },
265
+ "PronType": {
266
+ "p": 0.9965349965,
267
+ "r": 0.993780235,
268
+ "f": 0.9951557093
269
+ },
270
+ "Polarity": {
271
+ "p": 0.9918566775,
272
+ "r": 0.9983606557,
273
+ "f": 0.9950980392
274
+ },
275
+ "AdpType": {
276
+ "p": 0.998982706,
277
+ "r": 0.9969543147,
278
+ "f": 0.9979674797
279
+ },
280
+ "Definite": {
281
+ "p": 0.9886490807,
282
+ "r": 0.9815873016,
283
+ "f": 0.9851055356
284
+ },
285
+ "Degree": {
286
+ "p": 0.9582772544,
287
+ "r": 0.9563465413,
288
+ "f": 0.9573109244
289
+ },
290
+ "VerbForm": {
291
+ "p": 0.9774236388,
292
+ "r": 0.9787234043,
293
+ "f": 0.9780730897
294
+ },
295
+ "Abbr": {
296
+ "p": 0.9538461538,
297
+ "r": 0.8303571429,
298
+ "f": 0.8878281623
299
+ },
300
+ "Poss": {
301
+ "p": 1.0,
302
+ "r": 0.9927710843,
303
+ "f": 0.9963724305
304
+ },
305
+ "NumForm": {
306
+ "p": 0.9871794872,
307
+ "r": 0.3181818182,
308
+ "f": 0.48125
309
+ },
310
+ "NumType": {
311
+ "p": 0.9872881356,
312
+ "r": 0.3200549451,
313
+ "f": 0.4834024896
314
+ },
315
+ "Reflex": {
316
+ "p": 1.0,
317
+ "r": 1.0,
318
+ "f": 1.0
319
+ },
320
+ "Strength": {
321
+ "p": 0.9920318725,
322
+ "r": 0.9880952381,
323
+ "f": 0.9900596421
324
+ },
325
+ "Mood": {
326
+ "p": 0.972826087,
327
+ "r": 0.9853211009,
328
+ "f": 0.9790337284
329
+ },
330
+ "Tense": {
331
+ "p": 0.9725036179,
332
+ "r": 0.976744186,
333
+ "f": 0.9746192893
334
+ },
335
+ "Variant": {
336
+ "p": 0.9932885906,
337
+ "r": 0.9548387097,
338
+ "f": 0.9736842105
339
+ },
340
+ "Position": {
341
+ "p": 1.0,
342
+ "r": 0.9910714286,
343
+ "f": 0.9955156951
344
+ },
345
+ "Number[psor]": {
346
+ "p": 1.0,
347
+ "r": 0.9666666667,
348
+ "f": 0.9830508475
349
+ },
350
+ "PartType": {
351
+ "p": 1.0,
352
+ "r": 0.9459459459,
353
+ "f": 0.9722222222
354
+ },
355
+ "Foreign": {
356
  "p": 0.0,
357
  "r": 0.0,
358
  "f": 0.0
359
  }
360
  },
361
+ "lemma_acc": 0.8183070924,
362
+ "ents_p": 0.7550713749,
363
+ "ents_r": 0.7721859393,
364
+ "ents_f": 0.7635327635,
365
  "ents_per_type": {
366
  "DATETIME": {
367
+ "p": 0.7818791946,
368
+ "r": 0.8118466899,
369
+ "f": 0.7965811966
370
  },
371
  "ORGANIZATION": {
372
+ "p": 0.7076923077,
373
+ "r": 0.7324840764,
374
+ "f": 0.7198748044
375
  },
376
  "FACILITY": {
377
+ "p": 0.5039370079,
378
+ "r": 0.4885496183,
379
+ "f": 0.496124031
380
+ },
381
+ "PRODUCT": {
382
+ "p": 0.5590551181,
383
+ "r": 0.5182481752,
384
+ "f": 0.5378787879
385
  },
386
  "NUMERIC_VALUE": {
387
+ "p": 0.8875502008,
388
+ "r": 0.936440678,
389
+ "f": 0.9113402062
390
  },
391
  "ORDINAL": {
392
+ "p": 0.8214285714,
393
  "r": 0.8363636364,
394
+ "f": 0.8288288288
395
  },
396
  "EVENT": {
397
+ "p": 0.5151515152,
398
+ "r": 0.4594594595,
399
+ "f": 0.4857142857
400
  },
401
  "GPE": {
402
+ "p": 0.8636363636,
403
+ "r": 0.8735632184,
404
+ "f": 0.8685714286
405
  },
406
  "PERSON": {
407
+ "p": 0.7046153846,
408
+ "r": 0.7684563758,
409
+ "f": 0.735152488
410
  },
411
  "NAT_REL_POL": {
412
+ "p": 0.9315068493,
413
  "r": 0.9066666667,
414
+ "f": 0.9189189189
415
  },
416
  "MONEY": {
417
+ "p": 0.9622641509,
418
+ "r": 0.8793103448,
419
+ "f": 0.9189189189
 
 
 
 
 
420
  },
421
  "LOC": {
422
+ "p": 0.4864864865,
423
+ "r": 0.4736842105,
424
+ "f": 0.48
425
  },
426
  "WORK_OF_ART": {
427
+ "p": 0.3571428571,
428
+ "r": 0.2631578947,
429
+ "f": 0.303030303
430
  },
431
  "QUANTITY": {
432
+ "p": 0.962962963,
433
+ "r": 1.0,
434
+ "f": 0.9811320755
 
 
 
 
 
435
  },
436
  "LANGUAGE": {
437
+ "p": 0.6666666667,
438
+ "r": 1.0,
439
+ "f": 0.8
440
+ },
441
+ "PERIOD": {
442
+ "p": 0.8648648649,
443
+ "r": 0.7619047619,
444
+ "f": 0.8101265823
445
  }
446
+ },
447
+ "speed": 7699.716829035
448
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -1,10 +1,8 @@
1
  [paths]
2
- train = "corpus/ro-dep-mixed/train.spacy"
3
- dev = "corpus/ro-dep-mixed/dev.spacy"
4
- vectors = "corpus/ro_vectors"
5
- raw = null
6
  init_tok2vec = null
7
- vocab_data = null
8
 
9
  [system]
10
  gpu_allocator = null
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
24
 
25
  [components.attribute_ruler]
26
  factory = "attribute_ruler"
 
27
  validate = false
28
 
29
  [components.lemmatizer]
@@ -31,11 +30,13 @@ factory = "lemmatizer"
31
  mode = "lookup"
32
  model = null
33
  overwrite = false
 
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
 
39
  update_with_oracle_cut_size = 100
40
 
41
  [components.ner.model]
@@ -53,8 +54,8 @@ nO = null
53
  [components.ner.model.tok2vec.embed]
54
  @architectures = "spacy.MultiHashEmbed.v2"
55
  width = 96
56
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
57
- rows = [5000,2500,2500,2500]
58
  include_static_vectors = true
59
 
60
  [components.ner.model.tok2vec.encode]
@@ -69,6 +70,7 @@ factory = "parser"
69
  learn_tokens = false
70
  min_action_freq = 30
71
  moves = null
 
72
  update_with_oracle_cut_size = 100
73
 
74
  [components.parser.model]
@@ -87,6 +89,8 @@ upstream = "tok2vec"
87
 
88
  [components.senter]
89
  factory = "senter"
 
 
90
 
91
  [components.senter.model]
92
  @architectures = "spacy.Tagger.v1"
@@ -98,8 +102,8 @@ nO = null
98
  [components.senter.model.tok2vec.embed]
99
  @architectures = "spacy.MultiHashEmbed.v2"
100
  width = 16
101
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
102
- rows = [1000,500,500,500]
103
  include_static_vectors = true
104
 
105
  [components.senter.model.tok2vec.encode]
@@ -111,6 +115,8 @@ maxout_pieces = 2
111
 
112
  [components.tagger]
113
  factory = "tagger"
 
 
114
 
115
  [components.tagger.model]
116
  @architectures = "spacy.Tagger.v1"
@@ -130,8 +136,8 @@ factory = "tok2vec"
130
  [components.tok2vec.model.embed]
131
  @architectures = "spacy.MultiHashEmbed.v2"
132
  width = ${components.tok2vec.model.encode:width}
133
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
- rows = [5000,2500,2500,2500]
135
  include_static_vectors = true
136
 
137
  [components.tok2vec.model.encode]
@@ -145,22 +151,19 @@ maxout_pieces = 3
145
 
146
  [corpora.dev]
147
  @readers = "spacy.Corpus.v1"
148
- limit = 0
149
- max_length = 0
150
- path = ${paths:dev}
151
  gold_preproc = false
 
 
152
  augmenter = null
153
 
154
  [corpora.train]
155
  @readers = "spacy.Corpus.v1"
156
- path = ${paths:train}
157
- max_length = 5000
158
  gold_preproc = false
 
159
  limit = 0
160
-
161
- [corpora.train.augmenter]
162
- @augmenters = "spacy.lower_case.v1"
163
- level = 0.1
164
 
165
  [training]
166
  train_corpus = "corpora.train"
@@ -191,9 +194,8 @@ compound = 1.001
191
  t = 0.0
192
 
193
  [training.logger]
194
- @loggers = "spacy.WandbLogger.v1"
195
- project_name = "spacy-v3.0.0a2"
196
- remove_config_values = []
197
 
198
  [training.optimizer]
199
  @optimizers = "Adam.v1"
@@ -214,16 +216,17 @@ dep_las_per_type = null
214
  sents_p = null
215
  sents_r = null
216
  sents_f = 0.02
217
- lemma_acc = 0.33
218
- ents_f = 0.33
219
  ents_p = 0.0
220
  ents_r = 0.0
221
  ents_per_type = null
 
222
 
223
  [pretraining]
224
 
225
  [initialize]
226
- vocab_data = ${paths.vocab_data}
227
  vectors = ${paths.vectors}
228
  init_tok2vec = ${paths.init_tok2vec}
229
  before_init = null
 
1
  [paths]
2
+ train = null
3
+ dev = null
4
+ vectors = null
 
5
  init_tok2vec = null
 
6
 
7
  [system]
8
  gpu_allocator = null
 
22
 
23
  [components.attribute_ruler]
24
  factory = "attribute_ruler"
25
+ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
 
30
  mode = "lookup"
31
  model = null
32
  overwrite = false
33
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
 
35
  [components.ner]
36
  factory = "ner"
37
  incorrect_spans_key = null
38
  moves = null
39
+ scorer = {"@scorers":"spacy.ner_scorer.v1"}
40
  update_with_oracle_cut_size = 100
41
 
42
  [components.ner.model]
 
54
  [components.ner.model.tok2vec.embed]
55
  @architectures = "spacy.MultiHashEmbed.v2"
56
  width = 96
57
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
58
+ rows = [5000,2500,2500,2500,100]
59
  include_static_vectors = true
60
 
61
  [components.ner.model.tok2vec.encode]
 
70
  learn_tokens = false
71
  min_action_freq = 30
72
  moves = null
73
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
74
  update_with_oracle_cut_size = 100
75
 
76
  [components.parser.model]
 
89
 
90
  [components.senter]
91
  factory = "senter"
92
+ overwrite = false
93
+ scorer = {"@scorers":"spacy.senter_scorer.v1"}
94
 
95
  [components.senter.model]
96
  @architectures = "spacy.Tagger.v1"
 
102
  [components.senter.model.tok2vec.embed]
103
  @architectures = "spacy.MultiHashEmbed.v2"
104
  width = 16
105
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
106
+ rows = [1000,500,500,500,50]
107
  include_static_vectors = true
108
 
109
  [components.senter.model.tok2vec.encode]
 
115
 
116
  [components.tagger]
117
  factory = "tagger"
118
+ overwrite = false
119
+ scorer = {"@scorers":"spacy.tagger_scorer.v1"}
120
 
121
  [components.tagger.model]
122
  @architectures = "spacy.Tagger.v1"
 
136
  [components.tok2vec.model.embed]
137
  @architectures = "spacy.MultiHashEmbed.v2"
138
  width = ${components.tok2vec.model.encode:width}
139
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
140
+ rows = [5000,2500,2500,2500,100]
141
  include_static_vectors = true
142
 
143
  [components.tok2vec.model.encode]
 
151
 
152
  [corpora.dev]
153
  @readers = "spacy.Corpus.v1"
154
+ path = ${paths.dev}
 
 
155
  gold_preproc = false
156
+ max_length = 0
157
+ limit = 0
158
  augmenter = null
159
 
160
  [corpora.train]
161
  @readers = "spacy.Corpus.v1"
162
+ path = ${paths.train}
 
163
  gold_preproc = false
164
+ max_length = 0
165
  limit = 0
166
+ augmenter = null
 
 
 
167
 
168
  [training]
169
  train_corpus = "corpora.train"
 
194
  t = 0.0
195
 
196
  [training.logger]
197
+ @loggers = "spacy.ConsoleLogger.v1"
198
+ progress_bar = false
 
199
 
200
  [training.optimizer]
201
  @optimizers = "Adam.v1"
 
216
  sents_p = null
217
  sents_r = null
218
  sents_f = 0.02
219
+ lemma_acc = 0.5
220
+ ents_f = 0.16
221
  ents_p = 0.0
222
  ents_r = 0.0
223
  ents_per_type = null
224
+ speed = 0.0
225
 
226
  [pretraining]
227
 
228
  [initialize]
229
+ vocab_data = null
230
  vectors = ${paths.vectors}
231
  init_tok2vec = ${paths.init_tok2vec}
232
  before_init = null
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"ro",
3
  "name":"core_news_lg",
4
- "version":"3.1.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
- "spacy_version":">=3.1.0,<3.2.0",
11
- "spacy_git_version":"caba63b74",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
@@ -30,6 +30,7 @@
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
 
33
  "Afpf--n",
34
  "Afpfp-n",
35
  "Afpfp-ny",
@@ -131,6 +132,7 @@
131
  "Ds2ms-s",
132
  "Ds3---p",
133
  "Ds3---s",
 
134
  "Ds3fp-s",
135
  "Ds3fsos",
136
  "Ds3fsrs",
@@ -159,18 +161,23 @@
159
  "LSQR",
160
  "LT",
161
  "M",
162
- "Mc",
163
  "Mc-p-d",
164
  "Mc-p-l",
 
 
 
165
  "Mcfp-l",
166
  "Mcfp-ln",
167
  "Mcfprln",
168
  "Mcfprly",
169
  "Mcfsoln",
 
170
  "Mcfsrln",
 
171
  "Mcmp-l",
172
  "Mcms-ln",
173
  "Mcmsrl",
 
174
  "Mcmsrly",
175
  "Mffprln",
176
  "Mffsrln",
@@ -243,7 +250,6 @@
243
  "Pd3mpr--y",
244
  "Pd3mso",
245
  "Pd3msr",
246
- "Pi3",
247
  "Pi3--r",
248
  "Pi3-po",
249
  "Pi3-so",
@@ -289,6 +295,7 @@
289
  "Pp3-po--------s",
290
  "Pp3-sd--------w",
291
  "Pp3-sd--y-----w",
 
292
  "Pp3fpa--------w",
293
  "Pp3fpa--y-----w",
294
  "Pp3fpr--------s",
@@ -315,7 +322,6 @@
315
  "Ps2fp-s",
316
  "Ps2fsrp",
317
  "Ps2fsrs",
318
- "Ps2ms-s",
319
  "Ps3---p",
320
  "Ps3---s",
321
  "Ps3fp-s",
@@ -348,7 +354,6 @@
348
  "RPAR",
349
  "RSQR",
350
  "Rc",
351
- "Rgc",
352
  "Rgp",
353
  "Rgpy",
354
  "Rgs",
@@ -406,6 +411,7 @@
406
  "Va--3s",
407
  "Va--3s----y",
408
  "Vag",
 
409
  "Vaii1",
410
  "Vaii2s",
411
  "Vaii3p",
@@ -475,7 +481,7 @@
475
  "Vmp--sm",
476
  "Vmp--sm---y",
477
  "Vmsp1p",
478
- "Vmsp1s",
479
  "Vmsp2s",
480
  "Vmsp3",
481
  "Vmsp3-----y",
@@ -488,6 +494,7 @@
488
  "Ynmsoy",
489
  "Ynmsry",
490
  "Yp",
 
491
  "Yp-sr",
492
  "Yr"
493
  ],
@@ -525,14 +532,14 @@
525
  "iobj",
526
  "mark",
527
  "nmod",
528
- "nmod:agent",
529
- "nmod:pmod",
530
  "nmod:tmod",
531
  "nsubj",
532
  "nsubj:pass",
533
  "nummod",
534
  "obj",
535
  "obl",
 
 
536
  "orphan",
537
  "parataxis",
538
  "punct",
@@ -590,450 +597,451 @@
590
  ],
591
  "performance":{
592
  "token_acc":0.9990029326,
593
- "tag_acc":0.975315026,
594
- "pos_acc":0.965353945,
595
- "morph_acc":0.9760744713,
596
- "lemma_acc":0.8186589263,
597
- "dep_uas":0.8904573687,
598
- "dep_las":0.8467281511,
599
- "ents_p":0.7588147037,
600
- "ents_r":0.7771801767,
601
- "ents_f":0.7678876447,
602
- "sents_p":0.9533954727,
603
- "sents_r":0.9521276596,
604
- "sents_f":0.9527611444,
605
- "speed":10281.5880630945,
606
- "morph_per_feat":{
607
- "AdpType":{
608
- "p":0.9970784641,
609
- "r":0.9941739492,
610
- "f":0.9956240884
611
  },
612
- "Case":{
613
- "p":0.9896781203,
614
- "r":0.9840648211,
615
- "f":0.9868634886
616
  },
617
- "Variant":{
618
- "p":0.9845559846,
619
- "r":0.9205776173,
620
- "f":0.9514925373
621
  },
622
- "Gender":{
623
- "p":0.9840800225,
624
- "r":0.9782913165,
625
- "f":0.9811771316
626
  },
627
- "Number":{
628
- "p":0.9833276236,
629
- "r":0.9778560073,
630
- "f":0.9805841827
631
  },
632
- "PronType":{
633
- "p":0.9938366718,
634
- "r":0.987244898,
635
- "f":0.9905298183
636
  },
637
- "Definite":{
638
- "p":0.9784728611,
639
- "r":0.9725676664,
640
- "f":0.9755113272
641
  },
642
- "Degree":{
643
- "p":0.9530685921,
644
- "r":0.9428571429,
645
- "f":0.947935368
646
  },
647
- "Polarity":{
648
- "p":0.9884467266,
649
- "r":0.985915493,
650
- "f":0.9871794872
651
  },
652
- "Mood":{
653
- "p":0.9740072202,
654
- "r":0.9635714286,
655
- "f":0.9687612208
656
  },
657
- "Person":{
658
- "p":0.9837338262,
659
- "r":0.9718772827,
660
- "f":0.9777696123
661
  },
662
- "Tense":{
663
- "p":0.9730337079,
664
- "r":0.9572586588,
665
- "f":0.9650817236
666
  },
667
- "VerbForm":{
668
- "p":0.9698996656,
669
- "r":0.9593572779,
670
- "f":0.9645996674
671
  },
672
- "NumForm":{
673
- "p":0.9901960784,
674
- "r":0.9853658537,
675
- "f":0.9877750611
676
  },
677
- "NumType":{
678
- "p":0.9927536232,
679
- "r":0.9856115108,
680
- "f":0.9891696751
681
  },
682
- "PartType":{
683
- "p":0.9473684211,
684
- "r":0.9,
685
- "f":0.9230769231
686
  },
687
- "Strength":{
688
- "p":0.9897959184,
689
- "r":0.9797979798,
690
- "f":0.9847715736
691
  },
692
- "Reflex":{
693
- "p":0.9938461538,
694
- "r":0.990797546,
695
- "f":0.9923195084
696
  },
697
- "Poss":{
698
- "p":0.9792387543,
699
- "r":0.9895104895,
700
- "f":0.9843478261
701
  },
702
- "Position":{
703
- "p":0.9928057554,
704
- "r":0.9517241379,
705
- "f":0.9718309859
706
  },
707
- "Number[psor]":{
708
- "p":0.9295774648,
709
- "r":0.9565217391,
710
- "f":0.9428571429
711
  },
712
- "Abbr":{
713
- "p":0.9746835443,
714
- "r":0.9058823529,
715
- "f":0.9390243902
716
  },
717
- "Foreign":{
 
 
 
 
 
718
  "p":0.0,
719
  "r":0.0,
720
  "f":0.0
721
- }
722
- },
723
- "dep_las_per_type":{
724
- "case":{
725
- "p":0.9257307139,
726
- "r":0.9415204678,
727
- "f":0.9335588306
728
  },
729
- "det":{
730
- "p":0.9473684211,
731
- "r":0.9671052632,
732
- "f":0.9571351058
733
- },
734
- "nmod:tmod":{
735
- "p":0.4,
736
- "r":0.0465116279,
737
- "f":0.0833333333
738
- },
739
- "amod":{
740
- "p":0.8639212175,
741
- "r":0.8756805808,
742
- "f":0.8697611537
743
- },
744
- "cc":{
745
- "p":0.8669354839,
746
- "r":0.89958159,
747
- "f":0.8829568789
748
- },
749
- "conj":{
750
- "p":0.5984962406,
751
- "r":0.6012084592,
752
- "f":0.5998492841
753
- },
754
- "nmod":{
755
- "p":0.7883565797,
756
- "r":0.8217446271,
757
- "f":0.8047044259
758
- },
759
- "mark":{
760
- "p":0.8857142857,
761
- "r":0.9056179775,
762
- "f":0.8955555556
763
- },
764
- "fixed":{
765
- "p":0.8689217759,
766
- "r":0.7172774869,
767
- "f":0.7858508604
768
- },
769
- "nsubj":{
770
- "p":0.8333333333,
771
- "r":0.7814485388,
772
- "f":0.806557377
773
  },
774
- "advcl:tcl":{
775
  "p":0.0,
776
  "r":0.0,
777
  "f":0.0
778
  },
779
- "obj":{
780
- "p":0.7794117647,
781
- "r":0.8139931741,
782
- "f":0.796327212
783
- },
784
- "nummod":{
785
- "p":0.8703703704,
786
- "r":0.8676923077,
787
- "f":0.8690292758
788
  },
789
  "flat":{
790
- "p":0.7441860465,
791
- "r":0.6857142857,
792
- "f":0.7137546468
793
  },
794
- "obl":{
795
- "p":0.649068323,
796
- "r":0.7116912599,
797
- "f":0.6789388197
798
  },
799
- "nmod:pmod":{
800
- "p":0.44,
801
- "r":0.1692307692,
802
- "f":0.2444444444
803
  },
804
- "acl":{
805
- "p":0.7024793388,
806
- "r":0.7264957265,
807
- "f":0.7142857143
808
  },
809
- "advmod":{
810
- "p":0.7860962567,
811
- "r":0.7577319588,
812
- "f":0.7716535433
813
  },
814
  "expl:pv":{
815
- "p":0.7883597884,
816
- "r":0.7967914439,
817
- "f":0.7925531915
818
- },
819
- "root":{
820
- "p":0.917222964,
821
- "r":0.9135638298,
822
- "f":0.9153897402
823
- },
824
- "advcl":{
825
- "p":0.5625,
826
- "r":0.5853658537,
827
- "f":0.5737051793
828
  },
829
- "iobj":{
830
- "p":0.7591240876,
831
- "r":0.7027027027,
832
- "f":0.7298245614
833
- },
834
- "ccomp":{
835
- "p":0.7178217822,
836
- "r":0.8146067416,
837
- "f":0.7631578947
838
- },
839
- "goeswith":{
840
- "p":0.875,
841
- "r":0.5833333333,
842
- "f":0.7
843
  },
844
- "parataxis":{
845
- "p":0.7027027027,
846
- "r":0.5954198473,
847
- "f":0.6446280992
848
  },
849
  "expl:poss":{
850
- "p":0.5909090909,
851
- "r":0.6046511628,
852
- "f":0.5977011494
853
  },
854
- "cop":{
855
- "p":0.7647058824,
856
- "r":0.8024691358,
857
- "f":0.7831325301
858
- },
859
- "cc:preconj":{
860
  "p":0.0,
861
  "r":0.0,
862
  "f":0.0
863
  },
864
- "aux":{
865
- "p":0.9716713881,
866
- "r":0.9122340426,
867
- "f":0.9410150892
868
- },
869
- "expl":{
870
- "p":0.5294117647,
871
- "r":0.4186046512,
872
- "f":0.4675324675
873
- },
874
- "appos":{
875
- "p":0.4347826087,
876
- "r":0.396039604,
877
- "f":0.414507772
878
- },
879
  "xcomp":{
880
- "p":0.5441176471,
881
- "r":0.4512195122,
882
- "f":0.4933333333
883
- },
884
- "csubj":{
885
- "p":0.7966101695,
886
- "r":0.746031746,
887
- "f":0.7704918033
888
- },
889
- "nmod:agent":{
890
- "p":0.7285714286,
891
- "r":0.7846153846,
892
- "f":0.7555555556
893
- },
894
- "aux:pass":{
895
- "p":0.7769784173,
896
- "r":0.9,
897
- "f":0.833976834
898
  },
899
- "dep":{
900
  "p":0.0,
901
  "r":0.0,
902
  "f":0.0
903
  },
904
- "nsubj:pass":{
905
- "p":0.6111111111,
906
- "r":0.6644295302,
907
- "f":0.6366559486
908
  },
909
- "advmod:tmod":{
910
  "p":0.0,
911
  "r":0.0,
912
  "f":0.0
913
  },
914
- "expl:pass":{
915
- "p":0.6734693878,
916
- "r":0.7252747253,
917
- "f":0.6984126984
918
- },
919
- "ccomp:pmod":{
920
- "p":0.4,
921
- "r":0.2666666667,
922
- "f":0.32
923
- },
924
  "compound":{
925
- "p":0.25,
926
- "r":0.3333333333,
927
- "f":0.2857142857
928
  },
929
- "orphan":{
930
  "p":0.0,
931
  "r":0.0,
932
  "f":0.0
933
  },
934
- "expl:impers":{
935
- "p":0.3333333333,
936
- "r":0.1,
937
- "f":0.1538461538
938
- },
939
- "csubj:pass":{
940
  "p":0.25,
941
  "r":0.3333333333,
942
  "f":0.2857142857
943
  },
944
- "vocative":{
945
  "p":0.0,
946
  "r":0.0,
947
  "f":0.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
948
  },
949
- "discourse":{
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
950
  "p":0.0,
951
  "r":0.0,
952
  "f":0.0
953
  }
954
  },
 
 
 
 
955
  "ents_per_type":{
956
  "DATETIME":{
957
- "p":0.7852348993,
958
- "r":0.8153310105,
959
- "f":0.8
960
  },
961
  "ORGANIZATION":{
962
- "p":0.6873065015,
963
- "r":0.7070063694,
964
- "f":0.6970172684
965
  },
966
  "FACILITY":{
967
- "p":0.5317460317,
968
- "r":0.5114503817,
969
- "f":0.5214007782
 
 
 
 
 
970
  },
971
  "NUMERIC_VALUE":{
972
- "p":0.8978723404,
973
- "r":0.8940677966,
974
- "f":0.8959660297
975
  },
976
  "ORDINAL":{
977
- "p":0.7931034483,
978
  "r":0.8363636364,
979
- "f":0.814159292
980
  },
981
  "EVENT":{
982
- "p":0.5675675676,
983
- "r":0.5675675676,
984
- "f":0.5675675676
985
  },
986
  "GPE":{
987
- "p":0.8351409978,
988
- "r":0.8850574713,
989
- "f":0.859375
990
  },
991
  "PERSON":{
992
- "p":0.7360890302,
993
- "r":0.7768456376,
994
- "f":0.7559183673
995
  },
996
  "NAT_REL_POL":{
997
- "p":0.925170068,
998
  "r":0.9066666667,
999
- "f":0.9158249158
1000
  },
1001
  "MONEY":{
1002
- "p":0.9411764706,
1003
- "r":0.8275862069,
1004
- "f":0.880733945
1005
- },
1006
- "PRODUCT":{
1007
- "p":0.6260162602,
1008
- "r":0.5620437956,
1009
- "f":0.5923076923
1010
  },
1011
  "LOC":{
1012
- "p":0.4886363636,
1013
- "r":0.5657894737,
1014
- "f":0.5243902439
1015
  },
1016
  "WORK_OF_ART":{
1017
- "p":0.4285714286,
1018
- "r":0.4736842105,
1019
- "f":0.45
1020
  },
1021
  "QUANTITY":{
1022
- "p":0.8620689655,
1023
- "r":0.9615384615,
1024
- "f":0.9090909091
1025
- },
1026
- "PERIOD":{
1027
- "p":0.9428571429,
1028
- "r":0.7857142857,
1029
- "f":0.8571428571
1030
  },
1031
  "LANGUAGE":{
1032
- "p":0.6,
1033
- "r":0.75,
1034
- "f":0.6666666667
 
 
 
 
 
1035
  }
1036
- }
 
1037
  },
1038
  "sources":[
1039
  {
@@ -1043,7 +1051,7 @@
1043
  "author":"Michal M\u011bchura"
1044
  },
1045
  {
1046
- "name":"UD Romanian RRT v2.5",
1047
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1048
  "license":"CC BY-SA 4.0",
1049
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
 
1
  {
2
  "lang":"ro",
3
  "name":"core_news_lg",
4
+ "version":"3.2.0",
5
  "description":"Romanian pipeline optimized for CPU. Components: tok2vec, tagger, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"CC BY-SA 4.0",
10
+ "spacy_version":">=3.2.0,<3.3.0",
11
+ "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":300,
14
  "vectors":500000,
 
30
  "Afp",
31
  "Afp-p-n",
32
  "Afp-poy",
33
+ "Afp-srn",
34
  "Afpf--n",
35
  "Afpfp-n",
36
  "Afpfp-ny",
 
132
  "Ds2ms-s",
133
  "Ds3---p",
134
  "Ds3---s",
135
+ "Ds3---sy",
136
  "Ds3fp-s",
137
  "Ds3fsos",
138
  "Ds3fsrs",
 
161
  "LSQR",
162
  "LT",
163
  "M",
 
164
  "Mc-p-d",
165
  "Mc-p-l",
166
+ "Mc-s-b",
167
+ "Mc-s-d",
168
+ "Mc-s-l",
169
  "Mcfp-l",
170
  "Mcfp-ln",
171
  "Mcfprln",
172
  "Mcfprly",
173
  "Mcfsoln",
174
+ "Mcfsrl",
175
  "Mcfsrln",
176
+ "Mcfsrly",
177
  "Mcmp-l",
178
  "Mcms-ln",
179
  "Mcmsrl",
180
+ "Mcmsrln",
181
  "Mcmsrly",
182
  "Mffprln",
183
  "Mffsrln",
 
250
  "Pd3mpr--y",
251
  "Pd3mso",
252
  "Pd3msr",
 
253
  "Pi3--r",
254
  "Pi3-po",
255
  "Pi3-so",
 
295
  "Pp3-po--------s",
296
  "Pp3-sd--------w",
297
  "Pp3-sd--y-----w",
298
+ "Pp3-so--------s",
299
  "Pp3fpa--------w",
300
  "Pp3fpa--y-----w",
301
  "Pp3fpr--------s",
 
322
  "Ps2fp-s",
323
  "Ps2fsrp",
324
  "Ps2fsrs",
 
325
  "Ps3---p",
326
  "Ps3---s",
327
  "Ps3fp-s",
 
354
  "RPAR",
355
  "RSQR",
356
  "Rc",
 
357
  "Rgp",
358
  "Rgpy",
359
  "Rgs",
 
411
  "Va--3s",
412
  "Va--3s----y",
413
  "Vag",
414
+ "Vag-------y",
415
  "Vaii1",
416
  "Vaii2s",
417
  "Vaii3p",
 
481
  "Vmp--sm",
482
  "Vmp--sm---y",
483
  "Vmsp1p",
484
+ "Vmsp2p",
485
  "Vmsp2s",
486
  "Vmsp3",
487
  "Vmsp3-----y",
 
494
  "Ynmsoy",
495
  "Ynmsry",
496
  "Yp",
497
+ "Yp,Yn",
498
  "Yp-sr",
499
  "Yr"
500
  ],
 
532
  "iobj",
533
  "mark",
534
  "nmod",
 
 
535
  "nmod:tmod",
536
  "nsubj",
537
  "nsubj:pass",
538
  "nummod",
539
  "obj",
540
  "obl",
541
+ "obl:agent",
542
+ "obl:pmod",
543
  "orphan",
544
  "parataxis",
545
  "punct",
 
597
  ],
598
  "performance":{
599
  "token_acc":0.9990029326,
600
+ "token_p":0.9967350492,
601
+ "token_r":0.9957244934,
602
+ "token_f":0.9959492157,
603
+ "tag_acc":0.9664291788,
604
+ "sents_p":0.954787234,
605
+ "sents_r":0.954787234,
606
+ "sents_f":0.954787234,
607
+ "dep_uas":0.8897462438,
608
+ "dep_las":0.8389686971,
609
+ "dep_las_per_type":{
610
+ "root":{
611
+ "p":0.8786231884,
612
+ "r":0.9133709981,
613
+ "f":0.8956602031
 
 
 
 
614
  },
615
+ "mark":{
616
+ "p":0.9288389513,
617
+ "r":0.9358490566,
618
+ "f":0.9323308271
619
  },
620
+ "case":{
621
+ "p":0.9638554217,
622
+ "r":0.959880015,
623
+ "f":0.9618636107
624
  },
625
+ "nmod:tmod":{
626
+ "p":0.6842105263,
627
+ "r":0.1092436975,
628
+ "f":0.1884057971
629
  },
630
+ "amod":{
631
+ "p":0.9172297297,
632
+ "r":0.9250425894,
633
+ "f":0.9211195929
634
  },
635
+ "nsubj":{
636
+ "p":0.8803986711,
637
+ "r":0.8372827804,
638
+ "f":0.8582995951
639
  },
640
+ "nmod":{
641
+ "p":0.8218838527,
642
+ "r":0.8286326312,
643
+ "f":0.8252444444
644
  },
645
+ "aux":{
646
+ "p":0.9867924528,
647
+ "r":0.9561243144,
648
+ "f":0.9712163417
649
  },
650
+ "advcl":{
651
+ "p":0.5862068966,
652
+ "r":0.6390977444,
653
+ "f":0.6115107914
654
  },
655
+ "obj":{
656
+ "p":0.8326180258,
657
+ "r":0.896073903,
658
+ "f":0.8631813126
659
  },
660
+ "det":{
661
+ "p":0.9575688073,
662
+ "r":0.9456398641,
663
+ "f":0.9515669516
664
  },
665
+ "cc":{
666
+ "p":0.9340425532,
667
+ "r":0.9164926931,
668
+ "f":0.9251844046
669
  },
670
+ "conj":{
671
+ "p":0.6115288221,
672
+ "r":0.5654692932,
673
+ "f":0.5875978326
674
  },
675
+ "nummod":{
676
+ "p":0.887675507,
677
+ "r":0.8835403727,
678
+ "f":0.8856031128
679
  },
680
+ "acl":{
681
+ "p":0.8063583815,
682
+ "r":0.7209302326,
683
+ "f":0.761255116
684
  },
685
+ "advmod":{
686
+ "p":0.8117048346,
687
+ "r":0.8416886544,
688
+ "f":0.8264248705
689
  },
690
+ "obl":{
691
+ "p":0.6821052632,
692
+ "r":0.8223350254,
693
+ "f":0.7456846951
694
  },
695
+ "expl:pass":{
696
+ "p":0.8085106383,
697
+ "r":0.7037037037,
698
+ "f":0.7524752475
699
  },
700
+ "nsubj:pass":{
701
+ "p":0.8,
702
+ "r":0.756097561,
703
+ "f":0.7774294671
704
  },
705
+ "fixed":{
706
+ "p":0.9,
707
+ "r":0.8562367865,
708
+ "f":0.8775731311
709
  },
710
+ "appos":{
711
+ "p":0.4956896552,
712
+ "r":0.4389312977,
713
+ "f":0.4655870445
714
  },
715
+ "parataxis":{
716
+ "p":0.1627906977,
717
+ "r":0.2,
718
+ "f":0.1794871795
719
  },
720
+ "aux:pass":{
721
+ "p":0.9125,
722
+ "r":0.9733333333,
723
+ "f":0.9419354839
724
+ },
725
+ "nmod:agent":{
726
  "p":0.0,
727
  "r":0.0,
728
  "f":0.0
 
 
 
 
 
 
 
729
  },
730
+ "ccomp":{
731
+ "p":0.8759689922,
732
+ "r":0.8759689922,
733
+ "f":0.8759689922
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
734
  },
735
+ "nmod:pmod":{
736
  "p":0.0,
737
  "r":0.0,
738
  "f":0.0
739
  },
740
+ "iobj":{
741
+ "p":0.8157894737,
742
+ "r":0.7654320988,
743
+ "f":0.7898089172
 
 
 
 
 
744
  },
745
  "flat":{
746
+ "p":0.7557251908,
747
+ "r":0.7815789474,
748
+ "f":0.7684346701
749
  },
750
+ "cop":{
751
+ "p":0.8524590164,
752
+ "r":0.8387096774,
753
+ "f":0.8455284553
754
  },
755
+ "csubj":{
756
+ "p":0.8235294118,
757
+ "r":0.6666666667,
758
+ "f":0.7368421053
759
  },
760
+ "obl:agent":{
761
+ "p":0.0,
762
+ "r":0.0,
763
+ "f":0.0
764
  },
765
+ "dep":{
766
+ "p":0.0,
767
+ "r":0.0,
768
+ "f":0.0
769
  },
770
  "expl:pv":{
771
+ "p":0.7564102564,
772
+ "r":0.8550724638,
773
+ "f":0.8027210884
 
 
 
 
 
 
 
 
 
 
774
  },
775
+ "expl":{
776
+ "p":0.6875,
777
+ "r":0.8148148148,
778
+ "f":0.7457627119
 
 
 
 
 
 
 
 
 
 
779
  },
780
+ "obl:pmod":{
781
+ "p":0.0,
782
+ "r":0.0,
783
+ "f":0.0
784
  },
785
  "expl:poss":{
786
+ "p":0.9655172414,
787
+ "r":0.9032258065,
788
+ "f":0.9333333333
789
  },
790
+ "goeswith":{
 
 
 
 
 
791
  "p":0.0,
792
  "r":0.0,
793
  "f":0.0
794
  },
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
795
  "xcomp":{
796
+ "p":0.5806451613,
797
+ "r":0.6666666667,
798
+ "f":0.6206896552
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
799
  },
800
+ "orphan":{
801
  "p":0.0,
802
  "r":0.0,
803
  "f":0.0
804
  },
805
+ "expl:impers":{
806
+ "p":1.0,
807
+ "r":0.3333333333,
808
+ "f":0.5
809
  },
810
+ "csubj:pass":{
811
  "p":0.0,
812
  "r":0.0,
813
  "f":0.0
814
  },
 
 
 
 
 
 
 
 
 
 
815
  "compound":{
816
+ "p":0.5714285714,
817
+ "r":0.5714285714,
818
+ "f":0.5714285714
819
  },
820
+ "list":{
821
  "p":0.0,
822
  "r":0.0,
823
  "f":0.0
824
  },
825
+ "ccomp:pmod":{
 
 
 
 
 
826
  "p":0.25,
827
  "r":0.3333333333,
828
  "f":0.2857142857
829
  },
830
+ "cc:preconj":{
831
  "p":0.0,
832
  "r":0.0,
833
  "f":0.0
834
+ }
835
+ },
836
+ "pos_acc":0.9405873228,
837
+ "morph_acc":0.9510657636,
838
+ "morph_micro_p":0.9896160458,
839
+ "morph_micro_r":0.9582489383,
840
+ "morph_micro_f":0.9706797273,
841
+ "morph_per_feat":{
842
+ "Case":{
843
+ "p":0.9938697318,
844
+ "r":0.9896985883,
845
+ "f":0.9917797744
846
+ },
847
+ "Gender":{
848
+ "p":0.991821842,
849
+ "r":0.9854981873,
850
+ "f":0.9886499028
851
+ },
852
+ "Number":{
853
+ "p":0.9894903379,
854
+ "r":0.922363847,
855
+ "f":0.9547486643
856
+ },
857
+ "Person":{
858
+ "p":0.9911452184,
859
+ "r":0.9893930466,
860
+ "f":0.9902683574
861
+ },
862
+ "PronType":{
863
+ "p":0.9965349965,
864
+ "r":0.993780235,
865
+ "f":0.9951557093
866
+ },
867
+ "Polarity":{
868
+ "p":0.9918566775,
869
+ "r":0.9983606557,
870
+ "f":0.9950980392
871
+ },
872
+ "AdpType":{
873
+ "p":0.998982706,
874
+ "r":0.9969543147,
875
+ "f":0.9979674797
876
+ },
877
+ "Definite":{
878
+ "p":0.9886490807,
879
+ "r":0.9815873016,
880
+ "f":0.9851055356
881
+ },
882
+ "Degree":{
883
+ "p":0.9582772544,
884
+ "r":0.9563465413,
885
+ "f":0.9573109244
886
+ },
887
+ "VerbForm":{
888
+ "p":0.9774236388,
889
+ "r":0.9787234043,
890
+ "f":0.9780730897
891
+ },
892
+ "Abbr":{
893
+ "p":0.9538461538,
894
+ "r":0.8303571429,
895
+ "f":0.8878281623
896
+ },
897
+ "Poss":{
898
+ "p":1.0,
899
+ "r":0.9927710843,
900
+ "f":0.9963724305
901
+ },
902
+ "NumForm":{
903
+ "p":0.9871794872,
904
+ "r":0.3181818182,
905
+ "f":0.48125
906
+ },
907
+ "NumType":{
908
+ "p":0.9872881356,
909
+ "r":0.3200549451,
910
+ "f":0.4834024896
911
+ },
912
+ "Reflex":{
913
+ "p":1.0,
914
+ "r":1.0,
915
+ "f":1.0
916
+ },
917
+ "Strength":{
918
+ "p":0.9920318725,
919
+ "r":0.9880952381,
920
+ "f":0.9900596421
921
+ },
922
+ "Mood":{
923
+ "p":0.972826087,
924
+ "r":0.9853211009,
925
+ "f":0.9790337284
926
  },
927
+ "Tense":{
928
+ "p":0.9725036179,
929
+ "r":0.976744186,
930
+ "f":0.9746192893
931
+ },
932
+ "Variant":{
933
+ "p":0.9932885906,
934
+ "r":0.9548387097,
935
+ "f":0.9736842105
936
+ },
937
+ "Position":{
938
+ "p":1.0,
939
+ "r":0.9910714286,
940
+ "f":0.9955156951
941
+ },
942
+ "Number[psor]":{
943
+ "p":1.0,
944
+ "r":0.9666666667,
945
+ "f":0.9830508475
946
+ },
947
+ "PartType":{
948
+ "p":1.0,
949
+ "r":0.9459459459,
950
+ "f":0.9722222222
951
+ },
952
+ "Foreign":{
953
  "p":0.0,
954
  "r":0.0,
955
  "f":0.0
956
  }
957
  },
958
+ "lemma_acc":0.8183070924,
959
+ "ents_p":0.7550713749,
960
+ "ents_r":0.7721859393,
961
+ "ents_f":0.7635327635,
962
  "ents_per_type":{
963
  "DATETIME":{
964
+ "p":0.7818791946,
965
+ "r":0.8118466899,
966
+ "f":0.7965811966
967
  },
968
  "ORGANIZATION":{
969
+ "p":0.7076923077,
970
+ "r":0.7324840764,
971
+ "f":0.7198748044
972
  },
973
  "FACILITY":{
974
+ "p":0.5039370079,
975
+ "r":0.4885496183,
976
+ "f":0.496124031
977
+ },
978
+ "PRODUCT":{
979
+ "p":0.5590551181,
980
+ "r":0.5182481752,
981
+ "f":0.5378787879
982
  },
983
  "NUMERIC_VALUE":{
984
+ "p":0.8875502008,
985
+ "r":0.936440678,
986
+ "f":0.9113402062
987
  },
988
  "ORDINAL":{
989
+ "p":0.8214285714,
990
  "r":0.8363636364,
991
+ "f":0.8288288288
992
  },
993
  "EVENT":{
994
+ "p":0.5151515152,
995
+ "r":0.4594594595,
996
+ "f":0.4857142857
997
  },
998
  "GPE":{
999
+ "p":0.8636363636,
1000
+ "r":0.8735632184,
1001
+ "f":0.8685714286
1002
  },
1003
  "PERSON":{
1004
+ "p":0.7046153846,
1005
+ "r":0.7684563758,
1006
+ "f":0.735152488
1007
  },
1008
  "NAT_REL_POL":{
1009
+ "p":0.9315068493,
1010
  "r":0.9066666667,
1011
+ "f":0.9189189189
1012
  },
1013
  "MONEY":{
1014
+ "p":0.9622641509,
1015
+ "r":0.8793103448,
1016
+ "f":0.9189189189
 
 
 
 
 
1017
  },
1018
  "LOC":{
1019
+ "p":0.4864864865,
1020
+ "r":0.4736842105,
1021
+ "f":0.48
1022
  },
1023
  "WORK_OF_ART":{
1024
+ "p":0.3571428571,
1025
+ "r":0.2631578947,
1026
+ "f":0.303030303
1027
  },
1028
  "QUANTITY":{
1029
+ "p":0.962962963,
1030
+ "r":1.0,
1031
+ "f":0.9811320755
 
 
 
 
 
1032
  },
1033
  "LANGUAGE":{
1034
+ "p":0.6666666667,
1035
+ "r":1.0,
1036
+ "f":0.8
1037
+ },
1038
+ "PERIOD":{
1039
+ "p":0.8648648649,
1040
+ "r":0.7619047619,
1041
+ "f":0.8101265823
1042
  }
1043
+ },
1044
+ "speed":7699.716829035
1045
  },
1046
  "sources":[
1047
  {
 
1051
  "author":"Michal M\u011bchura"
1052
  },
1053
  {
1054
+ "name":"UD Romanian RRT v2.8",
1055
  "url":"https://github.com/UniversalDependencies/UD_Romanian-RRT",
1056
  "license":"CC BY-SA 4.0",
1057
  "author":"Barbu Mititelu, Verginica; Irimia, Elena; Perez, Cenel-Augusto; Ion, Radu; Simionescu, Radu; Popel, Martin"
ner/model CHANGED
Binary files a/ner/model and b/ner/model differ
 
parser/model CHANGED
Binary files a/parser/model and b/parser/model differ
 
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves�{"0":{"":85972},"1":{"":90580},"2":{"case":22318,"punct":9077,"det":9009,"nsubj":7125,"advmod":6350,"cc":5364,"mark":5291,"aux":4018,"obl":2015,"nummod":1880,"expl:pv":1798,"cop":1706,"amod":1376,"aux:pass":1369,"nsubj:pass":963,"expl:pass":909,"parataxis":877,"obj":866,"advcl":710,"iobj":567,"expl:poss":464,"expl":390,"nmod":204,"nsubj||csubj":154,"nmod:tmod":152,"expl:impers":102,"xcomp":97,"advmod:tmod":85,"nmod:pmod":74,"cc:preconj":63,"csubj":58,"nsubj:pass||csubj":57,"obj||ccomp":44,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14423,"amod":9673,"obl":7745,"conj":7281,"fixed":5595,"obj":5457,"acl":4102,"advmod":2145,"advcl":2043,"ccomp":1929,"nummod":1646,"nsubj":1278,"nmod:pmod":1208,"flat":1160,"det":1031,"appos":915,"xcomp":886,"iobj":804,"nmod:agent":718,"csubj":626,"nsubj:pass":546,"case":442,"parataxis":426,"nmod:tmod":286,"goeswith":245,"ccomp:pmod":174,"cc":124,"cop":100,"expl:pv":86,"expl":55,"advcl:tcl":52,"compound":50,"csubj:pass":49,"expl:poss":36,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
 
1
+ ��moves� {"0":{"":86134},"1":{"":90421},"2":{"case":22293,"punct":9078,"det":9035,"nsubj":7080,"advmod":6417,"mark":5380,"cc":5367,"aux":4002,"obl":2028,"nummod":1887,"expl:pv":1796,"cop":1712,"aux:pass":1372,"amod":1370,"nsubj:pass":1013,"expl:pass":910,"parataxis":878,"obj":868,"advcl":713,"iobj":564,"expl:poss":469,"expl":393,"nmod":203,"nsubj||csubj":155,"nmod:tmod":153,"expl:impers":102,"xcomp":97,"advmod:tmod":84,"obl:pmod":74,"cc:preconj":63,"csubj":59,"nsubj:pass||csubj":57,"obj||ccomp":45,"orphan":32,"advcl:tcl":30,"dep":0},"3":{"nmod":16696,"punct":14500,"amod":9699,"obl":7775,"conj":7286,"fixed":5485,"obj":5462,"acl":4105,"advmod":2099,"advcl":2049,"ccomp":1932,"nummod":1667,"nsubj":1280,"obl:pmod":1208,"flat":1167,"det":1035,"appos":915,"xcomp":891,"iobj":803,"obl:agent":719,"csubj":632,"nsubj:pass":554,"parataxis":435,"case":434,"nmod:tmod":283,"ccomp:pmod":178,"cc":123,"cop":100,"expl:pv":86,"goeswith":72,"expl":55,"compound":52,"advcl:tcl":52,"csubj:pass":49,"expl:poss":35,"vocative":31,"dep":0},"4":{"ROOT":8021}}�cfg��neg_key�
ro_core_news_lg-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:750b5b5b0dad8fb1b0afc41dff5e52640545d643bee77be5c16b40d364a049c7
3
- size 571621040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d6e1a278aaa42bc7109e82f03fc9c7fee59b0d71d0d759f783c1057815553bb
3
+ size 572291519
senter/cfg CHANGED
@@ -1,3 +1,3 @@
1
  {
2
-
3
  }
 
1
  {
2
+ "overwrite":false
3
  }
senter/model CHANGED
Binary files a/senter/model and b/senter/model differ
 
tagger/cfg CHANGED
@@ -10,6 +10,7 @@
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
 
13
  "Afpf--n",
14
  "Afpfp-n",
15
  "Afpfp-ny",
@@ -111,6 +112,7 @@
111
  "Ds2ms-s",
112
  "Ds3---p",
113
  "Ds3---s",
 
114
  "Ds3fp-s",
115
  "Ds3fsos",
116
  "Ds3fsrs",
@@ -139,18 +141,23 @@
139
  "LSQR",
140
  "LT",
141
  "M",
142
- "Mc",
143
  "Mc-p-d",
144
  "Mc-p-l",
 
 
 
145
  "Mcfp-l",
146
  "Mcfp-ln",
147
  "Mcfprln",
148
  "Mcfprly",
149
  "Mcfsoln",
 
150
  "Mcfsrln",
 
151
  "Mcmp-l",
152
  "Mcms-ln",
153
  "Mcmsrl",
 
154
  "Mcmsrly",
155
  "Mffprln",
156
  "Mffsrln",
@@ -223,7 +230,6 @@
223
  "Pd3mpr--y",
224
  "Pd3mso",
225
  "Pd3msr",
226
- "Pi3",
227
  "Pi3--r",
228
  "Pi3-po",
229
  "Pi3-so",
@@ -269,6 +275,7 @@
269
  "Pp3-po--------s",
270
  "Pp3-sd--------w",
271
  "Pp3-sd--y-----w",
 
272
  "Pp3fpa--------w",
273
  "Pp3fpa--y-----w",
274
  "Pp3fpr--------s",
@@ -295,7 +302,6 @@
295
  "Ps2fp-s",
296
  "Ps2fsrp",
297
  "Ps2fsrs",
298
- "Ps2ms-s",
299
  "Ps3---p",
300
  "Ps3---s",
301
  "Ps3fp-s",
@@ -328,7 +334,6 @@
328
  "RPAR",
329
  "RSQR",
330
  "Rc",
331
- "Rgc",
332
  "Rgp",
333
  "Rgpy",
334
  "Rgs",
@@ -386,6 +391,7 @@
386
  "Va--3s",
387
  "Va--3s----y",
388
  "Vag",
 
389
  "Vaii1",
390
  "Vaii2s",
391
  "Vaii3p",
@@ -455,7 +461,7 @@
455
  "Vmp--sm",
456
  "Vmp--sm---y",
457
  "Vmsp1p",
458
- "Vmsp1s",
459
  "Vmsp2s",
460
  "Vmsp3",
461
  "Vmsp3-----y",
@@ -468,7 +474,9 @@
468
  "Ynmsoy",
469
  "Ynmsry",
470
  "Yp",
 
471
  "Yp-sr",
472
  "Yr"
473
- ]
 
474
  }
 
10
  "Afp",
11
  "Afp-p-n",
12
  "Afp-poy",
13
+ "Afp-srn",
14
  "Afpf--n",
15
  "Afpfp-n",
16
  "Afpfp-ny",
 
112
  "Ds2ms-s",
113
  "Ds3---p",
114
  "Ds3---s",
115
+ "Ds3---sy",
116
  "Ds3fp-s",
117
  "Ds3fsos",
118
  "Ds3fsrs",
 
141
  "LSQR",
142
  "LT",
143
  "M",
 
144
  "Mc-p-d",
145
  "Mc-p-l",
146
+ "Mc-s-b",
147
+ "Mc-s-d",
148
+ "Mc-s-l",
149
  "Mcfp-l",
150
  "Mcfp-ln",
151
  "Mcfprln",
152
  "Mcfprly",
153
  "Mcfsoln",
154
+ "Mcfsrl",
155
  "Mcfsrln",
156
+ "Mcfsrly",
157
  "Mcmp-l",
158
  "Mcms-ln",
159
  "Mcmsrl",
160
+ "Mcmsrln",
161
  "Mcmsrly",
162
  "Mffprln",
163
  "Mffsrln",
 
230
  "Pd3mpr--y",
231
  "Pd3mso",
232
  "Pd3msr",
 
233
  "Pi3--r",
234
  "Pi3-po",
235
  "Pi3-so",
 
275
  "Pp3-po--------s",
276
  "Pp3-sd--------w",
277
  "Pp3-sd--y-----w",
278
+ "Pp3-so--------s",
279
  "Pp3fpa--------w",
280
  "Pp3fpa--y-----w",
281
  "Pp3fpr--------s",
 
302
  "Ps2fp-s",
303
  "Ps2fsrp",
304
  "Ps2fsrs",
 
305
  "Ps3---p",
306
  "Ps3---s",
307
  "Ps3fp-s",
 
334
  "RPAR",
335
  "RSQR",
336
  "Rc",
 
337
  "Rgp",
338
  "Rgpy",
339
  "Rgs",
 
391
  "Va--3s",
392
  "Va--3s----y",
393
  "Vag",
394
+ "Vag-------y",
395
  "Vaii1",
396
  "Vaii2s",
397
  "Vaii3p",
 
461
  "Vmp--sm",
462
  "Vmp--sm---y",
463
  "Vmsp1p",
464
+ "Vmsp2p",
465
  "Vmsp2s",
466
  "Vmsp3",
467
  "Vmsp3-----y",
 
474
  "Ynmsoy",
475
  "Ynmsry",
476
  "Yp",
477
+ "Yp,Yn",
478
  "Yp-sr",
479
  "Yr"
480
+ ],
481
+ "overwrite":false
482
  }
tagger/model CHANGED
Binary files a/tagger/model and b/tagger/model differ
 
tok2vec/model CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
 
tokenizer CHANGED
@@ -1,3 +1,3 @@
1
- ��prefix_search�
2
  ��A�
3
- � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
 
1
+ ��prefix_search�
2
  ��A�
3
+ � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�1-A��A�1-A�1-UL��A�1-UL�1-Ul��A�1-Ul�1-a��A�1-a�1-ul��A�1-ul�10-A��A�10-A�10-LEA��A�10-LEA�10-Lea��A�10-Lea�10-a��A�10-a�10-lea��A�10-lea�11-A��A�11-A�11-LEA��A�11-LEA�11-Lea��A�11-Lea�11-a��A�11-a�11-lea��A�11-lea�12-A��A�12-A�12-LEA��A�12-LEA�12-Lea��A�12-Lea�12-a��A�12-a�12-lea��A�12-lea�2-A��A�2-A�2-LEA��A�2-LEA�2-Lea��A�2-Lea�2-a��A�2-a�2-lea��A�2-lea�3-A��A�3-A�3-LEA��A�3-LEA�3-Lea��A�3-Lea�3-a��A�3-a�3-lea��A�3-lea�4-A��A�4-A�4-LEA��A�4-LEA�4-Lea��A�4-Lea�4-a��A�4-a�4-lea��A�4-lea�5-A��A�5-A�5-LEA��A�5-LEA�5-Lea��A�5-Lea�5-a��A�5-a�5-lea��A�5-lea�6-A��A�6-A�6-LEA��A�6-LEA�6-Lea��A�6-Lea�6-a��A�6-a�6-lea��A�6-lea�7-A��A�7-A�7-LEA��A�7-LEA�7-Lea��A�7-Lea�7-a��A�7-a�7-lea��A�7-lea�8)��A�8)�8-)��A�8-)�8-A��A�8-A�8-D��A�8-D�8-LEA��A�8-LEA�8-Lea��A�8-Lea�8-a��A�8-a�8-lea��A�8-lea�8D��A�8D�9-A��A�9-A�9-LEA��A�9-LEA�9-Lea��A�9-Lea�9-a��A�9-a�9-lea��A�9-lea�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�A.C.��A�A.C.�A.F.��A�A.F.�A.M.��A�A.M.�A.R.��A�A.R.�AL.��A�AL.�ALIN.��A�ALIN.�ART.��A�ART.�AUG.��A�AUG.�Al.��A�Al.�Alin.��A�Alin.�Art.��A�Art.�Aug.��A�Aug.�BD.��A�BD.�Bd.��A�Bd.�C++��A�C++�D-L��A�D-L�D-LUI��A�D-LUI�D-Lui��A�D-Lui�D-NEI��A�D-NEI�D-Nei��A�D-Nei�D-VOASTRA��A�D-VOASTRA�D-VOASTRĂ��A�D-VOASTRĂ�D-Voastra��A�D-Voastra�D-Voastră��A�D-Voastră�D.P.D.V.��A�D.P.D.V.�DEM.��A�DEM.�DPDV��A�DPDV�DR.��A�DR.�DVS.��A�DVS.�Dem.��A�Dem.�Dpdv��A�Dpdv�Dr.��A�Dr.�Dvs.��A�Dvs.�ETC.��A�ETC.�EX.��A�EX.�Etc.��A�Etc.�Ex.��A�Ex.�FIG.��A�FIG.�FR.��A�FR.�Fig.��A�Fig.�Fr.��A�Fr.�GH.��A�GH.�GR.��A�GR.�Gh.��A�Gh.�Gr.��A�Gr.�IAN.��A�IAN.�ING.��A�ING.�INGR.��A�INGR.�INTR-ADEVAR��A�INTR-ADEVAR�INTR-ADEVĂR��A�INTR-ADEVĂR�Ian.��A�Ian.�Ing.��A�Ing.�Ingr.��A�Ingr.�Intr-Adevar��A�Intr-Adevar�Intr-Adevăr��A�Intr-Adevăr�LIT.��A�LIT.�LT.��A�LT.�Lit.��A�Lit.�Lt.��A�Lt.�NR.��A�NR.�Nr.��A�Nr.�O.O��A�O.O�O.o��A�O.o�OBS.��A�OBS.�O_O��A�O_O�O_o��A�O_o�Obs.��A�Obs.�P.A.��A�P.A.�P.M.��A�P.M.�PCT.��A�PCT.�PREP.��A�PREP.�PROF.��A�PROF.�Pct.��A�Pct.�Prep.��A�Prep.�Prof.��A�Prof.�ROM.��A�ROM.�Rom.��A�Rom.�S.A.��A�S.A.�S.A.M.D.��A�S.A.M.D.�SAMD.��A�SAMD.�SF.��A�SF.�ST.��A�ST.�STR.��A�STR.�Samd.��A�Samd.�Sf.��A�Sf.�St.��A�St.�Str.��A�Str.�TEL.��A�TEL.�Tel.��A�Tel.�UNIV.��A�UNIV.�Univ.��A�Univ.�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�a.c.��A�a.c.�a.f.��A�a.f.�a.m.��A�a.m.�a.r.��A�a.r.�al.��A�al.�alin.��A�alin.�art.��A�art.�aug.��A�aug.�b.��A�b.�bd.��A�bd.�c.��A�c.�d-l��A�d-l�d-lui��A�d-lui�d-nei��A�d-nei�d-voastra��A�d-voastra�d-voastră��A�d-voastră�d.��A�d.�d.p.d.v.��A�d.p.d.v.�dem.��A�dem.�dpdv��A�dpdv�dr.��A�dr.�dvs.��A�dvs.�e.��A�e.�etc.��A�etc.�ex.��A�ex.�f.��A�f.�fig.��A�fig.�fr.��A�fr.�g.��A�g.�gh.��A�gh.�gr.��A�gr.�h.��A�h.�i.��A�i.�ian.��A�ian.�ing.��A�ing.�ingr.��A�ingr.�intr-adevar��A�intr-adevar�intr-adevăr��A�intr-adevăr�j.��A�j.�k.��A�k.�l.��A�l.�lit.��A�lit.�lt.��A�lt.�m.��A�m.�n.��A�n.�nr.��A�nr.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�obs.��A�obs.�p.��A�p.�p.a.��A�p.a.�p.m.��A�p.m.�pct.��A�pct.�prep.��A�prep.�prof.��A�prof.�q.��A�q.�r.��A�r.�rom.��A�rom.�s.��A�s.�s.a.��A�s.a.�s.a.m.d.��A�s.a.m.d.�samd.��A�samd.�sf.��A�sf.�st.��A�st.�str.��A�str.�t.��A�t.�tel.��A�tel.�u.��A�u.�univ.��A�univ.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ÎNGR.��A�ÎNGR.�ÎNTR-ADEVAR��A�ÎNTR-ADEVAR�ÎNTR-ADEVĂR��A�ÎNTR-ADEVĂR�Îngr.��A�Îngr.�Într-Adevar��A�Într-Adevar�Într-Adevăr��A�Într-Adevăr�ä.��A�ä.�îngr.��A�îngr.�într-adevar��A�într-adevar�într-adevăr��A�într-adevăr�ö.��A�ö.�ü.��A�ü.�Ş.A.��A�Ş.A.�Ş.A.M.D.��A�Ş.A.M.D.�ŞAMD.��A�ŞAMD.�ŞT.��A�ŞT.�Şamd.��A�Şamd.�Şt.��A�Şt.�ş.a.��A�ş.a.�ş.a.m.d.��A�ş.a.m.d.�şamd.��A�şamd.�şt.��A�şt.�Ș.A.��A�Ș.A.�Ș.A.M.D.��A�Ș.A.M.D.�ȘAMD.��A�ȘAMD.�ȘT.��A�ȘT.�Șamd.��A�Șamd.�Șt.��A�Șt.�ș.a.��A�ș.a.�ș.a.m.d.��A�ș.a.m.d.�șamd.��A�șamd.�șt.��A�șt.�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�—��A�—�’��A�’�’’��A�’’
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0442198e6d05377364bc6e0ce4f78c69ae3b1d2ee6feb4c1265384ca182a1dbb
3
- size 8420995
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4534edb1d1b8e8017538d692a57054e6179b5b351805c50502b2f0ef77b79ec7
3
+ size 10070837
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }